Speaker
Description
Managing a complex Linux-based production environment of the computing center is a highly challenging operational task. Such tasks require a high level of automation in distributed multi-component systems, which must be applied to complex operational workflows. There are several approaches to achieve this goal, including: creating operational scripts based on Linux shell commands and programming languages, using specialized software for specific tasks (backups, configuration, management) or employing advanced orchestration tools. Among these solutions, the StackStorm automation engine stands out. This paper describes the use of this platform for orchestrating operational workflows such as distributed backups, distributed system upgrades and distributed system administration in the Linux-based computing center at NRC «Kurchatov Institute» – IHEP.