Speaker
Description
Megascience experiments, such as CMS, ATLAS, ALICE, MPD, BM@N, etc., are served at the Meshcheryakov Laboratory of Information Technologies (MLIT) of the Joint Institute for Nuclear Research (JINR) using the available computing infrastructure. To ensure the guaranteed and stable operation of the infrastructure under constant load conditions, the centralized and timely maintenance of software and the rapid introduction of new compute nodes are required. As a solution to this task, a service was created; its purpose is to automate the process related to the software maintenance and commissioning of compute nodes.
The report will give an overview of the service (LCMS) and its components: centralized configuration management of the operating system and programs installed on the compute nodes of the Tier1, Tier2 sites (JINR); continuous integration engine for the automatic validation and loading of puppet manifests from the software repository; service (LCMS) component performance and status monitoring; mechanism for detecting, viewing and comparing security compliance.