Lifecycle Management Service for the compute nodes of Tier1, Tier2 sites (JINR)

5 Jul 2021, 16:30
15m
403 or Online - https://jinr.webex.com/jinr/j.php?MTID=mf93df38c8fbed9d0bbaae27765fc1b0f

403 or Online - https://jinr.webex.com/jinr/j.php?MTID=mf93df38c8fbed9d0bbaae27765fc1b0f

Sectional reports 2. Research infrastructure Distributed computing applications

Speaker

Alexandr Baranov ((JINR))

Description

Megascience experiments, such as CMS, ATLAS, ALICE, MPD, BM@N, etc., are served at the Meshcheryakov Laboratory of Information Technologies (MLIT) of the Joint Institute for Nuclear Research (JINR) using the available computing infrastructure. To ensure the guaranteed and stable operation of the infrastructure under constant load conditions, the centralized and timely maintenance of software and the rapid introduction of new compute nodes are required. As a solution to this task, a service was created; its purpose is to automate the process related to the software maintenance and commissioning of compute nodes.
The report will give an overview of the service (LCMS) and its components: centralized configuration management of the operating system and programs installed on the compute nodes of the Tier1, Tier2 sites (JINR); continuous integration engine for the automatic validation and loading of puppet manifests from the software repository; service (LCMS) component performance and status monitoring; mechanism for detecting, viewing and comparing security compliance.

Primary author

Co-authors

Presentation materials

There are no materials yet.