Comprehensive monitoring, automation, and analisys system for the computing cluster at NRC «Kurchatov Institute» - IHEP

11 Jul 2025, 10:00
30m
MLIT Conference Hall

MLIT Conference Hall

Plenary talk Plenary

Speaker

Viktor Kotliar (Institute for High Energy Physics named by A.A. Logunov of National Research Center “Kurchatov Institute”)

Description

The computing cluster at the NRC «Kurchatov Institute»—IHEP is a complex system integrating multiple diverse components and technologies. These include distributed computing, high-performance computing , highly reliable uninterruptible power supply systems, precision cooling systems, and information and communication technologies. Monitoring the system's status, analyzing its behavior, and managing its operation present a highly challenging task that can be broken down into several subtasks. This paper describes the current state of the system for collecting and analyzing the computing cluster’s parameters and its management.

Author

Viktor Kotliar (Institute for High Energy Physics named by A.A. Logunov of National Research Center “Kurchatov Institute”)

Co-authors

Anna Kotliar (IHEP) Maria Shemeiko (Institute for High Energy Physics named by A.A. Logunov of National Research Center “Kurchatov Institute”) Valerii Morozov (Institute for High Energy Physics named by A.A. Logunov of National Research Center “Kurchatov Institute”) Victor Gusev (Institute for High Energy Physics named by A.A. Logunov of National Research Center “Kurchatov Institute”) Victoria Ezhova (IHEP)

Presentation materials

There are no materials yet.