The large research infrastructure project “Multifunctional Information and Computing Complex (MICC) of JINR” is an integral part of the Seven-Year Plan for the development of JINR for 2024-2030. A large research infrastructure project is justified and timely, given the decisive importance of the continuous development of the information and computing infrastructure, which will allow JINR to...
Quantum many-body control is among most challenging problems in the field of quantum technologies, yet it is absolutely essential for further developments of this vast field. In this work, we propose a novel approach for solving control problems for many-body quantum systems. The key feature of our approach is the ability to run tens of thousands of iterations of a gradient-based optimization...
To build a computer cluster for bioinformatics and biomedical research is a very complex task. Such cluster has to seamlessly combine different types of the software stack which is used for computations and it should provide the easiest way for organizing complex workflows for scientific research. On the other side it has to be as simple as possible for usage to allow researchers with no or...
Quasiprobability distributions associated to quantum states play the same role as the probability distribution functions in classical statistical physics, but with a key difference that quantum counterparts can take negative values for some states. Due to this fact, all states are divided into classes, the first one comprised of the "classical states", whose quasiprobability distributions are...
CVE, CWE, and CAPEC databases and their relationships are shortly introduced. Focus on this paper is on formalization and more specific on weakness formaliza-tion. Software weaknesses are described as formatted text. There is no widely ac-cepted formal notation for weakness specification. This paper shows how Z-notation can be used for formal specification of CWE-119.
Quantum computing performance depends on the properties of the underlying physical qubits. The depth of an algorithm is limited by the decoherence of the qubits. In this respect the design of algorithms that quantify the decoherence of qubits is particularly of interest. In order to fit the data qubit models are necessary. We present the performance of our SU2 C++ package for polymorphically...
The state of queues in modern computing centers is such that a user's task can hold in the queue before start for weeks. However, even if the queuing system gives a forecast for the start of a task, the forecast often turns out to be incorrect. It is happening because during the week the task is in the queue, there are some events will occur that will change the moments tasks starts. We know...
The Data Knowledge Base (DKB) project is aimed at knowledge acquisition and metadata integration which consists of DKB python library and API service. Due to its ETL workflow engine it allows to build a single system with integrated and pre-processed information behind distributed metadata infrastructure. Such system provides fast and efficient response for summary reports, monitoring tasks...
The approach of integrating Cluster Management and Cluster Simulation systems addresses the challenges of High-Performance Computing (HPC) cluster management by leveraging simulation to enhance decision-making in case of failures. Foliage's team as an extensive experience in building and managing HPC clusters, however, uncertainties regarding cluster management behaviour during failures...
The physical concept of elementary and composite systems forms the pillar on which our understanding of quantum phenomena stands. The present talk aims to discuss a complementary character of the description of elementary and composite finite-dimensional quantum systems within the modern phase-space formulation of quantum mechanics.
We will give a generic method of constructing the...
Containerization technology for Linux systems appeared many years ago with OpenVZ and LCX systems. However, wider usage came with the advent of the Docker system and the addition of new container support mechanisms to the Linux kernel itself. Originating as a system for servers, containerization technology has gradually moved into the user space as a universal mechanism for distributing and...
The paper simulates the process of the entanglement states transferring along a chain of tryptophans a) into cell's microtubule, b) connected by dipole-dipole interaction. In the work the conditions under which the migration of the entanglement states in the microtubule is possible are obtained.
The results of the work allow us to talk about the signal function of microtubule tryptophans...
УДК 519.6+004.42
ВЫСОКОПРОИЗВОДИТЕЛЬНЫЕ ВЫЧИСЛЕНИЯ НА МАТЕМАТИЧЕСКИХ СОПРОЦЕСCОРАХ И ГРАФИЧЕСКИХ УСКОРИТЕЛЯХ С ИСПОЛЬЗОВАНИЕМ PYTHON
С. В. Борзунов, А. В. Романов, С. Д. Кургалин, K. О. Петрищев
ФГБОУ ВО «Воронежский государственный университет»
В настоящей работе рассмотрено применение модулей языка программирования Python для решения ресурсоемких задач. На примере перемножения...
Modern computing centers consist of many enginering systems which provide working conditions for complex computing hardware. Some of such systems do not have build-in monitoring components or their cost is very expensive which makes them difficult for surveillance. Meanwhile there are many open source software libraries available for computer vision and image recoginition. One of them is...
Division of Computational Physics, MLIT, JINR
В Лаборатории Информационных Технологий им. М.Г. Мещерякова (ЛИТ), Объединённого Института Ядерных исследований (ОИЯИ) в качестве системы для пакетной обработки данных поступающих от экспериментов CMS, АTLAS, ALICE, MPD, BM@N и других, используется система Slurm. Выполнение задачи от Slurm происходит в операционной системе (ОС) непосредственно на физическом сервере (вычислительный узел, ВУ)....
As a part of organization of a simplified access to the computing resources of the central IHEP cluster based on WEB technologies, was developed a system architecture based on the free software Apache Guacamole. Apache Guacamole is a clientless remote desktop gateway supporting protocols like ssh, vnc and rdp via a web-browser. VNC and RDP support is implemented on the server side using native...
Status of MPDROOT framework for current and future tasks of MPD experiment is considered. Also the experience of using interware DIRAC for mass production and reconstruction of simulated data for MPD experiment is reviewed.
As in other large particle collision experiments, the topic of distributed event processing and computing is extremely relevant in the BM@N experiment, the first ongoing experiment of the NICA project due to the heavy data flow, the sequential processing of which would take hundreds of years. Only in the last Run of the BM@N experiment about half a petabyte of raw data was collected, and when...
The SPD (Spin Physics Detector) is a planned spin physics experiment in the second interaction point of the NICA collider that is under construction at JINR. The main goal of the experiment is the test of basics of QCD via the study of the polarized structure of the nucleon and spin-related phenomena in the collision of longitudinally and transversely polarized protons and deuterons at the...
Since many years, the Worldwide LHC Computing Grid (WLCG) has provided
the distributed computing infrastructure for the CERN Large Hadron Collider
experiments. During that time, it has seen steady evolution in technologies
as well as growth, to deal with ever increasing data rates. Those trends
need to be made to continue, to allow the WLCG to take on High-Luminosity
LHC data volumes as...
The JINR grid infrastructure was created at the Meshcheryakov Laboratory of Information Technologies and successfully developed from year to year in accordance with the rapid development of information technologies, computing equipment and computing technologies, satisfying user needs. Thus, the participation of JINR scientists in the experiments at the Large Hadron Collider (LHC) at CERN...
As part of the PIK nuclear reactor reconstruction project, the PIK Data Centre was commissioned in 2017. After more than five years of successful operation we would like to share our experience in one of the crucial parts of business continuity: monitoring. PIK Data Centre monitoring covers everything from engineering systems such as cooling machines to storage and computing nodes, jobs and...
WLCG Tier-2 computing center at NRC "Kurchatov Institute" - IHEP has been participating in the Worldwide LHC Computing Grid from very beginning since 2003. Over a twenty-year period it became one of the biggest WLCG Tier-2 centers in Russia. Ru-Protvino-IHEP Grid site provides computing resources for LHC experiments in high energy physics such as Atlas, Alice, CMS, LHCb and internal...
Discrete Fourier methods have the known Gibbs phenomenon problematic due to their limited time window. A solution to this problem has been apodisation, a truncation of the time window that softens the edges. Still, due to discretisation such methods are imperfect, here reported Fourier apodisation aleviating this aspect. Although Fourier-space apodisation is known, no consistent approach...
In this work we consider the planar three-body problem with zero angular momentum symmetric initial configuration and bodies with equal masses. We are interested in special periodic orbits called choreographies. A choreography is a periodic orbit in which the three bodies move along one and the same trajectory with a time delay of T/3, where T is the period of the solution. Such an orbit is...
We present our results on updating the middleware of Russian GRID sites to be able to continue processing ALICE data in the future, including the HL stage of the Large Hadron Collider operation. We will share our experience with one of the GRID sites and discuss some practical cases of scaling the updated middleware to other Russian sites in 2022-2023.
"This work is supported by the ...
Every year the ATLAS experiment produces several billions event records in raw and other formats. The data are spread among hundreds of computing Grid sites around the world. The EventIndex is the complete catalogue of all ATLAS real and simulated events, keeping the references to all permanent files that contain a given event in any processing stage; its implementation has been substantially...
Particle-in-Cell (PIC) simulation of high-beta plasmas in an axisymmetric mirror machine is of interest because of a new proposal for a plasma confinement regime with extremely high pressure, equal to the pressure
of the magnetic field, so-called diamagnetic confinement. The results of simulations can be used for the development of aneutronic fusion.
In this work, we will show our latest...
В лаборатории нейтронной физики им. И. М. Франка в качестве базовой установки ОИЯИ с 2012 г пущен в эксплуатацию импульсный реактор периодического действия ИБР-2М, сменивший реактор ИБР-2 после выработки его ресурса. Реактор генерирует мощные нейтронные импульсы шириной 200 мкс с частотой 5 Гц при средней мощности 2 МВт. В процессе работы ИБР-2М происходит усиление низко частотных колебаний...
The CREST project is a new realization of the Conditions DB for the ATLAS experiment, using the Rest API and JSON support. This project simplifies the conditions data structure and optimizes data access.
CREST development requires not only a client C++ library (CrestApi) but also various tools for testing software and validating data. A command line client (crest_cmd) was written to get a...
Рассматривается динамика φ0 джозефсоновского перехода и явления переворота намагниченности под воздействием импульса тока. Динамика φ0 перехода описывается замкнутой системой уравнений, состоящих из уравнений Ландау-Лифщиц-Гильберта для намагниченности и уравнений резистивной модели для разности фаз перехода, которая представляет собой задачу Коши для системы обыкновенных нелинейных...
The P-BEAST is a highly scalable, highly available and durable system for archiving monitoring information of the trigger and data acquisition (TDAQ) system of the ATLAS experiment at CERN. The Grafana plugin communicate with P-BEAST by the Rest API and JSON support. Grafana as a multi-platform open source analytics and interactive visualization web application is continuously developed with...
Spherically symmetric localized long-lived pulsating states (oscillons) in the three-dimensional φ$^4$ theory are numerically investigated in a ball of finite radius. These structures are of interest in a number of physical and mathematical applications including several cosmological contexts. Numerical approach is based on numerical continuation of solutions of a boundary value problem for...
The BM@N 8th physics run using Xenon ion beams was successfully completed in February 2023, resulting in the recording of approximately 550 million events. They were recorded in the form of 31306 files, with a combined size exceeding 430TB. However, the reconstruction of these files demands significant computing resources, which is why a distributed infrastructure unified by DIRAC was chosen...
Под воздействием внешнего излучения на вольтамперной характеристике (ВАХ) джозефсоновского перехода возникает ступенька постоянного напряжения, так называемая ступенька Шапиро. Ширина этой ступеньки зависит от амплитуды и частоты внешнего излучения, а также от параметров модели. При численном моделировании динамики джозефсоновского перехода и исследовании влиянии параметров модели на ступеньки...
The Configuration Information System (CIS) has been developed for the BM@N experiment to store and provide data on the configuration of the experiment hardware and software systems while collecting data from the detectors in the online mode. The CIS allows loading configuration information into the data acquisition and online processing systems, activating the hardware setups and launching all...
Математическое моделирование и вычислительный эксперимент служат важным инструментом в изучении процессов переноса заряда в биополимерах, таких как ДНК. Актуальность исследований переноса заряда в ДНК связана, в частности, с развитием нанобиоэлектроники, которая является потенциальной заменой современной микроэлектроники, основанной на полупроводниковых технологиях.
Задача моделирования...
One of the modern technologies for obtaining new materials and coatings is the deposition of nanoparticles on a substrate. This process is relevant for many industries and the social sphere. Constantly increasing requirements for the quality and nomenclature of this type of product lead to the need for detailed theoretical and experimental studies of the spraying process in various conditions....
The high-precision coordinate detectors of the tracking system in the BM@N experiment are based on microstrip readout. The complete tracking system designed for the latest xenon physics run (winter of 2023) consists of three parts: an ion-beam tracker and two trackers (inner and outer) for charged particle registration after primary interactions. The report reviews the features and...
Machine Learning methods are proposed to be used in more and more high energy physics tasks nowadays, in particular for charged particle identification (PID). It is due to the fact that machine learning algorithms improve PID in the regions where conventional methods fail to provide good identification. This report gives results of gradient boosted decision tree application for particle...
This paper presents a mathematical and numerical model of basal melt of Antarctic glaciers. At each point of the continent, for which the heights above sea level of the lower and upper ice edges are known, a one-dimensional three-phase Stefan problem with moving phase boundaries is solved along the vertical direction. The model allows to calculate the dynamics of the temperature distribution...
Event reconstruction in the SPD (Spin Physics Detector) experiment in the NICA mega-science project presents a significant challenge of processing a high data flow with limited valuable events. To address this, we propose novel approaches for unraveling time slices. With a data rate of 20 GB/sec and a pileup of about 40 events per time slice, our methods focus on efficient event reconstruction...
Данное исследование посвящено моделированию распространения упругих волн в гетерогенной среде с явным учетом неоднородностей. Для реализации данного подхода был реализован алгоритм, основанный на сеточно-характеристическом методе с использованием наложенных сеток. Предложенный алгоритм был распараллелен в распределенной кластерной среде с использованием технологии MPI. Результаты исследования...
The task of fluid simulation is computationally difficult, both in terms of the required computational costs and in terms of representing the system with a large number of particles. This study considers various methods for solving this problem, such as the use of parallel computing, rendering optimization, and optimizing information transfer between the CPU and GPU. The work was conducted in...
In accordance with the technical design report, the SPD detector, which is being built at the NICA collider at JINR, will produce trillions of physical events per year, estimated at dozens of petabytes of data, which puts it on a par with experiments at the Large Hadron Collider. Although the physical facility is under construction, these figures must be taken into account already now, at the...
Particle tracking is critical in high-energy physics experiments, but traditional methods like the Kalman filter cannot handle the massive amounts of data generated by modern experiments. This is where deep learning comes in, providing a significant boost in efficiency and tracking accuracy.
A new experiment called the SPD is planned for the NICA collider, which is currently under...
Численное решение задач сейсморазведки играет важную роль в нефтегазовой промышленности, помогая определять наличие нефтегазоносных пластов и оптимизировать процессы бурения и добычи нефти и газа.
Учет топографии является важным аспектом при сейсморазведке, поскольку форма поверхности земли может оказывать значительное влияние на распространение сейсмических волн и, следовательно, на...
Detailed theoretical studies of deposition processes, including the interaction of nanoparticles with a substrate, are of particular practical interest. The technology under consideration is used in such critical areas as microelectronics, the creation of protective coatings and new medical materials. Mathematical modeling of each level of this process makes it possible to effectively select...
Мега-сайенс проект NICA задаёт высокую планку к вычислительным ресурсам и системам хранения и обработки данных. Участники коллабораций MPD, BM@N, SPD при выполнении расчётов активно задействуют различные вычислительные ресурсы ОИЯИ: МИВК Tier-2, СК Говорун, Облако JINR-Cloud, вычислительный кластер NCX. При выполнении расчётов применяется классическая иерархия систем хранения и обработки...
We present a new conservative scheme for computation of the Boltzmann collision integral for binary and triple processes in relativistic plasma based on direct integration of exact quantum electrodynamical matrix elements. Parallel evaluation of collision integral is done within the framework of general-purpose computing on graphics processing units (GPGPU). This approach is important for...
Современные научные исследования не могут существовать без крупных вычислительных систем, которые способны хранить большие объемы данных и обрабатывать их в относительно короткие сроки. К таким системам относятся распределенные центры сбора, хранения и обработки данных (РЦОД).
Распределенные системы имеют сложную структуру и включают в себя множество разнообразных компонент, поэтому для...
Extensive studies, in the field of high temperature plasma and controlled thermonuclear fusion were started in 50th of the last century. Main goal of these studies was the creation of power source runs on relatively cheap hydrogen isotope Deuterium heated up to hundred million degrees in the conditions where it will be possible to obtain thermonuclear reaction.
In the beginning, the simple...
In the BM@N experiment, a xenon heavy ion beam with an energy of 2.7 GeV/nucleon interacts with a cesium target, generating many secondary particles π, μ, p, n, γ, e, d, α, K, etc. After computer processing of the data from the detectors used in the experiment, we obtain a series of images of the tracks of emerging particles. We processed four of them using the Gwyddion program and calculated...
We give a presentation of our polymorphic non-abelian package of 3D vectors and matrices for high-speed algorithms intended for trigger applications in Particle Physics. The package is part of our "Math-on-Paper" C++ concept - of fielding solutions that are as close as possible in code to actual scientific on-paper computations, known that often it is nearly impossible to bring paper equations...
Презентация посвящённа созданию и развитию вычислительно центра института SAPHIR (Millennium Institute for Subatomic Physics at the High Energy Frontier, Santiago, Chile).
Приводится описание технологии сборки модулей в ЯП средствами системы АПРОП, сделанной по ГОСДоговору с НиЦЕВТ (1975-1976) под руководством академика В.М.Глушкова для ОС ЕС ЭВМ. АПРОП cдана в ГОСФонд. В 1977г. Применялась АПРОП на ВПК (МНИИПА, Липаев В.В.) по договору с 1978-1985г. для реализации программных комплексов Прометей, Яуза, Руза и бортовых приборов для авиации, космоса и...
Bioinformatics is the area that develops methods and software tools for understanding of biological data, which includes sequence analysis, gene and protein expression, analysis of cellular organization, structural bioinformatics, data centers etc. A new and more general direction is to consider bioinformatics as informatics on the bases of nanobioelectronics and biocomputer...
В постановке и проектировании вычислительного эксперимента представляются актуальными вопросы интерактивного управления ресурсоёмкими алгоритмами, с возможностью динамической перенастройки моделей гидромеханики, под независимым визуальным контролем трёхмерных физических явлений и процессов в реальном масштабе времени. Прямой вычислительный эксперимент позволяет достигать практических...
Recently, the branch of mathematics associated with functional integration has been rapidly developing. For a long time it was a means of constructing perturbation theory and solving applied problems. However, recently it has become clear that this can be a very effective tool for high-performance algorithms creation. Moreover, it may be the only tool for developing algorithms for quantum...
Статья посвящена описанию концепции и основных характеристик магистерской программы "Кибербезопасность", разработанной факультетом вычислительной математики и кибернетики МГУ имени М.В. Ломоносова совместно с Департаментом кибербезопасности ПАО Сбербанк. Рассмотрены цели, основные принципы разработки, архитектура свода знаний магистерской программы, ее принципиальные особенности,...
Artificial Neural Networks in High Energy Physics data processing (succinct survey) and probable future development
Abstract
The rising the role of Artificial Neural Networks (ANN) as part of machine learning/deep learning (ML/DL) in High Energy Physics (HEP) and related areas can be seen last decades. Several reasons for rising the role of ANN were observed. The...
Machine learning systems are today the main examples of the use of Artificial Intelligence in a wide variety of areas. From a practical point of view, we can say that machine learning is synonymous with the concept of Artificial Intelligence. In some works, this definition is somewhat limited, and they only talk about artificial neural networks and deep learning in the context of artificial...
Данный доклад посвящен исследованию задачи детекции объектов различного размера на примере открытого датасета с использованием нейросетевой модели Yolo v5. Основное внимание уделено изучению влияния предварительной фильтрации изображений на результаты детекции объектов и разработке методики оценки такого влияния. Кроме того, в работе проводится оценка влияния фильтрации искажений от дождя и...
В данном докладе описывается метод смешивания изображений на основе интерполяции признаков скрытого пространства в процессе генерации диффузионных моделей. Основное внимание уделено особенностям реализации и изучению влияния параметров генерации на процесс создания итогового изображения. Также производится обзор альтернативных методов решения задачи и их сравнение с предложенным методом....
Recently, the landscape of computational infrastructure is in dramatic changes under the pressure of application requirements. The suit of the properties of modern applications can be summarized as follows: distributed, self-sufficient, work in real time, elastic, cross-platform, actively interact and synchronize, and are easy to update. The definitions of these terms are in [1]. For further...
We are investigating the quantum dynamics of a well-collimated electron beam transmitting through planar channels of the Si crystal. Electron states were represented by wave packets, while the electron beam was treated as an ensemble of noninteracting wave packets. The evolution of electron states was obtained using the method of Chebyshev global propagation, specifically modified to give...
The talk will cover the current status of work on the development of the BIOHLIT information system (IS), which is being created within a joint project between MLIT and LRB JINR. The system is designed to create a convenient environment for storing, processing and automating the analysis of data from experiments aimed at studying the radiobiological effects of exposure to ionizing radiation at...
The training of teams and crews for the control and operation of complex technical objects implies the training of a big number of people at once, who must jointly solve the assigned tasks. Unlike a simple training system, such simulators are built on the basis of a distributed computing environment, consisting of the workplaces of the training crew members and the control server. In the case...
В работе представлены результаты применения технологий мягких и квантовых вычислений для задач обучения, адаптации и самоорганизации интеллектуальной системы управления стабилизацией давления в азотной криогенной установке на фабрике магнитов в ЛФВЭ ОИЯИ. Проведено сравнение работы системы с применением разных типов моделей управления: ПИД-регулятор, ПИД-регулятор с применением генетического...
This research paper explores methods for balancing privacy and performance in distributed systems, specifically within multilayered architectures. It proposes a potential solution for secure data exchange on a hybrid blockchain platform, leveraging cryptographic tools to protect sensitive data while maintaining system functionality. The paper emphasizes the importance of considering both...
На базе методов компьютерного зрения разработаны алгоритмы анализа видеоданных полученных при проведении поведенческого теста «Водный лабиринт Морриса», который применяется для оценки функции памяти и обучения пространственной памяти, у мелких лабораторных животных. Работа велась в рамках совместного проекта ЛИТ и ЛРБ ОИЯИ. Для удобства и верификации правильности получаемых траекторий...
Одной из областей применения технологий искусственного интеллекта является решение задачи стабилизации при управлении техническими системами, в том числе системами промышленного класса.
В работе представлены результаты исследования по применению эволюционных и адаптивных алгоритмов в интеллектуальных системах управления для стабилизацией давления в азотной криогенной установке на фабрике...
Abstract—This paper states the decentralization of a task management algorithm in a distributed environment. At the same time, the main criteria for task management were presented, and various approaches to designing this algorithm were described. The author considered the architecture of a blockchain-based task management system using the Parity Substrate framework of the Polkadot ecosystem....
The research computer infrastructure for working with experimental MRI/fMRI data of the brain of a human or a laboratory animals is described:
- System "Neurovisualization" of the IAP "Digital Laboratory", with the involvement of the supercomputer of the Research Center KI as a computing resource;
- Additional software services based on the IAP "Digital Laboratory" that implement new methods...
Квантовые компьютеры обладают потенциалом решать проблемы, которые оказываются вычислительно сложными для некоторых классических алгоритмов. Однако создание физических квантовых устройств с большим количеством кубитов и высокой стабильностью остается на текущий момент сложной задачей. Разработка и отладка квантовых алгоритмов на симуляторах с классической архитектурой может использоваться не...
Одной из ключевых технических особенностей установки SPD (Spin Physics Detector) является безтриггерный съем данных, обусловленный определенной сложностью исследуемых физических процессов. Система сбора данных (DAQ) осуществляет агрегацию данных с детекторов установки и организацию их в блоки для последующей первичной обработки. Совокупный объем данных после агрегации может достигать 20...
In the first part of the report, we examined control systems with constant coefficients of the conventional PID controller (based on genetic algorithm) and intelligent control systems based on soft computing technologies. For demonstration, MatLab / Simulink models and a test benchmark of the robot manipulator demonstrated. Advantages and limitations of intelligent control systems based on...
The network infrastructure is an integral part of the major research infrastructure project "Multifunctional Information and Computing Complex (MICC) of JINR". The main goal is to provide a guaranteed and reliable traffic transmission that will fully meet the needs of scientific experiments. This presentation provides an overview of the local and external network infrastructure at JINR.
Reconstruction of neutron spectra over a wide energy range from $10^-$$^8$ to $10^3$ MeV is very relevant for the purposes of ensuring radiation safety behind biological shields at high-energy accelerators and reactors. A Bonner multi-sphere spectrometer is used for measurements. However, to unfold the entire spectrum from the measurement data, it is necessary to solve the Fredholm integral...
Providing reliable Internet connection is the key to success of any network. In the current paper questions about highly reliable network topology for data transfer between nodes in JINR are considered. The big challenge for the network service is to integrate between the two GRID sites Tier 1 and Tier 2 data centers together with the backbone JINR LAN and upscaling data rates to 100G, and in...
The article proposes algorithms for the automatic diagnosis of the facts of human lung diseases with pneumonia and cancer based on images obtained by radiation irradiation, which allow making decisions with the necessary reliability, that is, by limiting the probabilities of making possible errors to a pre-planned level. The proposed algorithms have been tested using statistical simulation and...
Алгоритм поиска научных публикаций на основе информации о внешнем цитировании с применением нейросетевых моделей
Базы данных научных публикаций в настоящее время насчитывают миллионы статей, методы поиска в них непрерывно развиваются: от традиционного текстового поиска к системам, которые учитывают дополнительную библиометрическую информацию (индексы цитирований), семантический поиск,...
According to the Food and Agriculture Organization, the world's food production needs to increase by 60-70 percent by 2050 to feed the growing population. However, the EU agricultural workforce has declined by 35% over the last decade, and 54% of agriculture companies have cited a shortage of staff as their main challenge. This, among other factors, has led to an increased interest in advanced...
- types of virtualization
- select Open Source based suitable solution
- use Proxmox VE cluster solution
В данной работе мы рассматриваем задачу оптимального формирования групп пользователей для мультивещания при обслуживании одноадресными и многоадресными соединениями с помощью многолучевых антенн. Мы сформировали данную задачу как подкласс задачи упаковки контейнеров (Bin Packing Problem, BPP), и предложили точный алгоритм для оптимального разбиения пользователей с минимизацией использования...
Ensuring the confidentiality and protection of personal information in big data is an important aspect in data processing. One of the effective methods to achieve a high level of protection is depersonalization of data. The article presents an overview of modern methods of preserving personal data when conducting various kinds of research, in business analytics, etc. The influence of...
One of the promising areas in the field of high-performance computations is co-scheduling, which allows to schedule computational tasks with the possibility of coexecution on a single node. The common approach of running one task on each node simultaneously does not allow to utilize the resources of the computer network to the full extent. With usage of co-scheduling mechanism it is possible...
Developing SSO and using it in JINR applications
One of the most important components of the monitoring system LITMon MICC LIT JINR is the data storage system. Initially, it was based on the RRD database and a special pnp4nagios plugin, support for which ended in 2022. Required features no longer work. The RRD database is morally obsolete and has ceased to meet performance requirements and has begun to consume more computing resources of the...
In the study of diseases of the elderly, five different types of instruments are used, each of which alone does not allow a reliable diagnosis. In addition, tests and examinations are carried out by a doctor who makes his conclusion. Often the doctor's conclusion contradicts the data of computer diagnostics. In this communication, an attempt is made to construct a computer diagnostics system...
Executing millions of scientific high-throughput computing (HTC) jobs on distributed heterogeneous computing resources poses challenges in observing their status and behavior after their completion. To address this, an approach was developed to analyze jobs using scatter plots, showcasing the dependency between job durations and the relative performance of CPU cores they were assigned to....
Данный доклад посвящен исследованию задачи определения положения кистей рук по ключевым точкам на примере открытого датасета с использованием методов машинного обучения. Основное внимание уделено разработке ключевых признаков позволяющих построить качественную и компактную модель машинного обучения . Кроме того, в работе проводится исследования эффективности различных моделей машинного...
Distributed systems require an efficient and reliable consensus mechanism to reach agreement between nodes. In recent years, two popular consensus algorithms, Practical Byzantine Fault Tolerance (PBFT) and Raft, have gained wide acceptance in the community due to their advantages. PBFT provides high speed and fault tolerance, while Raft is simple and easy to understand. However, each of them...
В рамках эффективной квантовой теории поля с нелокальным взаимодействием рассмотрены основные свойства мезонов. Создан математический аппарат для самосогласованного решения системы нелинейных интегральных уравнений. Проведен численный расчет спектра масс мезонов и их констант связи.
В докладе рассмотрен разработанный авторами метод определения общей̆ координации движения и состояния алкогольного опьянения по данным с виброметрических сенсоров смартфона, расположенного в области верхне-передней части бедра человека (в кармане). Анализ вибросигнала, поступающего с устройства, рассматривается во временной области. В качестве признаков модели машинного обучения...
During remote sensing of the Earth, satellite equipment registers solar radiation reflected by the earth's surface. This reflected radiation travels through the atmosphere, distorting the spectral characteristics of the radiation reaching the satellite's sensors. The task of atmospheric correction is to eliminate the influence of these distortions. Currently, data from most satellites are in...
Для современной науки имеет большое значение Эффективное управление экспериментальными установками, и ускорительными комплексами различного уровня сложности. Инженерные решения в этом направлении различны, но все они сводятся к использованию специализированных объектно-ориентированных распределенных систем управления аппаратным оборудованием.
В данном докладе представлен алгоритм работы...
Исследованы свойства $\eta$ - $\eta'$ мезонов при конечной температуре ядерной материи. Создан алгоритм самосогласованного решения уравнений Швингера-Дайсона и Бете-Салпетера. Проведены численные расчеты спектра масс псевдоскалярных мезонов.
The License Management System (LMS) was developed at the JINR Information Technology Laboratory. The purpose of creating an LMS is to automate the management, acquisition, maintenance and use of licensed software products. The article presents the results of development of the system over the last year. From the "EDMS Dubna" the mechanism for coordinating requests (workflow) was imported and...
Generative models have become widespread over the past few years, taking valuable part in content creation. Generative adversarial networks (GANs) are one of the most popular generative model types. However, computational powers required for training stable, large scale and high resolution models can be enormous, making training or even running such models an expensive process. Study of neural...
The study is devoted to developing an algorithm for extracting the names of organizations from poorly structured data. Bibliographic information about the publications from the abstract database Scopus was taken as the initial data.
The main problem in extracting names of organizations from affiliations, apart from the presence of typos, is that the requirements of journals and conferences to...
Currently, problems related to task placement in clusters play an important role as they significantly reduce the execution time of parallel applications. To efficiently allocate tasks, the scheduler must consider both the topology of the cluster and that of the input task.
In this work, we study various cluster topologies and consider several task placement algorithms. In particular, we...
JINRLIB - библиотека программ, предназначенных для решения широкого круга математических и физических проблем. Программы JINRLIB написаны в разных системах и на разных языках программирования и относятся к различным направлениям вычислительной математики и вычислительной физики. Есть раздел программ, написанных с использованием технологии параллельных вычислений, в частности, MPI и OpenMP....
An essential part of any system’s security architecture is an authentication mechanism – some algorithm or combination of algorithms making sure that only legitimate users can gain access to the system. Continuous authentication (CA) is a new approach to user authentication in distributed systems. Its main principle is that unlike a “traditional” approach, where a user is only authenticated...
Natural language processing technologies are one of the key areas in the field of data analysis. Natural language processing performs a plenty of tasks, which include the task of named-entity recognition. It provides an opportunity to get value information from a large amount of data. The study is devoted to select better program packages for named-entity recognition from Russian news text....
The report discusses a corporate geographic information system, which is designed to optimize management decisions for the operation of the laboratory building. The purpose of the implementation of the service is the competent management of the building, its technical communications, monitoring of engineering networks, visual representation of the placement of stuff, keeping a log of ongoing...
The modern world every day is subjected to the digitalization in various spheres of life, due to the need to introduce more efficient and accurate methods of handling data. This paper focuses on the development of a system that can track the publication activity of researchers in a scientific organization. The system is developed in the framework of the project to track the publication...
Аннотация. Рассматривается общая постановка задачи идентификации. Обсуждаются вопросы изменения множества допустимых решений при добавалении дополнительных (изменении существующих) гипотез. Для связи математических моделей с данными используется технология сбалансированной идентификации. На простых наглядных примерах исследуется динамика некоторых статистических оценок точности...
Scheduling tasks and allocating resources in a cloud (distributed) system is significantly different from resource management of a single computer. Cloud (global) schedulers view the system as a large pool of resources to which they have full access. At the same time, the most urgent task remains to maintain a balance, which consists in managing each individual computing task in such a way...
The research is devoted to the development of requirements for new software that will automate the collection and analysis of data on the processing of a sample of biomaterial from existing systems. The paper compared the possibilities of data storage in information systems that are used in automated medical laboratories in Russia and abroad.
To improve the efficiency of the medical...
The paper considers methods to improve the performance of multi-agent system of knowledge representation and processing. The approach to the development of system and application software agents is described, the methods of distribution of agents on the nodes of the computing system and construction of the optimal logical structure of a distributed knowledge base are considered. The scheme of...
The study of Big Data is important, since the study of technologies in this area allows you to effectively use large amounts of information. The authors of the article studied data from scientific and news sources and present an analysis of development of big data technologies. The analysis examines the process of development of the Big Data market both in the world and in individual countries...
The advantages of cloud technologies allowed the latter to occupy a certain niche in the field of scientific computing. Around decade ago following that trend a cloud infrastructure had been deployed at JINR and later in the scientific organizations of the its Member States. These cloud resources were integrated into a Distributed Information and Computing Environment (DICE) to combine...
Modern systems and applications are increasingly distributed due to the growing performance, scalability and availability requirements. Distributed computing allows to flexibly aggregate the resources of individual machines into scalable computing infrastructures with required characteristics. However, distributed systems are hard to build, test and operate because of their asynchronous and...
Abstract
The article (proceeding) explores the importance of horizontally scalable technologies for storing and processing digital footprints, a crucial component of IT-professional training for accelerating digital transformation. It begins by defining digital footprints, subsequently addressing their increasing role in modern IT- education and digital transformation. The discussion...
Volunteer computing (VC) is a strong way to harness distributed computing resources to perform large-scale scientific tasks. Its success directly depends on the number of participants, PC (some other devices) and time of their work. So project managers/organizers in search of a mechanism to encourage participation in VC-projects use conditional points accrual mechanism. The number of these...
Status of the INP BSU Tier 3 grid site BY-NCPHEP presented. The experience of operation, efficience, flexibility of the cloud based structure is discussed.
Linked open data is crucial for Semantic Web development due to the ability to provide both unambiguous computer interpretation and human understanding of information. Despite the active growth, including the variety of standards, methods, and tools for preparing linked data (LD), there is the gap between the idea and its ubiquity. It is still not easy to discover LD, difficult to link them,...
The importance of the development of computing infrastructure and services for support of research data accumulation, storage, and processing is permanently increasing. e-Infrastructures became more and more demanded and universal tool for support modern science that provide wide set of services for operation with research data, data collection, systematization and archiving, computing...
Efficient management and retrieval of scientific information are crucial in the era of big data and machine learning. This study presents a prototype of a recommendation system that helps researchers select the most suitable journal for publishing their scientific articles. The system utilizes metadata and keyword filtering techniques to retrieve relevant information from open APIs. By...
Развивается подход к построению распределенных вычислительных сетей, основанный на разделении основных координирующих и управляющих функций единственного центра среди совокупности элементов, «покрывающих» (доминирующих) все остальные узлы системы коммуникаций. Полученные ранее результаты позволяют оценить достижимость, т.е. длину наибольшей кратчайшей цепи (диаметр) соответствующего графа и...
Containers are gaining more and more traction both in industry and science. The HEP community is also showing significant interest in adopting container technologies for software distribution. Encapsulating software inside of containers helps scientists create portable and reproducible research environments. However, running containerized workloads at scale via distributed computing...
Определение степени семантической близости текстов является ключевым этапом в решении целого ряда задач – в поисковых системах, системах автоматического перевода, других областях, связанных с обработкой текста на естественном языке, и включает предварительную обработку текстов, векторизацию, извлечение признаков, выбор метрики, построение модели и т. д.
В статье [1] представлена...
Технология сбалансированной идентификации математических моделей по экспериментальным данным [1, 2] давно и успешно используется в различных областях прикладных исследований [3]. Следуя авторской рекомендации, далее, для краткости, будем называть её SvF-технологией (Simplicity vs Fitting). Алгоритмическими основами этой математической технологии являются: Тихоновская регуляризация,...
The rapid growth of distributed computing systems has necessitated the development of efficient platforms for running optimization workflows. In this talk, we present a comprehensive approach to running optimization workflows using the Everest cloud platform, developed at IITP RAS. Everest enables the publication of code as web services and facilitates the allocation of computing resources,...
Неотъемлемой частью проводимых научными группа исследований является совместная работа над различными видами документов: статьи, тезисы, протоколы совещаний, презентации, заявки на гранты, отчеты, справочники и т.д. Для этого в ОИЯИ используется система DocDB, которая предназначена для задач хранения, контроля версий, совместного доступа и обмена документами между группами численностью до...
Работа посвящена построению рекомендательной системы для анализа эффективности алгоритмов решения крупноразмерных задач многомерной оптимизации. Рассматривается несколько существующих систем выбора эффективных алгоритмов. Предлагается подход прогнозирования эффективности, основанный на проведении статистического анализа гибридными методами фильтрации данных, приводится сравнение с...
The widespread use of web technologies is a trend in software development at the present time. More powerful modern hybrid architectures computing resources become available to the user located at any geographical point connected by a network with a supercomputer. Since modern high energy physics experiments are characterized by the complexity of the latest detectors and a very large channels...
The paper considers the adaptation of federated deep learning on a desktop grid system using the example of an image classification problem. Restrictions are imposed on data transfer between the nodes of the desktop grid only for a part of the dataset. The implementation of federated deep learning on a desktop grid system based on the BOINC platform is considered. Methods for generating local...
Humans and other animals can understand concepts from only a few examples. while standard machine learning algorithms require a large number of examples to extract hidden features. Unsupervised learning is procedure of revealing hidden features from unlabeled data.
In deep neural network training, unsupervised data pre-training increases the final accuracy of the algorithm by decreasing an...
Resource management in cloud computing, and especially in FaaS (Function-as-a-Service), is a very active area of research, given the rise in the popularity of serverless computing. In serverless computing users do not explicitly manage VMs or containers, instead, they upload their functions to the cloud, and the functions are executed when triggered by events, such as HTTP requests. All work...
The report is devoted to the practical aspects of modeling the operation of the BOINC computing infrastructure. The practical expediency of preliminary modeling for solving optimization problems by means of an evolutionary algorithm is substantiated. Important attention is paid to modeling the occurrence of abnormal situations in the work of the BOINC project. The main considered abnormal...
One of the important tasks of gamma-ray astronomy is the modeling of Extesnive Air Showers (EAS) generated by cosmic rays. Monte Carlo generators are commonly used. One of the most popular programs for generating events in gamma-ray astronomy is the CORSIKA package based on the GEANT4 program. The problem with such generators is the extrime consumption of computer resources. One alternative...
Imaging Atmospheric Cherenkov Telescopes (IACT) of gamma ray observatory TAIGA detect the Extesnive Air Showers (EASs), originating from the cosmic or gamma rays interactions with the atmosphere. Thereby telescopes obtain images of the EASs. The ability to extract the gamma rays from hadronic cosmic ray background in images is one of the main features of this type of detectors. However, in...
Simulation of a public desktop grid system or a project of voluntary distributed computing on a ComBoS software is considered. The features and limitations of a desktop grid system are determined using the example of a desktop grid on the BOINC platform. Scenarios with asynchronous calculation of several computing applications in one desktop grid are considered. The features of the modeling...
В связи с ростом объема прикладных вычислений в области обработки больших данных и искусственного интеллекта, а также при решении традиционных задач численного моделирования, возникает необходимость в программах, способных развертываться и исполняться на гибридных окружениях, состоящих из произвольной совокупности сетевых вычислительных ресурсов. Такими ресурсами являются виртуальные машины...
The talk provides overview of implementation of the Acceptance Test Driven Development (TDD) paradigm
for quality control/enhancement of the reconstruction engine in MPD offline data analysis framework MPDRoot.
The necessary changes in the codebase architecture and the ease-of-use of the TDD environment,
defining the pivotal success factor: the pool of possibilities (potential) for the...
*The general goal of dynamic modeling of complex systems is to reveal the patterns of their behavior in time and changes in time of the quantitative characteristics of interacting components. The key condition for the adequacy of the model to a real object is the correspondence between the structure of the model and the real system, as well as the correspondence of the form of equations to the...
When considering quantum systems in phase space, the Wigner function is used as a function of quasidensity of probabilities. Finding the Wigner function is related to the calculation of the Fourier transform from a certain composition of wave functions of the corresponding quantum system. As a rule, knowledge of the Wigner function is not the ultimate goal, and calculations of mean values of...
Проведены компьютерные исследования эффективности применения методов переноса глубокого обучения для решения задачи классификации биомедицинских изображений. В качестве моделей использовались трансформеры ViT, Swin и DeiT, предварительно обученные на наборе изображений ImageNet. Приведены сравнения результатов обученных моделей.
Рассматривается вопрос повышения эффективности программного обеспечения для вычислительных архитектур, поддерживающих векторные расширения системы команд. Современные компиляторы могут выполнять автоматическую векторизацию вычислений, преобразовывать программы из скалярного представления к векторной реализации. В работе анализируется эффективность автоматической векторизации, выполненной...
Цифровой двойник (Digital twin, сокр. DT) — это виртуальное представление процессов, физических объектов или систем, которое используется в качестве оценки, диагностики, оптимизации и контроля их характеристик при проектировании, принятии решений в различных ситуациях и для эффективного управления реальными системами . В системах информационной безопасности концепция DT решает несколько...