Irina Filozova
(JINR)
9/11/18, 1:30 PM
10. Databases, Distributed Storage systems, Datalakes
Sectional reports
This paper is dedicated to the current state of the Geometry Database (Geometry DB) for the CBM experiment and a first result of using the Geometry DB for NICA project. Geometry DB is an information system that supports the CBM geometry. The main aims of Geometry DB are to provide the storage of the CBM geometry, to manage the geometry modules, to assemble various setups as combinations of...
Daria Priakhina
(ЛИТ)
9/11/18, 1:45 PM
10. Databases, Distributed Storage systems, Datalakes
Sectional reports
В рамках работ по созданию компьютерной системы хранения и обработки данных установок B@MN и MPD, входящих в проект коллайдера NICA, возникает проблема выбора оптимальной конфигурации необходимого компьютерного и сетевого оборудования. Для решения этой проблемы требовалось разработать и исследовать модель перемещения данных внутри системы. Предыдущий опыт моделирования авторов настоящей...
9/11/18, 2:00 PM
10. Databases, Distributed Storage systems, Datalakes
Sectional reports
A critical challenge of high-luminosity Large Hadron Collider (HL-LHC), the next phase in LHC operation, is the increased computing requirements to process the experiment data. Coping with this demand with today’s computing model would exceed a realistic funding level by an order of magnitude. Many architectural, organizational and technical changes are being investigated to address this...
Dr
Andrey Demichev
(SINP MSU)
9/11/18, 2:15 PM
10. Databases, Distributed Storage systems, Datalakes
Sectional reports
Provenance metadata (PMD) contain key information that is necessary to determine the origin, authorship and quality of corresponding data, their proper storage, correct using, and for interpretation and confirmation of relevant scientific results. The need for PMD is especially essential when big data are jointly processed by several research teams, which is a very common practice in many...
Mr
Andrey Kiryanov
(PNPI)
9/11/18, 2:30 PM
10. Databases, Distributed Storage systems, Datalakes
Sectional reports
WLCG DataLake R&D project aims at exploring an evolution of distributed storage while bearing in mind very high demands of HL-LHC era. Its primary objective is to optimize hardware usage and operational costs of a storage system deployed across distributed centers connected by fat networks and operated as a single service. Such storage would host a large fraction of the WLCG data and optimize...
Mr
Haibo Li
(Institute of High Energy Physics,Chinese Academy of Sciences)
9/11/18, 2:45 PM
10. Databases, Distributed Storage systems, Datalakes
Sectional reports
Nowadays, data storage and management in cloud computing environment has been very important in high energy physics filed. The LHAASO(Large High Altitude Air Shower Observatory) experiment of IHEP will generate 2 PB per year in the future. These massive data processing faces many challenges in the distributed computing environment. For example, some sites may have no local HEP storage which...
Mrs
Marina Golosova
(National Research Center "Kurchatov Institute")
9/11/18, 3:30 PM
10. Databases, Distributed Storage systems, Datalakes
Sectional reports
ATLAS experiment at the CERN LHC is one of the most data-intensive modern scientific apparatus. To help managing all the experimental and modelling data, multiple information systems were created during the experiment's lifetime (more than 25 years). Each such system addresses one or several tasks of data and workload management, as well as information lookup, using specific sets of metadata...
Mr
Minh Duc Nguyen
(Skobeltsyn Institute of Nuclear Physics, Lomonosov Moscow State University)
9/11/18, 3:45 PM
10. Databases, Distributed Storage systems, Datalakes
Sectional reports
A distributed data warehouse system is one of the actual issues in the field of astroparticle physics. Famous experiments, such as Tunka, Taiga, produce tens of terabytes of data measured by their instruments. It is critical to have a smart data warehouse system on-site to store the collected data for further distribution effectively. It is also vital to provide scientists with a handy and...
Prof.
Vladimir Dimitrov
(University of Sofia)
9/11/18, 4:00 PM
10. Databases, Distributed Storage systems, Datalakes
Sectional reports
Several years after the initial announcement of the relational model of data, Codd published a review on the model, so called Version 2. This review is based on the experience of relational database systems implementation in the intermediate period. One of the main corrections are recommendation on date and time data types. This paper reinvestigate the topic from the nowadays point of view.