Montenegro, Budva, Becici, 25 September - 29 September 2017

Name: Montenegro, Budva, Becici, 25 September - 29 September 2017
Start: 2017-09-25T08:30:00+02:00
End: 2017-09-29T19:00:00+02:00
Location: Montenegro, Budva, Becici

25–29 Sept 2017

Montenegro, Budva, Becici

Europe/Podgorica timezone

Support

nec2017@jinr.ru

Applying Big Data solutions for log analytics in the PanDA infrastructure

28 Sept 2017, 12:30

15m

Conference Hall (Montenegro, Budva, Becici)

Conference Hall

Montenegro, Budva, Becici

Splendid Conference & SPA Resort, 85315 Becici, Montenegro Hotel Splendid

Sectional Distributed Computing. GRID & Cloud Computing Distributed Computing. GRID & Cloud computing

Mr Fernando Barreiro Megino (University of Texas at Arlington)

PanDA is the workflow management system of the ATLAS experiment at the LHC and is responsible for generating, brokering and monitoring up to two million jobs per day across 150 computing centers in the Worldwide LHC Computing Grid. The PanDA core consists of several components deployed centrally on around 20 servers. The daily log volume is around 400GB per day. In certain cases, troubleshooting a particular issue on the raw log files can be compared to searching for a needle in a haystack and requires a high level of expertise. Therefore we decided to build on trending Big Data solutions and utilize the ELK infrastructure (Filebeat, Logstash, Elastic Search and Kibana) to process, index and analyze our log files. This allows to overcome troubleshooting complexity, provides a better interface to the operations team and generates advanced analytics to understand our system. This paper will describe the features of the ELK stack, our infrastructure, optimal configuration settings and filters. We will provide examples of graphs and dashboards generated through the ELK system to demonstrate the potential of the system. Finally, we will show the current integration of Kibana with the PanDA monitoring frontend and other usage possibilities, such as proactive notification of exceptions in the system.

Mr Aleksandr Alekseev (National Research Tomsk Polytechnic University)

Dr Alexei Klimentov (Brookhaven National Lab) Mr Fernando Barreiro Megino (University of Texas at Arlington) Siarhei PADOLSKI (BNL) Tadashi Maeno (BNL) Tatiana Korchuganova (National Research Tomsk Polytechnic University)

Slides

NEC_2017.pdf

Montenegro, Budva, Becici, 25 September - 29 September 2017

Support

Applying Big Data solutions for log analytics in the PanDA infrastructure

Conference Hall

Montenegro, Budva, Becici

Speaker

Description

Author

Co-authors

Presentation materials

Choose timezone

Montenegro, Budva, Becici, 25 September - 29 September 2017

Support

Speaker

Description

Author

Co-authors

Presentation materials