ATLAS BigPanDA monitoring

A. Alekseev, A. Klimentov, T. Korchuganova, S. Padolski, T. Wenaus

Research output: Contribution to journalConference article

3 Citations (Scopus)

Abstract

BigPanDA monitoring is a web application that provides various processing and representation of the Production and Distributed Analysis (PanDA) system objects states. Analysing hundreds of millions of computation entities, such as an event or a job, BigPanDA monitoring builds different scales and levels of abstraction reports in real time mode. Provided information allows users to drill down into the reason of a concrete event failure or observe the broad picture such as tracking the computation nucleus and satellites performance or the progress of a whole production campaign. PanDA system was originally developed for the ATLAS experiment. Currently, it manages execution of more than 2 million jobs distributed over 170 computing centers worldwide on daily basis. BigPanDA is its core component commissioned in the middle of 2014 and now is the primary source of information for ATLAS users about the state of their computations and the source of decision support information for shifters, operators and managers. In this work, we describe the evolution of the architecture, current status and plans for the development of the BigPanDA monitoring.

Original languageEnglish
Article number032043
JournalJournal of Physics: Conference Series
Volume1085
Issue number3
DOIs
Publication statusPublished - 18 Oct 2018
Externally publishedYes
Event18th International Workshop on Advanced Computing and Analysis Techniques in Physics Research, ACAT 2017 - Seattle, United States
Duration: 21 Aug 201725 Aug 2017

    Fingerprint

ASJC Scopus subject areas

  • Physics and Astronomy(all)

Cite this

Alekseev, A., Klimentov, A., Korchuganova, T., Padolski, S., & Wenaus, T. (2018). ATLAS BigPanDA monitoring. Journal of Physics: Conference Series, 1085(3), [032043]. https://doi.org/10.1088/1742-6596/1085/3/032043