Data Knowledge Base for HENP Scientific Collaborations

V. A. Aulov, M. V. Golosova, M. A. Grigorieva, A. A. Klimentov, S. Padolski, T. Wenaus

Research output: Contribution to journalConference article

Abstract

Contemporary scientific experiments produce significant amount of data as well as scientific publications based on this data. Since volumes of both are constantly increasing, it becomes more and more problematic to establish a connection between a given paper and the underlying data. However, such an association is one of the crucial pieces of information for performing various tasks, such as validating the scientific results presented in paper, comparing different approaches to deal with a problem or even simply understanding the situation in some area of science. Authors of this paper are working under the Data Knowledge Base (DKB) R&D project, initiated in 2016 to solve this issue for the ATLAS experiment at CERN. This project is aimed at developing of the software environment, providing the storage and a coherent representation of the basic information objects. In this paper authors present a metadata model developed for the ATLAS experiment, the architecture of the DKB system and its main components. Special attention is paid to the Kafka-based ETL subsystem implementation and mechanism for extraction of meta-information from the texts of ATLAS publications.

Original languageEnglish
Article number032013
JournalJournal of Physics: Conference Series
Volume1085
Issue number3
DOIs
Publication statusPublished - 18 Oct 2018
Externally publishedYes
Event18th International Workshop on Advanced Computing and Analysis Techniques in Physics Research, ACAT 2017 - Seattle, United States
Duration: 21 Aug 201725 Aug 2017

Fingerprint

metadata
computer programs

ASJC Scopus subject areas

  • Physics and Astronomy(all)

Cite this

Aulov, V. A., Golosova, M. V., Grigorieva, M. A., Klimentov, A. A., Padolski, S., & Wenaus, T. (2018). Data Knowledge Base for HENP Scientific Collaborations. Journal of Physics: Conference Series, 1085(3), [032013]. https://doi.org/10.1088/1742-6596/1085/3/032013

Data Knowledge Base for HENP Scientific Collaborations. / Aulov, V. A.; Golosova, M. V.; Grigorieva, M. A.; Klimentov, A. A.; Padolski, S.; Wenaus, T.

In: Journal of Physics: Conference Series, Vol. 1085, No. 3, 032013, 18.10.2018.

Research output: Contribution to journalConference article

Aulov, VA, Golosova, MV, Grigorieva, MA, Klimentov, AA, Padolski, S & Wenaus, T 2018, 'Data Knowledge Base for HENP Scientific Collaborations', Journal of Physics: Conference Series, vol. 1085, no. 3, 032013. https://doi.org/10.1088/1742-6596/1085/3/032013
Aulov, V. A. ; Golosova, M. V. ; Grigorieva, M. A. ; Klimentov, A. A. ; Padolski, S. ; Wenaus, T. / Data Knowledge Base for HENP Scientific Collaborations. In: Journal of Physics: Conference Series. 2018 ; Vol. 1085, No. 3.
@article{96acbcde19fe4b8aba621db7bb86d966,
title = "Data Knowledge Base for HENP Scientific Collaborations",
abstract = "Contemporary scientific experiments produce significant amount of data as well as scientific publications based on this data. Since volumes of both are constantly increasing, it becomes more and more problematic to establish a connection between a given paper and the underlying data. However, such an association is one of the crucial pieces of information for performing various tasks, such as validating the scientific results presented in paper, comparing different approaches to deal with a problem or even simply understanding the situation in some area of science. Authors of this paper are working under the Data Knowledge Base (DKB) R&D project, initiated in 2016 to solve this issue for the ATLAS experiment at CERN. This project is aimed at developing of the software environment, providing the storage and a coherent representation of the basic information objects. In this paper authors present a metadata model developed for the ATLAS experiment, the architecture of the DKB system and its main components. Special attention is paid to the Kafka-based ETL subsystem implementation and mechanism for extraction of meta-information from the texts of ATLAS publications.",
author = "Aulov, {V. A.} and Golosova, {M. V.} and Grigorieva, {M. A.} and Klimentov, {A. A.} and S. Padolski and T. Wenaus",
year = "2018",
month = "10",
day = "18",
doi = "10.1088/1742-6596/1085/3/032013",
language = "English",
volume = "1085",
journal = "Journal of Physics: Conference Series",
issn = "1742-6588",
publisher = "IOP Publishing Ltd.",
number = "3",

}

TY - JOUR

T1 - Data Knowledge Base for HENP Scientific Collaborations

AU - Aulov, V. A.

AU - Golosova, M. V.

AU - Grigorieva, M. A.

AU - Klimentov, A. A.

AU - Padolski, S.

AU - Wenaus, T.

PY - 2018/10/18

Y1 - 2018/10/18

N2 - Contemporary scientific experiments produce significant amount of data as well as scientific publications based on this data. Since volumes of both are constantly increasing, it becomes more and more problematic to establish a connection between a given paper and the underlying data. However, such an association is one of the crucial pieces of information for performing various tasks, such as validating the scientific results presented in paper, comparing different approaches to deal with a problem or even simply understanding the situation in some area of science. Authors of this paper are working under the Data Knowledge Base (DKB) R&D project, initiated in 2016 to solve this issue for the ATLAS experiment at CERN. This project is aimed at developing of the software environment, providing the storage and a coherent representation of the basic information objects. In this paper authors present a metadata model developed for the ATLAS experiment, the architecture of the DKB system and its main components. Special attention is paid to the Kafka-based ETL subsystem implementation and mechanism for extraction of meta-information from the texts of ATLAS publications.

AB - Contemporary scientific experiments produce significant amount of data as well as scientific publications based on this data. Since volumes of both are constantly increasing, it becomes more and more problematic to establish a connection between a given paper and the underlying data. However, such an association is one of the crucial pieces of information for performing various tasks, such as validating the scientific results presented in paper, comparing different approaches to deal with a problem or even simply understanding the situation in some area of science. Authors of this paper are working under the Data Knowledge Base (DKB) R&D project, initiated in 2016 to solve this issue for the ATLAS experiment at CERN. This project is aimed at developing of the software environment, providing the storage and a coherent representation of the basic information objects. In this paper authors present a metadata model developed for the ATLAS experiment, the architecture of the DKB system and its main components. Special attention is paid to the Kafka-based ETL subsystem implementation and mechanism for extraction of meta-information from the texts of ATLAS publications.

UR - http://www.scopus.com/inward/record.url?scp=85055657947&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85055657947&partnerID=8YFLogxK

U2 - 10.1088/1742-6596/1085/3/032013

DO - 10.1088/1742-6596/1085/3/032013

M3 - Conference article

VL - 1085

JO - Journal of Physics: Conference Series

JF - Journal of Physics: Conference Series

SN - 1742-6588

IS - 3

M1 - 032013

ER -