Evaluating non-relational storage technology for HEP metadata and meta-data catalog

M. A. Grigorieva, M. V. Golosova, M. Y. Gubin, A. A. Klimentov, V. V. Osipova, E. A. Ryabinkin

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Large-scale scientific experiments produce vast volumes of data. These data are stored, processed and analyzed in a distributed computing environment. The life cycle of experiment is managed by specialized software like Distributed Data Management and Workload Management Systems. In order to be interpreted and mined, experimental data must be accompanied by auxiliary metadata, which are recorded at each data processing step. Metadata describes scientific data and represent scientific objects or results of scientific experiments, allowing them to be shared by various applications, to be recorded in databases or published via Web. Processing and analysis of constantly growing volume of auxiliary metadata is a challenging task, not simpler than the management and processing of experimental data itself. Furthermore, metadata sources are often loosely coupled and potentially may lead to an end-user inconsistency in combined information queries. To aggregate and synthesize a range of primary metadata sources, and enhance them with flexible schema-less addition of aggregated data, we are developing the Data Knowledge Base architecture serving as the intelligence behind GUIs and APIs.

Original languageEnglish
Article number012017
JournalJournal of Physics: Conference Series
Volume762
Issue number1
DOIs
Publication statusPublished - 21 Nov 2016

    Fingerprint

ASJC Scopus subject areas

  • Physics and Astronomy(all)

Cite this