Heidelberg 2022 – wissenschaftliches Programm
Bereiche | Tage | Auswahl | Suche | Aktualisierungen | Downloads | Hilfe
AKPIK: Arbeitskreis Physik, moderne Informationstechnologie und Künstliche Intelligenz
AKPIK 1: Data Integration & Processing
AKPIK 1.4: Vortrag
Montag, 21. März 2022, 17:00–17:15, AKPIK-H13
CaosDB – a scientific research data management toolkit — Daniel Hornung1, •Florian Spreckelsen1, and Johannes Freitag2 — 1IndiScale Gmbh, Göttingen — 2Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research, Bremerhaven
Processing interconnected, multi-modal data poses a challenge in many fields, especially when the data model, i.e. the way how data is organized, changes over time or when its structure is poorly documented. The open-source software CaosDB is a toolkit for research data management which was originally developed at the Max Planck Institute for Dynamics and Self-Organization (Göttingen) because existing software could not fulfill the needs of the scientists.
We present examples where CaosDB helped make data FAIR (Findable, Accessible, Interoperable, Retrievable) and how it can simplify the workflows for researchers: Automated data collection and integration, export to data repositories, API libraries for third-party programs, integrated revisioning and workflow state machines. If the data model needs to change, existing data can remain as-is and future search queries will return matching results containing “old” and “new” data. We demonstrate how raw and processed data, analysis settings and results, and even labnotebooks and publications can be linked against each other, to improve long-term usability of data and reproducibility of results.