Preprints
https://doi.org/10.5194/gchron-2022-27
https://doi.org/10.5194/gchron-2022-27
 
22 Dec 2022
22 Dec 2022
Status: this preprint is currently under review for the journal GChron.

XLUM: an open data format for exchange and long-term data preservation of luminescence data

Sebastian Kreutzer1,2,a, Steve Grehl3, Michael Höhne4, Oliver Simmank4, Kay Dornich4, Grzegorz Adamiec5, Christoph Burow6, Helen Roberts1, and Geoff Duller1 Sebastian Kreutzer et al.
  • 1Geography & Earth Sciences, Aberystwyth University, Wales, United Kingdom
  • 2Archéosciences Bordeaux, UMR 6034, CNRS - Université Bordeaux Montaigne, Pessac, France
  • 3HUK-Coburg, Coburg, Germany
  • 4Freiberg Instruments GmbH, Freiberg, Germany
  • 5Institute of Physics, Division of Geochronology and Environmental Isotopes, Silesian University of Technology, Gliwice, Poland
  • 6piazza blu2 GmbH, Cologne, Germany
  • apresent address: Institute of Geography, Heidelberg University, Heidelberg, Germany

Abstract. Open data has become the modern science meme, and major funding bodies and publishers support open data. On a daily basis, however, the open data mandate frequently encounters technical obstacles, such as a lack of a suitable data format for data sharing and long-term data preservation. Such issue is often community-specific and best addressed through community-tailored solutions. In Quaternary sciences, luminescence dating is widely used for constraining the timing of event-based processes (e.g., sediment transport). Every luminescence-dating study produces a vast body of primary data that usually remains inaccessible and incompatible with future studies or adjacent scientific disciplines. To facilitate data exchange, long-term data preservation, in short, open data, in luminescence dating studies, we propose a new XML-based structured data format called XLUM. The format applies a hierarchical data storage concept consisting of a root node (node 0), a sample (node 1), a sequence (node 2), a record (node 3) and a curve (node 4). The curve level holds information on the technical component (e.g., photomultiplier, thermocouple). A finite number of curves represent a record (e.g., an optically stimulated luminescence curve). Records are part of a sequence measured for a particular sample. This design concept allows the user to retain information on a technical component level from the measurement process. The additional storage of related metadata fosters future data mining projects on large datasets. The XML-based format is less memory efficient than binary formats, however, in focus is data exchange, preservation and hence XLUM long-term format stability by design. XLUM is inherently stable to future updates and backwards compatible. We support XLUM through a new R package, `xlum', facilitating the conversion of different formats into the new XLUM format. XLUM is licensed under the MIT licence and hence available for free to be used in open and closed-source commercial and non-commercial software and research projects.

Sebastian Kreutzer et al.

Status: open (until 15 Feb 2023)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • AC1: 'Link to webpage with XLUM specification', Sebastian Kreutzer, 22 Dec 2022 reply

Sebastian Kreutzer et al.

Data sets

XLUM data format specification: v1.0.0 Sebastian Kreutzer; Steve Grehl; Michael Höhne https://doi.org/10.5281/zenodo.7362438

Model code and software

xlum: read, write, and convert XLUM Data Sebastian Kreutzer, Christoph Burow https://doi.org/10.5281/zenodo.7362364

Sebastian Kreutzer et al.

Viewed

Total article views: 260 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
196 53 11 260 7 4
  • HTML: 196
  • PDF: 53
  • XML: 11
  • Total: 260
  • BibTeX: 7
  • EndNote: 4
Views and downloads (calculated since 22 Dec 2022)
Cumulative views and downloads (calculated since 22 Dec 2022)

Viewed (geographical distribution)

Total article views: 227 (including HTML, PDF, and XML) Thereof 227 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 03 Feb 2023
Download
Short summary
Open data has become the modern science meme. Funding bodies and publishers support open data. However, the open data mandate frequently encounters technical obstacles, such as a lack of a suitable data format for data sharing and long-term data preservation. Such issues are often community-specific and demand community-tailored solutions. We proposed a new human-readable data format for data exchange and long-term data preservation of luminescence data based called XLUM.