Abstract

GChron

Geochronology

GChron

Geochronology

2628-3719

Copernicus Publications

Göttingen, Germany

10.5194/gchron-4-269-2022

Improving age–depth relationships by using the LANDO (“Linked age and depth modeling”) model ensemble

Improving age–depth relationships by using the LANDO model ensemble

Pfalz

Gregor

gregor.pfalz@awi.de

https://orcid.org/0000-0003-1218-177X

Diekmann

Bernhard

https://orcid.org/0000-0001-5129-3649

Freytag

Johann-Christoph

Syrykh

Liudmila

https://orcid.org/0000-0003-2076-8570

Subetto

Dmitry A.

https://orcid.org/0000-0002-3585-8598

Biskaborn

Boris K.

boris.biskaborn@awi.de

https://orcid.org/0000-0003-2378-0348

1Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research, Research Unit Potsdam, Telegrafenberg A45, 14473 Potsdam, Germany 2Institute of Geosciences, University of Potsdam, Karl-Liebknecht-Str. 24–25, 14476 Potsdam-Golm, Germany 3Einstein Center Digital Future, Robert-Koch-Forum, Wilhelmstraße 67, 10117 Berlin, Germany 4Department of Computer Science, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany 5Department of Physical Geography and Environment, Herzen State Pedagogical University of Russia, Moyka Emb. 48, St. Petersburg 191186, Russia 6Institute for Water and Environmental Problems of the Siberian Branch of the Russian Academy of Sciences, Molodezhnayastr.1, Barnaul 656038, Russia

Gregor Pfalz (gregor.pfalz@awi.de) and Boris K. Biskaborn (boris.biskaborn@awi.de)

18May2022

4 1 269295 29November2021 2December2021 20April2022 27April2022

2022

This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/

This article is available from https://gchron.copernicus.org/articles/4/269/2022/gchron-4-269-2022.html

The full text article is available as a PDF file from https://gchron.copernicus.org/articles/4/269/2022/gchron-4-269-2022.pdf

Abstract

Age–depth relationships are the key elements in paleoenvironmental studies to place proxy measurements into a temporal context. However, potential influencing factors of the available radiocarbon data and the associated modeling process can cause serious divergences of age–depth relationships from true chronologies, which is particularly challenging for paleolimnological studies in Arctic regions. This paper provides geoscientists with a tool-assisted approach to compare outputs from age–depth modeling systems and to strengthen the robustness of age–depth relationships. We primarily focused on the development of age determination data from a data collection of high-latitude lake systems (50 to 90∘ N, 55 sediment cores, and a total of 602 dating points). Our approach used five age–depth modeling systems (Bacon, Bchron, clam, hamstr, Undatable) that we linked through a multi-language Jupyter Notebook called LANDO (“Linked age and depth modeling”). Within LANDO we implemented a pipeline from data integration to model comparison to allow users to investigate the outputs of the modeling systems. In this paper, we focused on highlighting three different case studies: comparing multiple modeling systems for one sediment core with a continuously deposited succession of dating points (CS1), for one sediment core with scattered dating points (CS2), and for multiple sediment cores (CS3). For the first case study (CS1), we showed how we facilitate the output data from all modeling systems to create an ensemble age–depth model. In the special case of scattered dating points (CS2), we introduced an adapted method that uses independent proxy data to assess the performance of each modeling system in representing lithological changes. Based on this evaluation, we reproduced the characteristics of an existing age–depth model (Lake Ilirney, EN18208) without removing age determination data. For multiple sediment cores (CS3) we found that when considering the Pleistocene–Holocene transition, the main regime changes in sedimentation rates do not occur synchronously for all lakes. We linked this behavior to the uncertainty within the dating and modeling process, as well as the local variability in catchment settings affecting the accumulation rates of the sediment cores within the collection near the glacial–interglacial transition.

1Introduction

Lake sediments are important terrestrial archives for recording climate variability in the high latitudes of the Northern Hemisphere (Biskaborn et al., 2016a; Smol, 2016; Lehnherr et al., 2018; Subetto et al., 2017; Syrykh et al., 2021; Diekmann et al., 2017). The identification of age–depth relationships in those lake sediments helps us to put their measured sediment properties in a temporal context (Bradley, 2015; Lowe and Walker, 2014; Blaauw and Heegaard, 2012). We can determine these relationships by directly counting the annual laminated layers (varves) (Brauer, 2004; Zolitschka et al., 2015), or by using indirect age determination methods such as radiocarbon, optically stimulated luminescence (OSL), or lead–cesium (lead-210/cesium-137) dating (Lowe and Walker, 2014; Bradley, 2015; Appleby, 2008; Hajdas et al., 2021). Defining a reliable age–depth relationship for paleoenvironmental studies in cold regions is particularly challenging, as varves only exist in rare cases and the determination of ages mostly depends on radiocarbon dating (Strunk et al., 2020, and references therein). Because of primarily financial restrictions, however, only a few selected samples are taken from sediment core sections to determine the corresponding ages of certain depths (Blaauw et al., 2018; Ciarletta et al., 2019; Olsen et al., 2017). We therefore rely on model calculations to define the ages between the samples. In addition to the mathematical challenges that arise when establishing age–depth relationships, the selection of appropriate dating material has an impact on the modeling process.

In the special case of Arctic lake systems, the amount of material for radiocarbon dating, i.e. aquatic/terrestrial macrofossils and organic remains, is extremely low (Abbott and Stafford, 1996; Colman et al., 1996; Strunk et al., 2020). Radiocarbon dating is therefore often based on the organic carbon content in bulk sediment samples, which can be relatively small due to the lower bioproductivity in those lakes (Strunk et al., 2020, and references therein). However, the use of bulk sediments is problematic, as some portions of contributing carbon are not occurring at the same time as the deposition but may reveal inherited ages from reworked older materials (Rudaya et al., 2016; Biskaborn et al., 2013b, 2019; Schleusner et al., 2015; Palagushkina et al., 2017). Several methods are available for pre-treating bulk sediment samples to address sample-based dating uncertainties (Brock et al., 2010; Strunk et al., 2020; Rethemeyer et al., 2019; Bao et al., 2019; Dee et al., 2020). Each pre-treatment method may yield a different result for the same material due to the influence of humic acids, fulvic acids, and humins (Brock et al., 2010; Strunk et al., 2020; Abbott and Stafford, 1996). Similarly, older, inert material incorporated by living organism, known as “reservoir effect” or “hard-water effect”, distorts the actual radiocarbon age by up to ±10 000 years (Ascough et al., 2005; Austin et al., 1995; Lougheed et al., 2016). Such a distortion creates methodological and mathematical errors in the development of age–depth relationships, which possibly leads to a misinterpretation of these relationships.

There are numerous geochronological software systems (from now on simply called modeling systems) available to the geoscientific community, which try to solve the challenges stated above (Trachsel and Telford, 2017; Wright et al., 2017; Lacourse and Gajewski, 2020). Methods have been implemented for detecting outliers, accounting for varying sedimentation rates, or using bootstrapping processes to support the construction of an age–depth model (Parnell et al., 2011; Lougheed and Obrochta, 2019; Bronk Ramsey, 2009, 2008). However, the correct usage of those systems requires a high degree of understanding of the underlying mathematical methods and models. Trachsel and Telford (2017) noted that, despite the users' impact on the outcome of the model by setting priors and parameters, most users do not have any prior objective insights into appropriately choosing the right parameters. Wright et al. (2017), Trachsel and Telford (2017), and Lacourse and Gajewski (2020) even showed that the results produced by modeling systems could diverge from the true chronology. An in-depth comparison of the results is therefore extremely error-prone. Due to time constraints, users usually only select and apply one modeling system for paleoenvironmental interpretation. However, comparing multiple modeling systems, despite their inherent differences, offers the benefit of reducing biases towards interpreting of age–depth relationships.

The objective of this paper is to reduce the effort involved in applying different methods for determining age–depth relationships and to make their results comparable. We provide a tool to link five selected modeling systems in a single multi-language Jupyter Notebook. We introduce an ensemble age–depth model that uses uninformed models to create data-driven, semi-informed age–depth relationships. We demonstrate the power of our tool by highlighting three case studies in which we examine our application for individual sediment cores and a collection of multiple sediment cores. Throughout this paper, the term “LANDO” refers to our implementation, which stands for “Linked age and depth modeling”. The current development version of LANDO is accessible via GitHub (https://github.com/GPawi/LANDO, last access: 20 April 2022).

In this paper, we use published age determination data from 55 sediment cores from high-latitude lake systems (50 to 90∘ N). This unique collection of age determination data allows us to thoroughly test LANDO by examining changes in sedimentation rates over time for various modeling and lake systems. The harmonization of the acquired data follows the conceptual framework described in Pfalz et al. (2021).

2Methods

A key element in our data-science based approach for developing comparable age–depth relationships was to facilitate the use of modeling systems independent from their original proprietary development environment. A multi-language data analysis environment, such as SoS Notebook (Peng et al., 2018) or GraalVM (Niephaus et al., 2019), provides an interface that enables the comparison of modeling systems without being limited to one programming language or environment. Our implementation used SoS Notebook as its backbone. SoS Notebook is a native Python- and JavaScript-based Jupyter Notebook (Kluyver et al., 2016), which extends to other languages through so-called “Jupyter kernels”. We developed our implementation with the focus on four languages and their respective kernels: Python, R, Octave, and MATLAB. This selection allowed us to use the most common modeling systems.

According to Lacourse and Gajewski (2020), the most commonly used modeling systems are Bacon (Blaauw and Christen, 2011), Bchron (Haslett and Parnell, 2008; Parnell et al., 2008), OxCal (Bronk Ramsey, 1995; Bronk Ramsey and Lee, 2013), and clam (Blaauw, 2010). We additionally considered the MATLAB/Octave software Undatable (Lougheed and Obrochta, 2019), as an alternative to the classical Bayesian approach, and the R package hamstr (Dolman, 2022).

In our study, we were able to connect five of the above-mentioned modeling systems in SoS Notebook, namely Bacon, Bchron, clam, hamstr, and Undatable. All modeling systems assume a monotonic deposition process, i.e. a positive accumulation rate over the entire core length (Trachsel and Telford, 2017; Lougheed and Obrochta, 2019). The modeling system clam uses five different regression-based techniques in combination with a Monte Carlo procedure to repeatedly interpolate between calibrated dates. Because clam tries to fit the regression curves to the data, in some cases this can lead to age inversions, which clam automatically filters out (cf. Trachsel and Telford, 2017; Blaauw, 2010).

The modeling procedure of Undatable involves a weighted random sampling from both calibrated age and depth uncertainties (expressed as a probability density functions) for all dating points and an advanced bootstrapping process over a user-defined number of simulations. The advanced bootstrapping procedure includes removing age inversions from the simulation runs as well as inserting connection points between calibrated dates to account for uncertainties in sediment accumulation rates between the dating points (cf. Lougheed and Obrochta, 2019).

The Bayesian modeling systems Bacon, Bchron, and hamstr subdivide the sediment core into smaller increments for the modeling process but differ in their division technique. Bacon separates the core into equal segments, while hamstr extends Bacon's algorithm by adding additional hierarchical accumulation structures to each segment (Trachsel and Telford, 2017; Dolman, 2022; Blaauw and Christen, 2011). Bchron estimates the number of increments between calibrated dates by a compound Poisson-gamma distribution (Trachsel and Telford, 2017; Parnell et al., 2011). For age–depth calculations, Bacon uses prior distributions for the accumulation rate (gamma distribution) and autocorrelation memory (beta distribution) between segments, which users can fit with values for the mean and shape of these distributions (Blaauw and Christen, 2011). Similarly, hamstr relies on user input for the shape of the gamma distribution and values for the memory but estimates the mean value for the accumulation rate from the available age determination data by using a robust linear regression (Dolman, 2022). Bchron does not require any specific hyperparameter selection due to its fully automated numerical best-fit approach (Wright et al., 2017; Haslett and Parnell, 2008). All three Bayesian modeling systems use iterations of the Markov chain Monte Carlo (MCMC) algorithm to estimate the calibrated ages and confidence intervals at each depth within the sediment core (Dolman, 2022; Blaauw and Christen, 2011; Haslett and Parnell, 2008).

The workflow of LANDO consists of five major components: input – preparation – execution – result aggregation – evaluation of model performance.

2.1Input

To work with LANDO users need to provide age determination data, e.g., data from radiocarbon or OSL dating, and associated metadata as listed in Table 1. We developed two import options for the users: through a single spreadsheet or a connection to a database. For this study, we used a connection to a PostgreSQL database, which we developed after the conceptual framework as described in Pfalz et al. (2021), via the Python package SQLAlchemy (Bayer, 2012). We divided age determination input data into two attribute categories: necessary and recommended. The category “necessary” focused on the prerequisites of the individual modeling systems as well as project-related attributes, such as unique identifiers, i.e., “measurementid” or “labid”. However, a larger comprehensive set of descriptive metadata helps a better understanding of the data (Cadena-Vela et al., 2020; Thanos, 2017). We added four additional attributes from the category “recommended” to facilitate the interpretation of age–depth models regarding their age determination data.

Table 1

Necessary and recommended attributes for age determination input data when used with LANDO. Attributes apply for both input methods through either a database or a spreadsheet.

Attribute Description Data Necessary/ type recommended measurementid Composite key composed of a unique CoreID, a blank space, and the depth below sediment surface (mid-point cm) with a maximum of two decimal digits of corresponding analytical age measurement – example: “CoreA1 100.5” when users obtained sample of CoreA1 between 100 and 101 cm depth String Necessary thickness Thickness of the sample slice used for age determination in cm Float Necessary labid Unique sample identifier that was provided by the laboratory for age determination String Necessary lab_location Name of city, where laboratory that conducted the analysis resides String Recommended material_category One of the eight categories that describes the material best, based on the categories from age–depth modeling system Undatable (Lougheed and Obrochta, 2019) 14C marine fossil – 14C terrestrial fossil – 14C sediment – tephra – tie point – paleomag. – U / Th – other String Necessary material_description Short description of the material used String Recommended material_weight Weight of analyzed carbon used in radiocarbon dating in µg C Float Recommended age Uncalibrated radiocarbon age in uncal. yr BP or non-radiocarbon ages as values in yr BP (BP: before present (before 1950 CE)) Float Necessary age_error Error of the uncalibrated radiocarbon age or non-radiocarbon age in yr Float Necessary pretreatment_dating Concise description or abbreviation of sample pre-treatment – example: “ABA” when radiocarbon pre-treatment is comprised of an acid–base–acid sequence String Recommended reservoir_age Additional reservoir effect (also known as hard-water effect or age offset) identified by the user in yr; if unknown, then insert 0 Float Necessary reservoir_error Error of reservoir age known to the user in yr; if unknown, then insert 0 Float Necessary

If users decide to use a spreadsheet as an input option, then the spreadsheet should follow the same attribution as the database. In addition, we implemented an input prompt for further information, such as the year of core drilling and core length, to ensure comparability to our database implementation. We provide an example spreadsheet with all attributes in the expected format in the repository mentioned in the “Code and data availability” section of this paper.

2.2Preparation

The preparation component consisted of two separate steps. First, we checked each age determination dataset to find out whether a reservoir effect was influencing the radiocarbon data. In the absence of a known reservoir age or recent surface sample, we used available radiocarbon data points and a fast-calculating modeling system to predict the age of the uppermost layer within a sediment core. In our approach, we used the hamstr package with a default value of 6000 iterations. We then compared the predicted value for the uppermost layer with the year of the core retrieval, i.e., our target age. We accounted for an uncertainty in the estimate by allowing an extra 10 % error between predicted age and target age. If a gap between predicted and target age is observable, then we assumed a reservoir effect is present. We approximated the reservoir effect by subtracting the target age from the mean predicted age, whereas we based the associated error on the 2σ uncertainty ranges of the prediction. LANDO allows users to add the calculated reservoir age and its uncertainty range to the corresponding attributes (“reservoir_age” and “reservoir_error”). Depending on the choice of the user, this addition affects either all radiocarbon samples or only bulk sediment samples, or users completely discard the output for the subsequent modeling process.

As the second step in the preparation component, we built a module that automatically changes the format of the available data to the individually desired input of each of the five modeling systems implemented in LANDO. We primarily used the Python package pandas (Reback et al., 2020) for the transformation within the module. We transferred the newly transformed age determination data to the corresponding programming language for age–depth modeling using the built-in %get function of SoS Notebook.

2.3Execution

We developed LANDO with the specific ability of creating multiple age–depth models for multiple dating series from spatially distributed lake systems. Hence, reducing overall computing time was one of our highest priorities. We achieved this reduction by applying existing parallelization back ends for both R and Python, such as doParallel (Microsoft Corporation and Weston, 2020a) and Dask (Dask Development Team, 2016), respectively. For each modeling system in R, we wrote a separate script that takes advantage of the parallelization back end doParallel. Besides the individual modeling system packages, we made use of different R libraries, such as tidyverse (Wickham et al., 2019), parallel (R Core Team, 2021), foreach (Microsoft Corporation and Weston, 2020c), doRNG (Gaujoux, 2020), and doSNOW (Microsoft Corporation and Weston, 2020b). We neglected the use of parallelization for the Undatable software in MATLAB, since even the sequential execution for several sediment cores in our test setup was on the order of a few minutes. However, we achieved comparable results with Undatable in Octave using the parallelization package parallel (Fujiwara et al., 2021).

As mentioned before, the selection of model priors and parameters has an impact on the modeling outcome. This is challenging if no objective prior knowledge exists. To lower our impact and to avoid introducing biases in the modeling process, we used the default values from each modeling system as our own default values (Blaauw et al., 2021; Blaauw, 2021; Parnell et al., 2008; Dolman, 2022; Lougheed and Obrochta, 2019). In our adaptation of clam, the parameter “poly_degree” controls the polynomial degree of models for type 2, while the parameter “smoothing” controls the degree of smoothing for types 4 and 5. In the original version of clam, users adjust both parameters with the single option “smooth” (Blaauw, 2021). Furthermore, the default value for ssize within the original version of Bacon is 2000. We increased this value to 8000 to ensure good MCMC mixing for problematic cores, as recommended by Blaauw et al. (2021). In the case of the user having in-depth knowledge about their sediment core and wanting to change certain values, we opted for making crucial parameters accessible within SoS Notebook outside of the executing scripts. Table 2 provides an overview of all values which users can access and change for the individual systems. However, we limited the access to some parameters for operational purposes, such as the number of iterations or the resolution of the output.

Table 2

Default values for each modeling system, which users can access and change within LANDO.

Modeling system Parameter Default value Bacon acc.shape 1.5 acc.mean 20 mem.strength 10 mem.mean 0.5 ssize 8000 Bchron types 1 to 5 poly_degree 1 to 4 smoothing 0.1 to 1.0 clam not applicable – hamstr K c(10,10) Undatable xfactor 0.1 bootpc 30

2.4Result aggregation

After every model run, we received 10 000 age estimates (also known as “iterations” or “realizations”) per centimeter from each modeling system for every sediment core. We transferred these results back to Python using the built-in %put function of SoS Notebook, where in the next module, we calculated the median and mean age values per centimeter as well as 1σ and 2σ age ranges. For the summarizing statistics, we used standard Python libraries such as pandas (Reback et al., 2020) and numpy (Harris et al., 2020). We appended the model name as an attribute to the statistics to allocate each result to its modeling system. In addition, we implemented a module, which helped us to push the aggregated result to our initial database to reuse in follow-up research projects. In a similar approach to the input component, we established the connection to our designed PostgreSQL database via the package SQLAlchemy (Bayer, 2012).

Table 3

Approaches to calculating sedimentation rates within LANDO. The value represents the layer of interest within a sediment core for which the calculation is necessary. Both xi+1 and xi+2 are the following layers, while xi-1 and xi-2 are the previous layers. The unit for the resulting sedimentation rate is centimeter per year (cm yr-1).

Approach Equation Naïve (default)

sedimentationrate(xi)=depth(xi)-depth(xi-1)age(xi)-age(xi-1)

Moving average over three depths

sedimentationrate(xi)=depth(xi+1)-depth(xi-1)age(xi+1)-age(xi-1)

Moving average over five depths

sedimentationrate(xi)=depth(xi+2)-depth(xi-2)age(xi+2)-age(xi-2)

Similarly, we used the 10 000 age estimates per centimeter for calculating the sedimentation rates. Our calculation used three different approaches to calculate sedimentation rates: “naïve”, “moving average over three depths”, and “moving average over five depths”. Table 3 lists the appropriate equations for each approach. The user can decide which one of the three approaches best applies to the individual sediment record. We summarized the output into the basic summarizing statistics (mean, median, 1σ ranges, and two sigma ranges) accessible to the users but added the model name and employed approach as additional attributes. If users use more than one sediment core for sedimentation rate calculation, then LANDO will automatically execute the sedimentation rate calculation in parallel using the Dask back end (Dask Development Team, 2016) and the joblib Python package (Joblib Development Team, 2020).

2.5Evaluation of model performance

To evaluate the performance of each modeling system, we looked at three different case studies:

Case Study no. 1 – Comparison of multiple modeling systems for one sediment core with a continuously deposited sequence of dating points (“Continuously deposited sequence” – CS1)

Case Study no. 2 – Comparison of multiple modeling systems for one sediment core with a disturbed sequence (including inversions) of dating points (“Inconsistent sequence” – CS2)

Case Study no. 3 – Comparison of sedimentation rate changes for multiple sediment cores (“Multiple cores” – CS3).

We examined both sedimentation rate and age–depth modeling results in each of the three case studies. For the first case study, we selected the sediment core EN18218 (Vyse et al., 2021) to showcase the generated output of LANDO. The 6.53 m long sediment record obtained from Lake Rauchuvagytgyn, Chukotka (67.78938∘ N, 168.73352∘ E; core location water depth: 29.5 m) during an expedition in 2018 consisted of 23 bulk sediment samples used for radiocarbon sampling. The authors determined an existing age offset of 785 ± 31 years BP (years before present, i.e., before 1950 CE), which we used in our modeling process as well.

As a counterexample, for the second case study we have chosen the sediment core EN18208 (Vyse et al., 2020a). During the same expedition to Russia's Far East in 2018, scientists recovered this EN18208 core from Lake Ilirney, Chukotka (67.34030∘ N, 168.29567∘ E; core length: 10.76 m; core location water depth: 19.0 m). The authors based their age–depth model on 4 OSL dates and 17 radiocarbon dates from bulk sediment samples as well as an age offset of 1721 ± 28 years BP. However, in addition to the age offset, we included all 7 available OSL and 25 radiocarbon dates for this core in our study.

Figure 1

Map of the geographical distribution of lake sediment cores used for our study (triangles, n = 55). Orange triangles (n = 34) represent sediment cores for which we obtained age determination data from a related publication. Purple triangles (n = 13) show datasets we collected from the publicly accessible PANGAEA database (Diepenbroek et al., 2002). Red triangles (n = 8) indicate referenced datasets provided by the PaleoLake Database (Syrykh et al., 2021). ArcGIS Basemap: GEBCO Grid 2014 modified by AWI. The outer ring in the graphic corresponds to 45∘ N.

Table 4

List of all datasets used in this study. Main data source or repository are either the PANGAEA database, PaleoLake database, or tables within the main body or supplementary material of publications. Data accessible links to the main data source. Paper reference includes citation to the latest version of the corresponding dataset.

CoreID PaleoLake Age–depth Main data Data accessible Paper reference database model source/ ID available repository 16-KP-04-L19 Yes Publication

10.1111/bor.12521

Andreev et al. (2021) 2008-3 Yes Publication

10.1016/j.quascirev.2012.06.002

Rudaya et al. (2012) BC2008 No Publication

10.1016/j.rgg.2016.07.005

Zhdanova et al. (2017) BL02-2007 No Publication

10.1016/j.rgg.2015.05.012

Khazin et al. (2016) BN2016-1 Yes Publication

10.1177/09596836211019093

Rudaya et al. (2021) Chupa-8 295 No PaleoLake DB https://clck.ru/N5ksZ (last access: 24 March 2022) Kolka et al. (2015) – PALEOLAKE DATABASE ID 295 Co1309 76 Yes Publication

10.1111/bor.12379

Gromig et al. (2019) Co1412 Yes Publication

10.1111/bor.12476

Baumer et al. (2021) CON01-603-5 Yes PANGAEA 10.1594/PANGAEA.856103 Piotrowska et al. (2004, 2005) Dolgoe2012 335 No Publication

10.7868/S0435428118020049

Kolka et al. (2018) EN18208 Yes PANGAEA 10.1594/PANGAEA.921228 Vyse et al. (2020a, b) EN18218 Yes Publication

10.5194/bg-18-4791-2021

Vyse et al. (2021) ESM-1 Yes Publication

10.1016/j.quascirev.2012.03.004

Mackay et al. (2012) KAS-1 No Publication

10.1017/qua.2017.21

Lozhkin et al. (2017) Korzhino2010 336 No PaleoLake DB https://clck.ru/N5ksZ – PALEOLAKE DATABASE ID 336 Syrykh et al. (2021) LENDERY180-4 342 No PaleoLake DB https://clck.ru/N5ksZ – PALEOLAKE DATABASE ID 342 Shelekhova et al. (2021b) LENDERY192 343 No PaleoLake DB https://clck.ru/N5ksZ – PALEOLAKE DATABASE ID 343 Shelekhova et al. (2021b) LENDERY200-1 344 No PaleoLake DB https://clck.ru/N5ksZ – PALEOLAKE DATABASE ID 344 Shelekhova et al. (2021b) LENDERY203-3 345 No PaleoLake DB https://clck.ru/N5ksZ – PALEOLAKE DATABASE ID 345 Shelekhova et al. (2021b) LOT83-7 321 No PaleoLake DB https://clck.ru/N5ksZ – PALEOLAKE DATABASE ID 321 Syrykh et al. (2021) LS-9 Yes Publication

10.1016/S0277-3791(00)00120-7

Pisaric et al. (2001) Maloye-1 No Publication

10.1017/qua.2017.21

Lozhkin et al. (2017) MC2006 No Publication

10.1016/j.rgg.2015.05.012

Khazin et al. (2016) Muan2018 339 No PaleoLake DB https://clck.ru/N5ksZ – PALEOLAKE DATABASE ID 339 Shelekhova and Lavrova (2020) Okun2018 338 No Publication

10.17076/lim1319

Shelekhova et al. (2021a) OSIN 110 No Publication

10.17076/lim305

Tolstobrova et al. (2016) PER3 Yes Publication

10.1007/s10933-015-9858-y

Anderson et al. (2015) PG1111 Yes Publication

10.1016/j.quaint.2004.01.032

Andreev et al. (2004) PG1205 Yes PANGAEA 10.1594/PANGAEA.734962 Wagner et al. (2000a, b) PG1214 Yes PANGAEA

10.1594/PANGAEA.734137

Cremer et al. (2001a, b) PG1228 Yes PANGAEA

10.1594/PANGAEA.726591

Andreev et al. (2003a, b) PG1238 Yes Publication

10.1016/S0277-3791(03)00139-2

Raab et al. (2003) PG1341 Yes Publication

10.1101/2021.11.05.465756

von Hippel et al. (2021) PG1351 Yes Publication

10.1046/j.1365-246X.2002.01625.x

Nowaczyk et al. (2002) PG1437 Yes PANGAEA

10.1594/PANGAEA.728450

Andreev et al. (2005a, b) PG1746 Yes PANGAEA

10.1594/PANGAEA.802677

Nazarova et al. (2013a, b) PG1755 Yes Publication

10.1016/j.quascirev.2010.04.024

Müller et al. (2010) PG1756 Yes PANGAEA

10.1594/PANGAEA.708169

Müller et al. (2009, 2008) PG1856 Yes Publication

10.1016/j.gloplacha.2015.07.011

Hoff et al. (2015) PG1857 Yes Publication

10.1016/j.gloplacha.2015.07.011

Hoff et al. (2015) PG1858 Yes Publication

10.1007/s10933-012-9580-y

Hoff et al. (2012) PG1890 Yes Publication

10.1016/j.gloplacha.2015.07.010

Dirksen et al. (2015) PG1972 No PANGAEA

10.1594/PANGAEA.780526

Biskaborn et al. (2013a, c) PG1975 No PANGAEA

10.1594/PANGAEA.780385

Biskaborn et al. (2013b, d) PG1984 Yes PANGAEA

10.1594/PANGAEA.776407

Biskaborn et al. (2012a, b) PG2023 Yes PANGAEA

10.1594/PANGAEA.848897

Biskaborn et al. (2016a, b) PG2133 Yes Publication

10.3389/fevo.2021.625096

Courtin et al. (2021) PG2201 Yes Publication

10.3389/feart.2021.710257

Hughes-Allen et al. (2021) PG2208 Yes Publication

10.3389/feart.2021.737353

Biskaborn et al. (2021) Tel2006 Yes PANGAEA

10.1594/PANGAEA.914417

Rudaya et al. (2016), Rudaya (2020) Teriberka17 341 No Publication

10.17076/lim865

Tolstobrov et al. (2018) TKT-3 Yes Publication

10.1016/j.quaint.2020.05.023

Lozhkin et al. (2020) TL-1-1 No Publication

10.1191/095968399669823431

Wolfe et al. (1999) TULOMA27 23 No Publication

10.1016/S0921-8181(01)00118-7

Corner et al. (2001) UKhau2015 337 No Publication

10.31857/S0869607121060070

Shelekhova et al. (2021c)

Both cores are also part of the “Multiple cores” case study with a total of 55 sediment cores (Fig. 1). More details on each sediment cores are accessible in the corresponding references, which we list in Table 4.

2.5.1Numerical combination of model outputs

To introduce the ensemble model in LANDO, we combined the outputs from all five modeling systems into one composite model. We considered the outermost limits (min and max values) of all confidence intervals (1σ or 2σ) as our boundary for the ensemble model. By taking these outermost limits into account, we artificially increased the area of uncertainty covered by the ensemble model, but we made sure that we were representing all possible outcomes and maximizing the likelihood of including the true chronology. We also included a weighted average (x‾) of the age estimates and sedimentation rates, which we calculated using the following equations: 1x‾=∑k=1mnkn×x‾k,2n=∑k=1mnk,

with m being the number of participating modeling systems, n the total number of iterations, and x‾k and nk the median value (either for age estimate or sedimentation rate) and the associated number of iterations from each modeling system, respectively. In some cases, the weights from each modeling system are equal, as they produce the same number of iterations. Then we can simplify Eq. (1) to represent the arithmetic mean: 3x‾=1m∑k=1mx‾k. For our “Multiple cores” case study (CS3), we additionally had to ensure the comparability of sedimentation rates between sediment cores, since each model assigns a different age value to its sedimentation rate value per centimeter. Therefore, we binned sedimentation rate results into 1000-year bins for each age–depth model as well as the ensemble model and calculated the weighted averages and their confidence intervals within these bins. Inside LANDO, users can change the initial bin size of 1000 years to the desired resolution.

2.5.2Detection and filtering of unreasonable models

For cases in which age–depth models do not agree with each other, e.g., “Inconsistent sequence” case study (CS2), we have built in the option of importing data from measured sediment properties, also known as proxies. Because of compositional and density variations in deposits, changes in sedimentation rates imply changes in the deposition of proxies (Baud et al., 2021; Biskaborn et al., 2021; Vyse et al., 2021). By including appropriate, independent proxy data on lithological changes within the sediment core, we can weight each model based on its performance to represent these variations in sedimentation rate. Users should provide the independent sediment proxy data as a file with two columns, namely “compositedepth”, which should be the measurement depths (as mid-point centimeter below sediment surface), and “value”, representing the values of the proxy. This simplification makes it possible to import different available proxies or statistical representations of proxy data, i.e., results from ordination techniques (PCA, MDS, etc.), into the optimization process and to visualize the behavior of the age–depth models in comparison to these proxies.

In order to evaluate the performance, we adapted the fuzzy change point approach by Hollaway et al. (2021) to work with our input data and desired outcome on a depth-dependent scale instead of a time series. Similarly to Hollaway et al. (2021), our approach firstly detected change points within the proxy data and each modeling system output by fitting an ARIMA (autoregressive integrated moving average) model to the data and then extracted change points by using the changepoint R package (Killick and Eckley, 2014; Killick et al., 2016) on the residuals of the ARIMA model. If we found no change points in the proxy data via this approach, we applied the changepoint R package on the raw independent sediment proxy data instead. Through the additional bootstrapping process introduced by Hollaway et al. (2021), we were able to set up confidence intervals for the extracted change points. Subsequently, we searched for the intersection between the change points plus their confidence interval for each age–depth model with the independent proxy data. After converting the change points for both age–depth model and independent proxy data into triangular fuzzy numbers, we obtained similarity scores using the Jaccard similarity score of the fuzzy number pairs as described in Hollaway et al. (2021). The similarity score can reach numbers between 0 (no match) and 1 (perfect match). However, the threshold of excluding an age–depth model from the generated combined model depends on the imported proxy data and number of detected change points. Therefore, the user can set the threshold accordingly to their proxy within LANDO, but we have implemented the default value for this threshold as 0.1, which corresponds to an overlap of 10 % of the change points between model and proxy data.

In addition to the criterion of preparing the proxy data in the format of depth vs. value in a separate file, we suggest using a proxy with a high resolution. As a high-resolution proxy, we define a proxy with more than 50 measurements per meter of core length. For our “Inconsistent sequence” case study (CS2), we used high-resolution elemental proxy data from XRF (X-ray fluorescence) measurement as our independent proxy data. As our evaluation element to optimize the age–depth models, we selected zircon (Zr), which itself is an indicator for minerogenic/detrital input (Vyse et al., 2020a, and references therein). The zircon proxy data of EN18208 have a resolution of 200 measurements per meter of core length.

To achieve a realistic comparison between sediment cores in the “Multiple cores” case study (CS3), we looked at the individual age–depth model outputs for each sediment core to determine whether an optimization step was required. We have only selected sediment cores with a published age–depth model (n = 33) so that we can refer to lithological boundaries from the original publication. During the analysis, we saw that nine sediment cores needed to be optimized due to strong inconsistencies between models over the entire length of each core. In 12 cases, where models within the lower section of the cores did not match, we considered proxy-based optimization to improve the model outcome when high-resolution data were available.

2.5.3Display of models

To display the results from age–depth modeling and sedimentation rate calculation, we decided to create our own plots, instead of reusing the plots from each individual modeling system. Our plot header contains the unique CoreID; additionally, the header indicates whether the user decided to apply a reservoir correction to the radiocarbon data or not. Our single core plots consist of two main panels: on the left-hand side, the panel shows the results from the age–depth modeling process with the calibrated ages (in calibrated years BP) on the x axis and the composite depth of the sediment core (in centimeters) on the inverted y axis. On the right-hand side, the panel displays the result from the sedimentation rate calculation (in cm yr-1, centimeter per year) on the x axis plotted against the same composite depth on the inverted y axis. For better readability of the strong variability of sedimentation rate, we used the log scale for the x axis of the right panel. Generally, LANDO draws the ensemble age–depth model and sedimentation rate in gray with the weighted average as a dashed line.

For all models, LANDO will display the median values for age and sedimentation rate as solid lines. Both panels further display the corresponding 1σ range and 2σ range per centimeter for each model. Depending on the user's selection, users can plot both sigma ranges, only one of the two sigma ranges, or just the median ages. To include age determination data within the plots, LANDO internally calibrates the radiocarbon data with the BchronCalibrate function of the Bchron package (Haslett and Parnell, 2008; Parnell et al., 2008) with either the IntCal20 (Reimer et al., 2020), Marine20 (Heaton et al., 2020), or SHCal20 (Hogg et al., 2020) calibration curve. This allows users to analyze samples from locations other than the terrestrial Northern Hemisphere. By default, the left panel contains each age data point as a predefined symbol with its 1σ uncertainty as an error bar. The symbol used by LANDO depends on the material category defined in the input file for each dating point.

If users decide to filter out unreasonable age–depth models, similar to the “Inconsistent sequence” case study (CS2), we added the option to plot the independent proxy data and therefrom derived lithology as an additional panel on the left-hand side for a better interpretability. Further, LANDO highlights the boundaries of lithological change and its confidence interval in both sedimentation rate and age–depth model plots. The optimized plot includes a goodness of fit for each involved modeling system to represent the change points at the bottom of the plot.

When using LANDO for multiple sediment cores, for each sediment core, the overall plot holds the results from the binned weighted average sedimentation rate calculation (as median sedimentation rate in cm yr-1, centimeter per year) against the selected age bins (in calibrated years BP) for each modeling system. This visual illustration allows user to compare multiple sediment cores based on the time axis.

For people with color vision deficiency, we incorporated the extra option to plot the resulting age–depth plots with different line styles and textures to support the visual differentiation between each model. Figure S4 in the Supplement shows the color-blind friendly output created by LANDO. With LANDO we want to support inclusivity in science, but we look forward to feedback from the community on how we can improve LANDO in this regard.

2.6Further analysis – sedimentation rate development over time

To identify similar temporal shifts in sedimentation regimes in our case study “Multiple cores” (CS3), we examined our data collection of 55 sediment cores regarding a general tendency in sedimentation rate shifts. First, we considered the 11 700 years BP boundary as our marker for the change between Holocene and Late Pleistocene to separate the datasets (Rasmussen et al., 2006; Lowe and Walker, 2014; Walker et al., 2008). We selected this marker because numerous studies suggest a general difference in sedimentation regimes between these periods (e.g., Baumer et al., 2021; Bjune et al., 2021; Kublitskiy et al., 2020; Müller et al., 2009; Wolfe, 1996; Vyse et al., 2021). As some of the models were below the 11 700 years BP marker, the calculation of the mean sedimentation rate for the Late Pleistocene featured only a subset of sediment cores (total number of sediment cores with measurement in Late Pleistocene: 20). Then, for each age model of the sediment cores in the subset, we used the 2σ ranges around 11 700 years BP to determine whether the maximum absolute change occurred exactly at 11 700 years BP or around our set marker. For this investigation, we changed the bin size to 100-year bins to allow comparison between each modeling system and the combined models. Using the maximum from the interquartile ranges of the 2σ ranges for each model (see Fig. S3), we defined the observation period from 8700 to 14 700 years BP (corresponds to a range of ±3000 years). We then checked the data within the time span to see where the maximum change in sedimentation rate occurred. If the calculated age for the new marker was at the edge of our time span, we iteratively increased the outer limit by 100 years (up to a maximum of 18 000 years BP) to see if the calculated age still reflected the maximum absolute change. We then used the newly defined marker to calculate the mean sedimentation rate for before and after the marker.

3Results 3.1“Continuously deposited sequence” – Case Study no. 1

All five age–depth models were able to produce an age–depth relationship for sediment core EN18218 (Lake Rauchuvagytgyn) with only small diversions in between some of the calibrated ages. Figure 2 depicts the two visual outputs produced by LANDO. Figure 2a displays all models side by side, while Fig. 2b shows the combined output from all models.

Figure 2

Generated output from LANDO for sediment core EN18218 (14C data from Vyse et al., 2021) as an example of continuous lacustrine sedimentation over time. Panel (a) consists of a comparison between age–depth models from all five implemented modeling systems (left plot) and their calculated sedimentation rate (right plot). Colored solid lines indicate both the median age and median sedimentation rate for all models, while shaded areas represent their respective 1σ and 2σ ranges in the same colors with decreasing opacities. Panel (b) shows the ensemble age–depth model (left plot) and its sedimentation rate (right plot). The dashed line in panel (b) represents the weighted average age estimates (left plot) and the weighted average sedimentation rates (right plot) for the ensemble model, while the gray area represents the 2σ uncertainty, i.e., the outermost limits of 2σ ranges from all models. Both plots on the left of (a) and (b) show the depth below sediment surface on the inverted y axis as composite depth of the sediment core in centimeter (cm) and the calibrated ages on the x axis in calibrated years before present (cal BP, i.e., before 1950 CE). Black circles within (a) and (b) indicate the calibrated 14C bulk sediment samples with their mean calibrated age using the IntCal20 calibration curve (Reimer et al., 2020) and their 1σ uncertainty as error bars. The plots on the right display the sedimentation rate in centimeter per year (cm yr-1, x axis as log scale) against the depth below sediment surface as the composite depth of the sediment core in centimeter (cm, inverted y axis).

All models revealed the highest sedimentation rates for the interval between 108 and 133 cm. Mean values ranged from 0.242 cm yr-1 (hamstr) to 0.764 cm yr-1 (clam) within this interval, whereas the median sedimentation rate varied between 0.107 cm yr-1 (Bacon) and 0.314 cm yr-1 (clam). In the lower segment of EN18218 (653 to 504 cm), the models showed a stronger disagreement among each other with larger varying mean and median values for sedimentation rate. In three instances, the majority of models noticeably dropped to lower sedimentation rate values. We found the first two declines in sedimentation rate between 366 and 339 cm and between 249 and 222 cm with median sedimentation rates from 0.012 cm yr-1 (hamstr) to 0.027 cm yr-1 (Bacon) and from 0.013 cm yr-1 (hamstr) to 0.025 cm yr-1 (Bacon), respectively. The last significant downward shift occurred between 66 and 57 cm, where hamstr decreased the median sedimentation rate 10-fold from 0.15 to 0.015 cm yr-1 between 66 and 64 cm.

In our ensemble model, we found the highest value for weighted average sedimentation rate at 128 cm with 0.4483 cm yr-1 (2σ range: 0.032–2.338 cm yr-1), which corresponded to a weighted average age estimate of 4846 cal BP (2σ range: 4301–5384 cal BP). Throughout the core, the cumulative 2σ uncertainty of the ensemble model ranged from 0.002 to 2.486 cm yr-1.

3.2“Inconsistent sequence” – Case Study no. 2

For the second case study, we considered an example where the underlying age determination data within the core are very contradictory to each other (see Fig. 3). Before considering modeling such an age–depth relationship with conflicting data, users need to investigate and try to understand the reasons for any outliers. Fitting any age–depth model, including the LANDO ensemble, to such divergent data should be done with extreme caution, and we do not recommend doing so without further deliberate investigation. Here we primarily aim to illustrate the range of age–depth models obtained within the ensemble as well as the results of the optimization with our proxy-based lithology.

Figure 3

Generated output from LANDO for sediment core EN18218 (OSL and 14C data from Vyse et al., 2020a) as an example of discontinuous lacustrine sedimentation. Panel (a) consists of a comparison between age–depth models from four out of five implemented modeling systems (left plot) and their calculated sedimentation rate (right plot). The modeling system clam was unable to produce an age–depth model for this core. Colored solid lines indicate both the median age and median sedimentation rate for all four models, while shaded areas represent their respective 1σ and 2σ ranges in the same colors with decreasing opacities. Panel (b) shows the ensemble age–depth model (left plot) and its sedimentation rate (right plot). The dashed line in panel (b) represents the weighted average age estimates (left plot) and the weighted average sedimentation rates (right plot) for the ensemble model, while the gray area represents the 2σ uncertainty, i.e., the outermost limit of 2σ ranges from all four models. Both plots on the left of (a) and (b) show the depth below sediment surface on the inverted y axis as composite depth of the sediment core in centimeter (cm) and the calibrated ages on the x axis in calibrated years before present (cal BP, i.e., before 1950 CE). Black circles within (a) and (b) indicate the calibrated 14C bulk sediment samples with their mean calibrated age using the IntCal20 calibration curve (Reimer et al., 2020) and their 1σ uncertainty as error bars. Black down-pointing triangles show mean ages from OSL analysis and their 1σ uncertainty as error bars. The plots on the right display the sedimentation rate in centimeter per year (cm yr-1, x axis as log scale) against the depth below sediment surface as the composite depth of the sediment core in centimeter (cm, inverted y axis).

Figure 4

Optimized visual output for EN18208 (OSL and 14C data from Vyse et al., 2020a). We used high-resolution X-ray fluorescence (XRF) measurements of zircon (Zr) as independent proxy to evaluate model performance to represent lithological changes. Panel (a) extends the existing panel (a) of Fig. 3 by adding a plot on the left to show the proxy-derived lithology used to filter unreasonable models. This added plot consists of the proxy measurements of Zr (in counts per second) along the depth below sediment surface as the composite depth of the sediment core in centimeter (cm) and the derived lithological boundaries (solid horizontal lines) plus their uncertainty range (dashed horizontal lines). Both age–depth model and sedimentation rate plot contain the same lithological boundaries as a visual aid. The text box in the bottom middle lists the models with their matching score related to the proxy-derived lithology. Panel (b) shows the model (hamstr) with the highest matching score (0.0237). Panel (c) depicts our ensemble model based on this model. The age–depth models displayed in panels (b) and (c) show strong similarities with the age–depth model developed by Vyse et al. (2020a).

During the standard modeling procedure with LANDO, four out of five modeling systems produced an output for sediment core EN18208 (Lake Ilirney). The modeling system clam was unable to produce an age–depth model for this core. Figure 3 shows the visual outputs with all models in panel (a) and the combined model in panel (b). Figure 4 consists of three panels showing the results from the proxy-based optimization process using zircon (Zr). Figure 4a shows the visual output from the optimization process, while Fig. 4b and c illustrate the optimized age–depth model with the highest matching score and the resulting ensemble model, respectively.

While Undatable was the only modeling system that considered the dating point at 1066 cm before following the next dating point at 966 cm, all remaining three modeling systems assumed a steady accumulation (mean sedimentation rate: 0.0575 cm yr-1) from 1076 cm before their paths overlapped with Undatable. At the depth of 795 cm, we found the next divergence between the age–depth models. Undatable followed the younger OSL dates and the young radiocarbon date at 666 cm. Bacon, Bchron, and hamstr continued with the radiocarbon date at 561 cm before taking different paths until the age determination point at 184 cm. All modeling systems's paths again overlapped from 184 cm to the sediment surface with a mean sedimentation rate of 0.0277 cm yr-1.

Table 5

Average sedimentation rate of EN18208 divided into proxy-derived lithological units. The calibrated mean model range indicates the mean age estimates of the ensemble model for the corresponding depths of the proxy-derived lithological unit (PLU).

Proxy-derived Corresponding depths below Calibrated mean model Average sedimentation lithological unit sediment surface (cm) range (cal BP) rate (cm yr-1) PLU1 0–190 -67–17 752 0.0152 PLU2 190–646 17 752–29 073 0.1664 PLU3 646–891 29 073–34 244 0.1073 PLU4 891–1052 34 244–44 499 0.0307

During the optimization process, our adapted algorithm located four lithological boundaries with their uncertainty ranges from the independent proxy data: 189.5 cm (182–192.5 cm), 646 cm (638–657 cm), 890.5 cm (874–912 cm), and 1051.5 cm (1043–1061.5 cm). We found the highest matching score from the optimization for hamstr (Score: 0.0237). Table 5 shows the average sedimentation rate for each proxy-derived lithological unit (PLU) of the ensemble model of EN18208.

3.3“Multiple cores” – Case Study no. 3

In contrast to the previous case studies, this case study focused on understanding the development of sedimentation rates over time, with the emphasis on the transition from the Holocene to the Pleistocene. We used age determination data from 33 sediment cores with a published age–depth model to show the standard output of LANDO for multiple sediment cores, while using all datasets for the subsequent analyses. Figure 5 shows the ensemble models with weighted average sedimentation rates binned into 1000-year bins from our multi-core investigation with 33 published sediment cores (see Fig. S1 for the individual models in the Supplement). We set the boundaries from 0 to 21 000 cal BP within these figures to cover the time span from the present to the Last Glacial Maximum (LGM) (Clark et al., 2009). Below the number for each core in Fig. 5 are the proxies used for their optimization. In 17 out of 55 cases within our entire collection, the ensemble model was based on four out of five models, as neither clam or Undatable was able to find a suitable age–depth model (for more details, please see Table S1 in the Supplement). The maximum time span covered by the sediment cores varied between 2000 years BP (CoreID: PG1972) and 320 000 years BP (CoreID: PG1351). The average non-optimized sedimentation rate ranged between 0.004 cm yr-1 (CoreID: LOT83-7) and 1.142 cm yr-1 (CoreID: PG1228). In total, we optimized seven sediment cores, as in most cases high-resolution data were not available nor did the provided proxy data represent a lithological proxy when crosschecked with the original publication. From these seven sediment cores, we reconstructed the proxy-based lithology twice with TOC (total organic carbon) as a low-resolution proxy (CoreID: PG1228 & PG1437).

Figure 5

Optimized combined models for 33 sediment cores with a published age–depth model displayed as weighted average sedimentation rate (in centimeter per year, cm yr-1 – y axis) binned into 1000-year bins (in calibrated years before present, cal BP, i.e. before 1950 CE – x axis) for the last 21 000 years. Dashed line represents the weighted average sedimentation rate, whereas the gray areas are the respective 2σ ranges. Each grid cell contains the unique core identifier of each involved sediment core. In seven cases, the letters below each number give the name of the independent proxy used for the optimization process.

Figure 6

Average sedimentation rate in centimeter per year (cm yr-1) for each sediment core in our data collection of 55 sediment cores divided into a Holocene dataset (from present to 11 700 years BP, orange lines) and a Late Pleistocene dataset (from 11 700 to 21 000 years BP, blue lines). Each plot displays the 1σ range of sedimentation rate within each dataset for each model and sediment core. In addition, filled circles represent the mean value for the optimized models.

To visualize the difference in sedimentation rates between two neighboring and fundamentally different environmental settings, i.e. Pleistocene glacial and Holocene interglacial, we used the datasets that were split at the Holocene–Pleistocene boundary at 11 700 years BP. Figure 6 shows the mean sedimentation rate for the Holocene and Late Pleistocene for each model with its 1σ uncertainty. Figure S3 in the Supplement gives an overview over the overall uncertainty for all models. Among all models, clam models have the lowest range on average for both Holocene (0.0135 cm yr-1) and Late Pleistocene (0.0011 cm yr-1), while the combined models show the greatest uncertainty on average in the Holocene (0.0942 cm yr-1) and for the Late Pleistocene (0.0711 cm yr-1). The sediment core PG1228 (latitude: 74.473∘ N) showed the highest individual sedimentation rate for the Holocene in Undatable (median sedimentation rate: 1.1013 cm yr-1). We observed a significant reduction of about 77 % for the optimized model of the same core (0.1264 cm yr-1), compared to its combined model (0.5615 cm yr-1).

Figure 7

Boxplot representing the years with the biggest absolute change in sedimentation rate for our data collection of 55 sediment cores. Sedimentation rate results from each model binned into 100-year bins to allow comparisons between the modeling systems. The initial observation time span covers 8700 to 14 700 years BP. The orange line corresponds to the median value for each model.

Figure 8

Average sedimentation rate in centimeter per year (cm yr-1) for each sediment core in our data collection of 55 sediment cores divided into Holocene dataset (orange lines) and Late Pleistocene dataset (blue lines). The exact value for the split of the datasets for each individual core and each model depends on the results of the maximum change in sedimentation rate within the observation period 8700 to 14 700 years BP. Each plot displays the 1σ range of sedimentation rate within each dataset for each model and sediment core. In addition, filled circles represent the mean value for the optimized models.

For our data compilation, we found the largest absolute change in sedimentation rates within the modeling systems on average between 9600 and 11 900 years BP (Fig. 7). For our combined and optimized models, however, the largest change averaged between 10 500 and 10 700 years BP. Still, all sediment cores covered the entire range of our initial time span from 8700 to 14 700 years BP within the models. Using the results of the largest change in sedimentation rate for each sediment core and model as new markers, we again split the datasets into two separate datasets. One dataset contained mostly Holocene sedimentation rate values (Holocene dataset), while the other contained mostly Late Pleistocene values (Late Pleistocene dataset). Therefore, the initial display (Fig. 6) changed slightly to Fig. 8. The increase in total number of sediment cores in the Late Pleistocene dataset with an individual separation (n = 38) compared to the Late Pleistocene dataset with the separation at 11 700 years BP (n = 19) was most notable.

4Discussion 4.1Assessment of different case studies

By comparing the cases for the two single-sediment cores, it becomes clear how age–depth relationships may diverge depending on the individual modeling system and its treatment of available dating points (cf. Wright et al., 2017; Trachsel and Telford, 2017; Lacourse and Gajewski, 2020). In the case of EN18218 (“Continuously deposited sequence” – CS1), all five implemented modeling systems yield an agreeing and continuous chronology. However, the two radiocarbon dates at 81.25 and 114.75 cm have a significant impact on the model's interpretation for these depths. Vyse et al. (2021) argued that these two dates are outliers resulting from reworking and mixing effects within the sediment column. According to the authors, no additional proxy data from EN18218 would support the immediate increase in sedimentation rate for these depths, and, hence, they excluded both dates from the modeling process. Because we are not considering any additional proxy data to evaluate age–depth models in their geoscientific context but rather include all provided age determination data in the modeling process, the consideration of these two radiocarbon dates on the basis of all available models leads to a higher sedimentation rate. Nonetheless, the example here shows how the comprehensive application of the different modeling systems may help to identify doubtful dating points.

We saw a disagreement between the modeling systems in the case of sediment record EN18208 (“Inconsistent sequence” – CS2), which we expected prior to the execution of our application, due to the scattered dating points in the original data. Vyse et al. (2020a) linked this scatter of age data points observed in the interval between 282 and 755 cm of EN18208 to the redeposition of older carbon. They implied that to produce a reliable age–depth model they had to exclude both OSL and radiocarbon dating points for these depths. However, our optimized combined model agrees with their established age–depth model and can reproduce the characteristics of the existing model by Vyse et al. (2020a), without removing dating points. In addition, in three out of four cases, our proxy-derived lithology with its uncertainty matches the lithological boundaries set by the authors of the EN18208 study, according to criteria based on acoustic sub-bottom profiling. Only the first original boundary (196 cm) is outside our confidence interval from 182 to 192 cm. We still showed that our approach could set logical boundaries for sediment cores by solely relying on high-resolution proxy data.

Despite a strong similarity between our optimized model and the existing model developed by Vyse et al. (2020a), the highest score showed a low similarity value (0.0237) using our similarity scale from 0 (no match) to 1 (perfect match). Although we chose the highest matching score to demonstrate LANDO's ability of filtering out disagreeing models, we do not support the strategy of choosing a single age–depth model with such a low matching score. Rather, users should investigate the cause of the scatter in the age determination data and/or change the default values within LANDO. For example, to deal with the scatter in the data, users can increase the Undatable parameter “bootpc” to a higher value – as suggested by Lougheed and Obrochta (2019) – to account for a higher uncertainty in the given data. For palaeoenvironmental reconstruction, users should also propagate these increased uncertainties into their proxy interpretation, which is often underrepresented (Lacourse and Gajewski, 2020; McKay et al., 2021).

Even though LANDO can produce age–depth models for multiple sediment cores (“Multiple cores” – CS3), we must assume limitations in the geoscientific validity for some of the results. In a few cases, an optimization of age–depth models with independent proxy data would have been necessary, but such independent data were inaccessible or did not exist. As for these cases age–depth relationships between implemented modeling systems seem to disagree (see Fig. S1 in the Supplement), the results from our combined model might over- or underestimate the true sedimentation rate. On the other hand, optimization using proxy data can reduce these biases.

For instance, during the examination of the Holocene and the Pleistocene sedimentation rates (Fig. 6), we noticed that one sediment core (PG1228) had an extremely high mean sedimentation rate for the Holocene dataset in Undatable. Similar to the second case study (“Inconsistent sequence” – CS2), we found scattered age data points for this sediment core, which influenced the modeling process of Undatable. Further, the result then affected our combined model by increasing the overall sedimentation rate for the Holocene in this core. However, LANDO identified the Undatable model as an outlier based on the lithology established through independent TOC proxy data. The optimized model then agreed well with the original publication by Andreev et al. (2003a), which further increased the validity of our approach. Our findings suggest that high-resolution proxy data should accompany geochronological studies to enable a more concise and realistic assessment of the development of sedimentation rates over time in high-latitude lake systems.

We further improved the validity of some results of our multi-core study by comparing our LANDO output with the available age–depth models from publications. In four cases (CoreID: 2008-3, Co1309, LS-9, PG1205), we adjusted our initial output to the previously published age–depth models (Rudaya et al., 2012; Gromig et al., 2019; Pisaric et al., 2001; Wagner et al., 2000a). One reason for the discrepancy was that the age determination data were not available for the entire length of sediment cores and LANDO extrapolated beyond these dating points to match the core length. In the case of PG1205 (Wagner et al., 2000a) with a core length of 9.85 m, dating points were available for the upper 2.5 m (Table 4), and therefore LANDO extrapolated the remaining 7 m to cover the entire sediment core. However, the extrapolated results in accumulation rates do not reflect the geological history of the lake record provided by Wagner et al. (2000a). We have therefore changed the length of the sediment core to the last dating point to avoid strong extrapolation. In the case of Co1309 (Gromig et al., 2019), the age–depth model required the introduction of a hiatus that would span from 14 to 80 cal BP (Andreev et al., 2019; Savelieva et al., 2019). However, while a specific customization (such as a hiatus) is possible for single core cases, this is not possible in the current version of LANDO for multi-core investigation. To overcome this, we reduced the length of the record used in our study for core Co1309 to the depth of the last available dating point (Table 4), such that the LANDO output matches the age–depth relationship reported by Gromig et al. (2019).

The detection of sedimentation rate change as an indicator of the Holocene–Pleistocene boundary yielded contrasting results. While the results from hamstr were closest to the 11 700-year boundary, all other modeling systems place the largest change in sedimentation rate either before or after 11 700 years BP. We hypothesize that three factors may have influenced all model results. (1) The age uncertainty (1σ range) within each individual model varied on average between 1000 and 3000 years for the period of 11 600 to 11 800 years BP (Fig. S3 in the Supplement). This wide range of uncertainty does not provide confidence in pinpointing the boundary to an exact time slice. We expect that a higher amount of dating points close to the Holocene–Pleistocene boundary could constrain the models (Blaauw et al., 2018; Lacourse and Gajewski, 2020; Trachsel and Telford, 2017), which would lead to a better estimate of the boundary. (2) The age output for each model is not evenly distributed, which means that in the period from 11 600 to 11 800 years BP there are different numbers of observations for each core and each modeling system. We took this behavior into account by using binning (Alasadi and Bhaya, 2017). Otherwise, an interpolation between both age and sedimentation rate values could lead to potential biases in the interpretation. (3) While we assumed in our first setup that the main sedimentation rate change would occur at 11 700 years BP consistently for all sediment cores (Fig. 6), we cannot rule out the possibility that the sedimentation rate has changed significantly at different times for different lake systems. As our data collection covers a large area both in latitude and longitude (Fig. 1), the variability between the models indicates the local variability between the climate and lithological preferences of the lake catchment for the involved sediment cores (e.g., Lozhkin et al., 2018; Finkenbinder et al., 2015; Anderson and Lozhkin, 2015; Kokorowski et al., 2008; Biskaborn et al., 2016a; Courtin et al., 2021).

4.2Design of LANDO

From the beginning of the development of LANDO, we decided to integrate most of the default settings for each modeling system as default values (Table 2). Regional studies, such as the one performed by Goring et al. (2012), have shown that specific prior information for the Bayesian modeling systems is needed to best fit the models to lakes within a geographical area. Without this regional information, changing settings within the modeling system to an arbitrary higher or lower value without considering the regional diversity could lead to under- or overfitting if the constraints are too loose or too strict (Trachsel and Telford, 2017). For the special case that users have in-depth knowledge of one lake or multiple lake system, users can easily adapt these parameters within LANDO, as we have made these settings accessible in the Jupyter Notebook itself.

Part of the reason we made this decision was that we acquired external age determination datasets where we may not necessarily have all the essential information to specify each model. But we also wanted to simplify the process for users who do not have in-depth modeling knowledge. By using the default values, we can compare models based on their ability to work with the available data. On the other hand, we are sure that the developers have set their default values based on systematic testing. Since we did not tune the age–depth models to the existing core, i.e. changing the parameters within each modeling system, we generated “uninformed” models that solely work with the available age determination data. By combining these uninformed models into one model, we have created an ensemble model that we consider to be data-driven and “semi-informed”.

The advantage of this data-driven, semi-informed model approach is that we are reducing the risk of overfitting by considering the uncertainty of all modeling systems. This allows us to reevaluate existing geoscientific interpretations with larger uncertainty by taking advantage of the ensemble outcome. Additionally, we found that the more information is accessible to generate age–depth models, the more accurate and less uncertain these models become. A higher density of age determination along the depth of the sediment core is desirable for future drilling campaigns (cf. Blaauw et al., 2018).

The disadvantage arises in our second case study (“Inconsistent sequence” – CS2) and the multi-core investigation (“Multiple cores” – CS3). For both cases we needed the optimization step to narrow down the most suitable age–depth models for each sediment core, since the unoptimized uncertainty band was otherwise too wide for a clear interpretation. The optimization requires additional and independent proxy data, which are not available for some of our cores, especially for sediment cores obtained some decades ago. Our optimizing step is therefore mainly suitable for recently retrieved and analyzed sediment cores.

In addition to the assessment of age-modeling quality, we also checked the time and effort to conduct dating routines. We saw that Bacon had the highest runtime overall in all three case studies of our study design, which we link to our adjustment of the ssize parameter from 2000 (per default) to 8000 within the application. We increased this value to ensure good MCMC mixing for problematic cores, as suggested by Blaauw et al. (2021), as well as to guarantee we had enough iterations for our summarizing statistics to compare with other modeling systems. If users decide to reduce the value of ssize, we implemented an iterative process, which checks whether Bacon produced enough iterations. If this is not the case, then LANDO will iteratively rerun the same sediment core with a higher ssize to produce 10 000 iterations.

One unique feature of our application is the predominant use of parallelization within the age–depth modeling of multiple sediment cores. For instance, we used the Dask back end for our sedimentation rate calculation. The advantage over the popular Scala-based Apache Spark and its Python interface PySpark (Zaharia et al., 2016) is that the Dask back end is Python-based and well integrated into the Python ecosystem (cf. Dask Development Team, 2016). Therefore, Dask natively works with Python packages already implemented in LANDO. The key difference is that Dask neither provides a query optimizer nor relies on Map–Shuffle–Reduce, a data-processing technique for distributed computing, but instead uses generic task scheduling (cf. Dask Development Team, 2016). Still, parallelization libraries and back ends provide LANDO with additional speed-up that can promote future multi-core studies.

Within the ensemble model, we faced the challenge that the combination of all age distributions from the underlying age–depth models per centimeter represents a multi-modal distribution, especially in cases such as the “Inconsistent sequence” case study (CS2). It also means that the output of the ensemble model in these cases is susceptible to inclusion/exclusion of any model. However, we consider using the weighted average median age to be a suitable solution for the multi-model distribution problem, as it is a good indicator on the most probable age within each centimeter based on all modeling systems. But we advise users to use the age confidence intervals per centimeter in subsequent analyses, instead of relying solely on the weighted average median age (cf. Telford et al., 2004). By optimizing the ensemble model with the ability to include independent proxy data, users can increase the likelihood of a more probable mean age for their sediment core.

4.3Technical specifications of LANDO

In the further course of development, we decided to limit the resolution of the age–depth relationships. Using a resolution of one-centimeter increments allows us to match most proxy measurements from each sediment core with our age–depth models, apart from high-resolution measurement, such as XRF measurements. To allow a matching with high-resolution proxy data, we tested for a higher resolution of 0.25 cm for our application. In the single-sediment-core cases (CS1 and CS2), this change did not affect the workflow of LANDO. By contrast, the “Multiple cores” case (CS3) ran into memory issues. Since SoS Notebook and our parallel back ends store the resulting data frames in memory, expanding the resulting data frames to a 0.25 cm resolution causes a 4-fold increase in memory use, which limits our capability to run our application on a single laptop. As an intermediate solution, we stored the results from each parallelization worker on disk to free the memory and performed combining operations later. Based on this experience, we recommend working with data centers or increasing the available main memory (RAM) of the operating computer for multi-core studies with expected high-resolution output.

Another advantage of parallelization is that most modeling systems only run on one CPU/thread. Nowadays, however, both personal computers and data centers are made up of multiple CPUs/threads. Especially for larger multi-site studies, our application has the advantage of cutting the overall computing time by running each modeling system on multiple CPUs/threads simultaneously, even for personal computers. In comparison to serial execution of multiple models on one CPU/thread, which would take several hours, our parallel execution reduced the computing time per modeling system by a factor up to 4. When considering that our setup consisted of 6 CPUs (12 threads) and 16 GB RAM, user can increase this factor even further by using larger computing facilities.

Sediment core length is the most limiting factor that determines the overall computing time in our application. However, we want to ensure that users can model each sediment core over its entire length to match proxy data with the correct age–depth relationships. Within our LANDO system, we faced this problem by using extrapolation to calculate ages beyond available dating points. The exception here is the modeling system Undatable, which models only between the first and last dating point, as these two dating points act as anchors for the bootstrapping process (Lougheed and Obrochta, 2019). As a result, we saw the sedimentation rate dropping twice to zero at the end of the sedimentation rate calculations. We link this behavior to the end of the individual modeling processes of Undatable as well as the other implemented systems.

Extrapolating the age–depth models beyond age determination points always bears the risk that the extrapolated dates do not reflect the actual age. The implemented modeling systems account for this circumstance by increasing the uncertainty for these undated regions (Blaauw, 2010). While we are aware of this potential issue, we wanted to allow users to take advantage of the full age–depth coverage for their sediment core. Blaauw et al. (2018) pointed out in their findings that “most existing late-Quaternary studies contain fewer than one date per millennium” and recommended to increase the number of dating points to “a minimum of 2 dates per millennium”. This recommendation would further decrease the need for extrapolation and reduce the overall uncertainty of age–depth models. We agree that more age control can improve the age–depth modeling results, but until the associated costs of analyzing organic material for radiocarbon dating decrease significantly (Hajdas et al., 2021; Zander et al., 2020), we recommend LANDO as tool to improve age–depth modeling.

4.4Current and future model implementation in LANDO

During the development of our approach, we realized that some programs were not executable or parallelizable under the current circumstances. For instance, we tested OxCal 4.4 as stand-alone version on Windows with NodeJS (version 12.13.1.0) and the R package oxcAAR (Martin et al., 2021) within our application. In the case of EN18208, execution duration was above 3 h until the notebook lost connection to the OxCal interface. Furthermore, some cores never fully reached convergence within OxCal. We tried adapting our setups including changing the internal constraints, i.e. placement and number of boundaries, or using different depositions models, i.e. alternating between sequential model (Sequence) and Poisson-process deposition model (P_Sequence). According to Bronk Ramsey and Lee (2013), the long-term plan of OxCal is to make the entire source code openly accessible, which we fully support. An open source code would allow us to identify the current bottleneck so that we could implement OxCal in a future release.

To determine the best-fitting age–depth model through the clam modeling software, we added the “best-fit” option to LANDO by default. The best-fit option utilizes the negative log fit results from all clam outputs and identifies the fit with the lowest result as best fit. We included two further exclusion criteria for clam models within LANDO: if (a) there are too many age reversals within the models or (b) the fit reaches infinity. Under specific circumstances, some sediment cores will not have a fitting model, as is the case, for instance, in the “Inconsistent sequence” case study (CS2). Including models that do not fit the data would lead to erroneous estimations of the age–depth relationship. This comes with the cost of losing an established model in the combined model if no fitting clam model is available. However, we think that the benefit of having a better-fitting model outweighs this cost.

Although Undatable is open source and the fastest modeling system within LANDO, its original development environment (MATLAB) is not free of charge. That is why we implemented Undatable in the open-source MATLAB-equivalent Octave. Since the Octave version of Undatable was slower than the original MATLAB version, we used the parallelization package parallel (Fujiwara et al., 2021) to provide comparable results in terms of computing time. To use Undatable with MATLAB within our application, users must acquire a license for MATLAB and link the MATLAB kernel to their license. Unfortunately, we do not have the capacity to provide individual licenses with LANDO. For users with an active MATLAB license, we provide the appropriate code to run the MATLAB version of Undatable in LANDO in the repository mentioned in the “Code and data availability” section.

We highly appreciate all the work that went into developing the stand-alone versions of each modeling system. Because LANDO relies on the work of these modeling systems, we encourage users of LANDO to cite the original modeling software alongside the LANDO publication in their work. Additionally, users should try the stand-alone versions for each modeling system to provide feedback to both LANDO and modeling system maintainers.

A potential expansion option of LANDO within the multi-language environment is to extend the application and allow future data analysis to use powerful tools, such as Python's machine learning libraries, e.g., keras (Chollet, 2015) and tensorflow (Abadi et al., 2016). We anticipate that other developers can use LANDO as their starting point in building a larger limnological data analysis application.

5Conclusion

This paper introduced our application LANDO – a linked age–depth modeling notebook approach. We presented an improved age–depth modeling procedure for sediment cores from high-latitude lake systems by linking five established systems: Bacon, Bchron, clam, hamstr, and Undatable. The added value of our application is the reduced effort to use established modeling systems in a single Jupyter Notebook for both single and multiple dating series and at the same time make the results comparable. In addition, we introduced an ensemble model that uses the output from all models to create a more robust age–depth relationship. In the case of scattered age determination data, we further implemented an adapted version of the fuzzy change point approach that allows users to integrate independent proxy data as indicators of lithological changes. This option helps evaluate the performance of modeling systems across lithological boundaries while providing a more reliable ensemble age–depth model by filtering inappropriate model runs for problematic datasets. Our application also allows users to run large datasets with multiple sediment cores in parallel to reduce the overall computation time. In our data collection of 55 sediment cores from northern lake systems at high latitudes, we found that the main regime changes in sedimentation rates do not occur synchronously for all lakes at the Pleistocene–Holocene boundary. However, we linked this behavior to the uncertainty within the modeling process as well as the local variability of the sediment cores within the collection.

Code and data availability

The LANDO code is accessible at GitHub (https://github.com/GPawi/LANDO; 10.5281/zenodo.5734333, Pfalz, 2022). We provide five example spreadsheets in the repository for users to test the application. A stand-alone version of the LANDO application will be available upon publication. The dataset with all dating points used in this study, including their references, will be accessible via PANGAEA.

The supplement related to this article is available online at: https://doi.org/10.5194/gchron-4-269-2022-supplement.

Author contributions

GP wrote the paper with inputs from all co-authors. GP developed the application, designed and implemented the LANDO system, and conducted testing. BKB, BD, and JCF advised and supervised the work of GP. BKB, BD, LS, and DAS provided published and unpublished age determination data for this publication.

Competing interests

The contact author has declared that neither they nor their co-authors have any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Acknowledgements

The authors acknowledge the support of the Helmholtz Einstein Berlin International Berlin Research School in Data Science (HEIBRiDS), the Alfred Wegener Institute – Helmholtz Centre for Polar and Marine Research, the Einstein Center Digital Future, and the Humboldt University of Berlin. We would like to thank the two reviewers Bryan C. Lougheed and Timothy J. Heaton as well as the authors of the community comment, who provided extensive and engaged feedback that helped us improve the paper. The authors highly appreciate all the work that went into developing the stand-alone versions of each modeling system. Hence, we would like to thank Maarten Blaauw, Andrew C. Parnell, Andrew Dolman, Bryan C. Lougheed, and Stephen P. Obrochta for their continuous work on Bacon, Bchron, clam, hamstr, and Undatable.

Financial support

The work of Liudmila Syrykh and Dmitry A. Subetto has been supported by the Ministry of Education and Science of the Russian Federation (grant no. FSZN-2020-0016). Publisher’s note: the article processing charges for this publication were not paid by a Russian or Belarusian institution.

Review statement

This paper was edited by Michael Dietze and reviewed by Bryan C. Lougheed and Timothy Heaton.

References 1

Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur, M., Levenberg, J., Monga, R., Moore, S., Murray, D. G., Steiner, B., Tucker, P., Vasudevan, V., Warden, P., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: A system for large-scale machine learning, in: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), 2–4 November 2016, Savannah, GA, USA, 265–283, 10.1016/0076-6879(83)01039-3, 2016.

Abbott, M. B. and Stafford, T. W.: Radiocarbon geochemistry of modern and Ancient Arctic lake systems, Baffin Island, Canada, Quat. Res., 45, 300–311, 10.1006/qres.1996.0031, 1996.

Alasadi, S. A. and Bhaya, W. S.: Review of data preprocessing techniques in data mining, J. Eng. Appl. Sci., 12, 4102–4107, 2017.

Anderson, P., Minyuk, P., Lozhkin, A., Cherepanova, M., Borkhodoev, V., and Finney, B.: A multiproxy record of Holocene environmental changes from the northern Kuril Islands (Russian Far East), J. Paleolimnol., 54, 379–393, 10.1007/s10933-015-9858-y, 2015.

Anderson, P. M. and Lozhkin, A. V.: Late Quaternary vegetation of Chukotka (Northeast Russia), implications for Glacial and Holocene environments of Beringia, Quat. Sci. Rev., 107, 112–128, 10.1016/j.quascirev.2014.10.016, 2015.

Andreev, A. A., Tarasov, P. E., Siegert, C., Ebel, T., Klimanov, V. A., Melles, M., Bobrov, A. A., Dereviagin, A. Y., Lubinski, D. J., and Hubberten, H.-W.: Late Pleistocene and Holocene vegetation and climate on the northern Taymyr Peninsula, Boreas, 32, 484–505, 10.1111/j.1502-3885.2003.tb01230.x, 2003a.

Andreev, A. A., Tarasov, P. E., Siegert, C., Ebel, T., Klimanov, V. A., Melles, M., Bobrov, A. A., Dereviagin, A. Y., Lubinski, D. J., and Hubberten, H.-W.: Table 1. Radiocarbon dating on profile PG1228, PANGAEA [data set], 10.1594/PANGAEA.726591, 2003b.

Andreev, A. A., Tarasov, P. E., Klimanov, V. A., Melles, M., Lisitsyna, O. M., and Hubberten, H. W.: Vegetation and climate changes around the Lama Lake, Taymyr Peninsula, Russia during the Late Pleistocene and Holocene, Quat. Int., 122, 69–84, 10.1016/j.quaint.2004.01.032, 2004.

Andreev, A. A., Tarasov, P. E., Ilyashuk, B. P., Ilyashuk, E. A., Cremer, H., Hermichen, W.-D., Wischer, F., and Hubberten, H.-W.: Holocene environmental history recorded in Lake Lyadhej-To sediments, Polar Urals, Russia, Palaeogeogr. Palaeoclimatol. Palaeoecol., 223, 181–203, 10.1016/j.palaeo.2005.04.004, 2005a.

Andreev, A. A., Tarasov, P. E., Ilyashuk, B. P., Ilyashuk, E. A., Cremer, H., Hermichen, W.-D., Wischer, F., and Hubberten, H.-W.: Age determinations on a sediment profile from lake Lyadhej-To, PANGAEA [data set], 10.1594/PANGAEA.728450, 2005b.

Andreev, A. A., Shumilovskikh, L. S., Savelieva, L. A., Gromig, R., Fedorov, G. B., Ludikova, A., Wagner, B., Wennrich, V., Brill, D., and Melles, M.: Environmental conditions in northwestern Russia during MIS 5 inferred from the pollen stratigraphy in a sediment core from Lake Ladoga, Boreas, 48, 377–386, 10.1111/bor.12382, 2019.

Andreev, A. A., Raschke, E., Biskaborn, B. K., Vyse, S. A., Courtin, J., Böhmer, T., Stoof-Leichsenring, K., Kruse, S., Pestryakova, L. A., and Herzschuh, U.: Late Pleistocene to Holocene vegetation and climate changes in northwestern Chukotka (Far East Russia) deduced from lakes Ilirney and Rauchuagytgyn pollen records, Boreas, 50, 652–670, 10.1111/bor.12521, 2021.

Appleby, P. G.: Three decades of dating recent sediments by fallout radionuclides: A review, Holocene, 18, 83–93, 10.1177/0959683607085598, 2008.

Ascough, P., Cook, G., and Dugmore, A.: Methodological approaches to determining the marine radiocarbon reservoir effect, Prog. Phys. Geogr., 29, 532–547, 10.1191/0309133305pp461ra, 2005.

Austin, W. E. N., Bard, E., Hunt, J. B., Kroon, D., and Peacock, J. D.: The 14C Age of the Icelandic Vedde Ash: Implications for Younger Dryas Marine Reservoir Age Corrections, Radiocarbon, 37, 53–62, 10.1017/S0033822200014788, 1995.

Bao, R., McNichol, A. P., Hemingway, J. D., Lardie Gaylord, M. C., and Eglinton, T. I.: Influence of different acid treatments on the radiocarbon content spectrum of sedimentary organic matter determined by RPO/accelerator mass spectrometry, Radiocarbon, 61, 395–413, 10.1017/RDC.2018.125, 2019.

Baud, A., Jenny, J. P., Francus, P., and Gregory-Eaves, I.: Global acceleration of lake sediment accumulation rates associated with recent human population growth and land-use changes, J. Paleolimnol., 66, 453–467, 10.1007/s10933-021-00217-6, 2021.

Baumer, M. M., Wagner, B., Meyer, H., Leicher, N., Lenz, M., Fedorov, G., Pestryakova, L. A., and Melles, M.: Climatic and environmental changes in the Yana Highlands of north-eastern Siberia over the last c. 57 000 years, derived from a sediment core from Lake Emanda, Boreas, 50, 114–133, 10.1111/bor.12476, 2021.

Bayer, M.: SQLAlchemy, in: The Architecture of Open Source Applications Volume II: Structure, Scale, and a Few More Fearless Hacks, edited by: Brown, A. and Wilson, G., The Architecture of Open Source Applications, http://aosabook.org (last access: 20 April 2022), 2012.

Biskaborn, B. K., Herzschuh, U., Bolshiyanov, D., Savelieva, L., and Diekmann, B.: Environmental variability in northeastern Siberia during the last ∼ 13 300 yr inferred from lake diatoms and sediment-geochemical parameters, Palaeogeogr. Palaeoclimatol. Palaeoecol., 329–330, 22–36, 10.1016/j.palaeo.2012.02.003, 2012a.

Biskaborn, B. K., Herzschuh, U., Bolshiyanov, D. Y., Savelieva, L. A., and Diekmann, B.: (Table 1) Age determination of sediment core PG1984, PANGAEA [data set], 10.1594/PANGAEA.776407, 2012b.

Biskaborn, B. K., Herzschuh, U., Bolshiyanov, D., Savelieva, L., Zibulski, R., and Diekmann, B.: Late Holocene thermokarst variability inferred from diatoms in a lake sediment record from the Lena Delta, Siberian Arctic, J. Paleolimnol., 49, 155–170, 10.1007/s10933-012-9650-1, 2013a.

Biskaborn, B. K., Herzschuh, U., Bolshiyanov, D. Y., Schwamborn, G., and Diekmann, B.: Thermokarst processes and depositional events in a Tundra Lake, Northeastern Siberia, Permafr. Periglac. Process., 24, 160–174, 10.1002/ppp.1769, 2013b.

Biskaborn, B. K., Herzschuh, U., Bolshiyanov, D. Y., Savelieva, L. A., Zibulski, R., and Diekmann, B.: Age determination of sediment core PG1972-1 (09-Tik-03), PANGAEA [data set], 10.1594/PANGAEA.780526, 2013c.

Biskaborn, B. K., Herzschuh, U., Bolshiyanov, D. Y., Schwamborn, G., and Diekmann, B.: Age determination of sediment core PG1975-1 (09-Tik-05), PANGAEA [data set], 10.1594/PANGAEA.780385, 2013d.

Biskaborn, B. K., Subetto, D. A., Savelieva, L. A., Vakhrameeva, P. S., Hansche, A., Herzschuh, U., Klemm, J., Heinecke, L., Pestryakova, L. A., Meyer, H., Kuhn, G., and Diekmann, B.: Late Quaternary vegetation and lake system dynamics in north-eastern Siberia: Implications for seasonal climate variability, Quat. Sci. Rev., 147, 406–421, 10.1016/j.quascirev.2015.08.014, 2016a.

Biskaborn, B. K., Subetto, D. A., Savelieva, L. A., Vakhrameeva, P., Hansche, A., Herzschuh, U., Klemm, J., Heinecke, L., Pestryakova, L. A., Meyer, H., Kuhn, G., and Diekmann, B.: Radiocarbon age determination on composite core PG2023, PANGAEA [data set], 10.1594/PANGAEA.848897, 2016b.

Biskaborn, B. K., Nazarova, L., Pestryakova, L. A., Syrykh, L., Funck, K., Meyer, H., Chapligin, B., Vyse, S., Gorodnichev, R., Zakharov, E., Wang, R., Schwamborn, G., Bailey, H. L., and Diekmann, B.: Spatial distribution of environmental indicators in surface sediments of Lake Bolshoe Toko, Yakutia, Russia, Biogeosciences, 16, 4023–4049, 10.5194/bg-16-4023-2019, 2019.

Biskaborn, B. K., Nazarova, L., Kröger, T., Pestryakova, L. A., Syrykh, L., Pfalz, G., Herzschuh, U., and Diekmann, B.: Late Quaternary Climate Reconstruction and Lead-Lag Relationships of Biotic and Sediment-Geochemical Indicators at Lake Bolshoe Toko, Siberia, Front. Earth Sci., 9, 703, 10.3389/feart.2021.737353, 2021.

Bjune, A. E., Greve Alsos, I., Brendryen, J., Edwards, M. E., Haflidason, H., Johansen, M. S., Mangerud, J., Paus, A., Regnéll, C., Svendsen, J., and Clarke, C. L.: Rapid climate changes during the Lateglacial and the early Holocene as seen from plant community dynamics in the Polar Urals, Russia, J. Quat. Sci., 00, jqs.3352, 10.1002/jqs.3352, 2021.

Blaauw, M.: Methods and code for “classical” age-modelling of radiocarbon sequences, Quat. Geochronol., 5, 512–518, 10.1016/j.quageo.2010.01.002, 2010.

Blaauw, M.: clam: Classical Age-Depth Modelling of Cores from Deposits, R CRAN [code], https://cran.r-project.org/package=clam (last access: 20 April 2022), 2021.

Blaauw, M. and Christen, J. A.: Flexible paleoclimate age-depth models using an autoregressive gamma process, Bayesian Anal., 6, 457–474, 10.1214/11-BA618, 2011.

Blaauw, M. and Heegaard, E.: Estimation of Age-Depth Relationships, in: Tracking Environmental Change Using Lake Sediments, Developments in Paleoenvironmental Research, edited by: Birks, H., Lotter, A., Juggins, S., and Smol, J., Springer, Dordrecht, vol. 5, 379–413, 10.1007/978-94-007-2745-8_12, 2012.

Blaauw, M., Christen, J. A., Bennett, K. D., and Reimer, P. J.: Double the dates and go for Bayes – Impacts of model choice, dating density and quality on chronologies, Quat. Sci. Rev., 188, 58–66, 10.1016/j.quascirev.2018.03.032, 2018.

Blaauw, M., Christen, J. A., and Aquino Lopez, M. A.: rbacon: Age-Depth Modelling using Bayesian Statistics, R CRAN [code], https://cran.r-project.org/package=rbacon (last access: 20 April 2022), 2021.

Bradley, R. S.: Paleoclimatology: Reconstructing Climates of the Quaternary Second Edition, 3rd edn., Elsevier, Oxford, 557 pp., 10.1029/eo081i050p00613-01, 2015.

Brauer, A.: Annually Laminated Lake Sediments and Their Palaeoclimatic Relevance, in: The Climate in Historical Times, GKSS School of Environmental Research, Springer, Berlin, Heidelberg, 109–127, 10.1007/978-3-662-10313-5_7, 2004.

Brock, F., Higham, T., Ditchfield, P., and Ramsey, C. B.: Current Pretreatment Methods for AMS Radiocarbon Dating at the Oxford Radiocarbon Accelerator Unit (Orau), Radiocarbon, 52, 103–112, 10.1017/S0033822200045069, 2010.

Bronk Ramsey, C.: Radiocarbon Calibration and Analysis of Stratigraphy: The OxCal Program, Radiocarbon, 37, 425–430, 10.1017/s0033822200030903, 1995.

Bronk Ramsey, C.: Deposition models for chronological records, Quat. Sci. Rev., 27, 42–60, 10.1016/j.quascirev.2007.01.019, 2008.

Bronk Ramsey, C.: Dealing with Outliers and Offsets in Radiocarbon Dating, Radiocarbon, 51, 1023–1045, 10.1017/s0033822200034093, 2009.

Bronk Ramsey, C. and Lee, S.: Recent and Planned Developments of the Program OxCal, Radiocarbon, 55, 720–730, 10.1017/s0033822200057878, 2013.

Cadena-Vela, S., Mazón, J.-N., and Fuster-Guilló, A.: Defining a Master Data Management Approach for Increasing Open Data Understandability, in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Springer, Cham, LNCS, vol. 11878, 169–178, 10.1007/978-3-030-40907-4_17, 2020.

Chollet, F.: Keras: The Python Deep Learning library, GitHub [code], https://keras.io (last access: 20 April 2022), 2015.

Ciarletta, D. J., Shawler, J. L., Tenebruso, C., Hein, C. J., and Lorenzo-Trueba, J.: Reconstructing Coastal Sediment Budgets From Beach- and Foredune-Ridge Morphology: A Coupled Field and Modeling Approach, J. Geophys. Res.-Earth Surf., 124, 1398–1416, 10.1029/2018JF004908, 2019.

Clark, P. U., Dyke, A. S., Shakun, J. D., Carlson, A. E., Clark, J., Wohlfarth, B., Mitrovica, J. X., Hostetler, S. W., and McCabe, A. M.: The Last Glacial Maximum, Science, 325, 710–714, 10.1126/science.1172873, 2009.

Colman, S. M., Jones, G. A., Rubin, M., King, J. W., Peck, J. A., and Orem, W. H.: AMS radiocarbon analyses from Lake Baikal, Siberia: Challenges of dating sediments from a large, oligotrophic lake, Quat. Sci. Rev., 15, 669–684, 10.1016/0277-3791(96)00027-3, 1996.

Corner, G. D., Kolka, V. V., Yevzerov, V. Y., and Møller, J. J.: Postglacial relative sea-level change and stratigraphy of raised coastal basins on Kola Peninsula, northwest Russia, Glob. Planet. Change, 31, 155–177, 10.1016/S0921-8181(01)00118-7, 2001.

Courtin, J., Andreev, A. A., Raschke, E., Bala, S., Biskaborn, B. K., Liu, S., Zimmermann, H., Diekmann, B., Stoof-Leichsenring, K. R., Pestryakova, L. A., and Herzschuh, U.: Vegetation Changes in Southeastern Siberia During the Late Pleistocene and the Holocene, Front. Ecol. Evol., 9, 233, 10.3389/fevo.2021.625096, 2021.

Cremer, H., Wagner, B., Melles, M., and Hubberten, H. W.: The postglacial environmental development of Raffles Sø, East Greenland: Inferences from a 10,000 year diatom record, J. Paleolimnol., 26, 67–87, 10.1023/A:1011179321529, 2001a.

Cremer, H., Wagner, B., Melles, M., and Hubberten, H.-W.: (Table 1) Age determination of sediment profile PG1214, PANGAEA [data set], 10.1594/PANGAEA.734137, 2001b.

Dask Development Team: Dask: Library for dynamic task scheduling, GitHub [code], https://dask.org (last access: 20 April 2022), 2016.

Dee, M. W., Palstra, S. W. L., Aerts-Bijma, A. T., Bleeker, M. O., De Bruijn, S., Ghebru, F., Jansen, H. G., Kuitems, M., Paul, D., Richie, R. R., Spriensma, J. J., Scifo, A., Van Zonneveld, D., Verstappen-Dumoulin, B. M. A. A., Wietzes-Land, P., and Meijer, H. A. J.: Radiocarbon Dating at Groningen: New and Updated Chemical Pretreatment Procedures, Radiocarbon, 62, 63–74, 10.1017/RDC.2019.101, 2020.

Diekmann, B., Pestryakova, L., Nazarova, L., Subetto, D. A., Tarasov, P. E., Stauch, G., Thiemann, A., Lehmkuhl, F., Biskaborn, B. K., Kuhn, G., Henning, D., and Müller, S.: Late Quaternary Lake Dynamics in the Verkhoyansk Mountains of Eastern Siberia: Implications for Climate and Glaciation History, Polarforschung, 86, 97–110, 10.2312/polarforschung.86.2.97, 2017.

Diepenbroek, M., Grobe, H., Reinke, M., Schindler, U., Schlitzer, R., Sieger, R., and Wefer, G.: PANGAEA – an information system for environmental sciences, Comput. Geosci., 28, 1201–1210, 10.1016/S0098-3004(02)00039-0, 2002.

Dirksen, V., Dirksen, O., van den Bogaard, C., and Diekmann, B.: Holocene pollen record from Lake Sokoch, interior Kamchatka (Russia), and its paleobotanical and paleoclimatic interpretation, Glob. Planet. Change, 134, 129–141, 10.1016/j.gloplacha.2015.07.010, 2015.

Dolman, A. M.: hamstr: Hierarchical Accumulation Modelling with Stan and R, GitHub [code], https://github.com/EarthSystemDiagnostics/hamstr, last access: 20 April 2022.

Finkenbinder, M. S., Abbott, M. B., Finney, B. P., Stoner, J. S., and Dorfman, J. M.: A multi-proxy reconstruction of environmental change spanning the last 37 000 years from Burial Lake, Arctic Alaska, Quat. Sci. Rev., 126, 227–241, 10.1016/j.quascirev.2015.08.031, 2015.

Fujiwara, H., Hajek, J., and Till, O.: Octave Forge - The “parallel” package, Octave Forge [code], https://octave.sourceforge.io/parallel/index.html (last access: 20 April 2022), 2021.

Gaujoux, R.: doRNG: Generic Reproducible Parallel Backend for “foreach” Loops, R CRAN [code], https://cran.r-project.org/package=doRNG (last access: 20 April 2022), 2020.

Goring, S., Williams, J. W., Blois, J. L., Jackson, S. T., Paciorek, C. J., Booth, R. K., Marlon, J. R., Blaauw, M., and Christen, J. A.: Deposition times in the northeastern United States during the Holocene: Establishing valid priors for Bayesian age models, Quat. Sci. Rev., 48, 54–60, 10.1016/j.quascirev.2012.05.019, 2012.

Gromig, R., Wagner, B., Wennrich, V., Fedorov, G., Savelieva, L., Lebas, E., Krastel, S., Brill, D., Andreev, A., Subetto, D., and Melles, M.: Deglaciation history of Lake Ladoga (northwestern Russia) based on varved sediments, Boreas, 48, 330–348, 10.1111/bor.12379, 2019.

Hajdas, I., Ascough, P., Garnett, M. H., Fallon, S. J., Pearson, C. L., Quarta, G., Spalding, K. L., Yamaguchi, H., and Yoneda, M.: Radiocarbon dating, Nat. Rev. Methods Prim., 1, 62, 10.1038/s43586-021-00058-7, 2021.

Harris, C. R., Millman, K. J., van der Walt, S. J., Gommers, R., Virtanen, P., Cournapeau, D., Wieser, E., Taylor, J., Berg, S., Smith, N. J., Kern, R., Picus, M., Hoyer, S., van Kerkwijk, M. H., Brett, M., Haldane, A., del Río, J. F., Wiebe, M., Peterson, P., Gérard-Marchant, P., Sheppard, K., Reddy, T., Weckesser, W., Abbasi, H., Gohlke, C., and Oliphant, T. E.: Array programming with NumPy, Nature, 585, 357–362, 10.1038/s41586-020-2649-2, 2020.

Haslett, J. and Parnell, A.: A simple monotone process with application to radiocarbon-dated depth chronologies, J. R. Stat. Soc. Ser. C Appl. Stat., 57, 399–418, 10.1111/j.1467-9876.2008.00623.x, 2008.

Heaton, T. J., Köhler, P., Butzin, M., Bard, E., Reimer, R. W., Austin, W. E. N., Bronk Ramsey, C., Grootes, P. M., Hughen, K. A., Kromer, B., Reimer, P. J., Adkins, J., Burke, A., Cook, M. S., Olsen, J., and Skinner, L. C.: Marine20 – The Marine Radiocarbon Age Calibration Curve (0–55 000 cal BP), Radiocarbon, 62, 779–820, 10.1017/RDC.2020.68, 2020.

Hoff, U., Dirksen, O., Dirksen, V., Herzschuh, U., Hubberten, H. W., Meyer, H., van den Bogaard, C., and Diekmann, B.: Late Holocene diatom assemblages in a lake-sediment core from Central Kamchatka, Russia, J. Paleolimnol., 47, 549–560, 10.1007/s10933-012-9580-y, 2012.

Hoff, U., Biskaborn, B. K., Dirksen, V. G., Dirksen, O., Kuhn, G., Meyer, H., Nazarova, L., Roth, A., and Diekmann, B.: Holocene environment of Central Kamchatka, Russia: Implications from a multi-proxy record of Two-Yurts Lake, Glob. Planet. Change, 134, 101–117, 10.1016/j.gloplacha.2015.07.011, 2015.

Hogg, A. G., Heaton, T. J., Hua, Q., Palmer, J. G., Turney, C. S. M., Southon, J., Bayliss, A., Blackwell, P. G., Boswijk, G., Bronk Ramsey, C., Pearson, C., Petchey, F., Reimer, P., Reimer, R., and Wacker, L.: SHCal20 Southern Hemisphere Calibration, 0–55 000 Years cal BP, Radiocarbon, 62, 759–778, 10.1017/RDC.2020.59, 2020.

Hollaway, M. J., Henrys, P. A., Killick, R., Leeson, A., and Watkins, J.: Evaluating the ability of numerical models to capture important shifts in environmental time series: A fuzzy change point approach, Environ. Model. Softw., 139, 104993, 10.1016/j.envsoft.2021.104993, 2021.

Hughes-Allen, L., Bouchard, F., Hatté, C., Meyer, H., Pestryakova, L. A., Diekmann, B., Subetto, D. A., and Biskaborn, B. K.: 14 000-year Carbon Accumulation Dynamics in a Siberian Lake Reveal Catchment and Lake Productivity Changes, Front. Earth Sci., 9, 1–19, 10.3389/feart.2021.710257, 2021.

Joblib Development Team: Joblib: running Python functions as pipeline jobs, PyPI [code], https://joblib.readthedocs.io/ (last access: 20 April 2022), 2020.

Khazin, L. B., Khazina, I. V., Krivonogov, S. K., Kuzmin, Y. V., Prokopenko, A. A., Yi, S., and Burr, G. S.: Holocene climate changes in southern West Siberia based on ostracod analysis, Russ. Geol. Geophys., 57, 574–585, 10.1016/j.rgg.2015.05.012, 2016.

Killick, R. and Eckley, I. A.: changepoint: An R Package for Changepoint Analysis, J. Stat. Softw., 58, 1–19, 2014.

Killick, R., Haynes, K., and Eckley, I. A.: changepoint: An R package for changepoint analysis, R CRAN [code], https://cran.r-project.org/package=changepoint (last access: 20 April 2022), 2016.

Kluyver, T., Ragan-Kelley, B., Pérez, F., Granger, B., Bussonnier, M., Frederic, J., Kelley, K., Hamrick, J., Grout, J., Corlay, S., Ivanov, P., Avila, D., Abdalla, S., and Willing, C.: Jupyter Notebooks – a publishing format for reproducible computational workflows, in: Positioning and Power in Academic Publishing: Players, Agents and Agendas, IOS Press, 87–90, 10.3233/978-1-61499-649-1-87, 2016.

Kokorowski, H. D., Anderson, P. M., Mock, C. J., and Lozhkin, A. V.: A re-evaluation and spatial analysis of evidence for a Younger Dryas climatic reversal in Beringia, Quat. Sci. Rev., 27, 1710–1722, 10.1016/j.quascirev.2008.06.010, 2008.

Kolka, V. V., Korsakova, O. P., Shelekhova, T. S., and Tolstobrova, A. N.: Reconstruction of the relative level of the White Sea during the Lateglacial – Holocene according to lithological, diatom analyses and radiocarbon dating of small lakes bottom sediments in the area of the Chupa settlement (North Karelia, Russia), Murmansk State Technical University, Vestnik of MSTU, 18, 255–268, 2015.

Kolka, V. V., Korsakova, O. P., Lavrova, N. B., Shelekhova, T. S., Tolstobrova, A. N., Tolstobrov, D. S., and Zaretskaya, N. E.: Small lakes bottom sediments stratigraphy and paleogeography of the Onega Bay west coast of the White Sea in the Late Glacial and Holocene, Geomorphology, 2018, 48–59, 10.7868/S0435428118020049, 2018.

Kublitskiy, Y., Kulkova, M., Druzhinina, O., Subetto, D., Stančikaitė, M., Gedminienė, L., and Arslanov, K.: Geochemical Approach to the Reconstruction of Sedimentation Processes in Kamyshovoye Lake (SE Baltic, Russia) during the Late Glacial and Holocene, Minerals, 10, 764, 10.3390/min10090764, 2020.

Lacourse, T. and Gajewski, K.: Current practices in building and reporting age-depth models, Quat. Res., 96, 28–38, 10.1017/qua.2020.47, 2020.

Lehnherr, I., St Louis, V. L., Sharp, M., Gardner, A. S., Smol, J. P., Schiff, S. L., Muir, D. C. G., Mortimer, C. A., Michelutti, N., Tarnocai, C., St Pierre, K. A., Emmerton, C. A., Wiklund, J. A., Köck, G., Lamoureux, S. F., and Talbot, C. H.: The world's largest High Arctic lake responds rapidly to climate warming, Nat. Commun., 9, 1–9, 10.1038/s41467-018-03685-z, 2018.

Lougheed, B. C. and Obrochta, S. P.: A Rapid, Deterministic Age-Depth Modeling Routine for Geological Sequences With Inherent Depth Uncertainty, Paleoceanogr. Paleocl., 34, 122–133, 10.1029/2018PA003457, 2019.

Lougheed, B. C., Van Der Lubbe, H. J. L., and Davies, G. R.: 87Sr/86Sr as a quantitative geochemical proxy for 14C reservoir age in dynamic, brackish waters: Assessing applicability and quantifying uncertainties, Geophys. Res. Lett., 43, 735–742, 10.1002/2015GL066983, 2016.

Lowe, J. J. and Walker, M.: Reconstructing Quaternary Environments, Routledge, 10.4324/9781315797496, 2014.

Lozhkin, A., Minyuk, P., Cherepanova, M., Anderson, P., and Finney, B.: Holocene environments of central Iturup Island, southern Kuril archipelago, Russian Far East, Quat. Res. (USA), 88, 23–38, 10.1017/qua.2017.21, 2017.

Lozhkin, A., Anderson, P., Minyuk, P., Korzun, J., Brown, T., Pakhomov, A., Tsygankova, V., Burnatny, S., and Naumov, A.: Implications for conifer glacial refugia and postglacial climatic variation in western Beringia from lake sediments of the Upper Indigirka basin, Boreas, 47, 938–953, 10.1111/bor.12316, 2018.

Lozhkin, A., Cherepanova, M., Anderson, P., Minyuk, P., Finney, B., Pakhomov, A., Brown, T., Korzun, J., and Tsigankova, V.: Late Holocene history of Tokotan Lake (Kuril Archipelago, Russian Far East): The use of lacustrine records for paleoclimatic reconstructions from geologically dynamic settings, Quat. Int., 553, 104–117, 10.1016/j.quaint.2020.05.023, 2020.

Mackay, A. W., Bezrukova, E. V., Leng, M. J., Meaney, M., Nunes, A., Piotrowska, N., Self, A., Shchetnikov, A., Shilland, E., Tarasov, P., Wang, L., and White, D.: Aquatic ecosystem responses to Holocene climate change and biome development in boreal, central Asia, Quat. Sci. Rev., 41, 119–131, 10.1016/j.quascirev.2012.03.004, 2012.

Martin, H., Schmid, C., Knitter, D., and Tietze, C.: oxcAAR: Interface to “OxCal” Radiocarbon Calibration, R CRAN [code], https://cran.r-project.org/package=oxcAAR (last access: 20 April 2022), 2021.

McKay, N. P., Emile-Geay, J., and Khider, D.: geoChronR – an R package to model, analyze, and visualize age-uncertain data, Geochronology, 3, 149–169, 10.5194/gchron-3-149-2021, 2021.

Microsoft Corporation and Weston, S.: doParallel: Foreach Parallel Adaptor for the “parallel” Package, R CRAN [code], https://cran.r-project.org/package=doParallel (last access: 20 April 2022), 2020a.

Microsoft Corporation and Weston, S.: doSNOW: Foreach Parallel Adaptor for the “snow” Package, R CRAN [code], https://cran.r-project.org/package=doSNOW (last access: 20 April 2022), 2020b.

Microsoft Corporation and Weston, S.: foreach: Provides Foreach Looping Construct, R CRAN [code], https://cran.r-project.org/package=foreach (last access: 20 April 2022), 2020c.

Müller, S., Tarasov, P. E., Andreev, A. A., and Diekmann, B.: (Table 2) Radiocarbon dates from Lake Billyakh, PANGAEA [data set], 10.1594/PANGAEA.708169, 2008.

Müller, S., Tarasov, P. E., Andreev, A. A., and Diekmann, B.: Late Glacial to Holocene environments in the present-day coldest region of the Northern Hemisphere inferred from a pollen record of Lake Billyakh, Verkhoyansk Mts, NE Siberia, Clim. Past, 5, 73–84, 10.5194/cp-5-73-2009, 2009.

Müller, S., Tarasov, P. E., Andreev, A. A., Tütken, T., Gartz, S., and Diekmann, B.: Late Quaternary vegetation and environments in the Verkhoyansk Mountains region (NE Asia) reconstructed from a 50-kyr fossil pollen record from Lake Billyakh, Quat. Sci. Rev., 29, 2071–2086, 10.1016/j.quascirev.2010.04.024, 2010.

Nazarova, L., Lüpfert, H., Subetto, D., Pestryakova, L., and Diekmann, B.: Holocene climate conditions in central Yakutia (Eastern Siberia) inferred from sediment composition and fossil chironomids of Lake Temje, Quat. Int., 290–291, 264–274, 10.1016/j.quaint.2012.11.006, 2013a.

100

Nazarova, L. B., Lüpfert, H., Subetto, D. A., Pestryakova, L. A., and Diekmann, B.: (Table 1) Radiocarbon dates from the Lake Temje sediment core, PANGAEA [data set], 10.1594/PANGAEA.802677, 2013b.

101

Niephaus, F., Felgentreff, T., and Hirschfeld, R.: Towards polyglot adapters for the GraalVM, ACM Int. Conf. Proceeding Ser., 1–4 April 2019, Genova, Italy, 10.1145/3328433.3328458, 2019.

102

Nowaczyk, N. R., Minyuk, P., Melles, M., Brigham-Grette, J., Glushkova, O., Nolan, M., Lozhkin, A. V., Stetsenko, T. V., Andersen, P. M., and Forman, S. L.: Magnetostratigraphic results from impact crater Lake El'gygytgyn, northeastern Siberia: a 300 kyr long high-resolution terrestrial palaeoclimatic record from the Arctic, Geophys. J. Int., 150, 109–126, 10.1046/j.1365-246X.2002.01625.x, 2002.

103

Olsen, J., Ascough, P., Lougheed, B. C., and Rasmussen, P.: Radiocarbon Dating in Estuarine Environments, in: Applications of Paleoenvironmental Techniques in Estuarine Studies, Developments in Paleoenvironmental Research, edited by: Weckström, K., Saunders, K., Gell, P., and Skilbeck, C., Springer, Dordrecht, vol. 20, 141–170, 10.1007/978-94-024-0990-1_7, 2017.

104

Palagushkina, O., Wetterich, S., Biskaborn, B. K., Nazarova, L., Schirrmeister, L., Lenz, J., Schwamborn, G., and Grosse, G.: Diatom records and tephra mineralogy in pingo deposits of Seward Peninsula, Alaska, Palaeogeogr. Palaeoclimatol. Palaeoecol., 479, 1–15, 10.1016/j.palaeo.2017.04.006, 2017.

105

Parnell, A. C., Haslett, J., Allen, J. R. M., Buck, C. E., and Huntley, B.: A flexible approach to assessing synchroneity of past events using Bayesian reconstructions of sedimentation history, Quat. Sci. Rev., 27, 1872–1885, 10.1016/j.quascirev.2008.07.009, 2008.

106

Parnell, A. C., Buck, C. E., and Doan, T. K.: A review of statistical chronology models for high-resolution, proxy-based Holocene palaeoenvironmental reconstruction, Quat. Sci. Rev., 30, 2948–2960, 10.1016/j.quascirev.2011.07.024, 2011.

107

Peng, B., Wang, G., Ma, J., Leong, M. C., Wakefield, C., Melott, J., Chiu, Y., Du, D., and Weinstein, J. N.: SoS notebook: An interactive multi-language data analysis environment, Bioinformatics, 34, 3768–3770, 10.1093/bioinformatics/bty405, 2018.

108

Pfalz, G.: GPawi/LANDO: LANDO public release v1.3, Zenodo [code], 10.5281/zenodo.5734333, 2022.

109

Pfalz, G., Diekmann, B., Freytag, J.-C., and Biskaborn, B. K.: Computers and Geosciences Harmonizing heterogeneous multi-proxy data from lake systems, Comput. Geosci., 153, 104791, 10.1016/j.cageo.2021.104791, 2021.

110

Piotrowska, N., Bluszcz, A., Demske, D., Granoszewski, W., and Heumann, G.: Extraction and AMS Radiocarbon Dating of Pollen from Lake Baikal Sediments, Radiocarbon, 46, 181–187, 10.1017/S0033822200039503, 2004.

111

Piotrowska, N., Bluszcz, A., Demske, D., Granoszewski, W., and Heumann, G.: Age determination of Lake Baikal sediment cores, PANGAEA [data set], 10.1594/PANGAEA.856103, 2005.

112

Pisaric, M. F. J., MacDonald, G. M., Velichko, A. A., and Cwynar, L. C.: The Lateglacial and Postglacial vegetation history of the northwestern limits of Beringia, based on pollen, stomate and tree stump evidence, Quat. Sci. Rev., 20, 235–245, 10.1016/S0277-3791(00)00120-7, 2001.

113

Raab, A., Melles, M., Berger, G. W., Hagedorn, B., and Hubberten, H. W.: Non-glacial paleoenvironments and the extent of Weichselian ice sheets on Severnaya Zemlya, Russian High Arctic, Quat. Sci. Rev., 22, 2267–2283, 10.1016/S0277-3791(03)00139-2, 2003.

114

Rasmussen, S. O., Andersen, K. K., Svensson, A. M., Steffensen, J. P., Vinther, B. M., Clausen, H. B., Siggaard-Andersen, M. L., Johnsen, S. J., Larsen, L. B., Dahl-Jensen, D., Bigler, M., Röthlisberger, R., Fischer, H., Goto-Azuma, K., Hansson, M. E., and Ruth, U.: A new Greenland ice core chronology for the last glacial termination, J. Geophys. Res.-Atmos., 111, 1–16, 10.1029/2005JD006079, 2006.

115

R Core Team: R: A Language and Environment for Statistical Computing, R CRAN [code], https://www.r-project.org/ (last access: 20 April 2022), 2021.

116

Reback, J., McKinney, W., jbrockmendel, Bossche, J. Van den, Augspurger, T., Cloud, P., gfyoung, Sinhrks, Hawkins, S., Roeschke, M., Klein, A., Petersen, T., Tratner, J., She, C., Ayd, W., Naveh, S., Garcia, M., Schendel, J., Hayden, A., Saxton, D., Jancauskas, V., McMaster, A., Battiston, P., Seabold, S., patrick, Dong, K., chris-b1, h-vetinari, Hoyer, S., and Gorelli, M.: pandas-dev/pandas: Pandas 1.1.5, Zenodo [code], 10.5281/zenodo.4309786, 2020.

117

Reimer, P. J., Austin, W. E. N., Bard, E., Bayliss, A., Blackwell, P. G., Bronk Ramsey, C., Butzin, M., Cheng, H., Edwards, R. L., Friedrich, M., Grootes, P. M., Guilderson, T. P., Hajdas, I., Heaton, T. J., Hogg, A. G., Hughen, K. A., Kromer, B., Manning, S. W., Muscheler, R., Palmer, J. G., Pearson, C., van der Plicht, J., Reimer, R. W., Richards, D. A., Scott, E. M., Southon, J. R., Turney, C. S. M., Wacker, L., Adolphi, F., Büntgen, U., Capano, M., Fahrni, S. M., Fogtmann-Schulz, A., Friedrich, R., Köhler, P., Kudsk, S., Miyake, F., Olsen, J., Reinig, F., Sakamoto, M., Sookdeo, A., and Talamo, S.: The Intcal20 Northern Hemisphere Radiocarbon Age Calibration Curve (0–55 cal kBP), Radiocarbon, 62, 1–33, 10.1017/RDC.2020.41, 2020.

118

Rethemeyer, J., Gierga, M., Heinze, S., Stolz, A., Wotte, A., Wischhöfer, P., Berg, S., Melchert, J., and Dewald, A.: Current Sample Preparation and Analytical Capabilities of the Radiocarbon Laboratory at CologneAMS, Radiocarbon, 61, 1449–1460, 10.1017/rdc.2019.16, 2019.

119

Rudaya, N.: Radiocarbon dates of sediment core Tel2006 Lake Teletskoye, Altai Mountains, south-eastern West Siberia, Russia, PANGAEA [data set], 10.1594/PANGAEA.914417, 2020.

120

Rudaya, N., Nazarova, L., Nourgaliev, D., Palagushkina, O., Papin, D., and Frolova, L.: Mid-late Holocene environmental history of Kulunda, southern West Siberia: vegetation, climate and humans, Quat. Sci. Rev., 48, 32–42, 10.1016/j.quascirev.2012.06.002, 2012.

121

Rudaya, N., Nazarova, L., Novenko, E., Andreev, A., Kalugin, I., Daryin, A., Babich, V., Li, H. C., and Shilov, P.: Quantitative reconstructions of mid- to late holocene climate and vegetation in the north-eastern altai mountains recorded in lake teletskoye, Glob. Planet. Change, 141, 12–24, 10.1016/j.gloplacha.2016.04.002, 2016.

122

Rudaya, N., Nazarova, L., Frolova, L., Palagushkina, O., Soenov, V., Cao, X., Syrykh, L., Grekov, I., Otgonbayar, D., and Bayarkhuu, B.: The link between climate change and biodiversity of lacustrine inhabitants and terrestrial plant communities of the Uvs Nuur Basin (Mongolia) during the last three millennia, Holocene, 31, 1443–1458, 10.1177/09596836211019093, 2021.

123

Savelieva, L. A., Andreev, A. A., Gromig, R., Subetto, D. A., Fedorov, G. B., Wennrich, V., Wagner, B., and Melles, M.: Vegetation and climate changes in northwestern Russia during the Lateglacial and Holocene inferred from the Lake Ladoga pollen record, Boreas, 48, 349–360, 10.1111/bor.12376, 2019.

124

Schleusner, P., Biskaborn, B. K., Kienast, F., Wolter, J., Subetto, D., and Diekmann, B.: Basin evolution and palaeoenvironmental variability of the thermokarst lake El'gene-Kyuele, Arctic Siberia, 44, 216–229, 10.1111/bor.12084, 2015.

125

Shelekhova, T. and Lavrova, N.: Paleogeographic reconstructions of the Northwest Karelia region evolution in the holocene based on the study of small lake sediments, Proc. Karelian Res. Cent. Russ. Acad. Sci., 101, 101–122, 10.17076/lim1268, 2020.

126

Shelekhova, T. S., Tikhonova, Y. S., and Lazareva, O. V.: Late Glacial and Holocene Natural Environment Dynamics and Evolution of Lake Okunozero, South Karelia: Micropalaeontological Data, Proc. Karelian Res. Cent. Russ. Acad. Sci., 55, 134, 10.17076/lim1319, 2021a.

127

Shelekhova, T. S., Lavrova, N. B., Lazareva, O. V., and Tikhonova, Y. S.: Paleogeographic Conditions Of Sedimentation In The Small Lakes Of Western Karelia In The Holocene, in: Routes Of Evolutionary Geography – Issue 2, Institute of Geography RAS, 449–454, ISBN 978-5-89658-074-4, 2021b.

128

Shelekhova, T. S., Lavrova, N. B., and Subetto, D. A.: Reconstruction of paleogeographic conditions in the Late Glacial-Holocene in Central Karelia based on comprehensive analysis of sediments from the lake Yuzhnoe Haugilampi, 153, 73–89, 10.31857/S0869607121060070, 2021c.

129

Smol, J. P.: Arctic and Sub-Arctic shallow lakes in a multiple-stressor world: a paleoecological perspective, Hydrobiologia, 778, 253–272, 10.1007/s10750-015-2543-3, 2016.

130

Strunk, A., Olsen, J., Sanei, H., Rudra, A., and Larsen, N. K.: Improving the reliability of bulk sediment radiocarbon dating, Quat. Sci. Rev., 242, 106442, 10.1016/j.quascirev.2020.106442, 2020.

131

Subetto, D. A., Nazarova, L. B., Pestryakova, L. A., Syrykh, L. S., Andronikov, A. V., Biskaborn, B., Diekmann, B., Kuznetsov, D. D., Sapelko, T. V., and Grekov, I. M.: Paleolimnological studies in Russian northern Eurasia: A review, Contemp. Probl. Ecol., 10, 327–335, 10.1134/S1995425517040102, 2017.

132

Syrykh, L., Subetto, D., and Nazarova, L.: Paleolimnological studies on the East European Plain and nearby regions: the PaleoLake Database, J. Paleolimnol., 65, 369–375, 10.1007/s10933-020-00172-8, 2021.

133

Telford, R. J., Heegaard, E., and Birks, H. J. B.: The intercept is a poor estimate of a calibrated radiocarbon age, Holocene, 14, 296–298, 10.1191/0959683604hl707fa, 2004.

134

Thanos, C.: Research Data Reusability: Conceptual Foundations, Barriers and Enabling Technologies, 5, 2, 10.3390/publications5010002, 2017.

135

Tolstobrov, D., Tolstobrova, A., Kolka, V., Korsakova, O., and Subetto, D.: Putative Records Of The Holocene Tsunami In Lacustrine Bottom Sediments Near The Teriberka Settlement (Kola Peninsula, Russia), Proc. Karelian Res. Cent. Russ. Acad. Sci., 6, 92, 10.17076/lim865, 2018.

136

Tolstobrova, A., Tolstobrov, D., Kolka, V., and Korsakova, O.: Late Glacial And Postglacial History Of Lake Osinovoye (Kola Region) Inferred From Ssdimentary Diatom Assemblages, Proc. Karelian Res. Cent. Russ. Acad. Sci., 89, 106, 10.17076/lim305, 2016.

137

Trachsel, M. and Telford, R. J.: All age–depth models are wrong, but are getting better, Holocene, 27, 860–869, 10.1177/0959683616675939, 2017.

138

von Hippel, B., Stoof-Leichsenring, K. R., Schulte, L., Seeber, P., Biskaborn, B. K., Diekmann, B., Melles, M., Pestryakova, L., and Herzschuh, U.: Long-term fungus–plant co-variation from multi-site sedimentary ancient DNA metabarcoding in Siberia, bioRxiv, 1–31 pp., 10.1101/2021.11.05.465756, 2021.

139

Vyse, S. A., Herzschuh, U., Andreev, A. A., Pestryakova, L. A., Diekmann, B., Armitage, S. J., and Biskaborn, B. K.: Geochemical and sedimentological responses of arctic glacial Lake Ilirney, chukotka (far east Russia) to palaeoenvironmental change since ∼ 51.8 ka BP, Quat. Sci. Rev., 247, 106607, 10.1016/j.quascirev.2020.106607, 2020a.

140

Vyse, S. A., Herzschuh, U., Andreev, A. A., Pestryakova, L. A., Diekmann, B., Armitage, S., and Biskaborn, B. K.: Age determination of sediment core EN18208 from Lake Ilirney, Chukotka, far east Russia, PANGAEA [data set], 10.1594/PANGAEA.921228, 2020b.

141

Vyse, S. A., Herzschuh, U., Pfalz, G., Pestryakova, L. A., Diekmann, B., Nowaczyk, N., and Biskaborn, B. K.: Sediment and carbon accumulation in a glacial lake in Chukotka (Arctic Siberia) during the Late Pleistocene and Holocene: combining hydroacoustic profiling and down-core analyses, Biogeosciences, 18, 4791–4816, 10.5194/bg-18-4791-2021, 2021.

142

Wagner, B., Melles, M., Hahne, J., Niessen, F., and Hubberten, H.-W.: Holocene climate history of Geographical Society Ø, East Greenland – evidence from lake sediments, Palaeogeogr. Palaeoclimatol. Palaeoecol., 160, 45–68, 10.1016/S0031-0182(00)00046-8, 2000a.

143

Wagner, B., Melles, M., Hahne, J., Niessen, F., and Hubberten, H.-W.: (Table 1) Age determination of sediment core PG1205, PANGAEA [data set], 10.1594/PANGAEA.734962, 2000b.

144

Walker, M., Johnsen, S., Rasmussen, S. O., Steffensen, J.-P., Popp, T., Gibbard, P., Hoek, W., Lowe, J., Andrews, J., Björck, S., Cwynar, L., Hughen, K., Kershaw, P., Kromer, B., Litt, T., Lowe, D. J., Nakagawa, T., Newnham, R., and Schwander, J.: The Global Stratotype Section and Point (GSSP) for the base of the Holocene Series/Epoch (Quaternary System/Period) in the NGRIP ice core, Episodes, 31, 264–267, 10.18814/epiiugs/2008/v31i2/016, 2008.

145

Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., Kuhn, M., Pedersen, T., Miller, E., Bache, S., Müller, K., Ooms, J., Robinson, D., Seidel, D., Spinu, V., Takahashi, K., Vaughan, D., Wilke, C., Woo, K., and Yutani, H.: Welcome to the Tidyverse, J. Open Source Softw., 4, 1686, 10.21105/joss.01686, 2019.

146

Wolfe, A. P.: A high-resolution late-glacial and early Holocene diatom record from Baffin Island, eastern Canadian Arctic, Can. J. Earth Sci., 33, 928–937, 10.1139/e96-070, 1996.

147

Wolfe, B. B., Edwards, T. W. D., and Aravena, R.: Changes in carbon and nitrogen cycling during tree-line retreat recorded in the isotopic content of lacustrine organic matter, western Taimyr Peninsula, Russia, Holocene, 9, 215–222, 10.1191/095968399669823431, 1999.

148

Wright, A. J., Edwards, R. J., van de Plassche, O., Blaauw, M., Parnell, A. C., van der Borg, K., de Jong, A. F. M., Roe, H. M., Selby, K., and Black, S.: Reconstructing the accumulation history of a saltmarsh sediment core: Which age-depth model is best?, Quat. Geochronol., 39, 35–67, 10.1016/j.quageo.2017.02.004, 2017.

149

Zaharia, M., Xin, R. S., Wendell, P., Das, T., Armbrust, M., Dave, A., Meng, X., Rosen, J., Venkataraman, S., Franklin, M. J., Ghodsi, A., Gonzalez, J., Shenker, S., and Stoica, I.: Apache spark: A unified engine for big data processing, Commun. ACM, 59, 56–65, 10.1145/2934664, 2016.

150

Zander, P. D., Szidat, S., Kaufman, D. S., Żarczyński, M., Poraj-Górska, A. I., Boltshauser-Kaltenrieder, P., and Grosjean, M.: Miniature radiocarbon measurements (< 150 µg C) from sediments of Lake Żabińskie, Poland: effect of precision and dating density on age–depth models, Geochronology, 2, 63–79, 10.5194/gchron-2-63-2020, 2020.

151

Zhdanova, A. N., Solotchina, E. P., Solotchin, P. A., Krivonogov, S. K., and Danilenko, I. V.: Reflection of Holocene climatic changes in mineralogy of bottom sediments from Yarkovsky Pool of Lake Chany (southern West Siberia), Russ. Geol. Geophys., 58, 692–701, 10.1016/j.rgg.2016.07.005, 2017.

152

Zolitschka, B., Francus, P., Ojala, A. E. K., and Schimmelmann, A.: Varves in lake sediments – a review, Quat. Sci. Rev., 117, 1–41, 10.1016/j.quascirev.2015.03.019, 2015.