the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Towards an improvement of optically stimulated luminescence (OSL) age uncertainties: modelling OSL ages with systematic errors, stratigraphic constraints and radiocarbon ages using the R package BayLum
Guillaume Guérin
Christelle Lahaye
Maryam Heydari
Martin Autzen
JanPieter Buylaert
Pierre Guibert
Mayank Jain
Sebastian Kreutzer
Brice Lebrun
Andrew S. Murray
Kristina J. Thomsen
Petra Urbanova
Anne Philippe
Statistical analysis has become increasingly important in optically stimulated luminescence (OSL) dating since it has become possible to measure signals at the singlegrain scale. The accuracy of large chronological datasets can benefit from the inclusion, in chronological modelling, of stratigraphic constraints and shared systematic errors. Recently, a number of Bayesian models have been developed for OSL age calculation; the R package “BayLum” presented herein allows different models of this type to be implemented, particularly for samples in stratigraphic order which share systematic errors. We first show how to introduce stratigraphic constraints in BayLum; then, we focus on the construction, based on measurement uncertainties, of dose covariance matrices to account for systematic errors specific to OSL dating. The nature (systematic versus random) of errors affecting OSL ages is discussed, based – as an example – on the dose rate determination procedure at the IRAMATCRP2A laboratory (Bordeaux). The effects of the stratigraphic constraints and dose covariance matrices are illustrated on example datasets. In particular, the benefit of combining the modelling of systematic errors with independent ages, unaffected by these errors, is demonstrated. Finally, we discuss other common ways of estimating dose rates and how they may be taken into account in the covariance matrix by other potential users and laboratories. Test datasets are provided as a Supplement to the reader, together with an R markdown tutorial allowing the reproduction of all calculations and figures presented in this study.
 Article
(364 KB)  Fulltext XML

Supplement
(665 KB)  BibTeX
 EndNote
Optically stimulated luminescence (OSL, called optical dating in Huntley et al., 1985) allows the dating of the last exposure of quartz grains to sunlight. The singlealiquot regenerative (SAR) dose protocol consists of comparing the natural luminescence signal to laboratorygenerated signals induced by artificial irradiation (Murray and Wintle, 2000; Wintle and Murray, 2006). The corresponding measurements, in particular at the singlegrain scale, result in large datasets characterised by significant scatter, owing to a number of dispersion factors (see for example Thomsen et al., 2005). An OSL age is then obtained by dividing the equivalent dose (i.e. in the case of coarse quartz grains, the dose absorbed by the mineral) by the dose rate to which quartz grains were exposed since the last exposure to light.
Statistical analysis, in geochronology, generally aims to improve the precision, accuracy, and/or range of dating methods. In the case of OSL dating, calibration errors on the laboratory source dose rate for natural dose estimation, and geochemical standards for dose rate assessment, have so far resulted in age uncertainties (at 1σ or 68 % confidence) of, at best, ∼5 % (see for example Duller, 2008; Guérin et al., 2013).
Note that in what follows, the unit of analysis is a sediment sample; we assume that each sample corresponds to a deposition event, and thus to a single age (no postdepositional mixing is considered). The system of analysis is the laboratory in which the measurements are performed and includes both the apparatus and associated calibration standards. It should be emphasised here that field equipment is part of what we call the laboratory; this is important for the definition of what we call systematic errors. By definition, an error is the difference between the measured or observed value of a physical quantity and its true (but unknown) value. Thus, by systematic errors, we refer to random errors affecting equipment calibration: whereas each of these errors may be assigned a Gaussian probability density function with zero mean and a known variance (the square root of the variance being generally referred to as uncertainty), at the scale of the laboratory this error takes a fixed, unknown value that affects all measurements in the same direction. Of course, other sources of errors may exist (for example when using the infinite matrix assumption to calculate grain size attenuation factors; see for example Guérin et al., 2012b), but in this article, we consider only known, quantified sources of errors.
Over the past few years, several models for routine Bayesian analysis of SAR OSL and dose rate data were developed to better reflect, and take advantage of, the measurement procedures implemented to calculate OSL ages. Among those models, Combès et al. (2015) proposed one for calculating the centraldose values for wellbleached samples, leading to higher overall accuracy (see Guérin et al., 2015a) compared to the most commonly used model for OSL data analysis (the centraldose model: CDM, Galbraith et al., 1999; note that we changed the original terminology following Galbraith and Roberts, 2012). Combès and Philippe (2017) developed models capable of dealing with individual and systematic multiplicative errors for OSL age calculation including stratigraphic constraints (for general introductions on a statistical analysis of OSL data, but also the statistical models discussed hereafter and associated prior distributions, the reader is referred to Combès et al., 2015; Combès and Philippe, 2017, and references therein).
To implement the Bayesian models of Combès et al. (2015) and Combès and Philippe (2017) in practice, and provide easy access to the community, an R package (R Core Team, 2020) named “BayLum” (Christophe et al., 2020; version 0.2.0) has been developed and released on the Comprehensive R Archive Network (CRAN; see also Mercier et al., 2016, for a first implementation of the centraldose model developed by Combès et al., 2015). The first features of this BayLum package were presented by Philippe et al. (2019), and its performance, when one is confronted either with large dose values or with dose variability issues, was tested in laboratorycontrolled experiments (Heydari and Guérin, 2018) and later applied to various case studies (Lahaye et al., 2018; Carter et al., 2019; Heydari et al., 2020, 2021; Chevrier et al., 2020).
The purpose of this paper is to focus on the treatment of stratigraphic constraints and systematic errors for chronological modelling using BayLum; i.e. it goes beyond than what was first demonstrated by Philippe et al. (2019). Together with the association of independent, more precise ages (^{14}C in this work), such modelling is expected to reduce OSL age uncertainties. In the past, other approaches to model systematic and random, individual errors in the field of palaeodosimetric dating methods were proposed; in particular, Millard (2006a, b) developed a Bayesian approach quite close to that presented here, but which – among different other things (see Combès and Philippe, 2017, for a more detailed discussion) – is limited in its applicability.
Herein we present a Bayesian modelling case study. (1) We start with how data should be pretreated prior to using the BayLum package; a simple example of chronological modelling (samples considered independent, i.e. without stratigraphic constraints and shared errors) is first presented, yielding an output from the BayLum package to serve as a reference for the following, more elaborate models. (2) In the next step, we detail how the user can integrate stratigraphic constraints and the effect on the chronological inference. It should be noted that we take here the stratigraphic information for granted, but we warn the user against treating such information lightly, as it bears great consequences on the age calculation (see discussions in Heydari et al., 2020, 2021). (3) Then, most importantly, we explain how to build a dose covariance matrix in practice to take into account systematic errors (for the definition of this matrix, the reader is referred to Combès and Philippe, 2017) and what effect it has on a series of ages. (4) For this purpose, we base our approach on dose rate measurements as performed by Guérin et al. (2015b) at the IRAMATCRP2A laboratory. The effect of integrating independent data such as radiocarbon ages, which usually do not share systematic errors affecting OSL data, is then illustrated. (5) Finally, we discuss different ways to measure dose rates and various assumptions that can be made regarding the nature (systematic or random) of additional sources of errors in OSL dating (see also Rhodes et al., 2003, for a similar discussion).
To help the reader, we provide in the Supplement an R markdown document with commented lines of code and example datasets, so that everything presented here may be reproduced.
2.1 Case study
To illustrate how to model OSL ages, both in stratigraphic constraints and sharing systematic errors, using the R BayLum package, we use the data from two sediment samples (FER 1 and FER 3) already dated by quartz OSL (Guérin et al., 2015b). These samples were taken from the archaeological site of La Ferrassie (France) and prepared following standard chemical preparation procedures applied to luminescencedating samples. While modelling with BayLum may be applied to both multigrain and singlegrain OSL datasets, in the following we only focus on singlegrain data, as this is probably where the need for appropriate statistical models is most acute (the reliability of multigrain OSL has been demonstrated when using a plain average (mean) for palaeodose estimation; see for example Murray and Olley, 2002; for theoretical justification, see Guérin et al., 2017). Singlegrain OSL data were measured using an automated Risø TL/OSL reader (DA 20) fitted with a singlegrain attachment (Duller et al., 1999; BøtterJensen et al., 2000). A standard SAR protocol (Murray and Wintle, 2000, 2003) was used to measure singlegrain equivalent doses, after checking its suitability for the samples under investigation. A comparison between quartz OSL and feldspar infrared stimulated luminescence (IRSL) signals for these two samples, as well as comparison with radiocarbon, showed that these samples were wellbleached at the time of deposition and unaffected by postdepositional mixing. As a result, the use of centraldose models is fully justified (it should be noted here that at the time of writing, BayLum does not yet include the Bayesian model of Christophe et al., 2018, allowing the analysis of poorly bleached samples).
2.2 Data pretreatment
The Bayesian modelling implemented in BayLum requires information of different natures: (i) raw OSL data in the form of BIN/BINX file(s), (ii) list(s) of grains to be included in the modelling (based on predefined selection criteria, e.g. on recycling and/or recuperation ratios), (iii) file(s) indicating how the data should be processed (signal integration channels, reproducibility of the instrument(s), etc.), and (iv) both natural (in Gy ka^{−1}) and laboratory (in Gy s^{−1}) dose rates. Based on these data, the calculations are performed all at once using Markov chain–Monte Carlo (MCMC) computations; as a result, unlike in standard frequentist data processing, there is no succession of steps in data analysis (for example, individual equivalent dose estimates are not parameterised, unlike when the CDM is used). While Combès et al. (2015) argue that this results in a better statistical inference about the age (or palaeodose), it also comes with a downside: the user cannot visualise the data during the statistical analysis. In particular, the fact that the user must specify the list of grains to be included in the analysis implies that one should always pretreat the samples in a standard way, by using for example “Analyst” (Duller, 2015) or the R “Luminescence” package (Kreutzer et al., 2012, 2020) to visually check the data but also to investigate the effect of various selection criteria on the datasets (see for example Thomsen et al., 2016, on the effect of applying various selection criteria with frequentist statistical models; see Heydari and Guérin, 2018, for a similar study in a Bayesian framework).
In other words, using BayLum for age calculation should not, and does not, prevent the user from a careful and critical examination of the measured OSL data. In particular, before running age calculations using the BayLum package, it is important that the user already has identified potential problems – e.g. saturation and/or dose rate variability (see Heydari and Guérin, 2018, for adapted modelling solutions).
We first ran the function Generate_DataFile()
for the OSL
samples FER 1 and FER 3, with the same lists of grains as those used for age
calculation by Guérin et al. (2015b): all grains with an uncertainty smaller
than 20 % on the first test dose signal were selected. A large number of
grains appeared to be in saturation for these samples (in Analyst, there is no
intersection of the natural $L/T$ signal, or the sum of this sensitivitycorrected natural signal and its uncertainty, with the dose–response curve).
As a result, following Thomsen et al. (2016), an additional selection criterion was
added, based on the curvature parameter of the dose–response curves. All
grains for which the D_{0} value, obtained with Analyst as described by
Guérin et al. (2015b), was smaller than 100 Gy were rejected from the
analysis (note however that such a selection criterion may not be necessary
when working with BayLum: Heydari and Guérin, 2018).
In practice, the data are contained in two folders named after the samples and provided in the Supplement. Each folder contains one BIN/BINXfile (i.e. OSL measurements; note that only a small fraction of the measured grains is included in the Supplement) and four CSV files:

“DiscPos.csv” lists all selected grains.

“Rule.csv” gives the rules for generating ${L}_{x}/{T}_{x}$ data (integration channels for both the natural or regenerated and test dose signals, uncertainty arising from the reproducibility of the OSL measurements, and number of SAR cycles to remove for curve fitting, if any – it may, for example, be desirable to remove recycled points and/or IR depletion points).

“DoseSource.csv” gives the laboratory source dose rate and its variance.

“DoseEnv.csv” gives the dose rate to which the sample was exposed during burial.
We ran the function AgeS_Computation()
with a prior age
interval limited to between 10 and 100 ka for each sample (so that the
bounds are far from the age values obtained using the arithmetic mean of
equivalent doses, namely 37±2 and 40±2 ka, respectively).
The dose–response curves were fitted, as in Analyst in our previous study, with
single saturating exponential functions passing through the origin. All
uncertainties, affecting both environmental and laboratory dose rates, were
included in the calculation, as is common practice in luminescence dating;
however, the covariance of ages was not modelled here, so the results are
equivalent to those one would obtain by running subsequent individual age
calculations for each of the two samples.
To run the AgeS_Computation()
function, the user must choose
a model for the distribution of individual equivalent doses around the
central dose; the different options are Cauchy, Gaussian or lognormal
distribution (in the latter case, the central dose may be estimated either
by the mean or the median of the distribution). A Cauchy distribution
(sometimes also called Lorentz distribution) is a symmetric distribution
which was chosen by Combès et al. (2015) because it has heavy tails;
i.e. extreme values have a nonzero probability. Hence, the Cauchy distribution
seemed to be wellsuited for the analysis of widely dispersed datasets
including outlier values such as singlegrain D_{e} distributions.
Coming back to the samples from La Ferrassie, on top of saturation problems Guérin et al. (2015b) also identified dose rate variability as an important factor of dispersion in equivalent doses: the values of the CDM overdispersion parameter for the D_{e} distributions of the samples were equal to 29±3 % and 35±3 %, respectively. If we assume that this overdispersion arises from dose rate variability to single grains of quartz, Heydari and Guérin (2018) using laboratorycontrolled experiments showed that the Cauchy distribution and the CDM should both lead to ∼5 %–10 % age underestimation, because both models are biased. Consequently, we did not use the Cauchy distribution model. Instead, we modelled the equivalent dose distribution by a lognormal distribution (one could also have chosen a Gaussian function) from which the mean (rather than the median) was used to estimate the central dose. Indeed, Guérin et al. (2017) formally demonstrated that the median of the lognormal distribution (as used in the CDM) is a biased estimator and leads to age underestimates when dose rates are dispersed.
After 5000 iterations of three independent Markov chains, we observed good convergence, as seen in the markdown document provided in the Supplement (for a discussion of the convergence of the Markov chains, the reader is referred to Philippe et al., 2019). The upper limit of the 95 % confidence intervals for the Gelman and Rubin indexes of convergence (Gelman and Rubin, 1992) were all smaller than 1.05, also indicating satisfying convergence of the three independent Markov chains (here again, the reader is referred to Philippe et al., 2019, who suggested 1.05 as the maximum acceptable value). The obtained 95 % credible intervals (CIs) for the ages of samples FER 1 and FER 3 are [34.1; 43.3] ka and [36.6; 47.8] ka, respectively (Fig. 1; Table 1) and are consistent with the ages obtained by Guérin et al. (2015b) with a much simpler approach (unweighted arithmetic mean of equivalent doses). It should be emphasised here that the two 95 % CIs for ages overlap. Figure 2 shows a bivariate scatter plot of a sample of observations from the joint posterior distribution of the two ages, as generated by the Markov chains; in such a plot, each point corresponds to one realisation of the ages of the two samples investigated in the MCMC. Figure 3 shows the corresponding probability densities for the ages estimated jointly, based on kernel density estimates (KDEs), and the marginal probability densities. No correlation is observed on the joint probability density, which is symmetrical and bellshaped. One can already compare the results obtained with this Bayesian model (lognormalaverage) for sample FER 3 with the radiocarbon ages obtained independently for the same layer by Guérin et al. (2015b). The 95 % CI for the three ^{14}C ages are bound by the interval [44.4; 47.3] ka, which means that the OSL and radiocarbon ages are in good agreement, which was not the case when calculating the ages with the CDM (38±2 ka; this OSL age corresponds to ∼15 % underestimation and is broadly consistent, within uncertainties, with theoretical predictions stated above). Thus, even without further modelling, the BayLum lognormalaverage model seems to provide OSL ages in better agreement with radiocarbon.
Samples FER 1 and 3 belong to two different stratigraphic layers: sample FER 1 (Layer 7) lies stratigraphically above sample FER 3 (Layer 5B), so we know
that the age of the sample FER 1 must be less than that of the sample FER 3 (i.e. sample
FER 1 is younger than sample FER 3; for detailed stratigraphic information
on the site of La Ferrassie, which is of paramount importance in this
section, the reader is referred to Guérin et al., 2015b). To encode this
information, the function AgeS_Computation()
takes as
argument the object StratiConstraints
, which is a matrix whose size depends
on the number of analysed samples. First, the data in the DATA object (which
is the output of the function Generate_DataFile()
) must be
ordered in stratigraphic order from top to bottom: thus, in our case the
list of names used by the function Generate_DataFile()
is FER 1, FER 3 (rather than FER 3, FER 1). Then, the stratigraphic matrix contains
numbers equal to 0 or 1, indicating the applied bounds to the age of each
sample. The matrix contains a number of rows equal to the number of samples
plus one and a number of columns equal to the number of samples. The first
row contains the value 1 in each column, which indicates that the younger
age bound specified as prior information (10 ka in our example, cf. Sect. 3
above) when running the function AgeS_Computation()
applies
to all samples. Then, for all j in $\mathit{\{}\mathrm{2},\mathrm{\dots},\text{Nb\_Sample}+\mathrm{1}\mathit{\}}$ and all
i in $\mathit{\{}j,\mathrm{\dots},\text{Nb\_Sample}\mathit{\}}$, StratiConstraints[j,i]
=1 if the age
of sample whose number ID is equal to j−1 is less than the age of sample whose
number ID is equal to i. Otherwise, StratiConstraints[j,i]
=0. In practice,
in our case StratiConstraints [1,]
$=(\mathrm{1},\mathrm{1})$, StratiConstraints [2,]
$=(\mathrm{0},\mathrm{1})$ (which means that sample FER 1 is not younger than itself but is
younger than sample FER 3) and StratiConstraints [3,]
$=(\mathrm{0},\mathrm{0})$ (which means
that sample FER 3 is neither younger than sample FER 1 nor than itself).
Note that in the markdown document provided in the Supplement, the
corresponding code lines have comments to make this description easier to
follow.
Running the function AgeS_Computation()
with this matrix of
stratigraphic constraints only marginally affects the ages; in this case,
the 95 % CIs become [34.3; 42.9] ka and [38.1; 48.5] for samples FER 1
and FER 3, respectively (Table 1). One can also look at the bivariate
scatter plot of observations from the joint posterior distribution (Fig. 4):
one can see that this scatter plot is truncated in the upper lefthand
corner – illustrating the fact that the age of sample FER 1 can never be
greater than that of sample FER 3 (see Fig. 2 for comparison). By contrast,
the KDE estimate (Fig. 5) also shows a positive correlation but does not
reveal the truncation (whereas the stratigraphic constraint imposes a null
probability for all pairs of ages above the 1:1 line).
5.1 General considerations
In the previous calculations, all the variance is treated as random, whereas common, systematic errors affect all ages in the same direction, although to varying degrees (so systematic errors are unlikely to result in stratigraphic inversions). One of the main advantages of applying the models implemented in the BayLum package – contrary to other chronological modelling tools such as “OxCal” (Bronk Ramsey and Lee, 2013) or “Chronomodel” (Lanos and Philippe, 2018) – lies in the possibility to include the structure of uncertainties specific to OSL dating. For instance, a radiocarbon age is derived only from the ratio of ^{14}C to ^{12}C (on top of which comes the more complex problem of calibration), whereas an OSL age involves a large number of measurements, each with its uncertainty (Aitken, 1985, 1998). The OSL measurements required for the determination of the palaeodose are relatively standardised through the widespread use of the SAR protocol (Murray and Wintle, 2000; Wintle and Murray, 2006). Conversely, there are several approaches – each with its equipment and standards – to determine the various dose rate components. Given that these dose rates derive from different types of radiation (alpha, beta, gamma and cosmic radiation) and are of various origins (mainly from potassium and the uranium and thorium decay chains), there are many more contributions to the age uncertainty from the dose rate term than from the palaeodose term, even though the size of the uncertainty on dose rate is of the same order of magnitude as that on palaeodose – see for example Murray et al. (2015). As a result, there are almost as many ways of estimating systematic and random uncertainties as there are (combinations of) ways to determine dose rates; in any case, the notion of systematic error is only valid in a given context, which must always be made explicit. Combès and Philippe (2017) detailed the mathematical formulation of the dose covariance matrix, which links the ages of several samples measured using the same equipment and standards through common (systematic) errors (see also Philippe et al., 2019). Nevertheless, the equations provided in this article are somewhat difficult to translate in practice; here, we propose to outline how we implement a covariance matrix adapted to (one example of) the measurements leading to OSL age calculation at the IRAMATCRP2A laboratory (Bordeaux). We emphasise that what follows is not prescriptive; it should be viewed as an example of a model of uncertainties. For alternative ways of estimating systematic and random errors, for example, due to different dose rates measurements, the reader is referred to the discussion (Sect. 7.1).
Here, we consider the case of a series of n sediment samples taken from one unique site and all measured using the same equipment and standards. Let us consider the following relationship between palaeodoses, dose rates and ages (Combès and Philippe, 2017):
where D_{i} is a random variable modelling the unknown palaeodose of sample i, N is the symbol for a Gaussian distribution, A_{i} is the unknown age estimate of sample i (that we are trying to determine), ${\dot{d}}_{i}$ is the total dose rate to which this sample was exposed since burial (${\dot{d}}_{i}$ is the observed dose rate, i.e. the result of the measurements) and Σ is the dose covariance matrix (for the full definition of the model, we refer the reader to Combès and Philippe, 2017). This covariance matrix verifies, for all (i,j),
where θ is the matrix that the user needs to specify to run the calculations with BayLum. It should be noted here that by default when running age calculations with BayLum, the offdiagonal elements are set to zero, i.e. the covariance in ages is not modelled.
Before entering the details specific to luminescence dating, let us consider a simple example of two measurements ${y}_{\mathrm{1}}={\mathit{\mu}}_{\mathrm{1}}+{e}_{\mathrm{1}}+f$ and ${y}_{\mathrm{2}}={\mathit{\mu}}_{\mathrm{2}}+{e}_{\mathrm{2}}+f$ where μ_{1} and μ_{2} are fixed measurands and e_{1}, e_{2} and f are all independent random errors from distributions with mean zero. The covariance of y_{1} and y_{2} is the variance of f (so the offdiagonal elements of the matrix are equal to this variance). For each sample, the diagonal element of the corresponding covariance matrix is the sum of all the components of variance for that sample. The variety of physical quantities to measure and determine dose rate, and their relationship with the dose rate contributions, will now be discussed with this simple definition in mind.
5.2 Implementation in practice
First, we detail the series of measurements carried out, and we introduce the corresponding notations for the estimates and associated uncertainties. Table 2 summarises all physical units and associated error standard deviations; as a general rule, we assume that all error terms are Gaussian variables with the expected value (mean) equal to zero and a fixed, known standard deviation (see for example Eq. 2 in Combès and Philippe, 2017). For clarity, in the following relative standard deviations are described by the letter σ, while absolute standard deviations are denoted by s; moreover, each standard deviation corresponding to random errors (i.e. when the error varies from sample to sample) is identified by the letter i in the subscript. The absence of this letter in the subscript indicates that the measurement error affects all samples.
5.2.1 Equivalent doses and OSL measurements
Equivalent doses are determined from OSL measurements performed on a luminescence reader equipped with a radioactive beta source, whose dose rate and associated relative standard deviation of the error, noted ${\dot{d}}_{\mathrm{lab}}$ and σ_{lab}, are known. There are several ways the latter term can be determined; in its simplest form, it includes the standard deviation of the error on the absolute dose absorbed by the standard reference material (in our case calibration quartz provided by DTU Nutech; see Hansen et al., 2015) and an error term due to replicate measurements of several aliquots of this calibration material. Using a large number of measurements repeated in time, as suggested by Hansen et al. (2015), may somewhat complicate the matter, but this goes beyond the scope of the present study.
In practice, regeneration doses are delivered by irradiating the aliquots for a given duration (in s). This duration is converted to absorbed energy dose (Gy) by multiplication with the source dose rate (Gy s^{−1}). Strictly speaking, the error on the source dose rate affects all regeneration doses, and so this error term should appear in the dose–luminescence relationship (right side of the directed acyclic graph shown in Fig. 7 of Combès and Philippe, 2017). However, it is common practice in luminescence dating to first calculate an equivalent dose in seconds of irradiation for each aliquot, then convert this to Gy and calculate an average (or determine another central parameter such as with the CDM), and only then consider σ_{lab}. This is what led (e.g., Jacobs et al., 2008) to the associated standard deviation being excluded from the total OSL age uncertainties, to test the assumption of a time gap between two series of ages. Here, for simplicity, we take the same route, and hence the relative error on the laboratory source dose rate becomes a relative, systematic error on the equivalent doses.
One may thus write that the error on the dose D_{i} arising from the calibration of the source follows a Gaussian distribution with mean 0 and variance $({\mathit{\sigma}}_{\mathrm{lab}}{A}_{i}{\dot{d}}_{i}{)}^{\mathrm{2}}$.
5.2.2 Dose rates
When it comes to the dose rate term, here we restrict ourselves to the case of coarse quartz grains measured after HF etching to remove the alpha dose rate component: the total natural dose rate is the sum of an internal dose rate, external beta and gamma dose rates, and cosmic dose rates.
Cosmic dose rates
We consider that cosmic dose rates are determined following Prescott and Hutton (1994) based on the burial depth of the dated samples, which may be different from the presentday thickness of the overburden. As a result, the error on cosmic dose rate estimates depends on the error estimation of this effective burial depth since the dated sediment was deposited. Because the relationship between cosmic dose rates and burial depths is not linear, and because the error on this burial depth may not be systematic (e.g. in cases where successive erosion and deposition events, of unknown duration, happened between the deposition of superimposed sedimentary layers – see Aitken, 1998, p. 65, for a discussion) even at the scale of a site, the error associated with cosmic dose rates cannot easily be treated as systematic. For $i=\mathit{\{}\mathrm{1},\mathrm{\dots},n\mathit{\}}$, ${\dot{d}}_{\mathrm{cosmic},i}$ and s_{cosmic,i} denote the estimate of the average cosmic dose rate to which sample i has been exposed and its associated standard deviation.
Beta dose rates
We consider the beta dose rates as determined from concentrations (or activities) of ^{40}K and in radioelements from the U and Th decay chains, converted to dose rates using specific conversion factors (e.g. Guérin et al., 2011). At the IRAMATCRP2A laboratory, these concentrations are usually determined with lowbackground, highresolution gammaray spectrometry following Guibert and Schvoerer (1991). The simplest case is that of ^{40}K, since only one peak is used (at 1.461 MeV); the concentration in sample i, denoted [K]_{i}, is equal to the concentration in the standard multiplied by the ratio in count rates (the count rate observed for the investigated sample is divided by the count rate observed for a reference material). Thus, we consider in this paper that the standard deviation of the error on the ^{40}K concentration includes three components: the standard deviation of the error on the concentration in the standard sample, and the counting uncertainties both on the standard and on the measured sample. The counting uncertainties are calculated, assuming Poisson statistics. Of these three sources of errors, only one is treated as random – namely the counting uncertainty of the sample; the other two standard deviations (corresponding to the counting of the standard and to the error of the radioelement concentration in the standard) are quadratically summed and considered as a systematic source of error. One considers for sample i the beta dose rate from potassium ${\dot{d}}_{\mathit{\beta},\mathrm{K},i}$ – after correction for grainsizedependent attenuation using the factors from Guérin et al. (2012b) and for moisture content following Nathan and Mauz (2008) (see the discussion section below regarding uncertainties on these correction factors). Neglecting uncertainties in the dose rate conversion factors, we call σ_{K,i} the relative random standard deviation of the error on the ^{40}K concentration; its systematic counterpart σ_{K} is common to all samples. It should be emphasised here that systematic errors on radioelement concentrations, although shared by all samples, will affect all ages in the same direction but not necessarily by the same amount (even in relative terms, contrary to the error on laboratory beta source calibration) because the relative contribution of beta dose rate from potassium to the total dose rate may vary from one sample to another. The beta dose rates from the U and Th series come from a number of radioelements in the corresponding chains; here, for simplicity we consider each series to be in secular equilibrium (this is generally the case for ^{232}Th but may not be true for the Useries; see for example Guibert et al., 1994, 2009; Lahaye et al., 2012). Thus, for each sample, the concentrations in ^{238}U and ^{232}Th are converted to dose rate contributions denoted ${\dot{d}}_{\mathit{\beta},\mathrm{U},i}$ and ${\dot{d}}_{\mathit{\beta},\mathrm{Th},i}$. In contrast to the case of ^{40}K, the analysis of the highresolution spectra for these radioactive chains is based on a number of primary gamma rays (whereas there is only one ray for K); more specifically, a weighted mean of the concentrations determined from each ray included in the analysis (after taking interference into account) is calculated to estimate the concentration of U and that of Th. As a result, the standard deviation of the error on the concentration in U (and that of Th) in the sample comes from two sources: the relative standard deviation on the concentration of the standard corresponds to a systematic error and is denoted σ_{U} for U (σ_{Th} for Th); conversely, the other relative standard deviations (arising from the counting of the standards and of the sample) are treated as random and quadratically summed to obtain σ_{U,i} (and σ_{Th,i}).
Internal dose rates
Unless the internal radioelement concentration is experimentally determined (in which case one needs to consider both systematic and random sources of error for each sample, as is done for beta dose rates), some have suggested using a fixed internal dose rate of 0.06±0.03 Gy ka^{−1} (Vagn Mejdahl, personal communication to Andrew Murray, based on Mejdahl, 1987). In this case, we may assume that the dated quartz grains are all of the same origin and have the same internal radioelement concentration; as a result, we associate a systematic standard deviation s_{int} with the internal dose rate ${\dot{d}}_{\mathrm{int}}$.
Gamma dose rates
Gamma dose rates ${\dot{d}}_{\mathit{\gamma},i}$ may be determined, as beta dose rates, from K, U and Th concentrations in the sediment. In this case, the reader is referred to the corresponding section above. However, it is relatively frequent, in the case of heterogeneous configurations at the 10 cm scale, that gamma dose rates received by the samples do not correspond to the infinite matrix gamma dose rates of the samples (see for example large gamma dose rate variations at the interface between sediment and bedrock in a cave reported by Guérin et al., 2012a: Fig. 7). In such contexts, gamma dose rates may be determined by in situ measurements with Al_{2}O_{3}:C artificial dosimeters: these dosimeters are measured with greenlight stimulation and their calibration is based on a block of homogeneous bricks located in the basement of IRAMATCRP2A (Richter et al., 2010; Kreutzer et al., 2018; note that we also discuss below – Sect. 7.1 – the use of portable spectrometers). Two sources of relative errors are taken into account: a random standard deviation (σ_{γ,i}) accounting for measurement uncertainties, and a shared calibration error including both standard deviations on (i) the true gamma dose rate in the block of bricks and on (ii) the measurement of the dosimeters irradiated inside the block for calibration of the source (σ_{γ}).
Water content
To account for the effect of water on dose rates, one commonly considers the following equation (Zimmerman, 1971; Aitken, 1985):
where ${\dot{d}}_{\mathit{\beta},i,\mathrm{dr}\mathit{\gamma}}$ is the beta dose rate in the dry sediment, WF_{i} represents the effective mass fraction of water in the sediment during burial, and x_{β} is a water correction coefficient accounting for the fact that water absorbs more beta dose than typical sedimentary elements, due to lower atomic numbers (Nathan and Mauz, 2008). A similar equation applies to gamma dose rates, with a corresponding factor x_{γ} (see Guérin and Mercier, 2012). The determination of the water content in the sediment over time is a challenging task as it involves many different parameters, including past rainfall – see for example Nelson and Rittenour (2015) for a discussion on how to determine water contents depending on sediment grain size, hydrometric regimes, etc. One commonly employed solution is to measure the water content at the time of sampling and assume it to be representative of that in the past; measuring the water content at saturation may then be a solution to evaluate an upper limit to this value; and depending on the context one may also propose a lower limit to the water content. One then obtains a way of quantifying the standard deviation of the error on the water content, although necessarily imperfect. Neglecting uncertainties on the water correction factors (x_{β} and x_{γ}) and calling s_{WF,i} the absolute standard deviation of the mass fraction WF_{i} for sample i, one may write
where ${s}_{\mathit{\beta},{\mathrm{H}}_{\mathrm{2}}\mathrm{O},i}$ is the standard deviation of the beta dose rate for sample i due to the uncertainty on its water mass fraction.
Similarly, one may write
where ${s}_{\mathit{\gamma},{\mathrm{H}}_{\mathrm{2}}\mathrm{O},i}$ is the standard deviation of the gamma dose rate for sample i due to the uncertainty on its water mass fraction. As a result,
To simplify the following equations, which are meant to be those used in practice, we introduce the relative standard deviation of the beta dose rate due to water content errors (${\mathit{\sigma}}_{\mathit{\beta},{\mathrm{H}}_{\mathrm{2}}\mathrm{O},i}$) and a parameter called λ_{i} defined by
Finally, it should be emphasised that uncertainty on water content may well correspond to errors which are neither really random nor really systematic; in our view different modelling choices may be put forward and implemented, depending on the particular sedimentological and pedological context.
The θ matrix
With these considerations in mind on errors and their nature, the corresponding θ matrix (Eq. 2) to model these uncertainties is a square matrix containing one line (and column) per sample. The diagonal elements correspond to the sum of a term arising from the error on the laboratory source dose rate (${\dot{d}}_{i}^{\mathrm{2}}{\mathit{\sigma}}_{\mathrm{lab}}^{\mathrm{2}}$) and the total dose rate variance for each sample, for each i:
This long list of variance terms may seem rather complicated. However, it corresponds to the total variance arising from the laboratory beta source calibration, the errors on cosmic dose rates, environmental beta dose rates, internal dose rates, gamma dose rates and finally the error arising from uncertainties in water content. In other words, we can also write
where ${s}_{{\dot{d}}_{i}}^{\mathrm{2}}$ is the variance of the dose rate to which sample i was exposed to during burial (it is the square of the uncertainty appearing next to the dose rate value in every luminescence dating article; in our example, this term is the second one in the files DoseEnv.csv provided in Supplement).
Then, for i≠j,
which characterises the amount of correlation between the doses of samples
i and j, multiplied by their ages. The θ matrix, like the dose
covariance matrix Σ, is a symmetric matrix. The diagonal
members correspond to individual variances, while the nondiagonal terms
express the fact that systematic, shared errors link the measurements of the
series of samples. As a result, running the functions AgeS_Computation()
and Age_OSLC14()
with a θ matrix in
which all nondiagonal members are set to zero would be equivalent to
running the same functions without the correlation matrix, or running the
function Age_Computation()
independently for each sample –
in which case all sources of error are treated as random.
5.3 Examples
5.3.1 An illustrative, simplistic example without stratigraphic constraints
For illustration purposes, first, we did not apply stratigraphic constraints. We started with a simplistic θ matrix containing in the diagonal the real error variances (Eq. 9) as determined by Guérin et al. (2015b); the σ_{lab} value was equal to 0.02 (2 % relative standard deviation of the calibration of the laboratory beta source). The simplification comes from the offdiagonal members, for which in Eq. (10) we set all s and σ values to be equal to 0, except for the σ_{lab} value, set to 0.05. Obviously, this is not selfconsistent, but it corresponds to (i) random and systematic errors of approximately the same magnitude (in practice, these two sources of errors are of the same order of magnitude – a few percentage points) and (ii) the simplest form of systematic errors. Indeed, in such a case, all ages are affected by the same relative amount in the same direction.
Here again, after 5000 iterations of three independent Markov chains, we observed good convergence. The obtained 95 % CIs are [33.9; 43.8] and [36.7; 48.1] ka for samples FER 1 and FER 3, respectively. Figure 6 shows bivariate scatter plots corresponding to the sampling of the Markov chains for the ages of samples FER 1 and FER 3 (which are calculated simultaneously), and Fig. 7 displays the KDE together with the marginal probability densities. This set of figures illustrates the reason for the generation of the two types of figures: the bivariate scatter plot is most appropriate for visualising the effect of stratigraphic constraints (Fig. 4 above), whereas probability density figures best illustrate the effect of modelling systematic errors. Indeed, as can be seen, there is a positive correlation between the ages of samples FER 1 and FER 3: the greater the age of sample FER 1, the greater the mean age of sample FER 3. In other words, if the age of sample FER 1 were underestimated, then in all likelihood, the age of sample FER 3 would also be underestimated. Furthermore, the length of the CI for the age of each sample is slightly larger than without modelling the covariance (cf. Table 1); i.e. modelling the covariances slightly increases the age uncertainties. However, the positive correlation of ages has other, direct consequences.
First, let us suppose that we have no knowledge of a stratigraphic link
between the two investigated samples, and we wish to test the hypothesis that
sample FER 1 is younger than sample FER 3. The credibility of such an
assumption can be tested using the function MarginalProbability()
of the
“ArchaeoPhases” R package (Philippe and Vibet, 2020) devoted to the analysis
of MCMC chains for chronological inference. Without using the covariance
matrix, the credibility of this hypothesis is 0.83; with the simplistic
θ matrix, the credibility becomes 0.94; in other words, modelling
the age covariance reflects more faithfully the measurements and their
uncertainties for such tests.
The second consequence concerns the duration of a hypothetical phase that
would encompass the deposition of sample FER 1 and that of sample FER 3.
Indeed, since the ages vary together in the MCMC, the duration of such a
phase should be smaller when modelling the covariance than when all the
variance in ages is treated as random. Indeed, we could verify this
assertion using the function PhaseStatistics()
of “ArchaeoPhases” (Philippe
and Vibet, 2020): with the simplistic covariance matrix, the 95 % CI
for the duration of this phase is [−1.4; 9.7] ka, whereas it is [−0.6; 7.6] ka when the ages are calculated using the simplistic θ matrix.
5.3.2 A real example, including stratigraphic constraints
In a real case, since the relative contributions of the different dose rate components vary from one sample to another, the correlation will be less pronounced. For more realistic calculations of the ages of samples FER 1 and FER 3, we took the same values as above for the diagonal terms of the θ matrix (Eq. 9); on the other hand, for the nondiagonal, covariance terms, we used the following values: σ_{lab}=0.02 (which corresponds to the experimentally determined calibration standard deviation, including the uncertainty of the dose delivered to calibration quartz; Hansen et al., 2015), σ_{K}=0.012, σ_{U}=0.007 , σ_{Th}=0.007 (for these values, which also include counting of the standards used, the reader is referred to Guibert et al., 2009; Guibert, 2002) and s_{int}=0.003 Gy ka^{−1}. In the Supplement we provide a calculation spreadsheet allowing the covariance matrix to be built, intended for adaptation to the userspecific needs.
At the site of La Ferrassie, the uncertainties associated with the gamma dose rate observations are more complex. Al_{2}O_{3}:C dosimeters were placed at the end of 25 cm long aluminium tubes and inserted horizontally in the stratigraphic section at the location of sediment sampling. In an ideal case, sediment should be uniform in a horizontal plane; however, for samples FER 1 and FER 3 only a rather thin layer of sediment remained against the cliff wall (the layers of the sample were not present at the site in any other location), which resulted in the dosimeters being inserted either in the karstic cliff (the limestone contains few radioelements compared to the sediments, as shown in Fig. 5 of Guérin et al., 2015b) or at the interface between the cliff and the sediment. As a result, we took for ${\dot{d}}_{\mathit{\gamma},i}$ the average between the gamma dose rates measured in situ (which underestimate the real gamma dose rate because the effect of the cliff is overrepresented) and the gamma dose rates derived from the K, U and Th concentrations in the samples. The associated standard deviation, σ_{γ,i}, was calculated as the difference between these two extreme values divided by 4, so that the 95 % CI covers all possible values. As this standard deviation is much larger than the analytical uncertainties, we neglected the latter and considered σ_{γ,i} to characterise random sources of errors since each sample has a different environment and may be more or less far from the cliff.
The samples FER 1 and FER 3 are directly above and below, respectively, the Châtelperronian layer at the site (layer 6). Sample FER 2 from this layer being poorly bleached, it is at present impossible to model with BayLum. However, an alternative to estimate the age of FER 2 consists of supposing that it has a uniform prior probability density between the ages of samples FER 1 and FER 3:
where A_{i} is the age of sample i, ${I}_{\left[{A}_{\mathrm{1}};{A}_{\mathrm{3}}\right]}$ is the indicator function between A_{1} and A_{3}, and π(A_{1},A_{3}data) is the posterior joint density of A_{1} and A_{3} knowing the data (i.e. the density estimated with BayLum). Doing so (see the markdown file for the corresponding code lines), working from the output of BayLum one obtains a 95 % CI of [36; 46] ka, which can be compared with the confidence interval of [36; 48] ka obtained by Guérin et al. (2015b) with minimum age modelling.
The BayLum package also offers the possibility to include radiocarbon ages
in the chronological models (Philippe et al., 2019); more specifically,
radiocarbon ages are calibrated within BayLum, using the function
AgeC14_Computation()
or Age_OSLC14()
(in the
latter case the function necessitates at least one OSL age calculation).
Introducing covariance matrices to account for systematic errors on OSL data
does not reduce the OSL age uncertainties; however, it becomes particularly
useful to correct for estimation biases when more precise ages, unaffected
by these systematic errors, are integrated into the models. To illustrate
this, we decided to construct two models constraining the age of FER 3; for
illustration purposes, in this section, we used the simplistic θ
matrix described above in Sect. 5.3.1. In the first case, we constrained
the age of this sample by imposing that a “young” radiocarbon age (young
compared to the age of sample FER 3 considered alone) has an age greater
than sample FER 3. In practice, we arbitrarily took a radiocarbon age of
38 000±400 BP, which corresponds to [37.6; 39.9] ka cal. BP (95 % CI using the IntCal20 curve, Reimer et al., 2020; the calibration was performed
using BayLum; see Philippe et al., 2019). Naturally, the credible
intervals (both 68 % and 95 %) for sample FER 3 are shifted towards
younger age values (see the truncation of the scatter plot illustrated in Fig. 3).
So are the credible intervals for sample FER 1, since the ages of the two OSL
samples are close to each other even when considered independently of
radiocarbon data (in other words, the radiocarbon age “pushes” the age of
sample FER 3, which in turn “pushes” the age of sample FER 1). In practice,
the 95 % CIs become [33.3; 41.2] ka and [36.9; 42.3] ka for samples FER 1
and FER 3, respectively. It can be noted here that in such a case the
precision of the age of sample FER 3 is increased (i.e. the length of the CI
is much smaller than without the constraining radiocarbon age). More
interestingly, in the second case, we constrained the age of sample FER 3 by
imposing that an “old” radiocarbon age (old compared to the age of sample
FER 3 considered alone) has an age younger than sample FER 3. In practice we
– again, arbitrarily – took a radiocarbon age equal to 44 000±400 BP, which corresponds to [45.4; 47.4] ka cal. BP (95 % CI). Here again,
the effect on the age of sample FER 3 is straightforward: the credible
intervals are shifted towards older ages (the 95 % CI for the age of
sample FER 3 becomes [45.7; 51.2] ka). Perhaps less intuitive is the effect
on the age of sample FER 1, which is not directly constrained by
radiocarbon: because the ages of the three samples are estimated jointly,
and because of the systematic errors on the OSL ages, the age of sample FER 1 is also shifted towards older ages: the corresponding 95 % CI becomes
[36.7; 45.8] ka.
7.1 Differing ways of estimating dose rates
Every laboratory uses its specific equipment and calibration standards; if similar equipment to that described above is used, then only the values of the different terms need be changed. This case is particularly relevant for equivalent dose measurements, and hence the term σ_{lab} associated with ${\dot{d}}_{\mathrm{lab}}$. Conversely, for dose rate determination, several other experimental devices and techniques are commonly used. If beta and/or gamma dose rates are determined based on the determination of concentration in K, U and Th, (by mass spectrometry, neutron activation, etc.), then the situation is similar to that described for beta dose rates above.
Counting techniques (alpha, beta and gamma in the case of the threshold technique: Løvborg and Kirkegaard, 1974) may also be used for beta and gamma dose rate estimation. In the case of beta counting, the conversion factor from count rate to dose rate depends on the emitting radioelement (Ankjærgaard and Murray, 2007; see also Cunningham et al., 2018). This dependency is a source of error that may not easily be characterised by a systematic error (so there is no contribution to the dose covariance matrix); indeed, this error on the conversion factor will vary from one site to another depending on the concentrations in K, U and Th (which are generally unknown if a counting technique is used), and even within one site from one sample to another (again by unknown amounts since the variability in K, U and Th is unknown).
The data acquired with field gamma spectrometers may be analysed in two ways: the “window” technique (see for example Aitken, 1985) corresponds to classical spectrometry analysis; in this case, the structure of uncertainties is the same as that for beta dose rates determined from highresolution gamma spectrometry (Eq. 11). On the other hand, threshold techniques consist of taking advantage of proportionality between gamma dose rates and (i) the number of counts recorded per unit time above a threshold (Løvborg and Kirkegaard, 1974) or (ii) the energy deposited per unit time above a threshold (energy threshold: Guérin and Mercier, 2011; Miallier et al., 2009). In the former case, the conversion from count rate to dose rate depends on the emitting radioelement, so no systematic error term may be isolated. Conversely, in the latter case (energy threshold), this dependency is negligible (Guérin and Mercier, 2011). As a result, the error on the dose rate of the calibration standard may be considered as systematic and thus contribute one term in the nondiagonal elements of the covariance matrix. Difficulties in implementing the threshold technique may occur in lowdoserate environments, because energy calibration may not be straightforward if the probe is not selfstabilised. In such cases, routines such as those implemented in the R package “gamma” (Lebrun et al., 2020) may be put to profit. Finally, ageing of the crystal may also result in timedependent errors – the latter must be taken care of by regular calibration experiments.
7.2 Error terms neglected in this study
As mentioned earlier in the section devoted to dose rate uncertainties, there are many possibilities to quantify but also to consider errors in dose rate measurements; one could mention here the uncertainties on attenuation factors and water correction factors. However, both of these factors are dependent on the infinite matrix assumption: attenuation in grains implies that something other than the grains does not attenuate radiation (i.e. conservation of energy implies that if there is lower dose rate inside a quartz grain, there must be a higher dose rate somewhere else – see Guérin et al., 2012b); water correction factors are often calculated assuming a homogeneous mixture of water and other sedimentary components (Zimmerman, 1971; Aitken and Xie, 1990; note that the composition of the sediment also necessarily affects the ratios of electron stopping powers and photon interaction cross sections – see Nathan and Mauz, 2008, for a discussion). Limitations of this infinite matrix assumption, which is not met in sand samples at the scale of beta dose rates, have already been pointed out (Guérin and Mercier, 2012; Guérin et al., 2012a, b; Martin et al., 2015). Consequently, it seems that routine determination of a realistic standard deviation of the attenuation and water content correction parameters is not straightforward.
Dose rate conversion factors were assumed above to be known without error; however, estimation errors affect halflives, emission probabilities, average emitted energies, etc. Liritzis et al. (2013) took these uncertainties into account to estimate standard deviations of the dose rate conversion factors (in practice, these standard deviations amount to ∼1 % for K dose rates, ∼2 % for U and ∼2 % for Th). These standard deviations could be included as sources of systematic errors when the contributions of K, U and Th are determined separately (note that when this is not the case, as when dosimeters are used for gamma dose rate estimation, or when beta counting is implemented for beta dose rate assessment, these sources of errors should be treated as random).
In this study, we worked with coarsegrain quartz extracts that had been etched with HF to remove the alphairradiated part of the grains. This being said, if alpha dose rates are taken into account, then the situation becomes similar to that of beta dose rates treated above; however, the sensitivity to alpha irradiation must then be taken into account. It is rather frequent in such a case to use published values from the literature (e.g. Tribolo et al., 2001; Mauz et al., 2006). Depending on the geological origin of the quartz (one or more sources), one may then assume either systematic or random errors in the alpha sensitivity.
7.3 Publication habits and reanalysis of previously published ages
Compared to other statistical models for OSL dating, the Bayesian models implemented in BayLum appear rather complicated, at least partly because modelling starts from the measured OSL data. By comparison, the input data to the CDM or the Average Dose Model (ADM: Guérin et al., 2017) are lists of equivalent doses and associated uncertainties, which means that OSL measurements have already been analysed to derive equivalent doses. Combès et al. (2015) argued that their complete model (implemented in BayLum), relating all the variables to one another, produces a more homogeneous and consistent inference compared to consecutive inferences (and indeed, when approaching saturation, i.e. when equivalent doses and associated uncertainties can hardly be parameterised, Heydari and Guérin (2018) demonstrated the advantage of BayLum models compared to parametric models such as the CDM and ADM). However, working with lists of equivalent doses and uncertainties – or even with estimates of central doses and associated uncertainties – taken as observations would make the Bayesian modelling proposed in BayLum and described in this paper more straightforward and transparent. Such an approach, called the “twosteps” model by Combès and Philippe (2017; see also Millard, 2006a, b, for earlier, similar models), would also offer the advantage of allowing reanalysis of already published data to derive more precise chronologies. However, for this purpose, the breakdown of all uncertainties and related standard deviations of errors is needed; nowadays, providing such key information for the modelling is not in the luminescence dating community's publication habits. That being said, with the growing number of metaanalyses of previously published data, and the availability of models such as BayLum to combine measurements with systematic errors, these habits might evolve in the future.
7.4 Notes of caution
As always when working with statistical models, one should first and foremost evaluate the measured data in light of the sampling context. We already mentioned the importance of grain selection (Sect. 2.2); but, perhaps more importantly, and especially since users of BayLum have to make modelling choices (e.g. regarding the dose–response curves fitted to OSL measurements or the distribution of individual equivalent doses around the central dose), it is crucial to carefully examine data and assess their quality before building potentially sophisticated models.
We would like to emphasise a few warnings regarding modelling samples in stratigraphic constraints, and the association of ages obtained by different methods. We would advise users, before combining for example radiocarbon and OSL ages, to first thoroughly examine the corresponding datasets independently: how were the data produced (with which experimental procedure)? Are the provided uncertainties reliable (or is there an unrecognised source of error that should be included in the evaluation of uncertainty)? Users are also encouraged to examine the consistency of results produced by each method, in light of the stratigraphy. In a second stage, before modelling of independent ages, we would recommend assessing the consistency of these datasets – do they (at least broadly) agree? If not, can a parsimonious explanation be found? For example, it is rather common, when performing Bayesian modelling with tools such as OxCal, to observe a large fraction of ages considered as outliers; such observations should urge users to examine their data again and come up with likely explanations (note that to date, no outlier model has been developed for the OSL ages in BayLum).
When it comes to imposing ordering constraints between ages as a result of stratigraphic observations, it is, of course, essential to leave no doubt about the validity of these stratigraphic constraints (the results of a model depend on the assumptions that are made, and the order in ages is a very strong constraint). Perhaps more importantly, even when stratigraphic constraints are valid, it is possible that applying them will not improve the statistical inference.
A simple example to illustrate this point is that of two superimposed, distinct layers (so that a stratigraphic order is clear) whose true ages are equal (or in practice, for which the age difference is negligible compared to the typical uncertainties of the implemented dating method). In such a case, modelling the ages with stratigraphic constraints is likely to result in a loss of accuracy (the age of the older layer will be overestimated, and that of the younger layer underestimated) compared to a model where no stratigraphic constraints are imposed. Future developments of the BayLum package might include the possibility to test different modelling scenarios by comparing the agreement between the observations and the posterior probability densities, for example using the Bayes information criterion (BIC).
New models for building chronologies based on OSL, with the possibility to incorporate radiocarbon, have been proposed in the literature (Combès et al., 2015; Combès and Philippe, 2017). These models have been demonstrated to improve the chronological inference based on OSL data and in particular, the accuracy of OSL ages (Guérin et al., 2015a; Heydari and Guérin, 2018). The R package BayLum was developed to implement these models; Lahaye et al. (2018), Carter et al. (2019) and Heydari et al. (2020, 2021) have used some of them to establish the chronologies of sedimentary sequences dated by OSL, resulting in generally more precise chronologies.
In this article, we have presented a case study on how to build simple models and observe output data, particularly through bivariate plots of age probability densities. Then, we have shown how to include stratigraphic constraints in the models; we have described how to fill the covariance matrices to account for systematic errors in OSL age estimation; and we have shown the effect of including independent age information in the models, namely radiocarbon ages. Different tools to visualise and further analyse the output of BayLum were demonstrated.
As a result, it is now possible to use various sources of information often available in practice when dating stratigraphic sequences. Age inferences based on OSL and independent data (e.g. radiocarbon) in stratigraphic constraints are expected to benefit in terms of accuracy, precision and robustness, through the application of such Bayesian models.
The code lines used to generate all figures are provided in the Supplement.
Part of the data used to generate the figures is provided as an example dataset in the Supplement.
The supplement related to this article is available online at: https://doi.org/10.5194/gchron32292021supplement.
GG performed the experiments. GG, SK and AP wrote the code and produced the supplementary Rmd file containing the code. GG prepared the paper with contributions from all coauthors, in particular during two workshops held in Bordeaux.
The authors declare that they have no conflict of interest.
The authors thank Andrew Millard (twice), Eddie Rhodes, Rex Galbraith and two anonymous referees for helpful comments on previous versions of this article.
This study received financial support from the Région Aquitaine (in particular through the CHROQUI programme) and from the LaScArBx project (project no. ANR10LABX52). Martin Autzen and JanPieter Buylaert received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme ERC2014StG 639904RELOS. Guillaume Guérin received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (grant no. 851793) Project QuinaWorld (grant no. ERCStG2019).
This paper was edited by Richard Staff and reviewed by Andrew Millard and Ed Rhodes.
Aitken, M. J.: Thermoluminescence dating, Academic Press, London, 359 pp., 1985.
Aitken, M. J.: An introduction to optical dating, Oxford University press, Oxford, 267 pp., 1998.
Aitken, M. J. and Xie, J.: Moisture correction for annual gamma dose, Ancient TL, 8, 6–9, 1990.
Ankjærgaard, C. and Murray, A. S.: Total beta and gamma dose rates in trapped charge dating based on beta counting, Radiat. Meas., 42, 352–359, 2007.
BøtterJensen, L., Bulur, E., Duller, G. A. T., and Murray, A. S.: Advances in luminescence instrument systems, Radiat. Meas., 32, 523–528, 2000.
Bronk Ramsey, C. and Lee, S.: Recent and Planned Developments of the Program OxCal, Radiocarbon, 55, 720–730, 2013.
Carter, T., Contreras, D. A., Holcomb, J., Mihailović, D. D., Karkanas, P., Guérin, G., Taffin, N., Athanasoulis, D., and Lahaye, C.: Earliest occupation of the Central Aegean (Naxos), Greece: Implications for hominin and Homo sapiens' behavior and dispersals, Sci. Adv., 5, eaax0997, https://doi.org/10.1126/sciadv.aax0997, 2019.
Chevrier, B., Lespez, L., Lebrun B., Garnier, A., Tribolo, C., Rasse, M., Guérin, G., Mercier, N., Camara, A., Ndiaye, M., and Huysecom, E.: New data on settlement and environment at the 2 Pleistocene/Holocene boundary in SudanoSahelian West Africa: 3 interdisciplinary investigation at Fatandi V, Eastern Senegal, PlosOne, 15, e0243129, https://doi.org/10.1371/journal.pone.0243129, 2020.
Christophe, C., Philippe, A., Guérin, G., Mercier, N., and Guibert, P.: Bayesian approach to OSL dating of poorly bleached sediment samples: Mixture Distribution Models for Dose (MD^{2}), Radiat. Meas., 108, 59–73, 2018.
Christophe, C., Philippe, A., Kreutzer, S., and Guérin, G.: “BayLum”: Chronological Bayesian Models Integrating Optically Stimulated Luminescence and Radiocarbon Age Dating, R package, version 0.2.0, available at: https://CRAN.Rproject.org/package=BayLum, last access: 26 November 2020.
Combès, B. and Philippe, A.: Bayesian analysis of individual and systematic multiplicative errors for estimating ages with stratigraphic constraints in optically stimulated luminescence dating, Quat. Geochronol., 39, 24–34, 2017.
Combès, B., Lanos, P., Philippe, A., Mercier, N., Tribolo, C., Guérin, G., Guibert, P., and Lahaye, C.: A Bayesian central equivalent dose model for optically stimulated luminescence dating, Quat. Geochronol., 28, 62–70, 2015.
Cunningham, A. C., Murray, A. S., Armitage, S. J., and Autzen, M.: Highprecision natural dose rate estimates through beta counting, Radiat. Meas., 120, 209–214, 2018.
Duller, G. A. T.: Singlegrain optical dating of Quaternary sediments: why aliquot size matters in luminescence dating, Boreas, 37, 589–612, 2008.
Duller, G. A. T.: The Analyst software package for luminescence data: overview and recent improvements, Ancient TL, 33, 35–42, 2015.
Duller, G. A. T., BøtterJensen, L., Murray, A. S., and Truscott, A. J.: Single grain laser luminescence (SGLL) measurements using a novel automated reader, Nuclear Instruments and Methods B, 155, 506–514, 1999.
Galbraith, R. F. and Roberts, R. G.: Statistical aspects of equivalent dose and error calculation and display in OSL dating: An overview and some recommendations, Quat. Geochronol., 11, 1–27, 2012.
Galbraith, R. F., Roberts, R. G., Laslett, G. M., Yoshida, H., and Olley, J. M.: Optical dating of single and multiple grains of quartz from Jinmium rock shelter, northern Australia: Part I, experimental design and statistical models, Archaeometry, 41, 339–364, 1999.
Gelman, A. and Rubin, D.: Inference from Iterative Simulation using Multiple Sequences, Statist. Sci., 7, 457–511, 1992.
Guérin, G. and Mercier, N.: Determining gamma dose rates by field gamma spectroscopy in sedimentary media: results of Monte Carlo simulations, Radiat. Meas., 46, 190–195, 2011.
Guérin, G. and Mercier, N.: Preliminary insight into dose deposition processes in sedimentary media on a grain scale: Monte Carlo modelling of the effect of water on gamma doserates, Radiat. Meas., 47, 541–547, 2012.
Guérin, G., Mercier, N., and Adamiec, G.: Doserate conversion factors: update, Ancient TL, 29, 5–8, 2011.
Guérin, G., Discamps, E., Lahaye, C., Mercier, N., Guibert, P., Turq, A., Dibble, H., McPherron, S., Sandgathe, D., Goldberg, P., Jain, M., Thomsen, K., PatouMathis, M., Castel, J.C., and Soulier, M.C.: Multimethod (TL and OSL), multimaterial (quartz and flint) dating of the Mousterian site of the Roc de Marsal (Dordogne, France): correlating Neanderthals occupations with the climatic variability of MIS 5–3, J. Archaeol. Sci., 39, 3071–3084, 2012a.
Guérin, G., Mercier, N., Nathan R., Adamiec, G., and Lefrais, Y.: On the use of the infinite matrix assumption and associated concepts: a critical review, Radiat. Meas., 47, 778–785, 2012b.
Guérin, G., Murray, A. S., Jain, M., Thomsen, K. J., and Mercier, N.: How confident are we in the chronology of the transition between Howieson's Poort and Still Bay, J. Hum. Evol., 64, 314–317, 2013.
Guérin, G., Combès, B., Lahaye, C., Thomsen, K. J., Tribolo, C., Urbanova, P., Guibert, P., Mercier, N., and Valladas, H.: Testing the accuracy of a single grain OSL Bayesian central dose model with knownage samples, Radiat. Meas., 81, 62–70, 2015a.
Guérin, G., Frouin, M., Talamo, S., Aldeias, V., Bruxelles, L., Chiotti, L., Dibble, H. L., Goldberg, P., Hublin, J.J., Jain, M., Lahaye, C., Madelaine, S., Maureille, B, McPherron, S. P., Mercier, N., Murray, A. S., Sandgathe, D., Steele, T. E., Thomsen, K. J., and Turq, A.: A Multimethod Luminescence Dating of the Palaeolithic Sequence of La Ferrassie Based on New Excavations Adjacent to the La Ferrassie 1 and 2 Skeletons, J. Archaeol. Sci., 58, 147–166, 2015b.
Guérin, G., Jain, M., Thomsen, K. J., Murray, A. S., and Mercier, N.: Modelling dose rate to single grains of quartz in wellsorted sand samples: the dispersion arising from the presence of potassium feldspars and implications for single grain OSL dating, Quat. Geochronol., 27, 52–65, 2015c.
Guérin, G., Christophe, C., Philippe, A., Murray, A. S., Thomsen, K. J., Tribolo, C., Urbanova, P., Jain, M., Guibert, P., Mercier, N., Kreutzer, S., and Lahaye, C.: Absorbed dose, equivalent dose, measured dose rates, and implications for OSL age estimates: Introducing the Average Dose Model, Quat. Geochronol., 41, 163–173, 2017.
Guibert, P.: Progrès récents et perspectives. Habilitation à diriger des recherches, 10 juillet 2002. Datation par thermoluminescence des archéomatériaux: recherches méthodologiques et appliquées en archéologie médiévale et en archéologie préhistorique, 3, Université de Bordeaux, Bordeaux, 27–50, 2002.
Guibert, P. and Schvoerer, M.: TL dating : Low background gamma spectrometry as a tool for the determination of the annual dose, Nucl. Tracks Rad. Meas., 18, 231–238, 1991.
Guibert, P., Schvoerer, M., Etcheverry, M. P., Szepertyski, B., and Ney, C.: IXth millenium B.C. ceramics from Niger: detection of a Useries disequilibrium and TL dating, Quaternary Sci. Rev., 13, 555–561, 1994.
Guibert, P., Lahaye, C., and Bechtel, F.: The importance of Useries disequilibrium of sediments in luminescence dating: A case study at the Roc de Marsal Cave (Dordogne, France), Radiat. Meas., 44, 223–231, 2009.
Hansen, V., Murray, A., Buylaert, J. P., Yeo, E. Y., and Thomsen, K.: A new irradiated quartz for beta source calibration, Radiat. Meas., 81, 123–127, 2015.
Heydari, M. and Guérin, G.: OSL signal saturation and dose rate variability: investigating the behaviour of different statistical models, Radiat. Meas., 120, 96–10, 2018..
Heydari, M., Guérin, G., Kreutzer, S., Jamet, G., Kharazian, M. A., Hashemi, M., Nasab, H. V., and Berillon, G.: Do Bayesian methods lead to more precise chronologies? “BayLum” and a first OSLbased chronology for the Palaeolithic openair site of Mirak (Iran), Quat. Geochronol., 59, 101082, https://doi.org/10.1016/j.quageo.2020.101082, 2020.
Heydari, M., Guérin, G., Zeidi, M., and Conard, N. J.: Bayesian luminescence dating at Ghāre Boof, Iran, provides a new chronology for Middle and Upper Paleolithic in the southern Zagros, J. Hum. Evol., 151, 1–15, https://doi.org/10.1016/j.jhevol.2020.102926, 2021.
Huntley, D. J., GodfreySmith, D. I., and Thewalt, M. L. W.: Optical dating of sediments, Nature, 313, 105–107, 1985.
Jacobs, Z., Roberts, R. G., Galbraith, R. F., Deacon, H. J., Grün, R., Mackay, A., Mitchell, P., Vogelsang, R., and Wadley, L.: Ages for the Middle Stone Age of southern Africa: implications for human behavior and dispersal, Science, 322, 733–735, 2008.
Kreutzer, S., Schmidt, C., Fuchs, M. C., Dietze, M., Fischer, M., and Fuchs, M.: Introducing an R package for luminescence dating analysis, Ancient TL, 30, 1–8, 2012.
Kreutzer, S., Martin, L., Guérin, G., Tribolo, C., Selva, P., and Mercier, N.: Environmental Dose Rate Determination Using a Passive Dosimeter: Techniques and Workflow for alphaAl2O3:C Chips, Geochronometria, 45, 56–67, 2018.
Kreutzer, S., Dietze, M., Burow, C., Fuchs, M. C., Schmidt, C., Fischer, M., Friedrich, J., Mercier, N., Smedley, R. K., Christophe, C., Zink, A., Durcan, J., King, G.E.,Phlippe, A., Guérin, G., Riedesel, S., Autzen, M., and Guibert, P.: Luminescence: Comprehensive Luminescence Dating Data Analysis. R package, version 0.9.7, available at: https://CRAN.Rproject.org/package=Luminescence, last access: 10 November 2020.
Lahaye, C., Guibert, P., and Bechtel, F.: Uranium series disequilibrium detection and annual dose determination: A case study on Magdalenian ferruginous heated sandstones (La Honteyre, France), Radiat. Meas., 47, 786–789, 2012.
Lahaye, C., Guérin, G., Gluchy, M., Hatté, C., Fontugne, M., ClementeConte, I., Santos, J. C., Villagran, X., Da Costa, A., Borges, C., Guidon, N., and Boëda, E.: Another site, same old song: the PleistoceneHolocene archaeological sequence of Toca da Janela da Barra do AntoniãoNorth, Piauí, Brazil, Quat. Geochronol., 49, 223–229, 2018.
Lanos, P. and Philippe, A.: Event date model: a robust Bayesian tool for chronology building, Communications for Statistical Applications and Methods, 25, 1–28, 2018.
Lebrun, B., Frerebeau, N., Paradol, G., Guérin, G., Mercier, N., Tribolo, C., Lahaye, C., and Rizza, M.: gamma: An R Package for Dose Rate Estimation from InSitu GammaRay Spectrometry Measurements, Ancient TL, 18, 1–5, 2020.
Liritzis, I., Stamoulis, K., Papachristodoulou, C., and Ioannides, K.: A reevaluation of radiation doserate conversion factors, Mediterranean Archaeology and Archaeometry, 13, 1–15, 2013.
Løvborg, L. and Kirkegaard, P.: Response of ${\mathrm{3}}^{\prime \prime}\times {\mathrm{3}}^{\prime \prime}$ NaI(Tl) detectors to terrestrial gamma radiation, Nucl. Instr. Methods, 121, 239–251, 1974.
Martin, L., Mercier, N., Incerti, S., Lefrais, Y., Pecheyran, C., Guérin, G., Jarry, M., Bruxelles, L., Bon, F., and Pallier, C.: Dosimetric study of sediments at the beta dose rate scale: Characterization and modelization with the DosiVox software, Radiat. Meas., 81, 134–141, 2015.
Mauz, B., Packman, S., and Lang, A.: The alpha effectiveness in siltsized quartz: New data obtained by single and multiple aliquot protocols, Ancient TL, 24, 47–52, 2006.
Mejdahl, V.: Internal radioactivity in quartz and feldspar grains, Ancient TL, 5, 10–17, 1987.
Mercier, N., Kreutzer, S., Christophe, C., Guérin, G., Guibert, P., Lahaye, C., Lanos, P., Philippe, A., and Tribolo, C.: Bayesian statistics in luminescence dating: The “baSAR”model and its implementation in the R package “Luminescence”, Ancient TL, 34, 14–21, 2016.
Miallier, D., Guérin, G., Mercier, N., Pilleyre, T., and Sanzelle, S.: The Clermont radiometric reference rocks: a convenient tool for dosimetric purposes, Ancient TL, 27, 37–42, 2009.
Millard, A.: Bayesian analysis of ESR dates, with application to border cave, Quat. Geochronol., 1, 159–166, 2006a.
Millard, A.: Bayesian analysis of Pleistocene chronometric methods, Archaeometry, 48, 359–375, 2006b.
Murray, A. S. and Olley, J. M.: Precision and Accuracy in the Optically Stimulated Luminescence Dating of Sedimentary Quartz: A Status Review, Geochronometria, 21, 1–16, 2002.
Murray, A. S. and Wintle, A. G.: Luminescence dating of quartz using an improved singlealiquot regenerativedose protocol, Radiat. Meas., 32, 57–73, 2000.
Murray, A. S. and Wintle, A. G.: The single aliquot regenerative dose protocol: potential for improvements in reliability., Radiat. Meas., 37, 377–381, 2003.
Murray, A. S., Buylaert, J.P., and Thiel, C.: A luminescence dating intercomparison based on a Danish Beachridge sand, Radiat. Meas., 81, 32–38, 2015.
Nathan, R. P. and Mauz, B.: On the doserate estimate of carbonaterich sediments for trapped charge dating, Radiat. Meas., 43, 14–25, 2008.
Nelson, M. S. and Rittenour, T. M.: Using grainsize characteristics to model soil water content: Application to doserate calculation for luminescence dating, Radiat. Meas., 81, 142–149, 2015.
Philippe, A. and Vibet, M.: “Analysis of Archaeological Phases Using the R Package ArchaeoPhases, J. Statist. Softw., 93, 1–25, 2020.
Philippe, A., Guérin, G., and Kreutzer, S.: “BayLum” an R package for Bayesian Analysis of OSL Ages & Chronological Modelling, Quat. Geochronol., 49, 16–24, 2019.
Prescott, J. R. and Hutton, J. T.: Cosmic ray contributions to dose rates for luminescence and ESR dating: large depths and longterm time variations, Radiat. Meas., 23, 497–500, 1994.
Prescott, J. R. and Hutton, J. T.: Cosmic ray and gamma ray dosimetry for TL and ESR, Nucl. Tracks Rad. Meas., 14, 223–227, 1994.
R Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, available at: https://rproject.org, last access: 6 November 2020.
Reimer, P. J., Austin, W. E. N., Bard, E., Bayliss, A., Blackwell, P. G., Bronk Ramsey, C., Butzin, M., Cheng, H., Edwards, R. L., Friedrich, M., Grootes, P. M., Guilderson, T. P., Hajdas, I., Heaton, T. J., Hogg, A. G., Hughen, K. A., Kromer, B.; Manning, S. W., Muscheler, R., Palmer, J. G., Pearson, C., van der Plicht, J., Reimer, R. W., Richards, D. A., Scott, E. M., Southon, J. R., Turney, C. S. M., Wacker, L., Adolphi, F., Büntgen, U., Capano, M., Fahrni, S. M., FogtmannSchulz, A., Friedrich, R., Köhler, P., Kudsk, S., Miyake, F., Olsen, J., Reinig, F., Sakamoto, M., Sookdeo, A., and Talamo, S.: The IntCal20 northern hemisphere radiocarbon age calibration curve (0–55 cal kBP), Radiocarbon, 62, 725–757, 2020.
Rhodes, E. J., Ramsey, C. B., Outram, Z., Batt, C., Willis, L., Dockrill, S., and Bond, J.: Bayesian methods applied to the interpretation of multiple OSL dates: high precision sediment ages from Old Scatness Broch excavations, Shetland Isles, Quaternary Sci. Rev., 22, 1231–1244, 2003.
Richter, D., Dombrowski, H., Neumaier, S., Guibert, P., and Zink, A.: Environmental gamma dosimetry for insitu sediment measurements by OSL of aAl2O3:C, Radiat. Prot. Dosim., 141, 27–35, 2010.
Thomsen, K. J., Murray, A. S., and BøtterJensen, L.: Sources of variability in OSL dose measurements using single grains of quartz, Radiat. Meas., 39, 47–61, 2005.
Thomsen, K. J., Murray, A. S., Buylaert, J.P., Jain, M., HeltHansen, J., and Aubry, T.: Testing singlegrain quartz OSL methods using known age samples from the BordesFitte rockshelter (Roches d'Abilly site, Central France), Quat. Geochronol., 31, 77–96, 2016.
Tribolo, C., Mercier, N., and Valladas, H.: Alpha sensitivity determination in quartzite using an OSL single aliquot procedure, Ancient TL, 19, 47–50, 2001.
Wintle, A. G. and Murray, A. S.: A review of quartz optical stimulated luminescence characteristics and their relevance in singlealiquot regeneration dating protocols, Radiat. Meas., 41, 369–391, 2006.
Zimmerman, D. W.: Thermoluminescence dating using fine grains from pottery, Archaeometry, 13, 29–52, 1971.
 Abstract
 Introduction
 Samples and methods
 First simple model and output
 Stratigraphic constraints
 Dealing with multiple sources of errors through a covariance matrix
 Integration of independent chronological data (radiocarbon)
 Discussion
 Conclusion
 Code availability
 Data availability
 Author contributions
 Competing interests
 Acknowledgements
 Financial support
 Review statement
 References
 Supplement
 Abstract
 Introduction
 Samples and methods
 First simple model and output
 Stratigraphic constraints
 Dealing with multiple sources of errors through a covariance matrix
 Integration of independent chronological data (radiocarbon)
 Discussion
 Conclusion
 Code availability
 Data availability
 Author contributions
 Competing interests
 Acknowledgements
 Financial support
 Review statement
 References
 Supplement