the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
The daughter–parent plot: a tool for analyzing thermochronological data
Birk Härtel
Eva Enkelmann
Download
- Final revised paper (published on 26 Jul 2024)
- Supplement to the final revised paper
- Preprint (discussion started on 25 Jan 2024)
Interactive discussion
Status: closed
-
RC1: 'Comment on gchron-2024-1', Stephen Cox, 22 Feb 2024
This submission is designed to argue that isochron plots, rather than age-eU plots, should be used to interpret compositional trends in thermochronological data. This point has been made before, notably by Vermeesch (2008) and by this same lead author in a paper published in December 2022. Aside from some simulated data plots that might have been beneficial to bolster the theoretical arguments made in the 2022 paper, it is not clear to me what innovation is presented in the present work. I would request that the authors clearly explain the following in detail:
- Why are we calling these "daughter-parent plots" and not isochron plots, as they have been known in the literature for decades? Like the similar "isotope correlation diagram," this seems unnecessary and far more likely to cause confusion than the technical clarification that the authors presumably intend. The burden placed on the person who would request a change scales with the preponderance of contrary historical and common usage. In this case, the burden is a heavy one and the authors do not even engage with it. At the very least, it should be openly acknowledged that this is just a name change for an extremely familiar tool, otherwise it seems like this is designed to confuse the reader into thinking this is something new.
- How does this work expand meaningfully on the arguments made in Härtel et al., 2022 and already largely covered by previous work (e.g., Vermeesch 2008)? Generating some new plots from simulated data does not constitute an advance that merits an additional paper. Using a longer format journal as an extended discussion section or appendix for another publication is bad practice.
Beyond those two concerns, I question the practical utility of this approach and I see a real potential for misuse and confusion. Generally speaking, it is best to stick to the simplest possible geological interpretations unless there is a convincing physical AND statistical basis for doing otherwise. The examples in figure 1 and sections 2.2 and 3.3 should be backed up by both numerous real world datasets demonstrating the pattern and independent physical evidence that the purported physical phenomena are the reason that the patterns exist.
As it stands, the manuscript does not contain very much real data, and it makes a number of assertions that are not backed up by any such data. For example, in line 163-164, the authors assert that "the arithmetic mean age does usually not provide a reliable sample age." The citations provided are again Vermeesch 2008 and Hartel et al. 2022. This argument does not appear to be developed here or in the 2022 paper. The Vermeesch 2008 paper, on the other hand, demonstrates that this is true only in extremely rare special cases. Figures 4 and 5 provide the only real data, but they do not provide much evidence for the assertions made in figure 1. Figure 4a shows significant statistical scatter, enough that it clearly does not matter exactly how the age is calculated. Figure 4b seems like a better example of the uncertainty being inappropriately reduced (again without a statistically significant change in the reported age) on the basis of a questionable claim about parentless helium than an example of this technique working. And the data in figure 4c are apparently not interpretable regardless of the presentation method. Figure 5 shows another pitfall of insisting on plotting D vs P rather than age vs eU because of concern over rare cases of spurious correlations: without age on the plot, the usefulness of these figures in making actual geological interpretations is much reduced compared to the original presentations. And it does not seem that the authors are asserting that the original sources of these datasets made any mistakes or missed any interpretations on the basis of how they chose to present the data.
If the purpose of this paper is ostensibly to provide a practical "workflow" for real world samples, it should present real world examples showing a significant benefit from an approach that will otherwise add complication, confusion, and the potential for overinterpretation to current workflows. If it is not the case that these scenarios are commonplace and can be readily distinguished, the proposed data reduction workflow is at best unnecessary and at worst will lead to spurious overinterpretations of scattered or otherwise flawed datasets.
Citation: https://doi.org/10.5194/gchron-2024-1-RC1 -
AC1: 'Reply on RC1', Birk Härtel, 23 Mar 2024
We appreciate the review comments by Stephen Cox, whose suggestions will significantly improve our manuscript. We are pleased that he does not contest the theoretical concept of the workflow, nor does he point out any technical mistake. However, there seems to be a misunderstanding regarding the scope and purpose of this contribution, and he suggests to include examples of real-world datasets to improve the impact and show the applicability of the D-P plot.
Below we respond to each of these major and all minor comments in detail. R1 marks the reviewer#1 comments and A the authors’ replies.
R1: This submission is designed to argue that isochron plots, rather than age-eU plots, should be used to interpret compositional trends in thermochronological data. This point has been made before, notably by Vermeesch (2008) and by this same lead author in a paper published in December 2022. Aside from some simulated data plots that might have been beneficial to bolster the theoretical arguments made in the 2022 paper, it is not clear to me what innovation is presented in the present work.
A: We see a misunderstanding here, regarding the scope and motivation of our manuscript. It is not our aim to replace the age-eU plot by the D-P plot or to repeat the work of Vermeesch (2008) or Härtel et al. (2022). While these publications are concerned with applying the D-P plot to specific problems, we look at the bigger picture of making sense of thermochronological data in general. The mentioned publications are theoretical and mathematics-heavy and for that reason overlooked by the many users of thermochronometric data that are mostly interested in answering their geological question. We think that the D-P plot sits at the important interface between the analytical results and more specific data-analysis tools, such as radial plots or age vs. grainsize plots. However, there is a gap in thermochronological literature on how to use the D-P plot for data analysis towards a geological interpretation.
Our aim is to fill this gap and provide guidance to an audience that is less knowledgeable on the analytics of laboratory data such as Dr. Cox. Our own experience from providing low-temperature analytical services for others (university collaborators and students, geological surveys, and industry) has shown that many of them feel lost and confused how to proceed when given their data. The simply referral to publications such as Vermeesch (2008), Härtel et al. (2022) is not sufficient. Thus – the purpose of this manuscript is to target this “user” audience, which is significantly large, and describe how to use patterns in the D-P plot for data analysis and linking its use to sample-age calculation and the use of more common data-analysis tools (e.g., Flowers et al., 2022a, b ; Kohn et al., 2024).
This comment made us aware that this goal and target audience has not been stated and we revised our introduction section to clearly state our motivation (see also responses and revisions below that are targeted towards this goal).
R1: I would request that the authors clearly explain the following in detail: Why are we calling these "daughter-parent plots" and not isochron plots, as they have been known in the literature for decades? Like the similar "isotope correlation diagram," this seems unnecessary and far more likely to cause confusion than the technical clarification that the authors presumably intend.
The burden placed on the person who would request a change scales with the preponderance of contrary historical and common usage. In this case, the burden is a heavy one and the authors do not even engage with it.
At the very least, it should be openly acknowledged that this is just a name change for an extremely familiar tool, otherwise it seems like this is designed to confuse the reader into thinking this is something new.
A: We understand that there is a strong similarity between the D-P and the isochron plot, which might be confusing to readers familiar with the concept from radioisotopic dating, which is the background of Dr. Cox.
We prefer to stick to the name ‘daughter-parent plot’ due to the following reasons: (1) technically, the D-P plot is not the same as classic isochron plot. In contrast to the D-P plot, the latter assumes the initial presence of the radiogenic daughter isotope by default, and its x and y data are same-denominator ratios with error correlation. (2) In terms of the intended audience, our paper mainly aims at the low-temperature thermochronology community, in which such a plot is not a common tool, and when used, there is no consensus on calling it “isochron plot” (Fanale and Kulp, 1962; Green, 1981; Wernicke and Lippolt, 1993; Galbraith, 1997; Dunkl, 2002; Vermeesch, 2008; Hueck et al., 2018; He et al., 2021; Meier et al., 2024). (3) from a practical perspective, the name “isochron plot” emphasizes fitting a line to the data and reporting an isochron age. This is not a good use of the plot in cases where the data do not define a linear trend but where the D-P plot is still a useful tool for characterizing the age variation. (4) the name daughter-parent plot is very descriptive of what is plotted and thus easily accessible for users.
We agree with the reviewer that this naming may cause confusion for those with a background in geochronology and using isochron plots, for that reason, we will revise our manuscript and explain the apparent similarities and differences between the D-P and the isochron plot.
R1: How does this work expand meaningfully on the arguments made in Härtel et al., 2022 and already largely covered by previous work (e.g., Vermeesch 2008)? Generating some new plots from simulated data does not constitute an advance that merits an additional paper. Using a longer format journal as an extended discussion section or appendix for another publication is bad practice.
A: This manuscript focuses on the more general problem of analyzing thermochronological data from D-P relationships. We are not aware of any publication detailing an interpretation strategy for thermochronological data using the D-P plot as our Figure 2 and section 3.3, both of which are uncontested by Dr. Cox.
We disagree that this article should be an appendix or extended discussion section to Härtel et al. (2022). This publication dealt with problems related to the uses of age vs. eU plots due to spurious correlation. Hence – they deal with the mathematical reasons of observed correlations that many users of this plot try to interpret in a geological meaningful way. The D-P plot was suggested in these papers as an alternative plot that is mathematically correct. However, none of these papers suggest how to analyze thermochronological data in general or the D-P plot specifically.
Based on this and previous comments, we agree that the goal of this manuscript needs to be better formulated. In addition, we expanded our revised manuscript to include several examples of published real-world data that we analyzed using our proposed guide for the D-P plot (see comment below).
R1: Beyond those two concerns, I question the practical utility of this approach and I see a real potential for misuse and confusion. Generally speaking, it is best to stick to the simplest possible geological interpretations unless there is a convincing physical AND statistical basis for doing otherwise. The examples in figure 1 and sections 2.2 and 3.3 should be backed up by both numerous real world datasets demonstrating the pattern and independent physical evidence that the purported physical phenomena are the reason that the patterns exist.
As it stands, the manuscript does not contain very much real data, and it makes a number of assertions that are not backed up by any such data.
A: We agree with the suggestion of supporting our conceptual patterns in the D-P plots with actual geological data. We will add several well-studied examples for all D-P relationships shown in Figure 1 to our manuscript including fission-track and (U-Th)/He datasets.
R1: For example, in line 163-164, the authors assert that "the arithmetic mean age does usually not provide a reliable sample age." The citations provided are again Vermeesch 2008 and Hartel et al. 2022. This argument does not appear to be developed here or in the 2022 paper. The Vermeesch 2008 paper, on the other hand, demonstrates that this is true only in extremely rare special cases.
A: Concerning the arithmetic mean age as sample age, the submitted manuscript and the cited sources support our argument:
(1) Section 3.3 of this manuscript points to Appendix C, which explicitly discusses the bias of the arithmetic mean age.
(2) The last paragraph of section 2 in Härtel et al. (2022) provides a short discussion on the effects of systematic offsets on mean ages. The supplementary material of that article also provides the equations for mean-age bias (arithmetic, geometric and harmonic) caused by random error of D and P.
(3) As Dr. Cox correctly points out, the abstract of Vermeesch (2008) mentions that the difference between the arithmetic mean and the central age is “relatively small” in the case of typical reproducibility of single-grain ages in (U-Th)/He dating. This is not the same as the arithmetic being reliable. In addition, this assumption may not hold for fission-track data, whose reproducibility varies strongly with the track counts.
The main text of that article presents different sample ages for (U-Th)/He dating instead of the arithmetic mean, with each having its specific advantages and applications (section 3). In the case of parentless helium (section 3.2), for example, Vermeesch (2008) recommends using the isochron age; in this case neither the central age nor the arithmetic mean age represent the sample age well.
R1: Figures 4 and 5 provide the only real data, but they do not provide much evidence for the assertions made in figure 1. Figure 4a shows significant statistical scatter, enough that it clearly does not matter exactly how the age is calculated. Figure 4b seems like a better example of the uncertainty being inappropriately reduced (again without a statistically significant change in the reported age) on the basis of a questionable claim about parentless helium than an example of this technique working. And the data in figure 4c are apparently not interpretable regardless of the presentation method.
A: In general, Figure 4 seems to have caused quite some confusion. Its aim was to provide examples of using the workflow in Figure 2, not to provide evidence for the D-P relationships in Figure 1.
In Figure 4a, there is little difference between the pooled and the isochron age not because of statistical scatter, but because D and P are approximately proportional. Sample ages deviate more from each other the more the D-P data differ from proportionality (see Appendix C). We agree that the isochron and the pooled age in Figure 4b overlap within 2s uncertainties. However, this does not mean they are the same, as can be seen from the large difference between the uncertainties and the intercept of the isochron, which is significantly different from zero. The data in Figure 4c are difficult to interpret, but their scatter does not mean they are uninterpretable (see section 3.3.6). The scatter may be due to kinetic differences between the grains due to a property not captured by the measurements, but even without this information it is important to document the D-P relationship (see Table 1), instead of not showing the data.
We will make an effort to re-work Figure 4 and section 3.4 to clarify their purpose and potentially exchange some of the examples to show a more diverse range of scenarios.
R1: Figure 5 shows another pitfall of insisting on plotting D vs P rather than age vs eU because of concern over rare cases of spurious correlations.
A: It is not our intention to insist on analyzing thermochronological data using only the D-P plot (see section 3.3). We prioritize showing D-P plots in Fig. 5 because they are the focus of this manuscript; the original sources of these data display other plots. Our reason to stress the importance of avoiding spurious correlations is their frequent occurrence in ratio-vs.-denominator plots (see textbooks by Chayes, 1978; Rollinson, 1993).
R1: Without age on the plot, the usefulness of these figures in making actual geological interpretations is much reduced compared to the original presentations.
A: We agree that it is often useful to compare single-grain ages to other parameters, e.g. grain size or chemistry. However, our approach prioritizes detecting systematic offsets and analyzing outliers from the D-P relationship (see L.31), before moving on to more specific data-analysis tools (e.g., section 3.3.2, Figure 3c).
We will make sure to state this explicitly in the revised manuscript and will add a direct comparison between the D-P plot and those directly showing the age.
R1: And it does not seem that the authors are asserting that the original sources of these datasets made any mistakes or missed any interpretations on the basis of how they chose to present the data.
A: The fact that we do agree with the original interpretation of these selected examples, does not mean that our approach does not work. The purpose of Figure 5 was to show the D-P plot applied to multi-sample datasets, e.g. for pooling small datasets and comparing them to modelling results (5a), comparing data with to external parameters such as spatial information (5b), or directly comparing the daughter retention of different minerals in the same sample (5c). We understand that it did not reach this goal, and therefore decided to remove it from the manuscript.
R1: If the purpose of this paper is ostensibly to provide a practical "workflow" for real world samples, it should present real world examples showing a significant benefit from an approach that will otherwise add complication, confusion, and the potential for overinterpretation to current workflows.
A: We agree that showing D-P plots from real datasets displaying relationships in Figure 1 would be a strong addition to the manuscript.
We will therefore include new figures with a range of D-P patterns of published datasets and add a comparison of the D-P and its properties compared to other plots (e.g. age-eU, radial or age-grainsize plots).
R1: If it is not the case that these scenarios are commonplace and can be readily distinguished, the proposed data reduction workflow is at best unnecessary and at worst will lead to spurious overinterpretations of scattered or otherwise flawed datasets.
A: We agree wit Dr. Cox that for implementing a new data-analysis scheme, it is advantageous to show that it is more than a theoretical concept. In our opinion, the proposed analytical strategy is relatively robust to over-interpretation because the D-P plot displays the data without a preconceived idea regarding the cause of age variation. Relying on a simple scatter plot with two independent axes representing minimally processed data facilitates recognizing non-ideal (not necessarily ‘flawed’) datasets.
We will add well-documented examples to the manuscript to support this point.
Citation: https://doi.org/10.5194/gchron-2024-1-AC1
-
RC2: 'Comment on gchron-2024-1', Anonymous Referee #2, 23 Feb 2024
The manuscript “The daughter-parent plot: a tool for analyzing thermochronological data” by Härtel and Enkelmann presents the case for using a “daughter-parent” plot for thermochronological data. The manuscript is well written and follows a logical progression of introduction of the technique, background, and application of the method using synthetic and real examples. The main idea of the paper is the presentation of the D-P plot and assigning a decision-based classification scheme based on data relationships.
Main comments:
My initial concern is that the paper is reframing established geochronological concepts that may further confuse readers. Therefore, my concerns may be more related to the specific framing of the paper. Their manuscript shows various synthetic and some real examples but in my opinion does not present a strong argument for adoption of the “D-P plot.” I don’t understand why the D-P plot should be used in place of any of the other well established characterization methods that are discussed in the paper (e.g., isochron). There also seems to be considerable material overlap with previous paper(s) by the first author, specifically their 2022 paper (EPSL v. 599).
To reiterate, a detail I keep coming back to as I read the manuscript is that in most cases, the pooled/central age and isochron ages etc. seem to be the end goal of the decision-tree approach—but those methods already have established and definable criteria for their use and have been used in the geochronology community for decades. They are conveniently summarized in the Figure 2 text in the manuscript and Section 3.3. Why should the D-P plot be used with or in place of the pre-existing methods, especially if it is an intermediate step? The D-P plot seems to be a reinvention (or redefinition) of criteria that are already utilized for determining the model age of a thermochronological sample.
For example, when assessing a dataset of an assumed single population of grain ages, the choice between pooled age versus central age is mostly a statistical one. We already use radial plots and KDEs for identifying and quantifying different age populations, why should the D-P plot also be used? ...or used instead of these other plots? Another point is that if data display some weak trend but have large uncertainties, they of course can be treated as a “cluster”—this happens quite frequently with fission-track datasets generated by the external detector method. Again, why should the D-P plot be used instead of, or in addition to, pre-existing methods? Some of the arguments posed do not always hold up. EDM data characterized by large single-grain uncertainties may actually mask real age populations and therefore the data should NOT be treated as a data cluster. Would the D-P plot be useful in such a case? In some instances, other data (apatite composition, U-Pb ages, provenance, etc.) tell us there are likely multiple kinetic populations. This must be assessed by a skilled and experienced thermochronologist. As we well know, statistical methods are not infallible. The data are the data, and usually do not conform to our expectations.
This is my opinion, but I generally think that strict decision-based approaches are problematic for some types of data analysis, mainly because they are often subjective in the sense that they are based on a few known or conjured scenarios. That is not to say such approaches are entirely without merit, but it can be a slippery slope in terms of their careful use and application. What happens when data do not adhere to the provided scheme? I vaguely understand the intention behind the D-P plot, but in reality most users simply (and rather unfortunately) want The Answer and don’t want to spend lots of time doing detailed data analysis and interpretation. This is problematic and thus presents an opportunity for blind application by users, which is a nontrivial matter in thermochronological data interpretation. An analogous case is the practice of binning and averaging U-Th/He dates by eU for modeling—there is simply no legitimate or statistical rationale for doing it.
Other comments:
Outside of mineral age standards, how often do thermochronology datasets not meet the “small dataset” consideration in Section 4.1? The authors essentially define this case as the norm and say the D-P plot can’t really be used other than as a “qualitative guide”. This is a bit confusing.
Figure 5 presents a good example of showing data in He (nmol/g; daughter) vs. eU (ug/g; parent)—how do those plots compare to the normal plots of “apparent age” vs. eU? That should be shown. If they are the different, is there an advantage of the D-P plot? If they are the same, why use it? It is not clear to me why the datasets are shown in Figure 5? What is the message of this figure? For example, fig. 5a shows Miltich (2005) zircon helium data with different radiation damage relationship prediction curves for different thermal history scenarios. Is there a damage scenario the authors prefer or are they arbitrarily shown, and if arbitrary, why show them at all? Are the high-He grains outliers or is there just data scatter? Fig. 5c shows eU and He concentration for 3 different minerals. Suffice to say, they should plot differently since each mineral has different and characteristic parent/daughter isotopes, depending upon the mineral. What should the takeaway of this figure be?
Some line comments:
Line 17: what is meant by “consistent and traceable” with respect to thermochronometric data analysis? When someone plots U-Th/He dates versus eU, are there obvious issues with consistency and how it was done?
Line 21: use of word “ingredient” is odd, maybe change that.
Figure 4c: remove the dashed lines and min/max ages. As shown this looks like a radial plot and is misleading.
Citation: https://doi.org/10.5194/gchron-2024-1-RC2 -
AC2: 'Reply on RC2', Birk Härtel, 23 Mar 2024
We are pleased to receive the comments of reviewer #2 as they stimulate an in-depth discussion of data analysis. While the reviewer approves our general approach, there are some questions about the usefulness of the D-P plot. Essentially, the reviewer raises the concern that the D-P plot might not be necessary for analyzing data due to the multitude of readily available tools. In response to the reviewer’s concerns, we will emphasize the unique benefits of the D-P plot in the introducing sections of our revised manuscript and clarify our motivation for providing this workflow.
Below we answer each of the comments; R2 marks the reviewer#2 comments and A the authors’ replies.
R2: The manuscript “The daughter-parent plot: a tool for analyzing thermochronological data” by Härtel and Enkelmann presents the case for using a “daughter-parent” plot for thermochronological data. The manuscript is well written and follows a logical progression of introduction of the technique, background, and application of the method using synthetic and real examples. The main idea of the paper is the presentation of the D-P plot and assigning a decision-based classification scheme based on data relationships.
My initial concern is that the paper is reframing established geochronological concepts that may further confuse readers. Therefore, my concerns may be more related to the specific framing of the paper. Their manuscript shows various synthetic and some real examples but in my opinion does not present a strong argument for adoption of the “D-P plot.” I don’t understand why the D-P plot should be used in place of any of the other well established characterization methods that are discussed in the paper (e.g., isochron).
A: We are glad to see the general positive response of the reviewer, but see a misunderstanding that occured. Our main argument is not that the current data-analysis schemes are wrong, but that there are unique benefits of using the D-P plot (see below), that are underappreciated because of few thermochronological studies actually applying it. Besides, we envision the D-P plot as a simple device to navigate through the large number of data-analysis tools available, some of which are applicable to a given sample/dataset while others are not. Our aim is therefore to point out the benefits of D-P plotting for thermochronological data and provide guidance for integrating the D-P plot into a data-analysis scheme.
We will clarify our motivations in the introduction of the revised manuscript.
R2: There also seems to be considerable material overlap with previous paper(s) by the first author, specifically their 2022 paper (EPSL v. 599).
A: We will reduce the overlap with our previous works in the revised manuscript.
R2: To reiterate, a detail I keep coming back to as I read the manuscript is that in most cases, the pooled/central age and isochron ages etc. seem to be the end goal of the decision-tree approach—but those methods already have established and definable criteria for their use and have been used in the geochronology community for decades. They are conveniently summarized in the Figure 2 text in the manuscript and Section 3.3. Why should the D-P plot be used with or in place of the pre-existing methods, especially if it is an intermediate step? The D-P plot seems to be a reinvention (or redefinition) of criteria that are already utilized for determining the model age of a thermochronological sample.
A: We agree that there are established criteria for using certain sample ages or data-analysis tools and we do not wish to contest these. The reviewer is also correct that the D-P plot contains information also provided by other data-analysis tools. Still, we argue that it is not only a redefinition of known criteria, but actually has its own unique benefits:
-
it is the only thermochronological data plot for detecting systematic offsets in daughter or parent concentrations (e.g. Vermeesch, 2008 for “parentless He” from inclusions in apatite), which compromise most data-analysis tools (section 3.3.3).
-
It enables us to detect radiation-damage effects on daughter retention without being misled by spurious correlation (e.g. Carter, 1990; Härtel et al., 2022).
-
It allows identifying outliers in the D-P plot with respect to their daughter and parent concentrations and not the single-grain age alone (e.g., He et al., 2021). Examining datasets for outliers is essential for data analysis, and the D-P plot provides an additional perspective.
We will emphasize these benefits in section 2 of the revised manuscript and move the description of the impact of systematic offsets from section 3.3.3 to this section.
R2: For example, when assessing a dataset of an assumed single population of grain ages, the choice between pooled age versus central age is mostly a statistical one.
A: The reviewer correctly points out that the choice between the pooled and central age usually follows a statistical criterion, for example the χ2 test. For this decision no D-P plot is necessary, however, the D-P plot can be a criterion to assume a single age population beforehand, or, more importantly, point at a systematic offset or radiation-damage-dependent retention. Both of these factors may (1) cause the χ2 test to fail, and (2) compromise an interpretation based on the pooled or central age.
R2: We already use radial plots and KDEs for identifying and quantifying different age populations, why should the D-P plot also be used? ...or used instead of these other plots?
A: Again, we agree with the reviewer that there is nothing wrong with using KDE or radial plots; these are the tools of choice to analyze different age populations. However, we argue that there are some considerations necessary before applying them, to make sure that shoulders in the KDE curve are not due to statistical outliers or systematic offsets; or that observed patterns in the radial plot reflect age populations rather than age-uncertainty correlations (e.g., Jonckheere et al., 2024). Assessing the age variation, finding outliers (section 3.3), or testing for systematic offsets (section 3.3.3) can be done conveniently by the D-P plot. On top of that, the D-P plot may already point to distinct age populations in a sample. That is why we suggest the D-P plot as first step for the data analysis and suggest the KDE and radial plots as follow-up tools if different age populations seem plausible.
R2: Another point is that if data display some weak trend but have large uncertainties, they of course can be treated as a “cluster”—this happens quite frequently with fission-track datasets generated by the external detector method. Again, why should the D-P plot be used instead of, or in addition to, pre-existing methods?
A: Again, there is nothing wrong with using the classical methods instead of the D-P plot to analyze clustered data, if one can exclude systematic effects. Still, the D-P plot may be a convenient visualization tool to reach the conclusion that such data form a cluster as opposed to a linear relationship. In addition, the D-P plot shows how much of the individual age uncertainty stems from induced- and spontaneous-track counts; this is especially useful for comparing samples with different U content at a glance.
R2: Some of the arguments posed do not always hold up. EDM data characterized by large single-grain uncertainties may actually mask real age populations and therefore the data should NOT be treated as a data cluster. Would the D-P plot be useful in such a case? In some instances, other data (apatite composition, U-Pb ages, provenance, etc.) tell us there are likely multiple kinetic populations. This must be assessed by a skilled and experienced thermochronologist. As we well know, statistical methods are not infallible. The data are the data, and usually do not conform to our expectations.
A: We thank the reviewer for bringing up this example. We agree that in this special case the interpretation as a cluster is not necessarily correct (see section 3.3.2), but that the careful comparison to other, independent data is necessary to separate the different populations. Color-coding data in the D-P plot as shown in Figure 3c helps to test for relationships between age and grain properties, but also to discern possible relationships of these properties (e.g., Cl content in apatite) to the parent or daughter concentrations.
We will add this argument to section 3.3.2 in the revised manuscript.
R2: This is my opinion, but I generally think that strict decision-based approaches are problematic for some types of data analysis, mainly because they are often subjective in the sense that they are based on a few known or conjured scenarios. That is not to say such approaches are entirely without merit, but it can be a slippery slope in terms of their careful use and application. What happens when data do not adhere to the provided scheme?
A: It is correct that the weak spot of a strict decision-based approach is that some datasets may not be assignable unambiguously to a class. That is why our sections 3.2 and 3.3 provide additional discussion to the criteria used in the workflow (Figure 2), point out alternative classifications, e.g., in the complicated case of scattered data, and suggests further tools or gathering constraints from other data.
On the other hand, we think that a decision-based data-analysis scheme is helpful to keep the analysis process consistent, especially for studies involving many samples. On top of that, decisions based on simple criteria also enable inexperienced analysts to choose the right plots and interpretation tools rather than randomly applying them.
R2: I vaguely understand the intention behind the D-P plot, but in reality most users simply (and rather unfortunately) want The Answer and don’t want to spend lots of time doing detailed data analysis and interpretation. This is problematic and thus presents an opportunity for blind application by users, which is a nontrivial matter in thermochronological data interpretation. An analogous case is the practice of binning and averaging U-Th/He dates by eU for modeling—there is simply no legitimate or statistical rationale for doing it.
A: We agree that proper data analysis is not very attractive and often overlooked by many users, translating their data too quickly into geological models. In our opinion, this is also a function of the availability of easy-to-use software tools and the accessibility of data analysis literature for carrying out the analysis by users that get data delivered from laboratories.
That is why we will add a new section on Incaplot (Härtel, 2023) to the revised manuscript. Incaplot is a free graphical-user-interface program dedicated to producing D-P plots for thermochronological data in a straightforward manner with options for isochron fitting, pooled- and single-grain age calculation. The program is already in use in the Calgary Geo- and Thermochronology group and further updates will enhance its capabilities and usefulness.
R2: Outside of mineral age standards, how often do thermochronology datasets not meet the “small dataset” consideration in Section 4.1? The authors essentially define this case as the norm and say the D-P plot can’t really be used other than as a “qualitative guide”. This is a bit confusing.
A: We state in section 4.1 that this problem mainly concerns whole-grain (U-Th)/He data where 5 or less single-grain ages are common. For other methods such as FT dating, or laser-ablation based (U-Th)/He dating or zircon Raman dating, this is not a concern because >15 ages are typically measured per sample. Small samples are not only a problem for the D-P plot, but for any plots used in thermochronology to identify patterns/trends (age vs. grain size, age vs eU, radial plot). This is because of the data potentially not representing the sample well, hence, the question is if observed trends are actually significant.
R2: Figure 5 presents a good example of showing data in He (nmol/g; daughter) vs. eU (ug/g; parent)—how do those plots compare to the normal plots of “apparent age” vs. eU? That should be shown. If they are the different, is there an advantage of the D-P plot? If they are the same, why use it?
A: Yes, we agree that comparing the D-P plot with the age-eU plot is useful. However Härtel et al., 2022 specifically dealt with spurious correlation in the age-eU plot that does not affect the D-P plot. Still, we will include age-eU plots in the revised manuscript, pointing at this problem.
R2: It is not clear to me why the datasets are shown in Figure 5? What is the message of this figure? For example, fig. 5a shows Miltich (2005) zircon helium data with different radiation damage relationship prediction curves for different thermal history scenarios. Is there a damage scenario the authors prefer or are they arbitrarily shown, and if arbitrary, why show them at all? Are the high-He grains outliers or is there just data scatter? Fig. 5c shows eU and He concentration for 3 different minerals. Suffice to say, they should plot differently since each mineral has different and characteristic parent/daughter isotopes, depending upon the mineral. What should the takeaway of this figure be?
A: We appreciate the reviewer’s observations on Figure 5. Our intention was to show possible uses of D-P plots for comparing multiple samples from the same geological context or different thermochronometers from the same. It seems like this figure caused more confusion than it helped to understand the concept. That is why we will remove it from the manuscript. Following the request of reviewer #1, we will add real-world data to the manuscript and hope that this will be more useful to the overall concept of the paper.
R2: Line 17: what is meant by “consistent and traceable” with respect to thermochronometric data analysis? When someone plots U-Th/He dates versus eU, are there obvious issues with consistency and how it was done?
A: We mean that by reporting a type of D-P relationship instead of only a sample age or t-T model, it is easier for other thermochronologists to understand the data-analysis process. This is less on how someone did a certain plot, but on tracing why they chose to do so.
We will revise this section of the manuscript for clarification.
R2: Line 21: use of word “ingredient” is odd, maybe change that.
A: We will revise this sentence.
R2: Figure 4c: remove the dashed lines and min/max ages. As shown this looks like a radial plot and is misleading.
A: As the reviewer correctly points out, displaying an age as a line in the D-P plot works in the same way as in the radial plot, but we see no risk of mixing up the two due to the different plot construction and labels. We will therefore keep the lines and ages in this figure.
Citation: https://doi.org/10.5194/gchron-2024-1-AC2 -
-
AC2: 'Reply on RC2', Birk Härtel, 23 Mar 2024
-
CC1: 'Comment on gchron-2024-1', James Metcalf, 26 Feb 2024
Would it be possible to see the synthetic data that was used in the construction of the plots? Specifically what U, Th, Sm ratios/concentrations, ages, and grains sizes are being used to demonstrate the behavior?
Citation: https://doi.org/10.5194/gchron-2024-1-CC1 -
AC3: 'Reply on CC1', Birk Härtel, 23 Mar 2024
We are pleased to see the interest in the synthetic data we generated for our conceptual plots. We will add a supplementary file to the revised manuscript containing these data.
Citation: https://doi.org/10.5194/gchron-2024-1-AC3
-
AC3: 'Reply on CC1', Birk Härtel, 23 Mar 2024