Articles | Volume 7, issue 1
https://doi.org/10.5194/gchron-7-45-2025
https://doi.org/10.5194/gchron-7-45-2025
Research article
 | 
25 Feb 2025
Research article |  | 25 Feb 2025

The need for fission-track data transparency and sharing

Murat T. Tamer, Ling Chung, Richard A. Ketcham, and Andrew J. W. Gleadow
Abstract

We report a new image-based inter-analyst study to investigate apatite fission-track grain selection and analysis by 13 participants from an image dataset that included grains of variable quality. Results suggest that participants with less experience selected a higher percentage of unsuitable grains, while participants from the same laboratories generally provided similar results. Less analytical experience may result in the rejection of suitable grains or the inclusion of unsuitable ones. While inappropriate omission and inclusion can both bias results, the latter is more pernicious due to the standard practice of achieving a pre-decided number of analyses; particularly in difficult samples, there is a danger of “compromising data quality” and integrity by weakening selection criteria. Juxtaposing selected regions of interest (ROIs) on the same grains indicates that zoned grains and grains with inclusions and defects yield varying track density estimates, indicating that ROI placement can be an influential factor. We hereby propose the development of image data repositories for the purpose of achieving global data transparency. In this repository, images and analyses can be accessed, reviewed, and reanalyzed. In addition, we propose the implementation of global guidance for fission-track analysis, digital teaching modules, and open science. We also point out the need for new approaches to zeta calibration that include consideration of grain quality, methods of uranium determination, and etching protocols.

Share
1 Introduction

Apatite fission-track dating and thermal history modeling are widely used for near-surface research in earth sciences across a large spectrum of subjects such as landscape evolution (Reiners and Shuster, 2009; Lemot et al., 2023; Gallen et al., 2023), climate change (Barnes et al., 2012; Qiu and Liu, 2018; Yu et al., 2022), glacier-driven exhumation (Balestrieri et al., 1991; Fitzgerald and Goodge, 2022; Karaoğlan et al., 2023), natural resource exploration (Dumitru et al., 1991; Deng et al., 2015; Qiu et al., 2023; Gülyüz et al., 2024), and biodiversity (Kohn et al., 1992; Torres et al., 2013; Bernet et al., 2023). With the prerequisite that suitable apatite crystals are available, six essential “ingredients” are required for fission-track time–temperature modeling: (1) selection of grains on the polished and etched grain mount that are suitable for analysis, which then consists of the track densities calculated from (2) track counts over (3) a selected region of interest; (4) preferably more than a few tens of confined track lengths per sample; (5) mean etch figure diameter parallel to the c axis (Dpar) (Donelick, 1993; Burtner et al., 1994; Donelick et al., 1999) or chemical information to infer kinetics; and (6) an estimate of the 238U concentration (Tagami and O'Sullivan, 2005). All of these inputs are still largely analyst-driven, although some new technologies are being developed to alleviate this. Recent developments in image analysis and AI have contributed significant advances in auto-counting and auto-measurement (Gleadow et al., 2009, 2019; Nachtergaele and De Grave, 2021; Li et al., 2022; Ren et al., 2023; Boone et al., 2023a) but are not yet in a position to replace human decision-making. Similarly, laser ablation mass spectrometry (LA-ICP-MS) has become an alternative (Hasebe et al., 2004) to the widely used external detector method (EDM) (Gleadow and Lovering 1977) for uranium content determination, and U mapping has been developed to help account for U zonation (Ansberque et al., 2021); these obviate some human decisions but may add others.

Previous apatite fission-track inter-laboratory and inter-analyst experiments showed significant variation in measurements for the same samples and even standards (Naeser et al., 1981; Miller et al., 1985, 1990, 1993; Barbarand et al., 2003; Ketcham et al., 2009; Sobel and Seward, 2010; Ketcham et al., 2015; Ketcham et al., 2018; Tamer et al., 2019). These variations have been attributed to a broad range of factors, including instrumentation, analytical preferences, etching protocol, and analyst selection criteria. A common feature in these experiments (except Tamer et al., 2019, which compared only two analysts) is that there was no direct control over what analysts observed. In most cases participants had their own aliquots of study samples, and in experiments where all analysts measured the same grain mounts they undoubtedly looked at different sets of grains. Moreover, until the advent of efficient computational tools, there has been a limited ability to document and compare the counted areas within measured grains. As a result, ingredients (1) and (3) above have not been quantitatively explored as sources of variation in dates, even though they may exert a first-order influence on the data quality and extractable thermal history information. Similarly, again with the exception of Tamer et al. (2019), different analysts have never evaluated and measured the same sets of features for confined track measurement, and their decisions concerning individual features have not been captured, limiting the means to compare and evaluate ingredient (4).

From a given set of grains, grain selection influences results in several ways. Grains where oily and aqueous fluids have penetrated into tracks may hinder the recognition of some surface tracks or cause confined tracks to appear shorter and thus more annealed (Ketcham et al., 2015), respectively leading to underestimation of ages or overestimation of temperatures. Grains with excessive defects, such as polishing artifacts or etched dislocations, may cause misidentification of some spurious features as actual tracks and cause overestimation of ages. Conversely, recognition of some defects in a sample may cause an analyst to alter their counting criteria to be more likely to reject borderline features they interpreted as tracks when conducting their zeta calibration measurements. Track density can vary by up to 35 % if the grain is not oriented with the c axis in the viewing plane (Aslanian et al., 2022), making both the resulting age and etch figure dimensions (e.g., Dpar) inaccurate, thereby affecting estimates of kinetics and initial track length. A perceived need to meet targets for the number of grains analyzed may cause an analyst to select borderline-suitable grains or tracks that may not have been selected otherwise (Tamer and Ketcham, 2023).

Whereas the area counted for fission-track density determinations has historically been defined by squares in an eyepiece reticule, recent image-based systems allow the user to draw an arbitrarily shaped region of interest. In both cases, this process must be executed with care. Regions of interest need to be placed so that the grain surface they encompass is not biased with respect to the ability to host detectable tracks. Regions of interest within one fission-fragment range of the grain edge will not sample tracks from a full 4π geometry (Fleischer et al., 1975), and including sizable defects and cracks in the region of interest may result in uncountable areas; both effects will bias ages towards lower values. Regions of interest that include zones with different U content complicate the accurate determination of U across the track-generating region (Vermeesch, 2017) and suffer edge effects from sampling a 4π region that hosts variable U concentrations. This bias can result from both laser ablation, where typically a smaller area is sampled for the U determination than for the spontaneous track count, and the external detector method, where perfect matching between spontaneous and induced track regions of interest can be difficult to achieve, especially where the track density is low.

The minimum recommended number of grains for age measurements for igneous-type samples is ∼10 (Wagner and Van den Haute, 1992) and at least 20 for sedimentary rocks (Kohn et al., 2024). In both cases, even more are needed if there is any indication of kinetic variation (Donelick et al., 2005). For detrital samples the recommended number is ∼120 or more (Vermeesch, 2004). These guidelines are enforced not only at the laboratory level, but also during the peer review process. If grains are few or of low quality, an analyst may consciously or unconsciously add some borderline-quality grains to meet goals for data quantity. Similarly, having pre-determined goals for the number of confined tracks per sample can incentivize accepting lengths that might otherwise be passed over. For both data types, aiming for specific quantities of data may eventually cause a loss of quality.

We carried out a new apatite fission-track (FT) inter-analyst experiment designed to investigate variability in grain selection and region of interest definition criteria. Building upon a previous two-analyst study (Tamer et al., 2019), we also tested the identification of confined track lengths. Participants were asked to perform apatite FT analysis on a selection of grains drawn from an identical image set featuring variable grain quality using software that records all details of the analysis as overlays in an .xml file, thereby allowing for subsequent review. Analysts were also asked to fill out a questionnaire about their approach.

2 Materials and methods

2.1 Image data repository

We created an image data repository consisting of 41 apatite grain and three graticule (length calibration grid on a microscope slide) images from the in-house fission-track data repositories at the University of Melbourne (UM) and the University of Texas at Austin (UT). Images from UM were captured by Ling Chung using a Zeiss Axio Imager M1m microscope with an IDS µEye camera and white balance correction, while images from UT were taken by Sean Sanguinito and Murat Tamer using a Zeiss Axio Imager M2m microscope with an Olympus SYS UC30 camera and no white balance. Grains from UT were etched with 5.5 M HNO3 at 21 °C for 20 s (Carlson et al., 1999), while the grains from UM were etched with 5M HNO3 at 20 °C for 20 s (Gleadow et al., 1986; Green et al., 1986). The images used in this study can be viewed at geochron@home (Vermeesch, 2024, https://doi.org/10.5281/zenodo.13777917). A total of 36 grains were selected for track density measurements and 5 for confined track length measurements. To test the self-reproducibility of the analytical results, we repeated one grain image as two different grain areas (Grains 07 and 16). The grain descriptions are given in Table 1.

Table 1Description of images. UT: University of Texas at Austin, UM: University of Melbourne. * If U content (ppm) is determined using the LA-ICP-MS approach, there is a need to cross-check the counting area as the track distribution is slightly uneven.

Download Print Version | Download XLSX

2.2 Announcement and participant instructions

The announcement of the study was made at the 17th International Conference on Thermochronology (2021, Santa Fe) (Tamer et al., 2021) and in relevant email lists. The participants were asked to perform track density and confined track length and Dpar measurements using their preferred approach, including any analytical software, manual measurement, or AI-based analysis. The participants were not instructed to reach a given number of grains or confined track length analyses but were instructed to skip or accept grains for analysis according to their own judgment.

This experiment utilized Fission Track Studio, a dual software suite developed by the Melbourne Thermochronological Research Group (MTRG) that is capable of automatic grain stack-image acquisition (TrackWorks) and image review and measurement (FastTracks). The FastTracks program offers manual and automated analytical tools for obtaining all essential parameters for FT dating as well as a cross-section tool for precise dip angle determination for length measurements. All analytical results were recorded in an .xml file that can be reloaded for a follow-up analysis and review. The University of Melbourne provided a temporary FastTracks license and a detailed user manual for those who wanted to participate in this study. The participants had the option to reveal their names and affiliations or to be anonymous. A participant's submission was accepted only if the analysis was performed by a single analyst.

2.3 Reviewer criteria

In the absence of absolute standards, we used the grain selection criteria of Ling Chung and the confined track length measurement judgments of Murat Taner Tamer as reference points for the review of the participant results. However, no fission-track analyst can claim complete certainty in their judgments about track features and we do not suggest that these reference results represent “true” values. Rather they are simply used as reference values that are probably typical of reasonably experienced analysts. They were used as the starting point for a detailed grain-by-grain and track-by-track discussion with the participants to arrive at a consensus and to ascertain which factors are most likely to lead to discrepancies between analysts. Such a detailed analysis has not previously been undertaken to our knowledge and would be all but impossible without the image-based approach used here. To test the objectivity of the reviewers, the participants were shown their selections and measurements that the reviewers considered questionable or, in some cases, erroneous, after which they could acknowledge or dispute the review.

Grains were judged to be suitable, unsuitable, or borderline by Ling Chung. A grain having any of the three following properties was considered unsuitable: (1) the polished surface was not parallel to the c axis (Fig. 1a, b), (2) fluids were present in tracks (Fig. 1c, d), and (3) an excessive number of inclusions, defects, and/or uncertain features was intermingled with actual tracks (Fig. 1e, f). Additionally, heterogeneous U distribution within the grain, judging from the distribution of spontaneous tracks, can be a complicating factor, especially if LA-ICP-MS spot analysis is used for U determination. Induced tracks in the EDM can provide an improved basis for recognizing zoning, but zoning can also accentuate the consequences from misalignment of the spontaneous and induced track regions. In samples with low abundance and/or low quality of grains, some borderline-quality grains which are not 100 % parallel to the c axis and/or contain a distinguishable abundance of defects and inclusions may be included in the resulting datasets (Fig. 1g, h). Selection of the region of interest may become a challenge for inclusion- and/or defect-rich (Fig. 1i, j) and zoned grains (Fig. 1k, l). A track is defined as “confined” if both ends are not exposed at the surface (Fig. 1m, n), and the length of the confined track is measured as long as the track tips are visible and unobscured by surrounding features (Fig. 1o, p). Moreover, it is important that confined tracks are not filled, fully or partially, with fluid, which may result from fingerprints; this significantly affects their optical contrast within the host mineral (Fig. 1q, r). Any measurement that does not meet these criteria is considered invalid.

A total of 13 analysts returned completed questionnaires, though one of them could not provide the .xml file due to technical problems. Although Ling Chung and Murat Taner Tamer evaluated each other's analyses, we consider them to be reviewers in this study. Two additional analysts from the same laboratory with different years of experience submitted answers in one combined .xml file. Because they were unable to disentangle their results or re-conduct them independently, their results are not included in the analysis below. While some participants wished to remain anonymous, others chose to be transparent with their identities; Table S1 provides the list of participants. Participants' overall and recent experience and their current fission-track setup, a summary of the questionnaire, selection percentages for grains, and validity percentages for confined track lengths are provided in Table 2. Excluding one of the repeated grains for checking self-reproducibility (Grain 16), of the 35 grain image sets, Ling Chung estimated that 22 are suitable and 13 are unsuitable for fission-track analysis. In grain-by-grain checking, we counted how many of the suitable grains were selected and how many of the unsuitable grains were rejected. Some of the participants used the FastTracks automatic tools for c-axis orientation and Dpar length measurements, but we did not track whether these results were accepted “as is” or subsequently modified. The resulting suitable and unsuitable grain selection percentages are reported as percentages. Murat Taner Tamer examined every measured confined track length to check validity as determined above. The percentage of valid measurements is reported as the percentage of measured tracks divided by the total number of valid measurements. We did not evaluate how many valid tracks were excluded, as there was no way to determine whether such tracks were intentionally omitted or simply missed.

https://gchron.copernicus.org/articles/7/45/2025/gchron-7-45-2025-f01

Figure 1Examples of grains and tracks that could cause erroneous grain and region of interest selection or invalid confined track length measurement for fission-track analyses. (a) A grain not parallel to the c axis appears to be suitable in transmitted light, (b) but varying etch pit orientations clearly show that the grain is not parallel to the c axis in reflected light. (c, d) Transmitted and reflected light images showing how a dirty grain surface with liquids in tracks can impair track recognition. (e, f) Transmitted and reflected light images of a grain showing an excess of non-track features such as defects and inclusions that are difficult to distinguish from actual tracks. (g, h) Transmitted and reflected light images of a grain with a low and uneven track density, suggesting the possible presence of zoning that would need to be accounted for. (i, j) Transmitted and reflected light images of a grain showing large defects and inclusions occupying space and obscuring fission tracks, resulting in underestimation of the fission-track density if these features are included in the region of interest. (k, l) Transmitted and reflected light images of a zoned grain, for which different placements of the region of interest and location of ablation point(s) may yield divergent ages. (m, n) Transmitted and reflected light images of a would-be confined track, where the reflected light image shows that one tip may be exposed to the surface. (o, p) Confined tracks may also be rendered invalid for measurement by obscuring features (q, r) or partial fluid fillings.

Download

Table 2Summary of the questionnaire, percentages of grain selections, and confined track length measurement validity.

NA: not available; AS: auto-scan, C: custom; D: Dumitru system. * The analyst did not participate in the experiment but evaluated the grains as suitable and unsuitable.

Download Print Version | Download XLSX

3 Results and discussion

3.1 Graticule calibration

Graticule images taken by microscopes at UM and UT were included in the dataset for calibration. Although calibration is an essential step before performing an analysis, only five participants reported measuring them. Some omissions may have been due to not fully understanding the terms of the experiment. To make the comparison of results easier, we used the default graticule calibration for all analysts. The graticule measurements are summarized in Table S2. Using default calibration, analysts performed measurements with >99.0 % accuracy. Considering the limits of optical microscopy, this accuracy provides measurements within analytical errors.

3.2 Self-reproducibility

Grains 07 and 16 are duplicated images of the same grain in our dataset. While some participants skipped Grain 16 after noticing the repetition, some performed density measurements on both grains. Although these remeasurements demonstrated high self-reproducibility (Fig. 2), minimum and maximum densities vary by ±30 %. The difference can be traced to the varying region of interest selection, light source utilization (transmitted only, reflected only, or both), and track counting routines. Although the zeta method (Hurford and Green, 1983) is intended to normalize some differences among analysts, the degree of variation calls into question whether the normalization approach is being asked to accomplish more than it should.

https://gchron.copernicus.org/articles/7/45/2025/gchron-7-45-2025-f02

Figure 2Self-reproducibility of track density (ρ) determinations on replicated grain images for seven analysts.

Download

3.3 Post-review follow-up and objectivity of the review

After the initial review of grain and confined track length measurements, a follow-up meeting with each participant was conducted to discuss each judgment deemed questionable or unsuitable by the reviewers. Virtually all participants acknowledged all inappropriate grain selections and confined track length measurements, but Participant 6 considered Grain 08 and Grain 31 to be borderline instead of unsuitable. This high rate of acknowledgment by the participants supports the soundness of the criteria utilized by the reviewers. According to the participants, inappropriate selection and measurements stemmed from different factors. While some participants cited a lack of attention to details (e.g., poor identification of track ends), others stated that they have been choosing some unsuitable grains in their routine fission-track studies since their training. Some of the participants mentioned that they knowingly added unsuitable grains to the datasets in the past to meet the expected number of grains.

3.4 Track density and confined track length distributions

This study is principally designed to evaluate identification of individual features rather than measurement averages and standard deviations. If invalid track lengths are measured or unsuitable grains are analyzed which give results similar to valid lengths or suitable grains, summary statistics will not suggest any problem; in fact, they would appear to improve because of the increased number of analyses, the perverse incentive we wish to counteract. We calculated the suitable and unsuitable grain selection percentages (Table 3) based on the number of each grain type as listed in Table 1. The confined track length measurement validity percentage is calculated as the number of valid track length measurements divided by the total number of confined track length measurements for each analyst (Table 4). There are several valid lengths etched by both 5.0 and 5.5 M etchants that may be under-etched, which were not measured by the reviewer but were measured by some analysts. The reviewer evaluated the lengths based only on the criteria laid out in Sect. 2.3 and not specifically by how well-etched the tracks are.

When compiling summary statistics, we separated the density estimates based on suitable grains and valid track length measurements from estimates the included unsuitable ones, and we compared them using dispersion and χ2 probability values for the density data and the mean and standard deviation of mean lengths (Tables 3 and 4). Initial density determinations yield a dispersion value of 10 and a χ2 probability of 0.03. Exclusion of unsuitable grain data provided significant improvements in net density similarity, with dispersion and χ2 probability of 0 and 0.42, respectively (Table 3). Excluding invalid confined tracks raises the average mean length by ∼0.3µm, well beyond the 0.08 µm precision limit estimated by doubling the standard error, and reduces the group standard deviation by 20 % (Table 4).

The histograms of track density and confined track length distributions of each participant provide additional insights (Fig. 3). The density distributions of suitable grains are more consistent than for unsuitable grains, and the inclusion of unsuitable grains in all cases skewed the track density distribution to lower values. Participants 1 and 10 and Participants 8 and 9 are from the same two laboratories and their results are similar for their respective track density results. This may be related to the shared training and/or analytical routine in counting, though for length measurements Participants 8 and 9 had more divergent results, possibly due to different personal selection criteria. The confined length histograms indicate that both the number and the choice of tracks measured varied considerably between participants. Dpar measurements of suitable grains tend to be more similar with some outliers, while the Dpar measurements of unsuitable grains have higher dispersion (Fig. S1). Post-experiment interviews with participants suggested that the dispersion of Dpar measurements on suitable grains may have stemmed from different levels of zoom applied.

Table 3Track density estimations. N: number of analyzed grains; ρ: track density.

n/a: not applicable. * The analyst did not participate in the experiment but evaluated the grains as suitable and unsuitable. Numbers in parentheses denote standard errors. Dispersion and χ2 are calculated using Radialplotter (Vermeesch, 2009). Suitable and unsuitable grain selection percentages are calculated based on the number of grains of each type selected as listed in Table 1.

Download Print Version | Download XLSX

Table 4Confined track length measurements. N: number of lengths; lm: mean track length; σ: standard deviation. Numbers in parentheses denote standard errors. The confined track length measurement validity percentage is calculated as the number of valid track length measurements divided by the total number of confined track length measurements for each analyst.

n/a: not applicable

Download Print Version | Download XLSX

https://gchron.copernicus.org/articles/7/45/2025/gchron-7-45-2025-f03

Figure 3Track density (ρ) and confined track length distributions of each participant and the reviewers (2 and 10, in red). Participant numbers are indicated at the top left. The cumulative result for all participants is shown at the bottom right (1–15). Dark gray shows measurements of grains and confined tracks assessed to be unsuitable, and light gray displays the measurements of suitable selections.

Download

3.5 Impact of experience

The acceptance percentage of suitable grains shows only a weak relationship with years of experience (Fig. 4a). With increasing years of experience, the acceptance percentage of unsuitable grains decreases sharply, while the confined track validity percentage increases (Fig. 4b). Accordingly, those who select higher percentages of unsuitable grains tend to have lower validity percentages of confined track length measurements as well (Fig. 4c). These results highlight analyst experience as an important factor in data quality, although some less experienced analysts performed as well as much more experienced ones.

https://gchron.copernicus.org/articles/7/45/2025/gchron-7-45-2025-f04

Figure 4(a) Correlation of years of fission-track experience against acceptance percentage of suitable and unsuitable grains for density measurement and (b) confined track length measurement validity percentage. (c) The confined track length measurement validity percentage against the acceptance percentage of unsuitable grains. The sizes of the circles in (b) and (c) reflect the number of confined track length measurements. (d) Participant sections of the region of interest are juxtaposed in a defect- and/or inclusion-rich grain and (e) a zoned grain.

Download

Participant 5 yielded the lowest acceptance percentages of 50 % for suitable grains and 0 % for unsuitable grains. Participant 5 has been working exclusively on high-quality samples from one region for their entire fission-track experience (5 years), leading to selecting only the best-looking grains. This type of bias has been termed the “mere exposure effect” or “familiarity principle”, which is the tendency to develop preferences for things because they are familiar (Tversky and Kahneman, 1974). An analyst with narrower grain quality experience may miss available thermal history information by omitting objectively suitable, but less familiar, grains.

3.6 Region of interest (ROI) selection

Track density measurements on defect- and/or inclusion-free grains with homogeneous track distributions may not be greatly affected by different region of interest (ROI) selections, but the number and size of defects within a given ROI can cause an underestimation of density by obscuring tracks (Fig. 4d). Similarly, the selection of high- and low-track-density areas within a zoned grain can yield widely varying density determinations (Fig. 4e). A single-spot or even dual-spot laser ablation approach on such grains may result in a significant dispersion of dates depending on the analyst's ROI selection. Several participants placed ROIs in too close proximity to the mineral border (<10µm; Fig. 4d, e), where track registration is below the required 4π geometry (Fleischer et al., 1975).

3.7 Light source utilization and a case correction on a single grain

To demonstrate some effects of the light source and ROI specification on density measurement, at the request of one participant, we reanalyzed their track density for Grain 07, which was significantly lower than most of the group. Figure 5 shows the comparison on a track-by-track basis. Perhaps importantly, this analyst revealed a preference for counting tracks in transmitted light only. However, counting tracks solely in transmitted light images can cause an underestimation of the track density (Aslanian et al., 2022; Tamer and Ketcham, 2023). Figure 5a and b show the transmitted and reflected light image for this particular grain, with the participant analysis in blue and reanalysis using images from both light sources in red with excluded counts in yellow. This reanalysis suggested that some genuine tracks had been overlooked, especially in the region on the right that is shaded in transmitted light. After adjustment of the ROI, the addition of the overlooked tracks in reflected light, and the exclusion of a track showing a defect-like pattern, the track density increased by ∼35 %.

https://gchron.copernicus.org/articles/7/45/2025/gchron-7-45-2025-f05

Figure 5Transmitted and reflected light images of Grain 07 (a, b) with the analysis of one participant (blue) and a reviewer's proposed correction for inclusion (red) and exclusion of tracks (yellow).

Download

3.8 Highlights of participant comments on the dataset

Participants 1 and 5 found the images taken with no white balance to be nonideal for counting, and Participants 5 and 7 mentioned that the grains etched with the protocol of 5.0 M HNO3 for 20 s at 20 °C (Gleadow et al., 1986; Green et al., 1986) appear to be under-etched.

4 Implications, suggestions, and conclusions

Whether an analyst selects unsuitable grains and/or conducts invalid confined track length measurements will depend in part on their years of experience and training, but it may also be affected by the difficulty in finding sufficient grains to meet analytical goals. When the realities of a nonideal sample conflict with an imposed requirement for how much data are required for study objectives, the result can end up being improper thermal history information. Comprehensive laboratory training and calibrations are essential for fission-track analysts to avoid these problems. Results of graticule and confined track length calibrations and the identity of the analyst should be stated in publications.

ROI selection may cause varying track density determinations, especially in zoned and defect-rich grains. While a single-spot laser ablation analysis is a time-efficient approach, its application to such grains may result in varying U determinations unless the laser spot covers a high proportion of the counted area (e.g., Cogné et al., 2020). Laser uranium mapping (Ansberque et al., 2021) or EDM (Gleadow and Lovering, 1977) approaches require more work but in ideal cases may better represent the selected region of interest. However, precise matching of spontaneous and induced track areas in the EDM can also be difficult in some cases. These approaches may also be more effective in identifying zoned grains when the spontaneous track density is low.

Zeta calibration (ζ) using a set of age standards is intended to normalize uncertainties in some parameters in the age equation, such as thermal neutron fluence (φ) and the spontaneous fission decay constant (λf), and to account for varying counting efficiencies of different analysts (Hurford and Green, 1983). This method assumes, however, that a calibration derived from measurements on near-ideal standard samples with minimal inclusions and defects and limited annealing (e.g., Durango, Fish Canyon Tuff apatite) also reflects analyst judgments in unknown samples, which may yield any quality of grains. Zoning, high levels of inclusions and defects, variable ROI selection, and even a more complex track length distribution may cause significant divergences unaccounted for by zeta calibration, particularly for a less experienced analyst. These have been studied partially (Vermeesch, 2017; Cogné and Gallagher, 2021) but further work is needed on this matter. The analyst should remember that not all samples yield usable data.

Among the available confined tracks, analysts can select tracks with different effective etch times based on their individual perceptions and criteria. Some participants highlighted that the grains etched with 5.0 M HNO3 at 20 °C for 20 s (e.g., Gleadow et al., 1986) appear under-etched, which agrees with a previous inter-analyst comparison experiment (Tamer et al., 2019). A proposed two-step etching protocol (5.5 HNO3 21 °C 20+10 s) allows analysts to select any suitable track but ensures that the final confined track length dataset does not contain under-etched tracks (Tamer and Ketcham, 2023). However, a protocol for modeling such data with existing annealing equations has not yet been put forward.

The application of AI and machine learning methods has become a popular topic in various research fields in earth sciences including fission-track counting and confined track length measurements. The quality of any automated analyses will be defined not only by sophisticated algorithms but also the fission-track analysis experience of the initial “teacher” of the AI, including hardware preferences during image acquisition and the resulting image quality. Determining which grains and tracks are suitable for measurement represents an additional challenge for AI method development; training cannot be based on good images alone but must also include features that should be avoided based on sometimes subtle indications.

The accessibility of the “ingredients” of thermal history modeling is limited to data summary tables and sometimes raw data as supplementary files in research articles. Although fission-track data have generally fared well in inter-laboratory age comparisons in recent years, these have tended to utilize relatively straightforward samples. This study illustrates some of the potential hazards of fission-track analysis of more challenging materials but also presents pathways toward improving data reliability. In particular, the opportunity is coming into view for the fission-track community to share data on a new level, allowing analysts to see, learn from, and discuss each others' image data.

Recent developments in data repositories and metadata reporting are healthy signs of an emerging open science culture and up-to-date reporting in low-T thermochronology. However, these are currently limited to collecting and presenting the data in their corresponding geolocations (Boone et al., 2022, 2023b) as well as data reporting formats and table contents (Kohn et al., 2024). Given the continuing relevance of fission-track data, we recommend building toward a global infrastructure and culture enabling and encouraging data transparency and sharing through the formation of online digital image repositories (such as geochron@home; Vermeesch et al., 2023), which can accommodate fission-track image data. Furthermore, proper analyst training and reconsideration of laboratory routines for image acquisition are needed. It has been over 60 years since the fission-track dating method was first established (Price and Walker, 1962), and no clear guidelines have been formulated on “musts” and “cans” in fission-track practice. While existing fission-track laboratories develop their own preferences and routines, new laboratories often represent branching points, which can be a source of necessary and beneficial innovation but also undocumented and undesirable divergence. A global community repository hosting guidelines for best-practice fission-track analyses and fission-track training modules is needed, as are reference libraries of interpreted image sets. Adoption of an open science culture will ultimately benefit every fission-track laboratory and increase data quality.

Code and data availability

The image data used for this study, and the analysis results, have been uploaded to geochron@home (https://doi.org/10.5281/zenodo.13777917, Vermeesch, 2024). The analyst numbers in Table S1 are replaced with random letters in geochron@home to preserve anonymity.

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/gchron-7-45-2025-supplement.

Author contributions

Conceptualization: MTT, LC, RAK, AJWG. Data curation: MTT, LC. Formal analysis: MTT, LC. Software: LC, AJWG. Investigation: MTT, LC. Methodology: MTT, LC, RAK, AJWG. Resources: MTT, LC, RAK, AJWG. Writing – original draft preparation: MTT, LC, RAK, AJWG. Writing – review and editing: MTT, LC, RAK, AJWG.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

We appreciate the contributions of all of the participants to this effort. This research was supported by the Geology Foundation of the Jackson School of Geosciences and Fundamental Research Funds in the Institute of Geology, China Earthquake Administration (grant no. IGCEA2229), the National Science Foundation of China (grant no. 4231101318), and Project 3.51 of the AuScope Program within the Australian National Collaborative Research Infrastructure Strategy. We thank Edward Sobel and one anonymous reviewer for the reviews and Shigeru Sueoka for editorial handling.

Financial support

This research has been supported by the Geology Foundation, Jackson School of Geosciences, University of Texas at Austin; the Institute of Geology, China Earthquake Administration (grant no. IGCEA2229); the National Natural Science Foundation of China (grant no. 4231101318); and Project 3.51 of the AuScope Program within the Australian National Collaborative Research Infrastructure Strategy.

Review statement

This paper was edited by Shigeru Sueoka and reviewed by Edward Sobel and one anonymous referee.

References

Ansberque, C., Chew, D. M., and Drost, K.: Apatite fission-track dating by LA-Q-ICP-MS imaging, Chem. Geol., 560, 119977, https://doi.org/10.1016/j.chemgeo.2020.119977, 2021. 

Aslanian, C., Jonckheere, R., Wauschkuhn, B., and Ratschbacher, L.: Short communication: Experimental factors affecting fission-track counts in apatite, Geochronology, 4, 109–119, https://doi.org/10.5194/gchron-4-109-2022, 2022. 

Balestrieri, M. L., Bigazzi, G., and Ghezzo, C.: The transantarctic mountains: a natural laboratory for apatite fission-track analysis. Results from Italian Antarctic expeditions, Radiat. Meas., 31, 621–626, https://doi.org/10.1016/S1350-4487(99)00154-7, 1991. 

Barbarand, J., Hurford, T., and Carter, A.: Compositional and structural control of fission-track annealing in apatite, Chem. Geol., 198(1-2), 107-137, https://doi.org/10.1016/S0009-2541(02)00424-2, 2003. 

Barnes, J. B., Ehlers, T. A., Insel, N., McQuarrie, N., and Poulsen, C. J.: Linking orography, climate, and exhumation across the central Andes, Geology, 40, 1135–1138, https://doi.org/10.1130/G33229.1, 2012. 

Bernet, M., Wootton, L., Valla, P., Lavergne, S., Boucher, F., Barun, J., Husson, L., Robert, X., and Balvay, M.: Tracing Western Alps mountain building and high mountain biodiversity since the the early-mid Miocene. 18th International Conference of Thermochronology, 3–8 September 2023, Riva del Garda, Italy, 2023. 

Boone, S. C., Dalton, H., Prent, A., Kohlmann, F., Theile, M., Gréau, Y., Florin, G., Noble, W., Hodgekiss, S. A., and Ware, B.: AusGeochem: An Open Platform for Geochemical Data Preservation, Dissemination and Synthesis, Geostand. Geoanal. Res., 46, 245–259, https://doi.org/10.1111/ggr.12419, 2022. 

Boone, S. C., Faux, N., Nattala, U., Jiang, C., Church, T., Chung, L., Mcmillan, M., Jones, S., Jiang, H., Liu, D., Ehinger, K., Drummond, T., Kohn, B., and Gleadow, A. J. W.: Towards Fully Automated Digital Fission-Track Analysis Through Artificial Intelligence, 18th International Conference of Thermochronology, Riva del Garda, Italy, 2023a. 

Boone, S. C., Kohlmann, F., Noble, W., Theile, M., Beucher, R., Kohn, B., Glorie, S., Danišík, M., Zhou, R., McMillan, M., and Nixon, A.: A geospatial platform for the tectonic interpretation of low-temperature thermochronology Big Data, Sci. Rep., 13, 8581, https://doi.org/10.1111/ggr.12419, 2023b. 

Burtner, R. L., Nigrini, A., and Donelick, R. A.: Thermochronology of Lower Cretaceous source rocks in the Idaho-Wyoming thrust belt, Am. Assoc. Petr. Geol. B., 78, 1613–1636, 1994. 

Carlson, W. D., Donelick, R. A., and Ketcham, R. A.: Variability of apatite fission-track annealing kinetics: I. Experimental results, Am. Mineral., 84, 1213–1223, https://doi.org/10.2138/am-1999-0901, 1999. 

Cogné, N. and Gallagher, K.: Some comments on the effect of uranium zonation on fission track dating by LA-ICP-MS, Chem. Geol., 573, 120226, https://doi.org/10.1016/j.chemgeo.2021.120226, 2021. 

Cogné, N., Chew, D. M., Donelick, R. A., and Ansberque, C.: LA-ICP-MS apatite fission track dating: A practical zeta-based approach, Chem. Geol., 531, 119302, https://doi.org/10.1016/j.chemgeo.2019.119302, 2020. 

Deng, J., Wang, C., Bagas, L., Carranza, E. J. M., and Lu, Y.: Cretaceous–Cenozoic tectonic history of the Jiaojia Fault and gold mineralization in the Jiaodong Peninsula, China: constraints from zircon U–Pb, illite K–Ar, and apatite fission track thermochronometry, Mineralium Deposita, 50, 987–1006, https://doi.org/10.1007/s00126-015-0584-1, 2015. 

Donelick, R. A.: A method of fission track analysis utilizing bulk chemical etching of apatite, U.S. Patent Number 5,267,274, 1993. 

Donelick, R. A., Ketcham, R. A., and Carlson, W. D.: Variability of apatite fission track annealing kinetics II: Crystallographic orientation effects, Am. Mineral., 84, 1224–1234, 1999. 

Donelick, R. A., O'Sullivan, P. B., and Ketcham, R. A.: Apatite fission-track analysis, Reviews in Mineralogy and Geochemistry, 58, 49–94, https://doi.org/10.2138/rmg.2005.58.3, 2005. 

Dumitru, T. A., Hill, K. C., Coyle, D. A., Duddy, I. R., Foster, D. A., Gleadow, A. J. W., Green, P. F., Kohn, B. P., Laslett, G. M., and O'Sullivan, A. J.: Fission track thermochronology: application to continental rifting of south-eastern Australia, The APPEA Journal, 31, 131–142, https://doi.org/10.1071/AJ90011, 1991. 

Fitzgerald, P. G. and Goodge, J. W.: Exhumation and tectonic history of inaccessible subglacial interior East Antarctica from thermochronology on glacial erratics, Nat. Commun., 13, 6217, https://doi.org/10.1038/s41467-022-33791-y, 2022. 

Fleischer, R. L., Price, P. B., and Walker, R. M.: Nuclear Tracks in Solids: Principles and Techniques, University of California Press, Berkeley, 605 pp., 1975. 

Gallen, S. F., Seymour, N. M., Glotzbach, C., Stockli, D. F., and O'Sullivan, P.: Calabrian forearc uplift paced by slab–mantle interactions during subduction retreat, Nat. Geosci., 16, 513–520, https://doi.org/10.1038/s41561-023-01185-4, 2023. 

Gleadow, A. J. W. and Lovering, J. F.: Geometry factor for external detectors in fission track dating, Nuclear Track Detection, 1, 99–106, https://doi.org/10.1016/0145-224X(77)90003-5, 1977. 

Gleadow, A. J. W., Duddy, I. R., Green, P. F., and Lovering, J. F.: Confined fission track lengths in apatite: a diagnostic tool for thermal history analysis, Contrib. Mineral. Petrol., 94, 405–415, https://doi.org/10.1007/BF00376334, 1986. 

Gleadow, A. J. W., Gleadow, S. J., Belton D. X., Kohn, B. P., Krochmal, M. S., and Brown, R. W.: Coincidence mapping – a key strategy for the automatic counting of fission tracks in natural minerals, Geol. Soc. Lond. Special Publication, 324, 25–36, https://doi.org/10.1144/SP324.2, 2009. 

Gleadow, A. J. W., Kohn, B., and Seiler, C.: The Future of Fission-Track Thermochronology, in: Fission-Track Thermochronology and its Application to Geology, edited by: Malusà, M. and Fitzgerald, P., Springer Textbooks in Earth Sciences, Geography and Environment, Springer, Cham, https://doi.org/10.1007/978-3-319-89421-8_4, 2019. 

Green, P. F., Duddy, I. R., Gleadow, A. J. W., Tingate, P. R., and Laslett, G. M.: Thermal annealing of fission tracks in apatite 1. A qualitative description, Chem. Geol. (Isotope Geoscience Section), 59, 237–253, https://doi.org/10.1016/0168-9622(86)90074-6, 1986. 

Gülyüz, N., Gülyüz, E., Karaoglan, F., and Kuscu, I.: Low temperature thermochronology reveals tilting of crystalline bodies, Halilaga porphyry Cu-Au deposit, NW Anatolia: Implications for exploration of porphyry copper deposits and interpretation of low-temperature thermochronology data for regional tectonics, Ore Geol. Rev., 166, 105958, https://doi.org/10.1016/j.oregeorev.2024.105958, 2024. 

Hasebe, N., Barbarand, J., Jarvis, K., Carter, A., and Hurford, A. J.: Apatite fission-track chronometry using laser ablation ICP-MS, Chem. Geol., 207, 135–145, https://doi.org/10.1016/j.chemgeo.2004.01.007, 2004. 

Hurford, A. J. and Green, P. F.: The zeta age calibration of fission-track dating, Chem. Geol., 41, 285–317, https://doi.org/10.1016/S0009-2541(83)80026-6, 1983. 

Karaoğlan, F., Karatas, B., Ozdemir, Y., Gulyuz, E., Vassiliev, O., Selbesoglu, M. O., and Gildir, S.: The geo/thermochronology of Dismal Island (Marguerite Bay, Antarctic Peninsula), Turkish Journal of Earth Sciences, 32, 8, https://doi.org/10.55730/1300-0985.1887, 2023. 

Ketcham, R. A., Donelick, R. A., Balestrieri, M. L., and Zattin, M.: Reproducibility of apatite fission-track length data and thermal history reconstruction, Earth Planet. Sc. Lett., 284, 504–515, https://doi.org/10.1016/j.epsl.2009.05.015, 2009. 

Ketcham, R. A., Carter, A., and Hurford, A. J.: Inter-laboratory comparison of fission track confined length and etch figure measurements in apatite, Am. Mineral., 100, 1452–1468, https://doi.org/10.2138/am-2015-5167, 2015. 

Ketcham, R. A., van der Beek, P., Barbarand, J., Bernet, M., and Gautheron, C.: Reproducibility of thermal history reconstruction from apatite fission-track and (U-Th)/He data, Geochem. Geophys. Geosyst., 19, 2411–2436, https://doi.org/10.1029/2018GC007555, 2018. 

Kohn, B. P., Pillans, B., and McGlone, M. S.: Zircon fission track age for middle Pleistocene Rangitawa Tephra, New Zealand: stratigraphic and paleoclimatic significance, Palaeogeogr. Palaeocl., 95, 73–94, https://doi.org/10.1016/0031-0182(92)90166-3, 1992. 

Kohn, B. P., Ketcham, R. A., Vermeesch, P., Boone, S. C., Hasebe, N., Chew, D., Bernet, M., Chung, L., Danisik, M., Gleadow, A. J. W., and Sobel, E. R.: Interpreting and reporting fission-track chronological data, Geol. Soc. Am. B., 136, 3891–3920, https://doi.org/10.1130/B37245.1, 2024. 

Li, R., Xu, Z., Su, C., and Yang, R.: Automatic identification of semi-tracks on apatite and mica using a deep learning method, Comput. Geosci., 162, 105081, https://doi.org/10.1016/j.cageo.2022.105081, 2022. 

Lemot, F., Valla, P. G., van der Beek, P., Jagercikova, M., Niedermann, S., Carcaillet, J., Sobel, E.R., Andò, S., Garzanti, E., Robert, X., and Balvay, M.: Miocene cave sediments record topographic, erosional and drainage development in the Western European Alps, Earth Planet. Sc. Lett., 621, 118344, https://doi.org/10.1016/j.epsl.2023.118344, 2023. 

Miller, D. S., Duddy, I. R., Green, P. F., Hurford, A. J., and Naeser, C. W.: Results of interlaboratory comparison of fission-track age standards: Fission-track workshop – 1984, Nuclear Tracks and Radiation Measurements, 10, 383–391, https://doi.org/10.1016/0735-245X(85)90127-9, 1985. 

Miller, D. S., Eby, N., McCorkell, R., Rosenberg, P. E., and Suzuki, M.: Results of interlaboratory comparison of fission track ages for the 1988 fission track workshop, International Journal of Radiation Applications and Instrumentation Part D, 17, 237–245, https://doi.org/10.1016/1359-0189(90)90041-U, 1990. 

Miller, D. S., Crowley, K. D., Dokka, R. K., Galbraith, R. F., Kowallis, B. J., and Naeser, C. W.: Results of interlaboratory comparison of fission track ages for 1992 Fission Track Workshop, Nuclear Tracks, 21, 565–557, https://doi.org/10.1016/1359-0189(93)90197-H, 1993. 

Nachtergaele, S. and De Grave, J.: AI-Track-tive: open-source software for automated recognition and counting of surface semi-tracks using computer vision (artificial intelligence), Geochronology, 3, 383–394, https://doi.org/10.5194/gchron-3-383-2021, 2021. 

Naeser, C. W., Zimmermann, R. A., and Cebula, G. T.: Fission-track dating of apatite and zircon: an interlaboratory comparison, Nuclear Tracks, 5, 65–72, https://doi.org/10.1016/0191-278X(81)90027-5, 1981. 

Price, P. B. and Walker, R. M.: Chemical etching of charged particle tracks in solids, J. Appl. Phys., 33, 3407–3412, https://doi.org/10.1063/1.1702421, 1962. 

Qiu, K. F., Deng, J., Sai, S. X., Yu, H. C., Tamer, M. T., Ding, Z. J., Yu, X. F., and Goldfarb, R. J.: Low-temperature thermochronology for defining the tectonic controls on heterogeneous gold endowment across the Jiaodong Peninsula, eastern China, Tectonics, 42, e2022TC007669, https://doi.org/10.1029/2022TC007669, 2023. 

Qiu, N. and Liu, S.: Uplift and denudation in the continental area of China linked to climatic effects: Evidence from apatite and zircon fission track data, Sci. Rep., 8, 9546, https://doi.org/10.1038/s41598-018-27801-7, 2018. 

Reiners, P. W. and Shuster, D. L.: Thermochronology and landscape evolution, Phys. Today, 62, 31–36, https://doi.org/10.1063/1.3226750, 2009. 

Ren, Z., Li, S., Xiao, P., Yang, X., and Wang, H.: Artificial intelligent identification of apatite fission tracks based on machine learning, Mach. Learn.-Science and Technology, 4, 045039, https://doi.org/10.1088/2632-2153/ad0e17, 2023. 

Sobel, E. R. and Seward, D.: Influence of etching conditions on apatite fission-track etch pit diameter, Chem. Geol., 271, 59–69, https://doi.org/10.1016/j.chemgeo.2009.12.012, 2010. 

Tagami, T. and O'Sullivan, P. B.: Fundamentals of fission-track thermochronology, Reviews Mineralogy and Geochemistry, 58, 19–47, https://doi.org/10.2138/rmg.2005.58.2, 2005. 

Tamer, M. T. and Ketcham, R. A.: How many vs which: On track selection criteria for apatite fission track analysis, Chem. Geol., 634, 121584, https://doi.org/10.1016/j.chemgeo.2023.121584, 2023. 

Tamer, M. T., Chung, L., Ketcham, R. A., and Gleadow, A. J.: Analyst and etching protocol effects on the reproducibility of apatite confined fission-track length measurement, and ambient-temperature annealing at decadal timescales, Am. Mineral., 104, 1421–1435, https://doi.org/10.2138/am-2019-7046, 2019. 

Tamer, M. T., Chung, L., Ketcham, R. A., and Gleadow, A. J. W.: Inter-analyst comparison and reproducibility of apatite fission track analysis, 17th International Conference of Thermochronology, Santa Fe, USA, 2021, https://doi.org/10.1002/essoar.10507907.2, 2021. 

Torres, V., Hooghiemstra, H., Lourens, L., and Tzedakis, P. C.: Astronomical tuning of long pollen records reveals the dynamic history of montane biomes and lake levels in the tropical high Andes during the Quaternary, Quaternary Sci. Rev., 63, 59–72, https://doi.org/10.1016/j.quascirev.2012.11.004, 2013. 

Tversky, A. and Kahneman, D.: Judgment under Uncertainty: Heuristics and Biases: Biases in judgments reveal some heuristics of thinking under uncertainty, Science, 185, 1124–1131, https://doi.org/10.1126/science.185.4157.1124, 1974. 

Vermeesch, P.: How many grains are needed for a provenance study?, Earth Planet. Sc. Lett., 224, 441–451, https://doi.org/10.1016/j.epsl.2004.05.037, 2004. 

Vermeesch, P.: RadialPlotter: a Java application for fission track, luminescence and other radial plots, Radiat. Meas., 44, 409–410, 2009. 

Vermeesch, P.: Statistics for LA-ICP-MS based fission track dating, Chem. Geol., 456, 19–27, https://doi.org/10.1016/j.chemgeo.2017.03.002, 2017. 

Vermeesch, P.: pvermees/GaHa: v.1 (v.1), Zenodo [code and data set], https://doi.org/10.5281/zenodo.13777917, 2024. 

Vermeesch, P., Band, T., Duong, E., and Carter, A.: Crowd sourced fission track dating with geochron@home, 18th International Conference on Thermochronology, 3–8 September 2023, Riva del Garda, Italy, Abstract Volume, 190, 2023.  

Wagner, G. A. and Van den Haute, P.: Fission-Track Dating Method, 59–94, Springer Netherlands, 1992. 

Yu, J., Zheng, D., Pang, J., Li, C., Wang, Y., Wang, Y., Hao, Y., and Zhang, P.: Cenozoic mountain building in eastern China and its correlation with reorganization of the Asian climate regime, Geology, 50, 859–863, https://doi.org/10.1130/G49917.1, 2022. 

Download
Short summary
We present the first new image-based study to reveal how choices made by different analysts affect the results obtained by fission-track analysis. Participants analyzed an identical image dataset with varying grain quality. Experienced analysts tend to select lower numbers of unsuitable grains and conduct lower numbers of invalid length measurements. Fission-track studies need image data repositories, teaching modules, guidelines, an open science culture, and new approaches for calibration.

Share