Error Propagation

Bence

A recent question here on the forum was about iolite's error propagation: what is it, and how is it calculated?

The purpose of error propagation is to ensure that you have a reasonable estimate of the uncertainty of your calculated result(s). Depending on what you consider "reasonable" this might be: what range of values would I get if I measured this sample repeatedly in different sessions? This is normally impossible with samples as they might be consumed during their analysis. So we have to rely on repeat measurements of reference materials to estimate this uncertainty.

This estimate should take into account some of the decisions you've made in your data processing. For example, it should account for how you've modeled your baselines using splines, and similarly what spline types you used for your reference materials.

You might be thinking: but iolite already provides an uncertainty for each result. This is the "internal uncertainty" because it's the variation of the points within your selection for your chosen channel. If we take a result for Pb206/U238 final as an example, for a selection you'll get a mean/median value and some uncertainty. This is calculated for the points within the selection for that channel. However, this uncertainty doesn't take into account things such as the effect of splining your baselines, your choice of downhole fit etc. So we find that these internal uncertainties are usually lower than we'd expect if we'd measured the same reference material over and over again, and this suggests that there are other sources of uncertainty not captured just by reporting the uncertainty of the points within your selection.

So Chad Paton (and some other advisors) came up with the idea of looking at your primary reference material results to look at how much the results vary in comparison with the internal uncertainty.
To do this, iolite uses a metric called the "Mean Squared Weighted Deviation" or MSWD for short. This metric looks at a group of measurements, each with their own uncertainty, and determines whether the spread in results is realistic given the uncertainties on each of the measurements. We can use this to identify when our internal uncertainties are perhaps too small given our spread in means.

In the early days when this was first introduced, there were not a lot of secondary reference materials measured along with our samples, so instead of using the secondary reference materials, the error propagation routine uses the primary reference material results. Now we can't use them just as they are because they've been used in the calculation of the final results, and that would be circular reasoning. Instead what the error propagation routine does is pulls each primary RM measurement out, and treats it like it was an unknown. When the measurement is removed, the splines are recalculated and the result for that measurement is recorded. After this is done for all the measurements of the primary RM*, we have a pool of results that we can examine with our MSWD statistic to see if the internal uncertainties match up with the spread in results. This pool of results, where the primary RM is pulled out one measurement at a time and treated as an unknown, is called the "pool of pseudo-secondary reference materials results". It's pseudo-secondary because it's actually our primary RM, but we're treating it as an unknown.

One property of the MSWD is that it will be greater than 1 if the spread in measurements is larger than that predicted by the internal uncertainties. If this is the case for the pool of pseudo-secondary RM results, it suggests that the internal uncertainties are not taking into account all the uncertainty in our experiment. So iolite will add a little bit of uncertainty to the results and recalculate the MSWD. This little bit of extra uncertainty is called the "excess uncertainty" because it's added onto the internal uncertainty. iolite keeps adjusting this excess uncertainty until it gets an MSWD of roughly 1 for the pool of pseudo-secondary results. It then records this excess uncertainty as a percentage so that it can be applied to all the other results by adding it in quadrature.

The "propagated uncertainties" you see in iolite are the internal uncertainty with the excess uncertainty added to it.

If this was to work perfectly, you could look at your secondary reference materials, and with their propagated uncertainty, they should have an MSWD of 1 because the uncertainty of each measurement matches the spread in measurements.

Sometimes iolite doesn't need to add any excess uncertainty at all because the pool of pseudo secondary results has an MSWD of 1 to start with.

Sometimes you might see a message that iolite did not calculate propagated uncertainties because you haven't measured the primary RM enough times. To get a reasonable estimate of the MSWD, you need to have measured the primary RM about 15 times. If we calculated the MSWD on fewer results, would could be adding unnecessary excess uncertainty to our results just because the MSWD is faulty, which iolite would rather not do, so it just tells you why it didn't propagate the errors.

Note that all of this may not be necessary if you collect your data into iolite's Database feature and calculate uncertainties from that. You would still need to add any excess uncertainty you might calculate to each of your results, but there is a script that can do that for you.

I hope that clears up some of these concepts. Please post below if you have any questions.

-Bence

We don't actually use all the results of the primary RM because the spline might go funny (unconfined) if we took out the first and last measurements. So we can only, at most, use the total number of measurements - 2 to create our pool of results to test the MSWD.

ellenalexander

Hi Bence,

Is there documentation somewhere of the actual calculations being made? I am getting spuriously low MSWD values even on the basic error calculation, and they only get lower for the propagated error. I get extremely large individual sample uncertainty with low MSWD, suggesting the analytical error is being significantly overestimated in the calculation. Could you please point me to where to find the source code and/or explain the actual calculations implemented for both the internal and propagated error calculations?

Thanks,
Ellen

Bence

ellenalexander

I'll look into posting the relevant bit of the code somewhere public, but at the moment because it's written in C++ it's a little hard to expose (c.f. iolite v3 where we just made the Statistical functions.ipf file unencrypted).

In the meantime, if you could please send us an example where the error propagation decreases the MSWD, that would be very helpful, and we can look at fixing it if the issue is in the code.

Thanks,
Bence

ellenalexander

Hi Bence,

Here is a google drive folder with some example data (ICP data is in the .zip file). I've attached a screenshot with the age results calculated with VizualAge showing the way the MSWD decreases and the uncertainties increase. It seems that even the internal uncertainties are overestimating analytical error, and the "propagated" uncertainties exacerbate the problem.

I don't necessarily feel the need to see the code, but it is really important for me as a user to know what's actually being calculated from the raw data; otherwise I can't tell if the problem is with the raw data or with the data reduction. It would be great to see some documentation specifically explaining the math behind the calculation steps, because right now it's impossible to tell whether my data are the problem or the data reduction is giving spurious results without just doing the data reduction manually.

Thanks,
Ellen

Bence

Hi Ellen,

I totally agree that we need to improve our documentation of the error propagation maths/process. I'll start adding to the documentation and post back here when it's done.

However, in the mean time I can answer why it appears 'things are going wrong' with your data. It appears from your screenshot that you have just one selection group (ignoring baselines for now) that is your primary reference material, and that you're looking at the Final 206Pb/238U ratio channel. The reason that the MSWD of this channel is much less than 1 is because it is the primary reference material. The final step of the U/Pb data reduction process is to normalise to the primary reference material to correct for mass bias and sensitivity drift. This effectively gives you the 'right answer' for your primary RM because we're correcting to it and any scatter in this group is just due to the spline not quite going through the center of your selections. If you chose a completely un-smoothed spline, you should get close to the exact same value for all selections in this group, and so variation in this group of results (i.e. your selections) is not just due to sampling a normal (Gaussian) distribution.

Because we are normalising to the accepted value for the primary RM, we can't use this final ratio channel to determine excess uncertainties (because the selections will be all the same value ± some variation due to the spline fitting). That is why the error propagation is done using a channel that hasn't had it's variation affected by normalisation (normally the downhole-corrected ratio DC 206Pb/238U). Because the group of results for the Final 206Pb/238U channel should have all one value, any uncertainties on our selections are going to result in an MSWD of much less than one, because this is not a normally distributed group of results.

A much better check of whether the error propagation is working is to look at your secondary reference materials. If the MSWD for the final ratios of these groups of results is much less than one, that would suggest that the uncertainties for each selection are over-estimated, and in that case, I'd recommend sending us your example dataset so that we can check what's going on.

So, to summarise, don't expect an MSWD of 1 for the final ratios for your primary RM: either internal or propagated. Check your secondary RMs to make sure your error propagation makes sense.

We'll work on making the documentation more comprehensive and post here when it's ready.

If you have any questions about any of the above, please let us know. It's really good to discuss these things as I'm sure others have similar questions and concerns and getting some discussion going is the best way to ensure everyone is happy/confident with their data reduction.

-Bence

Bence

Also, I just wanted to add that if you have only measured one sample within your experiment, you can split the selections into a group for the primary RM, and another group that also contains selections of primary RM that is treated as an unknown. I hope that makes sense? Please let me know if not 😃 .

-Bence

LyndseyF

Was this ever added to the documentation? I'm currently trying to understand what the 2SE value is for a 3D TE data set but I can't seem to find any info on how this is calculated.

Bence

Hi @LyndseyF

The error propagation process is different for U-Pb (and other isotope systems where the error propagation procedures are used), which is described here. Joe and I have also been working on a tool to visualise the process, but it isn't quite ready yet. I'll post here when it is.

For 3DTE, the 2SE is simply the uncertainty for data points in the selection's interval for the output channel (e.g. Sr88_ppm). At present, we do not propagate the uncertainty of the block fit, or any of the other corrections that are available in the 3DTE DRS. You can include the uncertainty of the Ref Mat value for each element by checking the 'RefMatUncertInc' option at Export.

If you have any other questions, please just let me know.

-Bence