Making the Most of Uncertain Low-Level Measurements

80
Making the Most of Uncertain Low-Level Measurements Presented to the Savannah River Chapter of the Health Physics Society Aiken, South Carolina, 2011 April 15 Daniel J. Strom, Kevin E. Joyce, Jay A. MacLellan, David J. Watson, Timothy P. Lynch, Cheryl. L. Antonio, Alan Birchall, Kevin K. Anderson, Peter A. Zharov Pacific Northwest National Laboratory [email protected] +1 509 375 2626 PNNL-SA-75679

description

Making the Most of Uncertain Low-Level Measurements . Presented to the Savannah River Chapter of the Health Physics Society Aiken, South Carolina, 2011 April 15 - PowerPoint PPT Presentation

Transcript of Making the Most of Uncertain Low-Level Measurements

Health Physics Society Minneapolis 2009

Making the Most of UncertainLow-Level Measurements Presented to the Savannah River Chapter of the Health Physics SocietyAiken, South Carolina, 2011 April 15

Daniel J. Strom, Kevin E. Joyce, Jay A. MacLellan, David J. Watson, Timothy P. Lynch, Cheryl. L. Antonio, Alan Birchall, Kevin K. Anderson, Peter A. Zharov

Pacific Northwest National [email protected] +1 509 375 2626

PNNL-SA-75679Health Effects of RadiationCCHPS Health Physics Review 2008Original content copyright (c) 2008 Daniel J. Strom1Making the Most of Uncertain Low-Level MeasurementsStrom DJ, KE Joyce, JA MacLellan, DJ Watson, TP Lynch, CL Antonio, A Birchall, and PA Zharov Many measurement techniques in occupational and environmental monitoring produce results that have very substantial uncertainties. In many cases, a blank or background value must be subtracted from a gross result to produce a net result, a process that can produce non-physical, negative net results. When uncensored measurement results on a population are accompanied by well-characterized uncertainties, a novel method developed at PNNL separates the variance of the observations into two components, one arising from uncertainty and the other arising from population variability. Assuming the uncertain measurements are independent and unbiased, the method forces the arithmetic mean of the measurands (the true values) to equal that of the measurements. Further assuming that the measurands are lognormally distributed, the geometric mean and geometric standard deviation are calculated for a distribution of possibly true results. Assuming that the Bayesian prior probability density functions (PDF) of each individual measurement is a lognormal distribution comprised of every other measurand, posterior PDFs for each measurand are computed using Bayess theorem. The method corrects physically meaningless negative measurement results into PDFs of small positive values, but does little to alter large positive measurement results. Some surprising implications are illustrated by application to bioassay measurements of Sr-90, Cs-137, and Pu-239.Abstract submitted to The 44th Annual Midyear Meeting of the Health Physics Society, Charleston, SC. PNNL-SA-75679.Prologue2Uncertainty is different for sets of sets of data than it is for single data pointsIf you have more than one uncertain measurement, you need to learn about measurement error modelsHPs generally do not speak the language of statisticians well enough to be comprehended is not a synonym for standard deviations is not is not We have to get smarter!Or some biostatistician will commit regression calibration on our numbers!

Carroll RJ, D Ruppert, LA Stefanski, and CM Crainiceanu. 2006. Measurement Error in Nonlinear Models: A Modern Perspective. Chapman & Hall/CRC, Boca Raton.3OutlineCensoringThe lognormal distributionMeasurements and measurandsRequirements and assumptions for this novel methodPopulation variability and measurement uncertaintyDisaggregating the varianceDistribution of measurandsThe everybody prior4Outline 2Probability distributions for individual measurandsThe Bayesian approach The everybody else priorApplications to real radiobioassay dataThe importance of accurate uncertainty Bohrs correspondence principleConclusions 5CensoringChanging a measurement resultCommon practicesSet negative values to 0Set all results less than some value to0 the valueThe valueA non-numeric character like MChanging measurement results causes great problems in statistical inferenceDR Helsel. 2005. Nondetects and data analysis. Statistics for censored environmental data. John Wiley & Sons.This method requires uncensored data6The Lognormal DistributionFrequently observed in NatureMultiplication of arbitrary distributions results in lognormalsOtt WR. 1990. A Physical Explanation of the Lognormality of Pollutant Concentrations. J.Air Waste Mgt.Assoc. 40 (10):1378-13837Measurand, Measurement, Error, and Uncertainty (ISO)measurand: particular quantity subject to measurementalso, the true value of the quantity subject to measurementresult of a measurement: value attributed to a measurand, obtained by measurementerror: the unknown difference between the measurand and the measurementthis is a different meaning from the theoretical concept in statistics!uncertainty: a quantitative estimate of the magnitude of the errorstatisticians often do not distinguish between error and uncertainty and may use them synonymously8Requirements and AssumptionsThis method requires uncensored datasmall values are reported as they are calculated, with no rounding, setting negative values to zero, or otherwise changingAssume measurands are lognormally distributedMany populations in nature are lognormally distributedLognormal common in radiological and environmental measurementsOther functions could be used as long as they have a mean9Population Variability and Measurement UncertaintyThe sample variance of a set of measurements on a population arises from two sources:population variabilitymeasurement errorIf measurements have no error, then all observed sample variance is due to variability in the population10Measurement Error ModelTrue values (measurands) ti give rise to measured values miWe have good independent estimates of the combined standard uncertainty ui of each measurement mi mi = ti + uiui ~ N(0, ui2)We calculate the sample variance of miWe use sample variance and a summary measure of the ui to estimate the variance due to population variability of ti Observed Spread11Spread of Measurement Results (Sample Variance) Is Due to 2 CausesVariability within PopulationAverage Measurement Uncertainty12Spread of Measurement Results (Sample Variance) Is Due to 2 CausesVariability within PopulationuRMSObserved Spread13Spread of Measurement Results (Sample Variance) Is Due to 2 CausesuRMSs(mi)

The reliability or attenuation or variability fraction is

Analogous to a correlation coefficientr2: fraction of variance explained by modelr 2: fraction of variance due to measurand variability

Sample Variance of the Measurements

Estimated Variance of the Measurands

Mean Square Measurement Uncertainty14Estimating the Variance of the Distribution of Measurands Known Known Calculated

Health Effects of RadiationCCHPS Health Physics Review 2008Original content copyright (c) 2008 Daniel J. Strom1415Distribution of MeasurandsThe estimated variance of the measurands is Assume measurands are lognormally distributedAssume the expectation of the measurands equals the mean of the measurements:

measurements are unbiasedthis assumption respects the dataCalculate the parameters of the lognormalgeometric meangeometric standard deviation sGThis is the distribution of possibly true values

16Analysis of Baseline Radiobioassay Data90Sr: 128 baseline urine bioassaysEveryone is exposed to global falloutgas proportional counter100-minute counts137Cs: 5,337 baseline in vivo bioassaysEveryone is exposed to global fallout & Chernobylcoaxial high-purity germanium (HPGe) scanning system 10-minute scans239+240Pu: 3,270 baseline urine bioassaysAll exposure is occupational; essentially no environmental exposure in North America-spectrometry~2,520 minute counts

probability density137Cs (mBq/kg)

239Pu (Bq/sample)probability density

probability density90Sr (mBq/day)The Everybody Probability Density Function (PDF): A Distribution of Possibly True ValuesHistogram and PDF have identical arithmetic meansHistogram of dataPDF of measurands18Probability Distributions for Individual MeasurandsNow that we have the lognormal PDF of all measurands, what can we say about individual measurands?Each individuals measurand is somewhere within the population of measurandsWe now assume that each mi, ui pair is the mean and standard deviation of the Normal likelihood PDF for individual iAssume the ith measurement was the last one made in the populationWhen the ith measurement was made, the other M1 m and u values were knownUse this with Bayess theorem19The Bayesian Approach to Assigning Possibly True Results to Individuals

Thomas Bayes 1702 176120Bayesian Method for IndividualsInstead of the everybody PDF, the everybody else PDF is used as the prior for each individualEach individuals likelihood is a normal distribution with mean mi and standard deviation uiUsing Bayess theorem, we developed a method to derive a posterior probability density function (PDF) for each individuals measurand ti

21Applications to Real Radiobioassay Data

Impossible!For Pu measurements, either the uncertainties ui are overestimated, or a covariance term has been neglected.

s(xi)

22137Csr2=0.35137CsVariability Fractions r2

2390Srr2=0.15137Csr2=0.3590Sr137CsVariability Fractions r2

24239Pur2~090Srr2=0.15137Csr2=0.35239Pu90Sr137CsVariability Fractions r2

2590Sr Results for 4 IndividualsMeasurementLikelihood PDFPriorMeasurandUncensored Data Are Critical!Negative Result

Result Average

Result = Large Positive

Result 0

Health Effects of RadiationCCHPS Health Physics Review 2008Original content copyright (c) 2008 Daniel J. Strom25A Movie of 128 90Sr ResultsShort Dashes (Green): Likelihood (Data)Long Dashes (Red): Everybody Else PriorSolid (Blue): Posterior26

27

90Sr Measurands v Measurements

28

90Sr Measurands v Measurements

Assigned Uncertainty

292929Effect of Reducing Uncertainty29

Assigned Uncertainty

30303030Effect of Reducing Uncertainty30

Assigned Uncertainty

3131313131Effect of Reducing Uncertainty31

Assigned Uncertainty

3232323232Effect of Reducing Uncertainty32

Assigned Uncertainty

33

3333333333Effect of Reducing Uncertainty33

Assigned Uncertainty

34

uRMS(i)Visualizing Uncertainty Reductionr2 = 0.15

35uRMS(i)Visualizing Uncertainty Reductionr2 = 0.15r2 = 0.57

36uRMS(i)Visualizing Uncertainty Reductionr2 = 0.15r2 = 0.57r2 = 0.78

37uRMS(i)Visualizing Uncertainty Reductionr2 = 0.15r2 = 0.57r2 = 0.78r2 = 0.94

38uRMS(i)r2 = 0.15Visualizing Uncertainty Reductionr2 = 0.57r2 = 0.78r2 = 0.94r2 0

39The Common View: The Measurement Is the MeasurandOops! Activity < 0 is meaningless.Oh, no! Results arebelow some level(DL, DT, LOD, etc.).Might not be real!

Tilt

40The Bayesian View: The Measurement and the Prior Give the Measurand

41The Bayesian View: The Measurement and the Prior Give the Measurand

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

64

The Bayesian View: The Measurement and the Prior Give the MeasurandHealth Effects of RadiationCCHPS Health Physics Review 2008Original content copyright (c) 2008 Daniel J. Strom6465

5,337 137Cs Measurements Showing Uncertainty in Measurements & Measurands

66Assigned Uncertainty

5,337 137Cs Results with Same UncertaintyHealth Effects of RadiationCCHPS Health Physics Review 2008Original content copyright (c) 2008 Daniel J. Strom66

67Assigned Uncertainty

5,331 137Cs Results68The Importance of Accurate UncertaintyNearly the same measurement results(lower) 2.5s(upper)Upper posterior resembles likelihood (i.e., measurement)Lower posterior resembles prior

Upper (red) pointLower (yellow) pointeverybody else priorlikelihood (data)posterior (measurand)everybody else priorlikelihood (data)posterior (measurand)69

5,331 137Cs Results70

Assigned Uncertainty

5,331 137Cs Results Log Scale71Innovation 1this work addresses the situation in which each measurement is accompanied by a very good estimate of its uncertaintynot described in the literature reviewedwhich occurs routinely in radiochemical and radiobioassay measurements72Innovation 2This work provides a solution to the vexing problem of making sense of negative measurement results for a quantity, such as activity in becquerels, which physically must be nonnegativenone of the literature addresses negative valuesThe method makes sense of uncertain low-level measurementswithout injecting a bias into the dataset by left-censoringby implicitly recognizing that spurious negative results are accompanied by an equal amount of spurious positive signal73Innovation 3This work provides posterior estimates, in the form of probability distributions, of the true value of each measurandwhile the literature is concerned with correcting estimates of slopes of dose-response relationships for the effects of classical measurement error74Innovation 4This work shows that accurate estimates of uncertainty are as important as the values of the measurement resultsoverestimates of uncertainty can lead to nonsense results75Innovation 5This work provides the ability to explore the impact of the magnitude of uncertainty on the posterior distribution of measurands by thought experiments involving substitution of the mean square measurement uncertainty, or some multiple or submultiple of it, for the individual uncertainties 76Innovation 6The method is shown to closely correspond to classical (frequentist) methods when uncertainty is relatively small77Innovation 7This work answers the questions, conditional on plausible assumptions, What true state of nature gave rise to this set of observations?For each individual measurement result, what are the probable values of the measurand that led to this measurement result?The authors believe that the method represents a significant step forward in the making sense of groups of uncertain, low-level radioactivity measurements78ConclusionsSample variance of a set of measurements is disaggregated intomeasurement uncertaintypopulation variabilityA reasonable, possible distribution of measurands for a population is the resultWhen , positive posterior PDFs of the measurand are computed using everybody else priors for each individualnegative values are eliminatedmean of measurements is preservedWhen there is essentially no variance in the data due to population variability, the method cannot be expected to work, and it does not work

79Conclusions: Utility and Correspondence The method eliminates negative measurement results in an uncensored data set and preserves the arithmetic mean of the data setIf measurement results have a relatively large uncertainty, the posterior PDF of the measurand resembles the priorIf the measurement results have a relatively small uncertainty, the posterior PDF of the measurand resembles the likelihood, that is, it is relatively close to the measurement result before application of the Bayesian methodsBest estimate of uncertainty is just as important as measurement!As required by Bohrs correspondence principle, results produced by the methods introduced here correspond to results of traditional statistical inference in the domain in which that inference is known to be correctAuthors80

Strom MacLellan Joyce Watson Lynch Antonio Zharov Birchall (Mayak PA) (UK HPA) (Scherpelz, Vasilenko)AcknowledgmentsPNNL: Kevin Anderson, Gene Carbaugh, Michelle Johnson, Bruce Napier, Bob Scherpelz, Paul Stansbury, Rick TraubSUBI: Vadim Vostrotin