Managing Calibration Confidence in the Real World

Similar documents
Guard-banding Methods-An Overview

Measurement Decision Risk and Decision Rules in the new ISO/IEC 17025

Chapter 16. Financial Statements for a Partnership. South-Western Educational Publishing

DATA GAPS AND NON-CONFORMITIES

Window Width Selection for L 2 Adjusted Quantile Regression

Journal Of Financial And Strategic Decisions Volume 10 Number 2 Summer 1997 AN ANALYSIS OF VALUE LINE S ABILITY TO FORECAST LONG-RUN RETURNS

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS

Evaluation of Proficiency Testing Results and the elimination of Statistical Outliers. Mr. Neville Tayler South African National Accreditation System

Traditional Optimization is Not Optimal for Leverage-Averse Investors

Alternative VaR Models

Spline Methods for Extracting Interest Rate Curves from Coupon Bond Prices

The Brattle Group 1 st Floor 198 High Holborn London WC1V 7BD

Overview of Standards for Fire Risk Assessment

A STUDY ON INFLUENCE OF INVESTORS DEMOGRAPHIC CHARACTERISTICS ON INVESTMENT PATTERN

Predicting Economic Recession using Data Mining Techniques

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

Stock Trading System Based on Formalized Technical Analysis and Ranking Technique

Honest Precision to Tolerance Ratios

Superiority by a Margin Tests for the Ratio of Two Proportions

ASSUMPTIONS III. OPERATING PROCEDURE. The operating procedure of six sigma single sampling variables plan is described below:

Approximating the Confidence Intervals for Sharpe Style Weights

Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions

ASTM D02, Dec./ Orlando Statistics Seminar

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

THE ECONOMICS OF BANK ROBBERIES IN NEW ENGLAND 1. Kimberly A. Leonard, Diane L. Marley & Charlotte A. Senno

(recast) (Text with EEA relevance)

Journal of Insurance and Financial Management, Vol. 1, Issue 4 (2016)

RISK ANALYSIS AND CONTINGENCY DETERMINATION USING EXPECTED VALUE TCM Framework: 7.6 Risk Management

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Term Par Swap Rate Term Par Swap Rate 2Y 2.70% 15Y 4.80% 5Y 3.60% 20Y 4.80% 10Y 4.60% 25Y 4.75%

to the end of the i tn year following banding). 00 is the probability

Impact of Weekdays on the Return Rate of Stock Price Index: Evidence from the Stock Exchange of Thailand

Research Article Portfolio Optimization of Equity Mutual Funds Malaysian Case Study

Comment Letter No. 2

STATE OF CONNECTICUT PUBLIC UTILITIES REGULATORY AUTHORITY DOCKET NO

Non-Inferiority Tests for the Ratio of Two Proportions

At the height of the financial crisis in December 2008, the Federal Open Market

Monitoring Processes with Highly Censored Data

Measuring investment in intangible assets in the UK: results from a new survey

ROBUST CHAUVENET OUTLIER REJECTION

Chapter 9 Case on Quality Engineering: Acceptance Sampling

Portfolio Management Using the Bloomberg Professional Service

ISyE 512 Chapter 6. Control Charts for Variables. Instructor: Prof. Kaibo Liu. Department of Industrial and Systems Engineering UW-Madison

VALIDATING MORTALITY ASCERTAINMENT IN THE HEALTH AND RETIREMENT STUDY. November 3, David R. Weir Survey Research Center University of Michigan

Quantitative Trading System For The E-mini S&P

R & R Study. Chapter 254. Introduction. Data Structure

Biol 356 Lab 7. Mark-Recapture Population Estimates

Two-Sample T-Tests using Effect Size

Comparison of OLS and LAD regression techniques for estimating beta

Lecture Quantitative Finance Spring Term 2015

Supplementary Material: Strategies for exploration in the domain of losses

An Examination of the Predictive Abilities of Economic Derivative Markets. Jennifer McCabe

Artificially Intelligent Forecasting of Stock Market Indexes

STOCHASTIC COST ESTIMATION AND RISK ANALYSIS IN MANAGING SOFTWARE PROJECTS

Richardson Extrapolation Techniques for the Pricing of American-style Options

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method

Modelling the Sharpe ratio for investment strategies

An IMEX-method for pricing options under Bates model using adaptive finite differences Rapport i Teknisk-vetenskapliga datorberäkningar

LAST SECTION!!! 1 / 36

Anomalies under Jackknife Variance Estimation Incorporating Rao-Shao Adjustment in the Medical Expenditure Panel Survey - Insurance Component 1

STAT 157 HW1 Solutions

Using Fractals to Improve Currency Risk Management Strategies

In the past decade, there has been a dramatic shift in the

Chapter 4 Variability

Comment on Gary V. Englehardt and Jonathan Gruber Social Security and the Evolution of Elderly Poverty

Indiana Fiscal Policy Institute Brief REVENUE FORECASTING: INDIANA S METHOD AND THE RESULTS IT PRODUCES

A Skewed Truncated Cauchy Logistic. Distribution and its Moments

Historical Trends in the Degree of Federal Income Tax Progressivity in the United States

Developing Optimized Maintenance Work Programs for an Urban Roadway Network using Pavement Management System

Portfolio Optimization using Conditional Sharpe Ratio

Audit Sampling: Steering in the Right Direction

STOCHASTIC COST ESTIMATION AND RISK ANALYSIS IN MANAGING SOFTWARE PROJECTS

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND TECHNOLOGY (IJARET)

Challenges For the Future of Chinese Economic Growth. Jane Haltmaier* Board of Governors of the Federal Reserve System. August 2011.

Predicting the Success of a Retirement Plan Based on Early Performance of Investments

Statistics 101: Section L - Laboratory 6

FORECASTING OF VALUE AT RISK BY USING PERCENTILE OF CLUSTER METHOD

RISK POOLING IN THE PRESENCE OF MORAL HAZARD

1035. Pursuant to Article 95 item 3 of the Constitution of Montenegro, I hereby pass ORDINANCE PROMULGATING THE LAW ON METROLOGY

1 Introduction. Domonkos F Vamossy. Whitworth University, United States

Data Analysis. BCF106 Fundamentals of Cost Analysis

Testing Methodologies for Credit Score Models to Identify Statistical Bias toward Protected Classes

The Golub Capital Altman Index

ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA

The mathematical model of portfolio optimal size (Tehran exchange market)

Using Monte Carlo Analysis in Ecological Risk Assessments

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

Robust Critical Values for the Jarque-bera Test for Normality

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS

Non-Inferiority Tests for the Odds Ratio of Two Proportions

PIPELINE RISK ASSESSMENT

Power of t-test for Simple Linear Regression Model with Non-normal Error Distribution: A Quantile Function Distribution Approach

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION

Possible Legislative Guidelines for Constructing the Corporate Bond Yield Curve

CORVINUS ECONOMICS WORKING PAPERS. Quota bonuses as localized sales bonuses. by Barna Bakó, András Kálecz-Simon CEWP 1/2016

Prince2 Foundation.exam.160q

FINANCE 2011 TITLE: RISK AND SUSTAINABLE MANAGEMENT GROUP WORKING PAPER SERIES

Notes on Estimating the Closed Form of the Hybrid New Phillips Curve

Mean Variance Analysis and CAPM

Transcription:

Managing Calibration Confidence in the Real World David Deaver Fluke Corporation Everett, Washington ABSTRACT: Previous papers have investigated the risk of making false test decisions as a function of Test Uncertainty Ratio, Confidence Interval, and Guardbands for fairly idealized distributions. This paper continues that pursuit by assessing the impact less than ideal distributions have on the Consumer Risk and Producer Risk. Specifically, the increase in risk associated with systematic bias in the unit under test and the measurement standard is investigated. INTRODUCTION: Standards that govern the operations of a calibration or standards lab are written by experts in the calibration and the academic communities to control and improve the quality of the calibration process and to provide a means of audit or critique of that process. Of particular concern is the adequacy, maintenance, and traceability of standards which are used to calibrate subject units under test. One of the tests for adequacy is the extent to which the accuracy of the calibration standard (STD) exceeds that of the unit under test (UUT). Many trees have been sacrificed to record the efforts of those who have labored to analyze the risks associated with making an imperfect determination of the state of UUT. The impact of declaring a defective unit to be performing within its specifications (false accept) must be weighed against the costs of performing the calibrations themselves and the cost of declaring a properly performing instrument as exceeding its specifications (false reject). It is a difficult task for the authors of these documents to anticipate the impact of false test decisions in every application of calibration. 995 NCSL Workshop & Symposium Session A

THE ADEQUACY REQUIREMENT: The adequacy clause in Z540- [] is found in section 0..b and 0.3: The laboratory shall ensure that the calibration uncertainties are sufficiently small so that the adequacy of the measurement is not affected. Well defined and documented measurement assurance techniques or uncertainty analyses may be used to verify the adequacy of a measurement process. If such techniques are not used, then the collective uncertainty of the measurement standards shall not exceed 5% of the acceptable tolerance (e.g., manufacturer s specification) for each characteristic of the measuring and test equipment being calibrated or verified. Where methods are not specified, the laboratory shall, wherever practical, select methods that have been published in international or national standards, those published by reputable technical organizations or in relevant scientific texts or journals. Similar requirements are found in other standards. It is the observation of the author that, in the majority of the calibration and standards laboratories, that the test uncertainty ratio (TUR) provision of these standards (...shall not exceed 5% of the acceptable tolerance...) receives far more attention than the sections that surround it. This has resulted in the widespread presumption that the risks associated with a TUR of 4: are acceptable and those less than 4: are unacceptable. Though the statisticians and mathematicians have determined that a test uncertainty ratio of 4: (or 0: or 3:) provides adequacy in most instances, the responsibility still rests with the laboratory to determine that the associated risks implied by the procedure are acceptable. Metrology costs must be weighed against the consequences of false test decisions. In some critical applications, huge false reject losses may be acceptable in order to reduce the probability of a false accept. In others, where the cost of false test decisions is not particularly high, considerable savings in calibration costs may be realized by using lower accuracy standards. Numerous technical papers [-0] have been written to allow for risk assessment to be performed as a function of many more parameters than just TUR. Previous papers by this author [,] provided charts, figures, and tables to allow risk to be estimated and managed as a function of confidence interval and guardband factor as well as TUR. This paper adds charts and figures to show how systematic bias in the UUT and the STD affects the risk of false test decisions. This analysis was undertaken in response to concerns that the risk assessments and guardbanding recommendations presented in those papers may be too idealized (assuming normal distributions with means centered within the specification limits, etc.) and do not reflect the actual characteristics of the standards and units under test likely to be encountered. It is interesting that the same concern does not seem to be directed to the similar assumptions that surround the determination of adequacy based on a TUR of 4:. 995 NCSL Workshop & Symposium Session A

Standard v UUT u -3 - - 0 3 -L -kl s t +kl L UUT: The distribution of possible values for the unit under test. STD: The distribution of possible values for the Standard. t: Local variable for the UUT distribution t: A possible value of the UUT s: Local variable for the Standard s: A possible value of the Standard with the UUT at t L: Specification Limit kl: Test Limit [Guardband Factor (k) times the Specification Limit (L) ] u: Offset bias of the UUT v: Offset bias of the Standard R: Test Uncertainty Ratio CR: Consumer Risk or False Accept Risk PR: Producer Risk or False Reject Risk Figure : Nomenclature IMPLIED RIS: Figure summarizes some of the work done in the previous papers [,] and shows the level of risk implied by Z540, MIL-STD-4566A, and other standards which use 4: as the recommended minimum TUR if we assume no offset bias in the UUT nor the standard and normal distributions with confidence intervals of ±σ. With these assumptions, the risk of false accepts, or consumer risk is %. As the chart shows, however, that level of risk can be maintained at lower test uncertainty ratios if the confidence interval is higher (more conservative specs) or through the use of guardband techniques (dashed lines). The curves were generated using MathCAD [3] to solve: Eq. CR = π L s + t Rt ( k L) e dsdt Rt ( + k L) 995 NCSL Workshop & Symposium 3 Session A

CONSUMER RIS WITH UUT BIAS: The number of UUTs out of tolerance (OOT) rises significantly with UUT bias. Figure 3 shows OOT probabilities from σ to 6σ with offset biases of 0%, 5%, 50%, 75%, and 00%. The Motorola 6σ paper [4] concurs that the OOT probability is only parts per billion if the confidence interval is ±6σ with no offset bias but indicates that a more reasonable expectation is that there would be some bias. They picked.5σ of bias (5%) which yields and OOT probability of 3.4ppm. At a ±σ confidence interval, the OOT probability increases from 4.6% with no bias to 7.3% with 5% bias and 6% with 50% bias. At 00% bias, the mean of the distribution is aligned with the specification limit and half the units will be out of tolerance as can be seen in the figure. The equation to calculate the OOT condition of a normal distribution with bias is: Eq. z L ( t u) L OOT = e dt π The significant increase in the number of units out of tolerance due to UUT bias will increase the probability of false test decisions since there is a higher probability of units being at or near the specification limit. Figure 4 shows the effects of the UUT bias on the probability of accepting defective units (Consumer Risk) at the ±σ confidence level. Equation 3 was evaluated with UUT bias (u) to generate the curves. Charts are shown for TURs of 4: and : and with guardbands. First of all, it can be seen that for small biases, there is very little effect on the risk. All the curves have zero slope at the zero bias point because the slopes of the normal distribution at the specification limits are equal; the increase in OOT risk at one specification limit is canceled by a decrease in OOT risk at the other. As the amount of bias increases, however, the slopes are unequal and the increase in risk with bias at one limit is more than the reduction in risk at the other. Note also that guardbanding seems to still work with bias; the lines for the different guardband factors (k) are reasonably parallel. Figure 5 plots each of the ±σ curves relative to its value with no bias. Here it can be seen fairly easily that the risk is reasonably tolerant of modest biases, doubling for 40% bias. We can see that the increase in risk with bias is a little higher for lower TURs but that guardbanding (k values less than ) reduces the risk slightly. For example, if a calibration were being made with a 4: TUR at the ±σ confidence level and an offset bias of 80% were encountered, from the relative curves of Figure 4, we see that the risk of falsely accepting an out of tolerance unit is 4. times greater than with no bias. (The absolute risks of 3.4% vs. % can be seen on the upper charts of Figure 4.) What if we had only a : TUR and but were using a guardbands to maintain the risk of %? From the previous papers [,] we found a guardband factor (k) of would be required. With an 80% bias, the risk would be 4.8 times greater than with no bias (TUR=:, k= curve at 80% bias). Thus, with 80% bias, using guardbands and a 995 NCSL Workshop & Symposium 4 Session A

: TUR increased the risk only slightly (4.75/4.4 = %) more than the factor of 4. that would have been experienced with a 4: TUR. Figure 5 also contains plots consumer risk with UUT bias but at the ±3σ confidence level. Their shapes are similar to those of Figure 4, the ±σ relative curves, but the risk increases by a much larger factor than for the ±σ case. With 40% bias the ±σ risk doubled, but the same 40% bias causes a ten-fold increase in the risk in the ±3σ case. Comparing the absolute risk charts, we see that ±3σ confidence level gives much more improvement at low biases than at high biases. Indeed we saw in Figure 3, in the extreme case of 00% bias, that 50% of the units are out of tolerance regardless of the confidence interval. Though the high confidence units continue to out-perform the lower less conservatively specified units, the tremendous improvement in risk level is diminished in the presence of bias. Equation 3 was used to calculate the consumer risk for UUT bias (u 0) and for bias in the standard (v 0): Eq. 3 ( t u s v ) + ( ) L Rt ( kl ) CR = e ds dt π Rt ( + kl ) + π L Rt ( kl ) Rt ( + kl ) e ( t u s v ) + ( ) ds dt CONSUMER RIS WITH STANDARD BIAS: By varying v, in Equation 3, we can investigate the effect that bias in the standard has on the consumer risk. Figure 6 shows the consumer risk for confidence intervals of ±σ and Figure 7 for ±3σ, but this time as a function of offset bias in the standard. These figures show somewhat different behavior for standard bias vs. UUT bias. Whereas we saw larger relative increases in the TUR=: curves for UUT bias, for standard bias, the larger relative increases are in the TUR=4: curves. The relative increase is worse with guardbanding instead of less as we saw in the UUT bias curves. In fact if we use aggressive guardbanding with TUR=4:, the tremendous reductions in risk are much less if we have bias in the standard. For example, the risk at 80% bias with TUR=4: increases to 0.3% from 0.006% with no bias, a factor of about 0. For the ±3σ curves the increase is a whopping factor of 550 but only to a risk of 0.005%. 995 NCSL Workshop & Symposium 5 Session A

For the areas of practical metrology, however, the results are again quite reasonable. If we again consider our strategy of using a k= guardband with a TUR of, with 80% bias we could expect the consumer risk to twice the no-bias risk vs. increase a factor of.7 if we were using a 4: TUR. PRODUCER RIS WITH BIAS: Equation 4 was used to evaluate producer risk with UUT bias (u 0) and standard bias (v 0) similarly to the way Equation 3 was used to evaluate the consumer risk. Eq. 4 ( t u s v ) + ( ) L Rt ( + kl ) PR = e ds dt π L + π L L Rt ( kl ) e ( t u s v ) + ( ) ds dt Figures 8 & 9 document the absolute and relative producer risks associated with UUT bias and Figures 0 &, standard bias, for the same conditions considered for consumer risk. For both UUT and standard bias, the relative increase in risk is lower as guardbands are applied. With UUT bias (Figures 8 & 9), the relative increase for a TUR=: is lower than with the TUR=4:, for the same guardband factor, meaning there is no producer risk UUT bias penalty when using guardbands for risk management. With standard bias, however, the TUR=: exhibit the higher relative increases in risk. For our practical situation of TUR=: and k=, however, we still have a lower increase in risk than the TUR=4: and no guardband for the ±σ case (Figure 0) and only slightly higher at ±3σ (Figure ). 995 NCSL Workshop & Symposium 6 Session A

SUMMARY: The table below summarizes the results of offset bias for the example of managing calibration risks using a : TUR and a guardband factor (k) of. The incidence of false test decisions will increase with bias regardless of TUR. The guardbanding method of maintaining the same consumer risk as a 4: TUR nearly does so in the presence of bias. For conditions where bias is suspected, the last column of the table (k=) shows that a little more guardbanding can reduce the risk below that of a 4: TUR. TUR 4: : k No CR 0%.% 6% 0.4% Bias PR.5% 4.% 7.0% % 80% UUT CR 3.4% 6.% 3.6%.9% Bias PR 3.8% 7.7% % 8% 80% STD CR.3%.8%.5%.% Bias PR 3.8% % 7% % Table : Consumer Risk and Producer Risk with bias for L=±σ CONCLUSION: An analysis of the degradation in the ability to make quality calibrations in the presence of offset bias in the UUT and the standard shows that the guardbanding techniques continue to perform quite well as an means of managing the risk of accepting units which are not performing to their specifications when it is difficult to maintain the desired 4: test uncertainty ratio. With the tools available to the metrologist today, the ability exists to use the justifications and other methods allowed in the standards to significantly improve the quality and reduce the cost of calibrations performed in calibration and standards laboratories. 995 NCSL Workshop & Symposium 7 Session A

REFERENCES: [] ANSI/NCSL Z540--994, National Conference of Standards Laboratories, August, 994 [] Capell, Frank, "How Good is Your TUR?", Evaluation Engineering, January, 99, pp. 80-84 [3] Castrup, Howard, Evaluation of Customer and Manufacturer Risk vs. Acceptance Test Instrument In-Tolerance Level, NCSL Workshop & Symposium, 980. [4] Eagle, Alan R., A Method for Handling Errors in Testing and Measuring, Industrial Quality Control, March, 954, pp. 0-5 [5] Ferling, John A., The Role of Accuracy Ratios in Test and Measurement Processes, Measurement Science Conference, 984 [6] Grubbs, Frank E. and Coons, Helen J., On setting Test Limits Relative to Specification Limits, Industrial Quality Control, March, 954, pp. 5-0 [7] Hayes, Jerry L., Calibration and Maintenance of Test and Measuring Equipment, Encyclopedia of Applied Physics, Vol 3, VCH Publishers, Inc., -5608-06- 9/9, 99 [8] Hutchinson, Bill, Setting Guardband Test Limits to Satisfy MIL-STD-4566A Requirements, 99 NCSL Workshop & Symposium, pp. 305-309 [9] Schumacher, Rolf B. F., Decision Making and Customizing Decision Limits in Measurement Assurance, 99 NCSL Workshop & Symposium, pp. 30-30 [0] Weber, Stephen F. and Hillstrom, Anne P., "Economic Model of Calibration Improvements for Automatic Test Equipment", NBS Special Publication 673, 984 [] Deaver, David, Maintaining Your Confidence (In a World of Declining Test Uncertainty Ratios), 993 NCSL Workshop & Symposium, pp. 33-53 [] Deaver, David, Guardbanding with Confidence, 994 NCSL Workshop & Symposium, pp. 383-394 [3] MathCAD is a registered trademark of MathSoft, Inc. The author has no interest nor affiliation with MathSoft except that of a satisfied user of the MathCAD software program. [4] Harry, Mikel J., "The Nature of Six Sigma Quality", Motorola Inc., Government Electronics Group 995 NCSL Workshop & Symposium 8 Session A

0 % False Accept Risk L ± σ 0. Risk Management with Guardbands ±.5 σ ± σ ±.5 σ ±3 σ 0.0 3 4 5 5 6 7 8 9 0 Test Uncertainty Ratio Figure : False Accept Risk as a Function of TUR, Confidence Interval and Guardbands 00 Bias (% of SL) At σ: At 6σ: 00% 50% 50% 0 75% 3% 6.7% Out of Tolerance Probability (%) 0. 0.0 0.00 0.000 50% 5% 6% 7.3% 0.4% 3.4ppm 0.0000 0.00000 0% 0.000000 σ σ 3σ 4σ 5σ 6σ Specification Lim it (L) 4.6% ppb Figure 3: Out of Tolerance Probability of a Normal Distribution with Offset Bias. 995 NCSL Workshop & Symposium 9 Session A

0 0% L= ± σ, TUR= 4: 3.40% 0 6% L = ±σ, TUR= : 3.60% 0. 0.0 0. 0.0 0.00 0 0 40 60 80 00 0.00 0 0 40 60 80 00 6 4.8 Risk Relative to no Bias Condition 5 4 3 4. TUR= 0 0 0 40 60 80 00 TUR= 4 Figure 4: False Accept (Consumer) Risk with UUT Bias for L= ±σ 995 NCSL Workshop & Symposium 0 Session A

0 L= ± 3σ, TUR= 4: 0 L= ± 3σ, TUR= : 0. 0.0 0. 0.0 0.00 0 0 40 60 80 00 0.00 0 0 40 60 80 00 Risk Relative to no Risk Condition 90 80 70 60 50 40 30 0 0 TUR= 0 0 0 40 60 80 00 TUR= 4 Figure 5: False Accept (Consumer) Risk with UUT Bias for L= ±3σ 995 NCSL Workshop & Symposium Session A

L= ±σ, TUR= 4: L= ±σ, TUR= : 0 0. 0.0 0%.30% 0 0. 0.0 6%.50% 0.00 0 0 40 60 80 00 0.00 0 0 40 60 80 00 0 9 TUR= 4 Risk Relative to No Bias Condition 8 7 6 5 4 3.0.6 TUR= 0 0 0 40 60 80 00 Figure 6: False Accept (Consumer) Risk with STD Bias for L= ±σ 995 NCSL Workshop & Symposium Session A

L= ±3σ, TUR= 4: L= ±3σ, TUR= : 0 0 0. 0.0 0. 0.0 0.00 0 0 40 60 80 00 0.00 0 0 40 60 80 00 0 9 Risk Relative to No Bias Condition 8 7 6 5 4 3 0 0 0 40 60 80 00 TUR= 4 TUR= Figure 7: False Accept (Consumer) Risk with STD for L= ±3σ 995 NCSL Workshop & Symposium 3 Session A

L= ±σ, TUR= 4: L= ±σ, TUR= : 00 00 Producer Risk (%) 0.50% 3.80% Producer Risk (%) 0 7% % 0. 0 0 40 60 80 00 0. 0 0 40 60 80 00 3.5 Risk Relative to No Bias Condition.5.5 0.5.7 TUR= 4 0 TUR= 0 0 40 60 80 00 Figure 8: False Reject (Producer) Risk with UUT Bias for L= ±σ 995 NCSL Workshop & Symposium 4 Session A

L= ±3σ, TUR= 4: L= ±3σ, TUR= : 00 00 Producer Risk (%) 0 Producer Risk (%) 0 0. 0 0 40 60 80 00 0. 0 0 40 60 80 00 30 Risk Relative to No Bias Condition 5 0 5 0 5 0 0 0 40 60 80 00 TUR= 4 TUR= Figure 9: False Reject (Producer) Risk with UUT Bias for L= ±3σ 995 NCSL Workshop & Symposium 5 Session A

L= ±σ, TUR= 4: L= ±σ, TUR= : 00 00 Producer Risk (%) 0.5% 3.8% Producer Risk (%) 0 7% 7% 0. 0 0 40 60 80 00 0. 0 0 40 60 80 00 4.5 4 Risk Relative to No Bias Condition 3.5 3.5.5.6.4 0.5 0 0 0 40 60 80 00 TUR= 4 TUR= Figure 0: False Reject (Producer) Risk with STD Bias for L= ±σ 995 NCSL Workshop & Symposium 6 Session A

L= ±3σ, TUR= 4: L= ±3σ, TUR= : 00 00 Producer Risk (%) 0 Producer Risk (%) 0 0. 0 0 40 60 80 00 0. 0 0 40 60 80 00 8 Risk Relative to No Bias Condition 6 4 0 8 6 4 TUR= 4 0 0 0 40 60 80 00 TUR= Figure : False Reject (Producer) Risk with STD Bias for L= ±3σ 995 NCSL Workshop & Symposium 7 Session A