ASTM D02, Dec./ Orlando Statistics Seminar

Similar documents
Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

4.2 Probability Distributions

Chapter 8 Estimation

R & R Study. Chapter 254. Introduction. Data Structure

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Chapter 8 Statistical Intervals for a Single Sample

Some Characteristics of Data

MATH 10 INTRODUCTORY STATISTICS

Conover Test of Variances (Simulation)

Chapter ! Bell Shaped

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

1. Variability in estimates and CLT

A point estimate is a single value (statistic) used to estimate a population value (parameter).

The Central Limit Theorem (Solutions) COR1-GB.1305 Statistics and Data Analysis

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Stat 139 Homework 2 Solutions, Fall 2016

Value (x) probability Example A-2: Construct a histogram for population Ψ.

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

SLIDES. BY. John Loucks. St. Edward s University

Math 140 Introductory Statistics

Statistics for Managers Using Microsoft Excel 7 th Edition

Basic Procedure for Histograms

Section The Sampling Distribution of a Sample Mean

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics

Statistics for Business and Economics

Practical Hedging: From Theory to Practice. OSU Financial Mathematics Seminar May 5, 2008

Discrete Random Variables

Discrete Random Variables

Experimental Design and Statistics - AGA47A

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Numerical Descriptive Measures. Measures of Center: Mean and Median

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Lecture 1: Review and Exploratory Data Analysis (EDA)

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

σ e, which will be large when prediction errors are Linear regression model

Chapter 7 Study Guide: The Central Limit Theorem

Business Statistics 41000: Probability 3

Chapter 4 Variability

Statistics & Statistical Tests: Assumptions & Conclusions

Statistics 13 Elementary Statistics

Sampling and sampling distribution

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Sampling Distributions and the Central Limit Theorem

Confidence Intervals. σ unknown, small samples The t-statistic /22

Statistics Class 15 3/21/2012

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1

MAS187/AEF258. University of Newcastle upon Tyne

Two-Sample T-Test for Superiority by a Margin

Chapter 4 Probability Distributions

Data Analysis. BCF106 Fundamentals of Cost Analysis

Assessing the reliability of regression-based estimates of risk

Midterm Exam III Review

Two-Sample T-Test for Non-Inferiority

Lecture 39 Section 11.5

We will also use this topic to help you see how the standard deviation might be useful for distributions which are normally distributed.

FEEG6017 lecture: The normal distribution, estimation, confidence intervals. Markus Brede,

Data Analysis and Statistical Methods Statistics 651

Lecture 10 - Confidence Intervals for Sample Means

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS

Every data set has an average and a standard deviation, given by the following formulas,

Regression and Simulation

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Normal Model (Part 1)

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

STAT 157 HW1 Solutions

Chapter 6 Part 3 October 21, Bootstrapping

1 Inferential Statistic

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority

Tests for Two Means in a Cluster-Randomized Design

STA Module 3B Discrete Random Variables

Statistics for Managers Using Microsoft Excel/SPSS Chapter 6 The Normal Distribution And Other Continuous Distributions

Institute for the Advancement of University Learning & Department of Statistics

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Economics 345 Applied Econometrics

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

The Assumption(s) of Normality

Overview. Definitions. Definitions. Graphs. Chapter 5 Probability Distributions. probability distributions

Statistics and Probability

Chapter 7. Inferences about Population Variances

Chapter 5: Summarizing Data: Measures of Variation

Normality & confidence intervals. UNT Geog 3190, Wolverton

Random Variables and Probability Functions

Chapter Four: Introduction To Inference 1/50

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Lecture 2. Probability Distributions Theophanis Tsandilas

Chapter 16. Random Variables. Copyright 2010 Pearson Education, Inc.

Chapter 4: Estimation

Review of key points about estimators

Data Analysis and Statistical Methods Statistics 651

Statistics and Measurement. (Section 2.2 of Vardeman and Jobe plus some)

Transcription:

ASTM D02, Dec./ Orlando Statistics Seminar Presented by: Alex Lau, Chairman, D02.94 This brief seminar will provide : a very simple explanation on the use of statistics to estimate population parameters specific focus on the sample standard deviation statistic (s), where "degrees of freedom" will be explained why the minimum degree of freedom (df) of 30 is specified for r and R statistics in ASTM D6300 ASTM D02/Dec_2016_Orlando/A. Lau 2

Within the context of this seminar and ASTM Standard Test Method it is the universe of results obtained for a specific material using the same STM from an infinite number of labs Sample X lab A same method, similar apparatus, 1 result per lab lab ASTM D02/Dec_2016_Orlando/A. Lau 3 Two that we care about the most are: μ, σ μ mean (of the population), aka Consensus value, which we use as ARV σ standard deviation (of the population), which we use to derive Reproducibility Spread of single results Histogram of results from the hypothesized universe of results (in Statistics jargon we call this the target population of interest) ASTM D02/Dec_2016_Orlando/A. Lau 4

We would collect each and every result from the population, then, we can calculate the exact values for μ, and σ but, we don t (have infinite resources.. ) ASTM D02/Dec_2016_Orlando/A. Lau 5 We can take a random sample of adequate size from the target population, do some math, and come up with statistics to estimate the desired population parameters: sample average is an estimator of μ sample standard deviation (s) is an estimator of σ ASTM D02/Dec_2016_Orlando/A. Lau 6

How to use statistics to estimate parameter values target population random sampling of size n do the math = = ASTM D02/Dec_2016_Orlando/A. Lau 7

because only a limited amount of data is used, these estimates have variability themselves due to random sampling which means when you do it again, you will most likely get a numerically different answer this is known as variability of the sample statistics furthermore, the variability is a function of the sample size ASTM D02/Dec_2016_Orlando/A. Lau 9 s The most common question posed to statisticians: What is the minimum no. of data points required to estimate sigma ( )? ASTM D02/Dec_2016_Orlando/A. Lau 10

Re-phrasing that question, how many shots would you like to see before you would be comfortable to stand here? ASTM D02/Dec_2016_Orlando/A. Lau 11 To visualize the variability of s, let s do this: 1. repeatedly take samples of various sizes (n) from the following large (10,000) dataset which came from a Normal process with a true std dev of 1 2. calculate the sample std dev (s) for each sample 3. plot the numerical values of each s for each sample size σ=1 ASTM D02/Dec_2016_Orlando/A. Lau 12

variability of sample std dev statistic (s) versus sample size n 2 n= 6 n= 10 2 1.5 1.5 1 1 0.5 0.5 0 0 50 100 150 200 250 0 0 50 100 150 200 250 2 n = 20 obs 2 n = 50 obs 1.5 1.5 1 1 0.5 0.5 0 0 0 50 100 150 200 250 0 50 100 150 200 250 obs obs ASTM D02/Dec_2016_Orlando/A. Lau 13 Numerical variation of the std dev statistic (s), calculated from repeated sampling, is related to the sample size n: the larger the n, the less the variation Therefore, std dev estimate that is calculated from a small dataset is highly variable from dataset to dataset! Hence, decisions using a std dev estimate based on small dataset is highly unreliable. ASTM D02/Dec_2016_Orlando/A. Lau 14

this is a metric that can be viewed as the quality or reliability associated with the sample standard deviation statistic (s) the reliability or quality is gauged by the margin of error of the statistic in estimating the desired parameter (σ) mathematically, df = n - 1 ASTM D02/Dec_2016_Orlando/A. Lau 15 multiplier of sample std dev (s) to construct an interval estimate that will contain the true standard deviation with 95% confidence vs. df of (s) multiplier of s 2.60 2.50 2.40 2.30 2.20 2.10 2.00 1.90 1.80 1.70 1.60 1.50 1.40 1.30 1.20 1.10 1.00 0.90 0.80 0.70 0.60 0.50 0.40 0.30 0.20 0.10 0.00 0 20 40 60 80 100 120 df (degree of freedom)=n-1 lower limit multiplier upper limit multiplier ASTM D02/Dec_2016_Orlando/A. Lau 16

sample standard deviation statistics obtained from repeated sampling of the same target population can be combined via a process called pooling to improve the reliability of the estimate pooling is just a fancy term for weighted average fundamental assumption the target population parameter remains constant over multiple samplings ASTM D02/Dec_2016_Orlando/A. Lau 17 this is just a one-time snapshot sampling of the target population to arrive at estimates for r and R these estimates have the margin of errors as presented in the previous slides ASTM D02/Dec_2016_Orlando/A. Lau 18

ASTM D02/Dec_2016_Orlando/A. Lau 19 30 df is the de facto accepted point of diminishing return the nominal upper margin of error is no more than 35% multiplier of sample std dev (s) to construct an interval estimate that will contain the true standard deviation with 95% confidence vs. df of (s) multiplier of s 2.60 2.50 2.40 2.30 2.20 2.10 2.00 1.90 1.80 1.70 1.60 1.50 1.40 1.30 1.20 1.10 1.00 0.90 0.80 0.70 0.60 0.50 0.40 0.30 0.20 0.10 0.00 0 20 40 60 80 100 120 df (degree of freedom)=n-1 lower limit multiplier upper limit multiplier ASTM D02/Dec_2016_Orlando/A. Lau 20

for r, it s no. of samples for R, it depends ASTM D02/Dec_2016_Orlando/A. Lau 21 the ILS design and analysis technique in D6300 enables break out of the between-lab bias component if between-lab bias is dominant, there will be a significant loss of df for R in the limiting case, the df for R will approach (no. of labs 1) ASTM D02/Dec_2016_Orlando/A. Lau 22

do not be a minimalist and roll the dice strive for at least 10 labs (16 is my preference) if you can t find enough willing participants, it speaks to the real need (or, lack of) for the STM ASTM D02/Dec_2016_Orlando/A. Lau 23 ASTM D02/Dec_2016_Orlando/A. Lau 24

contribution of between-lab bias towards R can be roughly judged by the R/r ratio ASTM D02/Dec_2016_Orlando/A. Lau 25 Visualizing R/r ratio low R/r Lab A high R/r Lab B Lab C Lab B Lab C Lab D Lab E Lab D Lab E Lab A ARV ARV ASTM D02/Dec_2016_Orlando/A. Lau 26

It means: the degree of agreement in the execution of a Standard Test Method (STM) within a lab is significantly better than the degree of agreement in the execution of the test method between laboratories But, what does it really mean? ASTM D02/Dec_2016_Orlando/A. Lau 27 it means the method developers did not do a very good job in the S of TM, where S stands for Standard in other words, the test method needs more between-lab standardization to bring the between-lab agreement closer ASTM D02/Dec_2016_Orlando/A. Lau 28

any the precision quality of any Standard Test Methods can be judged by the following 2 KPI s used together (i.e.: not individually) : 1. signal-to-noise ratio (S/N) for r 2. R/r ratio ASTM D02/Dec_2016_Orlando/A. Lau 29 this metric judges the ability of the test method to be repeated in a single lab relative to the level (signal) of measurand: it is the simple ratio of : r min. acceptable S/N is 3.6 ASTM D02/Dec_2016_Orlando/A. Lau 30

Because when you do the math, this is the LOQ ASTM D02/Dec_2016_Orlando/A. Lau 31 this metric judges the adequacy of between-lab standardization, or, agreement between labs the ideal case for R/r is 1: which means the dominant component of disagreement (withinlab or between-lab) is similar in magnitude what this really means is the method is so wellstandardized that the dominant contributor towards R and r are within-lab noise ASTM D02/Dec_2016_Orlando/A. Lau 32

reality: R/r ratio is rarely 1 strive for 2 or less; raise your eyebrow if it s 3.5-5; frown if it s > 5 back to the drawing board if it s >10 ASTM D02/Dec_2016_Orlando/A. Lau 33 Qualification protocol an effective approach to improve between-lab agreement Some examples of Qualification protocol: TSF in octane test methods reference oil in D2887 reference standards in D5191 ASTM D02/Dec_2016_Orlando/A. Lau 34

Have I gotten your attention? ASTM D02/Dec_2016_Orlando/A. Lau 35 Insanity is: keep doing the same thing and expect a different outcome Only you, the paying customer, can drive change So, if you want a better test method, speak up, and, pitch in if you want a helping hand, look in your wrist actively participate in method development task groups actively support ILS s by participation ASTM D02/Dec_2016_Orlando/A. Lau 36

(CS94 is doing our part) ASTM D02/Dec_2016_Orlando/A. Lau 37