Nonparametric Statistics Notes

Similar documents
The binomial distribution p314

AMS7: WEEK 4. CLASS 3

Elementary Statistics Lecture 5

5.4 Normal Approximation of the Binomial Distribution

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

Lecture 18. Ingo Ruczinski. October 31, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

5.4 Normal Approximation of the Binomial Distribution Lesson MDM4U Jensen

Copyright 2005 Pearson Education, Inc. Slide 6-1

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

Section Introduction to Normal Distributions

Bernoulli and Binomial Distributions

The Bernoulli distribution

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Chapter 7. Sampling Distributions and the Central Limit Theorem

CH 5 Normal Probability Distributions Properties of the Normal Distribution

Statistics Class 15 3/21/2012

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

chapter 13: Binomial Distribution Exercises (binomial)13.6, 13.12, 13.22, 13.43

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance

Confidence Interval and Hypothesis Testing: Exercises and Solutions

χ 2 distributions and confidence intervals for population variance

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41

Chapter 5. Sampling Distributions

MATH 118 Class Notes For Chapter 5 By: Maan Omran

For more information about how to cite these materials visit

Studio 8: NHST: t-tests and Rejection Regions Spring 2014

Discrete Random Variables and Probability Distributions

Chapter 7. Sampling Distributions and the Central Limit Theorem

Math 14 Lecture Notes Ch The Normal Approximation to the Binomial Distribution. P (X ) = nc X p X q n X =

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

STAT Mathematical Statistics

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

Confidence Intervals for One-Sample Specificity

Statistics for Managers Using Microsoft Excel 7 th Edition

The Binomial Probability Distribution

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Statistics for Business and Economics

Statistics 6 th Edition

Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

3.2 Hypergeometric Distribution 3.5, 3.9 Mean and Variance

Examples of continuous probability distributions: The normal and standard normal

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

MATH 10 INTRODUCTORY STATISTICS

guessing Bluman, Chapter 5 2

Class 12. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

MAKING SENSE OF DATA Essentials series

ECON 214 Elements of Statistics for Economists 2016/2017

GETTING STARTED. To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop

ECO220Y Sampling Distributions of Sample Statistics: Sample Proportion Readings: Chapter 10, section

Financial Economics. Runs Test

Chapter Four: Introduction To Inference 1/50

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

1/2 2. Mean & variance. Mean & standard deviation

Chapter 3. Discrete Probability Distributions

Class 13. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Basic Procedure for Histograms

MidTerm 1) Find the following (round off to one decimal place):

Section 7-2 Estimating a Population Proportion

Some Characteristics of Data

Math 361. Day 8 Binomial Random Variables pages 27 and 28 Inv Do you have ESP? Inv. 1.3 Tim or Bob?

Homework Assignments

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Statistics and Probability

15.063: Communicating with Data Summer Recitation 4 Probability III

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Section Distributions of Random Variables

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr.

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Chapter 9: Sampling Distributions

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Probability Theory. Mohamed I. Riffi. Islamic University of Gaza

Statistics for Managers Using Microsoft Excel/SPSS Chapter 6 The Normal Distribution And Other Continuous Distributions

Binomial Random Variables. Binomial Random Variables

Binomial distribution

Math : Spring 2008

(# of die rolls that satisfy the criteria) (# of possible die rolls)

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

the number of correct answers on question i. (Note that the only possible values of X i

MAS187/AEF258. University of Newcastle upon Tyne

Mean Note: Weights were measured to the nearest 0.1 kg.

Chapter 3 - Lecture 5 The Binomial Probability Distribution

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1

AP Statistics Ch 8 The Binomial and Geometric Distributions

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Inverse Normal Distribution and Approximation to Binomial

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

FINAL REVIEW W/ANSWERS

MATH 3200 Exam 3 Dr. Syring

Transcription:

Nonparametric Statistics Notes Chapter 3: Some Tests Based on the Binomial Distribution Jesse Crawford Department of Mathematics Tarleton State University (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 1 / 29

Quantiles Definition Let X be a random variable and 0 p 1. x p is the quantile of order p of X if P(X < x p ) p, and P(X > x p ) 1 p. If more than one number satisfies these conditions, let x p be the midpoint of the interval of numbers satisfying these conditions. Also called the (100p)th percentile. Notation The pth quantile for the N(0, 1) distribution is z p, so P(Z < z p ) = p, and P(Z > z p ) = 1 p. (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 2 / 29

Outline 1 Section 3.1: The Binomial Test and Estimation of p 2 Section 3.2: The Quantile Test and Estimation of x p 3 Section 3.4: The Sign Test 4 Section 3.5: Some Variations on the Sign Test (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 3 / 29

The Binomial Test Example A machine manufactures parts. p = probability that a part is defective Assume parts are statistically independent. Take a sample of n = 10 parts. Sample contains 4 defective parts. Testing problem: H 0 : p 0.05 vs. H 1 : p > 0.05. Test statistic T Null distribution of T Decision rule/critical region p-value Power Confidence intervals (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 4 / 29

The Binomial Test Data and Assumptions n statistically independent trials Each trial results in class 1 or class 2 p = P(class 1) for a single trial O 1 = number of observations in class 1 Hypothesis Tests H 0 : p = p vs. H 1 : p p H 0 : p p vs. H 1 : p < p H 0 : p p vs. H 1 : p > p (Two-tailed) (Lower-tailed) (Upper-tailed) Test Statistic and Null Distribution Test statistic: T = O 1 Null distribution: T binomial(n, p ) (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 5 / 29

Upper-Tailed Binomial Test H 0 : p p vs. H 1 : p > p Test statistic: T = O 1 Null distribution: T binomial(n, p ) Decision rule: Choose t such that P(T t p = p ) 1 α, (Use Table A3 or a normal approximation) Reject H0 if T > t Critical region: [T > t] (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 6 / 29

binomial(n = 10, p = 0.05) Significance level = P(T > 2 p = 0.05) = 1 0.9885 = 0.0115 (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 7 / 29

Upper-Tailed Binomial Test H 0 : p p vs. H 1 : p > p Test statistic: T = O 1 Null distribution: T binomial(n, p ) Decision rule: Choose t such that P(T t p = p ) 1 α, (Use Table A3 or a normal approximation) Reject H 0 if T > t Critical region: [T > t] Given T = t obs, the p-value is P(T t obs p = p ). (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 8 / 29

binomial(n = 10, p = 0.05) p-value = P(T 4 p = 0.05) = 1 0.9990 = 0.0010 (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 9 / 29

Upper-Tailed Binomial Test H 0 : p p vs. H 1 : p > p Test statistic: T = O 1 Null distribution: T binomial(n, p ) Decision rule: Choose t such that P(T t p = p ) 1 α, (Use Table A3 or a normal approximation) Reject H 0 if T > t Critical region: [T > t] Given T = t obs, the p-value is P(T t obs p = p ). For any value of p, the power is P(T > t p) (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 10 / 29

binomial(n = 10, p = 0.3) Power = P(T > 2 p = 0.3) = 1 0.3828 = 0.6172 (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 11 / 29

binomial(n = 10, p = 0.95) Power = P(T > 2 p = 0.95) = 1 0.0000 = 1.0000 (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 12 / 29

Example Normally, at least 50% of men undergoing a prostate cancer operation experience a certain side effect. New method for performing operation. Sample of 19 men. 3 experienced side effect. Is there statistically significant evidence that the new method has a lower chance of producing the side effect? (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 13 / 29

Lower-Tailed Binomial Test H 0 : p p vs. H 1 : p < p Test statistic: T = O 1 Null distribution: T binomial(n, p ) Decision rule: Choose t such that P(T t p = p ) α, (Use Table A3 or a normal approximation) Reject H0 if T t Critical region: [T t] Given T = t obs, the p-value is P(T t obs p = p ). For any value of p, the power is P(T t p) (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 14 / 29

Normal Approximation for Binomial Quantiles Suppose X binomial(n, p). If np 5, and n(1 p) 5, then the qth quantile of X is approximately x q np + z q np(1 p) Normal Approximation for p-values ( P(T t obs p = p ) P P(T t obs p = p ) P ( Z t obs np + 0.5 np (1 p ) Z t obs np 0.5 np (1 p ) ) ) (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 15 / 29

Two-Tailed Binomial Test H 0 : p = p vs. H 1 : p p Test statistic: T = O 1 Null distribution: T binomial(n, p ) Decision rule: Choose t1 and t 2 such that P(T t 1 p = p ) α/2 P(T t 2 p = p ) 1 α/2 (Use Table A3 or a normal approximation) Reject H 0 if T t 1 or T > t 2 Critical region: [T t 1 or T > t 2 ] The p-value is 2 min[p(t t obs p ), P(T t obs p )]. For any value of p, the power is P(T t 1 or T > t 2 p) (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 16 / 29

Binomial Distribution: Confidence Interval for p Suppose Y binomial(n, p) If n 30 and the confidence level is 0.9, 0.95, or 0.99, the exact Clopper Pearson confidence interval is given in table A4. If np 5 and n(1 p) 5, we can use the normal approximation Y Y (n Y ) n ± z 1 α/2 n 3 Note that this is the same as where ˆp = Y /n. ˆp(1 ˆp) ˆp ± z 1 α/2, n (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 17 / 29

Outline 1 Section 3.1: The Binomial Test and Estimation of p 2 Section 3.2: The Quantile Test and Estimation of x p 3 Section 3.4: The Sign Test 4 Section 3.5: Some Variations on the Sign Test (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 18 / 29

The Quantile Test Example Random sample of standardized test scores: 189 233 195 160 212 176 231 185 199 213 202 193 174 166 248 Test whether the 75th percentile of the scores in the population is equal to 193. H 0 : x 0.75 = 193 vs. H 1 : x 0.75 193. (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 19 / 29

Two-Tailed Quantile Test Assumption: X 1,..., X p is a random sample (they are IID) from a distribution whose measurement scale is at least ordinal. H 0 : x p = x vs. H 1 : x p x Test statistics: T 1 = # of X i s less than or equal to x T 2 = # of X i s less than x Null distributions for both T 1 and T 2 : binomial(n, p ) Decision rule: Let Y represent a binomial(n, p ) random variable. Choose t 1 and t 2 such that P(Y t 1 ) α/2 P(Y t 2 ) 1 α/2 (Use Table A3 or a normal approximation) Reject H0 if T 1 t 1 or T 2 > t 2. Critical region: [T 1 t 1 or T 2 > t 2 ] (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 20 / 29

Example Random sample of standardized test scores: 189 233 195 160 212 176 231 185 199 213 202 193 174 166 248 H 0 : x 0.75 = 193 vs. H 1 : x 0.75 193. T 1 = # of X i s less than or equal to x T 2 = # of X i s less than x Choose t 1 and t 2 such that P(Y t 1 ) α/2 P(Y t 2 ) 1 α/2 Reject H 0 if T 1 t 1 or T 2 > t 2 (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 21 / 29

Lower-Tailed Quantile Test H 0 : x p x vs. H 1 : x p > x Test statistic: T 1 = # of X i s less than or equal to x Null distribution: T 1 binomial(n, p ) Decision rule: Let Y represent a binomial(n, p ) random variable. Choose t 1 such that P(Y t 1 ) α (Use Table A3 or a normal approximation) Reject H0 if T 1 t 1 Critical region: [T 1 t 1 ] (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 22 / 29

Upper-Tailed Quantile Test H 0 : x p x vs. H 1 : x p < x Test statistic: T2 = # of X i s less than x Null distribution: T 2 binomial(n, p ) Decision rule: Let Y represent a binomial(n, p ) random variable. Choose t 2 such that P(Y t 2 ) 1 α (Use Table A3 or a normal approximation) Reject H 0 if T 2 > t 2 Critical region: [T 2 > t 2 ] (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 23 / 29

Outline 1 Section 3.1: The Binomial Test and Estimation of p 2 Section 3.2: The Quantile Test and Estimation of x p 3 Section 3.4: The Sign Test 4 Section 3.5: Some Variations on the Sign Test (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 24 / 29

The Sign Test 81 ties H 0 : P(+) = P( ) vs. H 1 : P(+) P( ) Example 100 people tested two products. 15 people preferred product A to product B 4 people preferred product B to product A 81 people had no preference Summary: 15 + s 4 s (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 25 / 29

Two-Tailed Sign Test H 0 : P(+) = P( ) vs. H 1 : P(+) P( ) n = [# of + s] + [# of s] Test statistic: T = [# of + s] Null distribution: T binomial(n, 1 2 ) Decision rule: Let Y represent a binomial(n, 1 2 ) random variable. Choose t 1 and t 2 such that P(Y t 1 ) α/2 P(Y t 2 ) 1 α/2 (Use Table A3 or a normal approximation) Reject H 0 if T t 1 or T > t 2 Critical region: [T t 1 or T > t 2 ] (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 26 / 29

Outline 1 Section 3.1: The Binomial Test and Estimation of p 2 Section 3.2: The Quantile Test and Estimation of x p 3 Section 3.4: The Sign Test 4 Section 3.5: Some Variations on the Sign Test (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 27 / 29

The McNemar Test for Significance of Changes Example Presidential Debate Summary of voter intentions Test whether a statistically significant difference in voter intentions exists before and after the debate. (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 28 / 29

Cox-Stuart Test for Trend Example Precipitation readings for 19 years: 45.2 45.8 41.7 36.2 45.3 52.2 35.3 57.1 35.3 57.1 41.0 33.7 45.7 37.9 41.7 36.0 49.8 36.2 39.9 Test whether a trend in this data exists. (Tarleton State University) Ch 3: Tests Based on the Binomial Dist. 29 / 29