Chapter 7.2: Large-Sample Confidence Intervals for a Population Mean and Proportion. Instructor: Elvan Ceyhan

Similar documents
Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Chapter 7. Sampling Distributions and the Central Limit Theorem

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

Chapter 8 Statistical Intervals for a Single Sample

Confidence Intervals Introduction

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

STAT Chapter 7: Confidence Intervals

Statistical Intervals (One sample) (Chs )

8.1 Estimation of the Mean and Proportion

Statistics Class 15 3/21/2012

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance

1 Inferential Statistic

Chapter Seven: Confidence Intervals and Sample Size

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

Statistics 6 th Edition

χ 2 distributions and confidence intervals for population variance

Confidence Intervals. σ unknown, small samples The t-statistic /22

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

Chapter 7. Sampling Distributions and the Central Limit Theorem

Review of the Topics for Midterm I

Statistical Tables Compiled by Alan J. Terry

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

5.3 Statistics and Their Distributions

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

4.3 Normal distribution

Estimation Y 3. Confidence intervals I, Feb 11,

Two Populations Hypothesis Testing

1 Introduction 1. 3 Confidence interval for proportion p 6

Elementary Statistics Lecture 5

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

MA131 Lecture 9.1. = µ = 25 and σ X P ( 90 < X < 100 ) = = /// σ X

Statistics 13 Elementary Statistics

Confidence Intervals and Sample Size

Chapter 5. Sampling Distributions

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

Chapter 9: Sampling Distributions

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Sampling Distribution

Lecture 9 - Sampling Distributions and the CLT

Learning Objectives for Ch. 7

. (i) What is the probability that X is at most 8.75? =.875

SAMPLING DISTRIBUTIONS. Chapter 7

Contents. 1 Introduction. Math 321 Chapter 5 Confidence Intervals. 1 Introduction 1

Chapter 7 Sampling Distributions and Point Estimation of Parameters

MATH 3200 Exam 3 Dr. Syring

Statistics for Business and Economics

Measures of Variation. Section 2-5. Dotplots of Waiting Times. Waiting Times of Bank Customers at Different Banks in minutes. Bank of Providence

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

CHAPTER 5 SAMPLING DISTRIBUTIONS

Chapter 8. Introduction to Statistical Inference

Statistics, Their Distributions, and the Central Limit Theorem

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics

15.063: Communicating with Data Summer Recitation 4 Probability III

What was in the last lecture?

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

GETTING STARTED. To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop

Lecture 6: Chapter 6

Simulation Lecture Notes and the Gentle Lentil Case

7 THE CENTRAL LIMIT THEOREM

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

MidTerm 1) Find the following (round off to one decimal place):

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Statistics for Managers Using Microsoft Excel 7 th Edition

Section 7-2 Estimating a Population Proportion

CHAPTER 5 Sampling Distributions

Chapter 7: Point Estimation and Sampling Distributions

WebAssign Math 3680 Homework 5 Devore Fall 2013 (Homework)

6 Central Limit Theorem. (Chs 6.4, 6.5)

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Tutorial 11: Limit Theorems. Baoxiang Wang & Yihan Zhang bxwang, April 10, 2017

Uniform Probability Distribution. Continuous Random Variables &

BIO5312 Biostatistics Lecture 5: Estimations

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

Chapter 5 Discrete Probability Distributions. Random Variables Discrete Probability Distributions Expected Value and Variance

Point Estimation. Principle of Unbiased Estimation. When choosing among several different estimators of θ, select one that is unbiased.

The binomial distribution p314

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

CSC Advanced Scientific Programming, Spring Descriptive Statistics

Central Limit Theorem (cont d) 7/28/2006

The normal distribution is a theoretical model derived mathematically and not empirically.

Statistics for Business and Economics

Homework: (Due Wed) Chapter 10: #5, 22, 42

Simple Random Sampling. Sampling Distribution

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

STA 320 Fall Thursday, Dec 5. Sampling Distribution. STA Fall

Section 7.2. Estimating a Population Proportion

Transcription:

1 Chapter 7.2: Large-Sample Confidence Intervals for a Population Mean and Proportion Instructor: Elvan Ceyhan Outline of this chapter: Large-Sample Interval for µ Confidence Intervals for Population Proportion

Review of Chapter 7.1 2 1. Definition of a confidence interval 2. Interpretation of a confidence interval 3. A confidence interval for population mean µ when population distribution is normal population standard deviation σ is known 4. Interval width and sample size calculation Typically it is not reasonable to assume that the population standard deviation σ is known. Also, assuming that the population distribution is normal may not be realistic as well. In this chapter, we shall learn a method to construct confidence intervals where these two assumptions are not required. Assumption: Sample size n is large. We need to use the following result: The Central Limit Theorem (CLT): Let X 1, X 2,, X n be a random sample from a distribution with mean µ and variance σ 2. Then if n is sufficiently large, X has approximately a normal distribution with a mean of µ and a variance of σ 2 /n. The parent distribution of X s may not be normal (e.g., could be even discrete) The larger the value of n, the better the approximation. Also, we will use the following property: Result: If n is sufficiently large, then the sample standard deviation S becomes very close to the population standard deviation σ. Formally, we write as n grows to infinity. S p σ

3 Large sample confidence interval for µ Based on the two results (the CLT and the result about S), we can write that, if n is sufficiently large, then we have X µ S/ n approximately has a N(0, 1) distribution. A large sample 100(1 α)% confidence interval for µ can be formed as ( X z α/2 S n, X + z α/2 S n ) Note that we replaced σ by S in the above formula This formula is valid regardless of the shape of the population distribution (for sufficiently large n). Text book suggest n 40 as our rule of thumb. Interval width: 2z α/2 S n 1. Allowable mechanical properties for structural design of metallic aerospace vehicles requires an approved method for statistically analyzing empirical test data. The article Establishing Mechanical Property Allowables for Metals (J. of Testing and Evaluation, 1998: 293-299) used the data provided in Exercises 1.13 on tensile ultimate strength (ksi) as a basis for addressing the difficulties in developing such a method. Use the accompanying descriptive statistics output to calculate a 98% confidence interval for the true average ultimate tensile strength. n = 153, x = 135.39, s = 4.59, Minimum = 122.20, Maximum = 147.70.

2. The alternating current (AC) breakdown voltage (the minimum voltage that makes an insulator react as a conductor) of an insulating liquid indicates its dielectric strength. One random sample of 48 observations on breakdown voltage (kv) gives 4 48 i=1 x i = 2626, 48 i=1 x 2 i = 144, 950. Construct a 95% confidence interval for the mean breakdown voltage.

5 Estimation of population proportion p Often estimation of a population proportion is of interest. Typically, we observe the number of success X among n trials, where the probability of a trial being a success is p. Thus, Parameter: p Point estimator: X/n (sample proportion of success) Point estimate: x/n (x = observed number of success, once a sample is taken) Example: In 48 trials in a laboratory, 16 trials resulted in ignition of a particular type of substrate by a lighted cigarette. Let p be the long-run proportion of all such trials that would result in ignition. Parameter: n = x = Point estimate: Question: How to construct a confidence interval for p? Assumption: The number of success X out of n trials is assumed to have a Binomial distribution B(n, p) Population mean: E(X) = Population standard deviation: sd(x) = Normal approximation to Binomial: If n (number of trials) is sufficiently large, then the sample proportion P = X/n approximate has a normal distribution with mean p and variance p(1 p)/n. In other words, if n is large enough P N ( p, p(1 p) n Mean of the estimated proportion E( P ) = Standard deviation σ P = sd( P ) = ). Estimated standard deviation S p =

6 A large-sample confidence Interval for a Population Proportion p If n is sufficiently large, we can write P p P (1 P ) n N (0, 1). A large sample 100(1 α)% confidence interval for p can be formed as P (1 P P ) P (1 zα/2, P P ) + zα/2 n n What do we mean by large n : our rule of thumb is n P 10 and n(1 P ) 10. Pros and cons of the large sample interval Advantage: Simple to calculate, only need to have P and n Disadvantage: Performs poorly when the true proportion towards the boundary, that is, closer to 0 or 1. Also we need to check n P 10 and n(1 P ) 10. Score interval for a Population Proportion p The 100(1 α)% score interval for p can be formed as P + zα/2 2 /(2n) P (1 P )/n + z 2 α/2 1 + zα/2 2 /n ± z /(4n2 ) α/2 1 + zα/2 2 /n, where the corresponds to the lower confidence limit and the + to the upper confidence limit. (Textbook page 280, display (7.10)) The score interval can be used with nearly all n and p values (no need to check large sample conditions!) and performs much better.

3. In 48 trials in a laboratory, 16 trials resulted in ignition of a particular type of substrate by a lighted cigarette. Let p be the long-run proportion of all such trials that would result in ignition. Construct an approximate 95% confidence interval for p using both the large sample and score intervals. 7

8 Minimum sample size necessary to ensure an interval width w is (a) Always round n up n 4zα/2 2 p(1 p). w 2 (b) Note that the above formula involves the unknown p (c) Conservative approach: use p = 0.5 while calculating n, that is n 4zα/2 2 0.5 2 w = z2 α/2 2 w. 2 4. In the previous problem, what is the minimum sample size required to ensure a width of 0.10, and confidence level 0.95?