STA215 Confidence Intervals for Proportions

Similar documents
χ 2 distributions and confidence intervals for population variance

1. Variability in estimates and CLT

Statistics 13 Elementary Statistics

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41

Section The Sampling Distribution of a Sample Mean

Confidence Intervals Introduction

1. Covariance between two variables X and Y is denoted by Cov(X, Y) and defined by. Cov(X, Y ) = E(X E(X))(Y E(Y ))

Unit 5: Sampling Distributions of Statistics

GPCO 453: Quantitative Methods I Review: Hypothesis Testing

Unit 5: Sampling Distributions of Statistics

Statistics Class 15 3/21/2012

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Chapter 8: Sampling distributions of estimators Sections

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

Central Limit Theorem (cont d) 7/28/2006

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

Sampling Distribution

σ 2 : ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

Lecture 9 - Sampling Distributions and the CLT

1 Introduction 1. 3 Confidence interval for proportion p 6

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

STA258 Analysis of Variance

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Stat 139 Homework 2 Solutions, Fall 2016

Stat 213: Intro to Statistics 9 Central Limit Theorem

BIO5312 Biostatistics Lecture 5: Estimations

Chapter 5. Sampling Distributions

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

Confidence Intervals: Review

5.3 Interval Estimation

Lecture 9 - Sampling Distributions and the CLT. Mean. Margin of error. Sta102/BME102. February 6, Sample mean ( X ): x i

Lecture 10 - Confidence Intervals for Sample Means

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE

The Bernoulli distribution

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Sampling and sampling distribution

MATH 3200 Exam 3 Dr. Syring

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Elementary Statistics Lecture 5

1. Statistical problems - a) Distribution is known. b) Distribution is unknown.

Lecture 6: Confidence Intervals

MATH 10 INTRODUCTORY STATISTICS

Chapter Four: Introduction To Inference 1/50

Data Analysis and Statistical Methods Statistics 651

Chapter 7. Sampling Distributions

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance

Confidence Interval and Hypothesis Testing: Exercises and Solutions

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Statistics for Business and Economics

4.2 Probability Distributions

Midterm Exam III Review

Discrete Random Variables

Chapter 7 - Lecture 1 General concepts and criteria

Chapter Seven: Confidence Intervals and Sample Size

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 42

Section 7-2 Estimating a Population Proportion

Chapter 9 Chapter Friday, June 4 th

Final/Exam #3 Form B - Statistics 211 (Fall 1999)

Chapter 3 - Lecture 4 Moments and Moment Generating Funct

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

Chapter 8. Introduction to Statistical Inference

Confidence Intervals. σ unknown, small samples The t-statistic /22

Chapter 7: Point Estimation and Sampling Distributions

This is very simple, just enter the sample into a list in the calculator and go to STAT CALC 1-Var Stats. You will get

Sampling & populations

MATH 10 INTRODUCTORY STATISTICS

STA2601. Tutorial letter 105/2/2018. Applied Statistics II. Semester 2. Department of Statistics STA2601/105/2/2018 TRIAL EXAMINATION PAPER

STA218 Analysis of Variance

Chapter 7: Estimation Sections

= 0.35 (or ˆp = We have 20 independent trials, each with probability of success (heads) equal to 0.5, so X has a B(20, 0.5) distribution.

Experimental Design and Statistics - AGA47A

Chapter 5. Statistical inference for Parametric Models

Section 0: Introduction and Review of Basic Concepts

Random Variables Handout. Xavier Vilà

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples

Standard Normal, Inverse Normal and Sampling Distributions

Chapter 8 Estimation

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.

Chapter 7. Sampling Distributions and the Central Limit Theorem

1 Small Sample CI for a Population Mean µ

Contents. 1 Introduction. Math 321 Chapter 5 Confidence Intervals. 1 Introduction 1

Module 4: Probability

Mean GMM. Standard error

Discrete Random Variables

Discrete Random Variables

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

Discrete Probability Distribution

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

Central Limit Theorem 11/08/2005

4 Random Variables and Distributions

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

Section 1.4: Learning from data

WebAssign Math 3680 Homework 5 Devore Fall 2013 (Homework)

Chapter 8 Statistical Intervals for a Single Sample

INFERENTIAL STATISTICS REVISION

Transcription:

STA215 Confidence Intervals for Proportions Al Nosedal. University of Toronto. Summer 2017 June 14, 2017

Pepsi problem A market research consultant hired by the Pepsi-Cola Co. is interested in determining the proportion of UTM students who favor Pepsi-Cola over Coke Classic. A random sample of 100 students shows that 40 students favor Pepsi over Coke. Use this information to construct a 95% confidence interval for the proportion of all students in this market who prefer Pepsi.

Bernoulli Distribution x i = { 1 i-th person prefers Pepsi 0 i-th person prefers Coke µ = E(x i ) = p σ 2 = V (x i ) = p(1 p) n i=1 Let ˆp be our estimate of p. Note that ˆp = x i n = x. If n is large, by the Central Limit Theorem, we know that: σ x is roughly N(µ, n ), that is, ( ) ˆp is roughly N p, p(1 p) n

Interval Estimate of p Draw a simple random sample of size n from a population with unknown proportion p of successes. An (approximate) confidence interval for p is: ( ) ˆp(1 ˆp) ˆp ± z n where z is a number coming from the Standard Normal that depends on the confidence level required. Use this interval only when: 1) n is large and 2) n ˆp 10 and n(1 ˆp) 10.

Problem A simple random sample of 400 individuals provides 100 Yes responses. a. What is the point estimate of the proportion of the population that would provide Yes responses? b. What is the point estimate of the standard error of the proportion, σˆp? c. Compute the 95% confidence interval for the population proportion.

Solution a. ˆp = 100 400 = 0.25 b. Standard error of ˆp = ( ) c. ˆp ± z (ˆp)(1 ˆp) n 0.25 ± 1.96(0.0216) (0.2076, 0.2923) (ˆp)(1 ˆp) (0.25)(0.75) n = 400 = 0.0216

Problem A simple random sample of 800 elements generates a sample proportion ˆp = 0.70. a. Provide a 90% confidence interval for the population proportion. b. Provide a 95% confidence interval for the population proportion.

Solution ( a. ˆp ± z ( 0.70 ± 1.65 (ˆp)(1 ˆp) n ) (0.70)(1 0.70) 800 0.70 ± 1.65(0.0162) (0.6732, 0.7267) b. 0.70 ± 1.96(0.0162) (0.6682, 0.7317) )

Problem A survey of 611 office workers investigated telephone answering practices, including how often each office worker was able to answer incoming telephone calls and how often incoming telephone calls went directly to voice mail. A total of 281 office workers indicated that they never need voice mail and are able to take every telephone call. a. What is the point estimate of the proportion of the population of office workers who are able to take every telephone call? b. At 90% confidence, what is the margin of error? c. What is the 90% confidence interval for the proportion of the population of office workers who are able to take every telephone call?

Solution a. ˆp = 281 611 = 0.46 b. Margin of error = (ˆp)(1 ˆp) z n = 1.65 c. ˆp ± z ( 0.46 ±.0332 (0.4268, 0.4932) (ˆp)(1 ˆp) n ) (0.46)(0.54) 611 = 1.65(0.0201) = 0.0332

R Code prop.test(281,611,conf.level=0.90); ## ## 1-sample proportions test with continuity correction ## ## data: 281 out of 611, null probability 0.5 ## X-squared = 3.7709, df = 1, p-value = 0.05215 ## alternative hypothesis: true p is not equal to 0.5 ## 90 percent confidence interval: ## 0.4261763 0.4939896 ## sample estimates: ## p ## 0.4599018

Problem In a survey, the planning value for the population proportion is p = 0.35. How large a sample should be taken to provide a 95% confidence interval with a margin of error of 0.05? Solution. n = ( ) z 2 E p (1 p ) = ( 1.96 2 0.05) (0.35)(1 0.35) = 350 (Always round up).

Determining the Sample Size Sample Size for an Interval Estimate of a Population Proportion. ( z ) 2 n = p (1 p ) E In practice, the planning value p can be chosen by one of the following procedures. 1. Use the sample proportion from a previous sample of the same or similar units. 2. Use a planning value of p = 0.5.

Problem At 95% confidence, how large a sample should be taken to obtain a margin of error of 0.03 for the estimation of a population proportion? Assume that past data are not available for developing a planning value for p. Solution. n = ( z ) 2 E p (1 p ) = ( 1.96 2 0.03) (0.5)(1 0.5) = 1068 (Always round up).

Problem The percentage of people not covered by health care insurance in 2003 was 15.6%. A congressional committee has been charged with conducting a sample survey to obtain more current information. a. What sample size would you recommend if the committee s goal is to estimate the current proportion of individuals without health care insurance with a margin of error of 0.03? Use a 95% confidence level. b. Repeat part a) using a 99% confidence level.

Solution a. n = ( z E ) 2 p (1 p ) = ( 1.96 0.03) 2 (0.156)(1 0.156) = 563 b. n = ( z E ) 2 p (1 p ) = ( 2.58 0.03) 2 (0.156)(1 0.156) = 974