Section 7-2 Estimating a Population Proportion

Similar documents
Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Review. Preview This chapter presents the beginning of inferential statistics. October 25, S7.1 2_3 Estimating a Population Proportion

Statistics Class 15 3/21/2012

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

Statistics 13 Elementary Statistics

Statistics for Business and Economics

Chapter 7. Sampling Distributions

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1

AMS7: WEEK 4. CLASS 3

5.3 Interval Estimation

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Chapter Seven: Confidence Intervals and Sample Size

MidTerm 1) Find the following (round off to one decimal place):

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

1 Sampling Distributions

STAT Chapter 7: Confidence Intervals

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance

Chapter 4 and Chapter 5 Test Review Worksheet

Confidence Intervals Introduction

Confidence Intervals for the Mean. When σ is known

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

6. THE BINOMIAL DISTRIBUTION

Density curves. (James Madison University) February 4, / 20

χ 2 distributions and confidence intervals for population variance

Chapter 8 Estimation

The "bell-shaped" curve, or normal curve, is a probability distribution that describes many real-life situations.

Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) Estimating Population Parameters

Homework: (Due Wed) Chapter 10: #5, 22, 42

1 Inferential Statistic

Estimation Y 3. Confidence intervals I, Feb 11,

Chapter 5. Sampling Distributions

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Chapter 7. Confidence Intervals and Sample Size. Bluman, Chapter 7. Friday, January 25, 13

Chapter 4 Probability Distributions

Chapter 8 Statistical Intervals for a Single Sample

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Chapter 9 & 10. Multiple Choice.

University of California, Los Angeles Department of Statistics

5.1 Mean, Median, & Mode

Confidence Interval and Hypothesis Testing: Exercises and Solutions

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

Confidence Intervals and Sample Size

The Normal Distribution. (Ch 4.3)

Key Concept. 155S6.6_3 Normal as Approximation to Binomial. March 02, 2011

Abraham Baldwin Agricultural College Math 2000 Practice Test 3

HOMEWORK: Due Mon 11/8, Chapter 9: #15, 25, 37, 44

Sampling Distributions

Central Limit Theorem

Chapter Four: Introduction To Inference 1/50

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions

University of California, Los Angeles Department of Statistics. The central limit theorem The distribution of the sample mean

Confidence Intervals. σ unknown, small samples The t-statistic /22

Chapter 7.2: Large-Sample Confidence Intervals for a Population Mean and Proportion. Instructor: Elvan Ceyhan

Time Observations Time Period, t

10/1/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1

Data Analysis and Statistical Methods Statistics 651

Chapter 7: Sampling Distributions Chapter 7: Sampling Distributions

Lecture 9. Probability Distributions. Outline. Outline

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean.

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

8.1 Estimation of the Mean and Proportion

ECON 214 Elements of Statistics for Economists 2016/2017

Sampling and sampling distribution

Lecture 9. Probability Distributions

1. Find the area under the standard normal distribution curve for each. (a) Between z = 0.19 and z = 1.23.

Chapter 9 Chapter Friday, June 4 th

Two Populations Hypothesis Testing

Part 1 In which we meet the law of averages. The Law of Averages. The Expected Value & The Standard Error. Where Are We Going?

Determine whether the given procedure results in a binomial distribution. If not, state the reason why.

Chapter 6 Confidence Intervals

Sampling Distributions For Counts and Proportions

8.3 CI for μ, σ NOT known (old 8.4)

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

MATH 3200 Exam 3 Dr. Syring

7 THE CENTRAL LIMIT THEOREM

Learning Objectives for Ch. 7

Midterm Test 1 (Sample) Student Name (PRINT):... Student Signature:... Use pencil, so that you can erase and rewrite if necessary.

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

Confidence Intervals 8.6

Elementary Statistics

Statistics for Managers Using Microsoft Excel 7 th Edition

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

CIVL Confidence Intervals

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

Statistical Methods in Practice STAT/MATH 3379

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.

1. State Sales Tax. 2. Baggage Check

SLIDES. BY. John Loucks. St. Edward s University

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

ECON 214 Elements of Statistics for Economists

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

If the distribution of a random variable x is approximately normal, then

Transcription:

Section 7- Estimating a Population Proportion 1 Key Concept In this section we present methods for using a sample proportion to estimate the value of a population proportion. The sample proportion is the best point estimate of the population proportion. We can use a sample proportion to construct a confidence interval to estimate the true value of a population proportion, and we should know how to interpret such confidence intervals. We should know how to find the sample size necessary to estimate a population proportion.

Definition A point estimate is a single value (or point) used to approximate a population parameter. 3 Definition The sample proportion p is the best point estimate of the population proportion p. ˆ 4

Example: In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming. Q: What is the proportion of the adult population that believe in global warming? Notation: p is the population proportion (an unknown parameter). ˆp is the sample proportion (computed). From the poll data ˆp = 0.70. Apparently, 0.70 will be the best estimate of the proportion of all adults who believe in global warming. 5 Example In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming. The sample proportion ˆp = 0.70 is the best estimate of the population proportion p. A 95% confidence interval for the unknown population parameter is 0.677 < p < 0.73 will know later where these numbers come from What does it mean, exactly? 6

Interpreting a Confidence Interval We are 95% confident that the interval from 0.677 to 0.73 actually does contain the true value of the population proportion p. This means that if we were to select many different samples of size 1501 and construct the corresponding confidence intervals, then 95% of them would actually contain the value of the population proportion p. 7 Caution Know the correct interpretation of a confidence interval. It is wrong to say the probability that the population parameter belongs to the confidence interval is 95% because the population parameter is not a random variable, it does not change its value. 8

Caution Do not confuse two percentages: the proportion may be represented by percents (like 70% in the example), and the confidence level may be represented by percents (like 95% in the example). Proportion may be any number from 0% to 100%. Confidence level is usually 90% or 95% or 99%. 9 Next we learn how to construct confidence intervals 10

Critical Values A z score can be used to distinguish between sample statistics that are likely to occur and those that are unlikely to occur. Such a z score is called a critical value. The standard normal distribution is divided into three regions: middle part has area 1-α and two tails (left and right) have area α/ each: 11 Critical Values The z scores separate the middle interval (likely values) from the tails (unlikely values). They are z α/ and z α/, found from Table A-. 1

Definition A critical value is the number on the borderline separating sample statistics that are likely to occur from those that are unlikely to occur. 13 Notation for Critical Value The critical value z α/ separates an area of α/ in the right tail of the standard normal distribution. The value of z α/ separates an area of α/ in the left tail. The subscript α/ is simply a reminder that the z score separates an area of α/ in the tail. 14

Finding z α/ for a 95% Confidence Level α = 5% α/ =.5% =.05 -z α/ z α/ Critical Values 15 Definition Margin of error, denoted by E, is the maximum likely difference (with probability 1 α, such as 0.95) between the observed proportion ˆp and the true value of the population proportion p. The margin of error E is also called the maximum error of the estimate. 16

Margin of Error for Proportions E = z α pq ˆ ˆ n 17 Notation E = margin of error ^ p = sample proportion ^ q = 1 ^p n = number of sample values 18

Example (continued) We say that 0.70, or 70% is be the best point estimate of the proportion of all adults who believe in global warming. But how reliable (accurate) is this estimate? We will see that its margin of error is.3%. This means the true proportion of adults who believe in global warming is between 67.7% and 7.3%. 0.677 < p < 0.73.This gives an interval (from 67.7% to 7.3%) containing the true (but unknown) value of the population proportion. 19 Confidence Interval for a Population Proportion p ˆ p E < p < pˆ + E where E = z α pq ˆ ˆ n 0

Confidence Interval for a Population Proportion p ˆ p E < p < ˆ p+ E ˆ p + E (p ˆ E, ˆ p + E) 1 Finding the Point Estimate and E from a Confidence Interval Point estimate of p: ˆ Margin of Error: ˆ p = (upper confidence limit) + (lower confidence limit) E = (upper confidence limit) (lower confidence limit)

Round-Off Rule for Confidence Interval Estimates of p Round the confidence interval limits for p to three significant digits. 3 Sample Size Suppose we want to collect sample data in order to estimate some population proportion. The question is how many sample items must be obtained? 4

Determining Sample Size E = z α / ˆ p ˆ q n (solve for n by algebra) n = ˆ ( Z α / ) p ˆq E 5 Sample Size for Estimating Proportion p When an estimate of p is known: n = ˆ ˆ ˆ ( ) p q zα α / E ˆ When no estimate of p is known: n = ( zα α / ) 0.5 E 6

Round-Off Rule for Determining Sample Size If the computed sample size n is not a whole number, round the value of n up to the next larger whole number. Examples: n=310.67 round up to 311 n=310.3 round up to 311 n=310.01 round up to 311 7 Example: A manager for E-Bay wants to determine the current percentage of U.S. adults who now use the Internet. How many adults must be surveyed in order to be 95% confident that the sample percentage is in error by no more than three percentage points? a) In 006, 73% of adults used the Internet. b) No known possible value of the proportion. 8

Example: a) Use pˆ = 0.73 and qˆ = 1 pˆ = 0.7 α = 0.05 so z = 1.96 E = 0.03 α n = = ( zα ) E ( 1.96) ( 0.73)( 0.7) ( 0.03) pq ˆ ˆ = 841.3104 = 84 To be 95% confident that our sample percentage is within three percentage points of the true percentage for all adults, we should obtain a random sample of 84 adults. 9 Example: b) Use α = 0.05 so z α = 1.96 E = 0.03 n = = ( zα ) ( ) ( 0.03) E 0.5 1.96 0.5 = 1067.1111 = 1068 To be 95% confident that our sample percentage is within three percentage points of the true percentage for all adults, we should obtain a random sample of 1068 adults. 30

Analyzing Polls When analyzing polls consider: 1. The sample should be a simple random sample, not an inappropriate sample (such as a voluntary response sample).. The confidence level should be provided. (It is often 95%, but media reports often neglect to identify it.) 3. The sample size should be provided. (It is usually provided by the media, but not always.) 4. Except for relatively rare cases, the quality of the poll results depends on the sampling method and the size of the sample, but the size of the population is usually not a factor. 31 Caution Never follow the common misconception that poll results are unreliable if the sample size is a small percentage of the population size. The population size is usually not a factor in determining the reliability of a poll. 3

Recap In this section we have discussed: Point estimates. Confidence intervals. Confidence levels. Critical values. Margin of error. Determining sample sizes. 33 Section 7-3 Estimating a Population Mean: σ Known 34

Key Concept This section presents methods for estimating a population mean. In addition to knowing the values of the sample data or statistics, we must also know the value of the population standard deviation, σ. Here are three key concepts that should be learned in this section: 35 Key Concept 1. We should know that the sample mean is the best point estimate of the population mean µ.. We should learn how to use sample data to construct a confidence interval for estimating the value of a population mean, and we should know how to interpret such confidence intervals. 3. We should develop the ability to determine the sample size necessary to estimate a population mean. x 36

Point Estimate of the Population Mean The sample mean x is the best point estimate of the population mean µ. 37 Confidence Interval for Estimating a Population Mean (with σ Known) µ = population mean σ = population standard deviation x = sample mean n = number of sample values E = margin of error z α/ = z score separating an area of a/ in the right tail of the standard normal distribution 38

Confidence Interval for Estimating a Population Mean (with σ Known) 1. The sample is a simple random sample. (All samples of the same size have an equal chance of being selected.). The value of the population standard deviation σ is known. 3. Either or both of these conditions is satisfied: The population is normally distributed or n > 30. 39 Confidence Interval for Estimating a Population Mean (with σ Known) x E < µ < x + E where E = z α σ n or or x ± E ( x E,x + E) 40

Definition The two values x E and x + E are called confidence interval limits. 41 Sample Mean 1. For all populations, the sample mean x is an unbiased estimator of the population mean µ, meaning that the distribution of sample means tends to center about the value of the population mean µ.. For many populations, the distribution of sample means x tends to be more consistent (with less variation) than the distributions of other sample statistics. 4

Procedure for Constructing a Confidence Interval for µ (with Known σ) 1. Verify that the requirements are satisfied.. Refer to Table A- to find the critical value z α/ that corresponds to the desired confidence level. 3. Evaluate the margin of error 4. Find the values of x E and x + E. Substitute those values in the general format of the confidence interval: 5. Round using the confidence intervals round-off rules. x E < µ < x + E E = z σ n α 43 Round-Off Rule for Confidence Intervals Used to Estimate µ 1. When using the original set of data, round the confidence interval limits to one more decimal place than used in original set of data.. When the original set of data is unknown and only the summary statistics (n, x, s) are used, round the confidence interval limits to the same number of decimal places used for the sample mean. 44

Example: page 347 People have died in boat and aircraft accidents because an obsolete estimate of the mean weight of men was used. In recent decades, the mean weight of men has increased considerably, so we need to update our estimate of that mean so that boats, aircraft, elevators, and other such devices do not become dangerously overloaded. Using the weights of men from Data Set 1 in Appendix B, we obtain these sample statistics for the simple random sample: n = 40 and x = 17.55 lb. Research from several other sources suggests that the population of weights of men has a standard deviation given by σ = 6 lb. 45 Example: a. Find the best point estimate of the mean weight of the population of all men. b. Construct a 95% confidence interval estimate of the mean weight of all men. c. What do the results suggest about the mean weight of 166.3 lb that was used to determine the safe passenger capacity of water vessels in 1960 (as given in the National Transportation and Safety Board safety recommendation M-04-04)? 46

Example page 347 Since : 1- Sample is random - the value of σ is known 3 n >30 i.e. Conditions are satisfied Then: a) The sample mean 17.55 is the best point estimate for the population mean 47 Example: a. The sample mean of 17.55 lb is the best point estimate of the mean weight of the population of all men. b. A 95% confidence interval or 0.95 implies σ = 0.05, so z α/ = 1.96. Calculate the margin of error. σ 6 E = zα = 1.96 = 8.0574835 n 40 Construct the confidence interval. x E < µ < x E 17.55 8.0574835 < µ < 17.55 + 8.0574835 164.49 < µ < 180.61 48

Example: c. Based on the confidence interval, it is possible that the mean weight of 166.3 lb used in 1960 could be the mean weight of men today. However, the best point estimate of 17.55 lb suggests that the mean weight of men is now considerably greater than 166.3 lb. Considering that an underestimate of the mean weight of men could result in lives lost through overloaded boats and aircraft, the mean weight of men has been increased. 49 Finding a Sample Size for Estimating a Population Mean µ = population mean σ = population standard deviation x = population standard deviation E = desired margin of error z α/ = z score separating an area of α/ in the right tail of the standard normal distribution n = (z α/ ) σ E 50

Round-Off Rule for Sample Size n If the computed sample size n is not a whole number, round the value of n up to the next larger whole number. 51 Example: Assume that we want to estimate the mean IQ score for the population of statistics students. How many statistics students must be randomly selected for IQ tests if we want 95% confidence that the sample mean is within 3 IQ points of the population mean? α = 0.05 α / = 0.05 z α/ = 1.96 E = 3 σ = 15 n = (z α/ ) σ E n = 1.96 15 = 96.04 = 97 3 With a simple random sample of only 97 statistics students, we will be 95% confident that the sample mean is within 3 IQ points of the true population mean µ. 5

Recap In this section we have discussed: Margin of error. Confidence interval estimate of the population mean with σ known. Round off rules. Sample size for estimating the mean µ. 53