Today s plan: Section 4.4.2: Capture-Recapture method revisited and Section 4.4.3: Public Opinion Polls

Similar documents
Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

Chapter 5. Sampling Distributions

Math 140 Introductory Statistics. Next midterm May 1

STAT Chapter 7: Confidence Intervals

AP Statistics: Chapter 8, lesson 2: Estimating a population proportion

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance

Homework: (Due Wed) Chapter 10: #5, 22, 42

Chapter 10 Estimating Proportions with Confidence

HOMEWORK: Due Mon 11/8, Chapter 9: #15, 25, 37, 44

Oligopoly Games and Voting Games. Cournot s Model of Quantity Competition:

5.3 Interval Estimation

Lecture 5: Sampling Distributions

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

Chapter 9 Chapter Friday, June 4 th

FINAL REVIEW W/ANSWERS

Lecture 9 - Sampling Distributions and the CLT. Mean. Margin of error. Sta102/BME102. February 6, Sample mean ( X ): x i

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 9 & 10. Multiple Choice.

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

STA215 Confidence Intervals for Proportions

Chapter 8 Answers: Section 8.1: Section 8.2:

Statistics 13 Elementary Statistics

Data Analysis and Statistical Methods Statistics 651

Pssst! Coffee helps!

Random Sampling & Confidence Intervals

Chapter 7. Sampling Distributions and the Central Limit Theorem

Lecture 9 - Sampling Distributions and the CLT

Stats SB Notes 6.3 Completed.notebook April 03, Mar 23 5:22 PM. Chapter Outline. 6.1 Confidence Intervals for the Mean (σ Known)

This is very simple, just enter the sample into a list in the calculator and go to STAT CALC 1-Var Stats. You will get

University of California, Los Angeles Department of Statistics

Lecture 6: Confidence Intervals

3. Probability Distributions and Sampling

SAMPLING DISTRIBUTIONS. Chapter 7

STAT 1220 FALL 2010 Common Final Exam December 10, 2010

We will use an example which will result in a paired t test regarding the labor force participation rate for women in the 60 s and 70 s.

MBF2263 Portfolio Management. Lecture 8: Risk and Return in Capital Markets

Arbitrage Pricing. What is an Equivalent Martingale Measure, and why should a bookie care? Department of Mathematics University of Texas at Austin

Section Sampling Distributions for Counts and Proportions

Introduction to Statistics I

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

1 Sampling Distributions

Sampling Distributions

INSIGHTS WEST Survey on Surrey Municipal Issues - November 10, 2014

Confidence Interval and Hypothesis Testing: Exercises and Solutions

Sampling Distribution Models. Copyright 2009 Pearson Education, Inc.

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

Chapter Four: Introduction To Inference 1/50

Michigan Statewide Poll Results

The Ohio State University Department of Economics Second Midterm Examination Answers

Relationship between Correlation and Volatility. in Closely-Related Assets

Section 7-2 Estimating a Population Proportion

ECO220Y Sampling Distributions of Sample Statistics: Sample Proportion Readings: Chapter 10, section

Chapter 3 Discrete Random Variables and Probability Distributions

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

G5212: Game Theory. Mark Dean. Spring 2017

CUR 412: Game Theory and its Applications, Lecture 4

GPCO 453: Quantitative Methods I Review: Hypothesis Testing

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE

Review. Preview This chapter presents the beginning of inferential statistics. October 25, S7.1 2_3 Estimating a Population Proportion

The Binomial Distribution

The Mathematics of Normality

The Binomial Distribution

MATH 3200 Exam 3 Dr. Syring

Elementary Statistics

LIKELY VOTERS GIVE BOOKER LARGE LEAD, MOST EXPECT HIM TO WIN; LONEGAN WIDELY UNKNOWN

Statistic Midterm. Spring This is a closed-book, closed-notes exam. You may use any calculator.

Confidence Intervals and Sample Size

Sampling & populations

University of Hong Kong

Multinomial Coefficient : A Generalization of the Binomial Coefficient

Sampling and sampling distribution

Section 0: Introduction and Review of Basic Concepts

Intro to Probability Instructor: Alexandre Bouchard

An Introduction to Bayesian Inference and MCMC Methods for Capture-Recapture

A) The first quartile B) The Median C) The third quartile D) None of the previous. 2. [3] If P (A) =.8, P (B) =.7, and P (A B) =.

3.2 Hypergeometric Distribution 3.5, 3.9 Mean and Variance

Section 7.2. Estimating a Population Proportion

June 11, Dynamic Programming( Weighted Interval Scheduling)

Back to estimators...

Chapter 8 Probability Models

MESSAGING GUIDANCE ON TRUMP & REPUBLICAN TAX CUTS As of August 10, 2017

Math 124: Module 8 (Normal Distribution) Normally Distributed Random Variables. Solving Normal Problems with Technology

Preference Networks in Matching Markets

Chapter 6: Discrete Probability Distributions

Chapter 11. Data Descriptions and Probability Distributions. Section 4 Bernoulli Trials and Binomial Distribution

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean.

Alabama Senate Poll Results Moore 48%, Jones 40%, McBride 1% (11% undecided) Generic Republican 49%, Generic Democrat 45% (6% undecided)

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Statistics, Measures of Central Tendency I

MAKING SENSE OF DATA Essentials series

Math Tech IIII, May 7

Econ 250 Fall Due at November 16. Assignment 2: Binomial Distribution, Continuous Random Variables and Sampling

Chapter 8 Statistical Intervals for a Single Sample

3.5 Applying the Normal Distribution (Z-Scores)

5.3 Statistics and Their Distributions

1. Confidence Intervals (cont.)

5.1 Sampling Distributions for Counts and Proportions. Ulrich Hoensch MAT210 Rocky Mountain College Billings, MT 59102

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

MATH 10 INTRODUCTORY STATISTICS

Transcription:

1 Today s plan: Section 4.4.2: Capture-Recapture method revisited and Section 4.4.3: Public Opinion Polls

2 Section 4.4.2: Capture-Recapture method revisited

3 Let s use statistical inference to get a better estimate of a population size.

4 Example Estimate the population of fish in a lake.

4 Example Estimate the population of fish in a lake. Catch a sample of 150 fish. Tag and release them.

4 Example Estimate the population of fish in a lake. Catch a sample of 150 fish. Tag and release them. A week later, catch a new sample of 100 fish. The number of tagged fish is 12.

4 Example Estimate the population of fish in a lake. Catch a sample of 150 fish. Tag and release them. A week later, catch a new sample of 100 fish. The number of tagged fish is 12. Get a 95% confidence level estimate of the fish population.

5 The second sample is a repeated two-outcome experiment, done 100 times:

5 The second sample is a repeated two-outcome experiment, done 100 times: Take a fish and check for a tag

5 The second sample is a repeated two-outcome experiment, done 100 times: Take a fish and check for a tag The two outcomes are: tagged and not tagged

6 The number k of successes is the number of tagged fish in the sample.

6 The number k of successes is the number of tagged fish in the sample. The statistic ˆp is ˆp = k n = 12 100 = 0.12

7 With ˆp = 0.12 and n = 100 in hand, we compute: st.err. 0.12 (1 0.12) 100 0.0325

7 With ˆp = 0.12 and n = 100 in hand, we compute: st.err. 0.12 (1 0.12) 100 0.0325 So what s p, with 95% confidence?

8 ˆp (2 σ n ) p ˆp + (2 σ n )

8 ˆp (2 σ n ) p ˆp + (2 σ n ) 0.12 (2 0.0325) p 0.12 + (2 0.032

8 ˆp (2 σ n ) p ˆp + (2 σ n ) 0.12 (2 0.0325) p 0.12 + (2 0.032 0.055 150 N 0.185

8 ˆp (2 σ n ) p ˆp + (2 σ n ) 0.12 (2 0.0325) p 0.12 + (2 0.032 0.055 150 N 0.185 0.055 150 1 N 0.185 150

8 ˆp (2 σ n ) p ˆp + (2 σ n ) 0.12 (2 0.0325) p 0.12 + (2 0.032 0.055 150 N 0.185 0.055 150 1 N 0.185 150 150 0.055 N 150 0.185

8 ˆp (2 σ n ) p ˆp + (2 σ n ) 0.12 (2 0.0325) p 0.12 + (2 0.032 0.055 150 N 0.185 0.055 150 1 N 0.185 150 150 0.055 N 150 0.185 2727.27 N 810.81

9 We can say with 95% confidence that the population is somewhere between 811 and 2,727.

10 This interval is very wide

10 This interval is very wide We can narrow the interval at the cost of reducing the confidence level.

10 This interval is very wide We can narrow the interval at the cost of reducing the confidence level. or increasing the sample size

11 With 68% confidence, we conclude the population is between 984 and 1,714.

11 With 68% confidence, we conclude the population is between 984 and 1,714. The original estimate 1250 (when st.err. = 0) is not the middle of the interval [811,2,727]

11 With 68% confidence, we conclude the population is between 984 and 1,714. The original estimate 1250 (when st.err. = 0) is not the middle of the interval [811,2,727] This is an artifact of estimating 1/N to get N.

12 Section 4.4.3: Public opinion polls

13 Example The results of a poll (of 1350 people) for a mayoral election are 648 in favor of Candidate A

13 Example The results of a poll (of 1350 people) for a mayoral election are 648 in favor of Candidate A 702 in favor of Candidate B

13 Example The results of a poll (of 1350 people) for a mayoral election are 648 in favor of Candidate A 702 in favor of Candidate B What predictions can we make about the election?

14 Let s begin with Candidate A. Sample size n = 1350

14 Let s begin with Candidate A. Sample size n = 1350 Favorable voters k = 648

14 Let s begin with Candidate A. Sample size n = 1350 Favorable voters k = 648 Therefore ˆp = 648 = 0.48 or 1350 48%

14 Let s begin with Candidate A. Sample size n = 1350 Favorable voters k = 648 Therefore ˆp = 648 = 0.48 or 1350 48% σ 1350 0.48 (1 0.48) 18.3565

15 so the standard error is st.err. 18.3565 0.0136 1350 or 1.36%

15 so the standard error is st.err. 18.3565 0.0136 1350 or 1.36% Thus, the 95% confidence interval is [48 2 1.36, 48 + 2 1.36] or [45.28%, 50.72%]

16 Similarly, for Candidate B: Sample size n = 1350

16 Similarly, for Candidate B: Sample size n = 1350 favorable voters k = 702

16 Similarly, for Candidate B: Sample size n = 1350 favorable voters k = 702 Therefore ˆp = 702 = 0.52 or 1350 52%

16 Similarly, for Candidate B: Sample size n = 1350 favorable voters k = 702 Therefore ˆp = 702 = 0.52 or 1350 52% σ 1350 0.52 (1 0.52) 18.3565

17 so the standard error is st.err. 18.3565 0.0136 1350 or 1.36%

17 so the standard error is st.err. 18.3565 0.0136 1350 or 1.36% Thus, the 95% confidence interval is [52 2 1.36, 52 + 2 1.36] or [49.28%, 54.72%]

18 When we draw these two intervals we clearly see they overlap. A B Overlap 45.28 48 50.72 49.28 52 54.72 49.28 50.72

19 So with 95% confidence, we can t say who will win.

19 So with 95% confidence, we can t say who will win. We call this a statistical tie, or we say the difference is not statistically significant.

20 Remarks: For both candidates the standard error was exactly the same.

20 Remarks: For both candidates the standard error was exactly the same. That is always the case when there are only two options.

20 Remarks: For both candidates the standard error was exactly the same. That is always the case when there are only two options. σ 1350 0.48 (1 0.48) = 1350 0.52 (1 0.52)

21 Even with three options, say, A, B and No preference, if not many people pick the third option then the standard error for both candidates will be almost the same.

21 Even with three options, say, A, B and No preference, if not many people pick the third option then the standard error for both candidates will be almost the same. In such cases we can get away with only computing one standard error.

22 Example Now a new poll is taken, and the numbers are: 581 in favor of Candidate A 769 in favor of Candidate B Is the difference statistically significant now?

23 The sample size is n = 1350, and the poll has only two options, so there is a common standard error.

24 For Candidate A, we have k = 581

24 For Candidate A, we have k = 581 so ˆp = 581 0.4303 or 43.03%. 1350

25 For Candidate B, we have k = 769

25 For Candidate B, we have k = 769 so ˆp = 769 0.5696 or 56.96%. 1350

26 The standard error is 0.4304 (1 0.4304) st.err. 1350 0.0135 or 1.35%

27 The 95% confidence interval for Candidate A is [43.03 2 1.35, 43.03 + 2 1.35]

27 The 95% confidence interval for Candidate A is [43.03 2 1.35, 43.03 + 2 1.35] or [40.33%, 45.73%]

28 The 95% confidence interval for Candidate B [56.96 2 1.35, 56.96 + 2 1.35]

28 The 95% confidence interval for Candidate B [56.96 2 1.35, 56.96 + 2 1.35] or [54.26%, 59.66%]

29 A 40.33 43.03 45.73 B 54.26 56.96 59.66

30 Remarks: Now they don t overlap at all.

30 Remarks: Now they don t overlap at all. Candidate B now has a statistically significant advantage over Candidate A.

31 Another way to see whether the difference between the candidates is statistically significant is whether their levels of support in the poll differ by more than 4 standard errors.

31 Another way to see whether the difference between the candidates is statistically significant is whether their levels of support in the poll differ by more than 4 standard errors. ˆp B ˆp A 57% 43% = 14%

31 Another way to see whether the difference between the candidates is statistically significant is whether their levels of support in the poll differ by more than 4 standard errors. ˆp B ˆp A 57% 43% = 14% whereas 4 st.err. = 4 1.35% = 5.4%

32 Next time: Section 4.4.4: Clinical Studies