Sampling & populations

Similar documents
8.4: The Binomial Distribution

Discrete probability distributions

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Chapter 5 Discrete Probability Distributions. Random Variables Discrete Probability Distributions Expected Value and Variance

Chapter Five. The Binomial Probability Distribution and Related Topics

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Chapter 8: The Binomial and Geometric Distributions

Overview. Definitions. Definitions. Graphs. Chapter 5 Probability Distributions. probability distributions

4.2 Bernoulli Trials and Binomial Distributions

This is very simple, just enter the sample into a list in the calculator and go to STAT CALC 1-Var Stats. You will get

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) Estimating Population Parameters

Chapter 4 Probability Distributions

PROBABILITY DISTRIBUTIONS

Section 8.4 The Binomial Distribution

Review of the Topics for Midterm I

6. THE BINOMIAL DISTRIBUTION

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Chapter 3. Discrete Probability Distributions

MATH 264 Problem Homework I

Sampling and sampling distribution

Probability Theory. Mohamed I. Riffi. Islamic University of Gaza

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions

Useful Probability Distributions

Chapter 3 - Lecture 3 Expected Values of Discrete Random Va

Discrete Probability Distribution

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Chapter 6 Confidence Intervals

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

5.1 Sampling Distributions for Counts and Proportions. Ulrich Hoensch MAT210 Rocky Mountain College Billings, MT 59102

TOPIC: PROBABILITY DISTRIBUTIONS

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance

STA Module 3B Discrete Random Variables

Chapter 16. Random Variables. Copyright 2010 Pearson Education, Inc.

Chapter 5. Discrete Probability Distributions. McGraw-Hill, Bluman, 7 th ed, Chapter 5 1

BIO5312 Biostatistics Lecture 5: Estimations

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

AP Stats Review. Mrs. Daniel Alonzo & Tracy Mourning Sr. High

Section Distributions of Random Variables

Statistics (This summary is for chapters 18, 29 and section H of chapter 19)

8.1 Binomial Distributions

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

Math Week in Review #10. Experiments with two outcomes ( success and failure ) are called Bernoulli or binomial trials.

CVE SOME DISCRETE PROBABILITY DISTRIBUTIONS

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

Central Limit Theorem (cont d) 7/28/2006

Answer Key: Quiz2-Chapter5: Discrete Probability Distribution

Binomial Random Variables. Binomial Random Variables

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

Standard Normal, Inverse Normal and Sampling Distributions

Chapter 8 Solutions Page 1 of 15 CHAPTER 8 EXERCISE SOLUTIONS

Chapter 7. Sampling Distributions and the Central Limit Theorem

Random Variable: Definition

AP Statistics Test 5

Sampling Distribution

Midterm Test 1 (Sample) Student Name (PRINT):... Student Signature:... Use pencil, so that you can erase and rewrite if necessary.

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

Applications of the Central Limit Theorem

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics

Math 14 Lecture Notes Ch. 4.3

The Binomial Distribution

CD Appendix F Hypergeometric Distribution

Binomial and Normal Distributions. Example: Determine whether the following experiments are binomial experiments. Explain.

Continuous Probability Distributions & Normal Distribution

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

MATH 3200 Exam 3 Dr. Syring

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

Chapter 7. Sampling Distributions

SOME MOST POPULAR DISTRIBUTIONS OF RANDOM VARIABLES

Engineering Statistics ECIV 2305

PROBABILITY DISTRIBUTIONS. Chapter 6

Section Distributions of Random Variables

The binomial distribution p314

Chapter 3 Discrete Random Variables and Probability Distributions

Chapter 8 Homework Solutions Compiled by Joe Kahlig. speed(x) freq 25 x < x < x < x < x < x < 55 5

Chapter 14 - Random Variables

3. Continuous Probability Distributions

Discrete Random Variables and Their Probability Distributions

Estimation. Focus Points 10/11/2011. Estimating p in the Binomial Distribution. Section 7.3

Section Random Variables and Histograms

3.2 Hypergeometric Distribution 3.5, 3.9 Mean and Variance

STT315 Chapter 4 Random Variables & Probability Distributions AM KM

Statistical Tables Compiled by Alan J. Terry

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

Chapter 7 1. Random Variables

Part 1 In which we meet the law of averages. The Law of Averages. The Expected Value & The Standard Error. Where Are We Going?

Chapter 7. Sampling Distributions and the Central Limit Theorem

A.REPRESENTATION OF DATA

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

Chapter 5. Sampling Distributions

Statistics 6 th Edition

STAT Chapter 7: Confidence Intervals

AMS7: WEEK 4. CLASS 3

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

1. Steve says I have two children, one of which is a boy. Given this information, what is the probability that Steve has two boys?

Transcription:

Sampling & populations Sample proportions Sampling distribution - small populations Sampling distribution - large populations Sampling distribution - normal distribution approximation Mean & variance of a sample proportion Sampling distribution - comparing approximations Mean & variance of the sample proportion Confidence intervals Margin of error 1

Sample proportions A sample of size n is taken from a population. The number of positive outcomes in the sample is recorded to find the sample proportion. The population proportion can be estimated from the sample proportion. ˆp The sample proportions are the values of the random variable. ˆP number of positive outcomes in population p = population size number of positive outcomes in sample ˆp = sample size ˆP = The set of possible outcomes of ˆp. (A population statistic.) (A sample statistic.) 2

Sample proportions (Black is the positive outcome here) p = ˆp = number of positive outcomes in population population size number of positive outcomes in sample sample size p = 54 100 = 0.54 ˆp = 5 10 = 0.5 ˆP = 0,0.1,0.2,0.3,0.4,0.5,0.6,0.7,0.8,0.9,1 { } 3

Sampling distribution - small populations If a population is small, then the probability of a selection changes depending on the previous selections. (Conditional probability.) For example, a group of 5 students is to be randomly selected from 12 boys and 10 girls. What is the sampling distribution for the proportion of boys selected? Pr(X = x )= Pr(X =2)= D C x N D C n x N C n 12 C 2 10 C 3 22 C 5 Pr(X =2)= 0.3008 Pr( ˆP = 0.4)= 0.3008 This is known as a hypergeometric distribution. Pr( ˆP = 0) Pr( ˆP = 0.2) Pr( ˆP = 0.4) Pr( ˆP = 0.6) Pr( ˆP = 0.8) Pr( ˆP =1) = 0.0096 = 0.0957 = 0.3008 = 0.3759 = 0.1880 = 0.0300 4

Sampling distribution - large populations If a population is sufficiently large, the probability of selection remains constant. (Independent probability.) For example, a group of 5 students is to be randomly selected from a large population at the school. (1000+ students, where 6/11 of the students are boys and 5/11 girls. ) Pr(X = x )= n C ( p) x ( 1 p) n x x Pr(X =2)= 5 C 6 2 2 11 Pr(X =2)= 0.2794 Pr( ˆP = 0.4)= 0.2794 This is known as a binomial distribution. 5 11 3 Pr( ˆP = 0) Pr( ˆP = 0.2) Pr( ˆP = 0.4) Pr( ˆP = 0.6) Pr( ˆP = 0.8) Pr( ˆP =1) = 0.0194 = 0.1164 = 0.2794 = 0.3353 = 0.2012 = 0.0483 5

Sampling distribution - normal distribution approximation If a population is sufficiently large and the value of p is not too far from 0.5, the binomial distribution can be approximated by a normal distribution The binomial mean and standard deviation can be used with a normal distribution. For a binomial distribution: µ = E(X )= np sd = np(1 p ) 6

Sampling distribution - normal distribution approximation Binomial distribution; 10 trials, p = 0.6 Normal distribution; mean = 6, σ = 1.55 7

Mean & variance of a sample proportion If a sample of n is taken from a population with a proportion p: E(X )= np E( ˆP )= E X n E( ˆP )= p (Binomial Mean) The expected value of the sample distribution is: E( ˆP )= p Var(X )= np(1 p ) Var( ˆP )=Var X n Var( ˆP )= 1 n 2 Var(x ) Var( ˆP )= p(1 p ) n (Binomial Variance) The standard deviation of the sample distribution is: sd( ˆP )= p(1 p ) n 8

Sampling distribution - comparing approximations 60% of people in a town are overweight. If a group of 100 people was to be randomly selected for a health survey, what is the probability that less than 55% of those surveyed are overweight? Binomial distribution: Pr( ˆp < 0.55) Pr(0 < x <54) Normal distribution: µ = E( ˆp )= 0.6 binomcdf(100,0.6,0,54) =0.1311 σ = 0.6 0.4 100 = 0.0490 normcdf(,0.55,0.6,0.0490) = 0.1537 9

Sample proportions p = number of positive outcomes in population population size ˆp = 54 100 = 0.54 10

Mean & variance of the sample proportion As the sample size increases, the binomial distribution approaches a normal distribution. From the previous example: Expected value = E( ˆP )= p E( ˆP )= 0.54 Standard deivation= p(1 p ) sd = sd = 0.17 0.54 0.46 10 n We can expect with around 68% certainty that the sample proportion will be within one standard deviation of the population proportion. We can expect with around 95% certainty that the sample proportion will be within two standard deviations of the sample proportion. (0.37 < ˆp < 0.71) (0.20 < ˆp < 0.88) 11

Sample proportions number of positive outcomes in sample ˆp = 5 ˆp = 10 = 0.5 sample size What is the uncertainty of any estimates of the population proportion p? What sample size is needed to be confident of correctly estimating p? 12

Confidence intervals Actually the point estimate of the sample proportion was 0.5. ˆp = 0.5 sd = ˆp(1 ˆp ) n sd = 0.5 0.5 10 sd = 0.16 We can expect with about 68% certainty that the population proportion is within one standard deviation of the sample proportion. (0.34 < p < 0.66) We can expect with about 95% certainty that the population proportion is within two standard deviations of the sample proportion. (0.18 < p < 0.82) 13

Margin of error The distance between the sample estimate and the end-points of the confidence interval is called the margin of error. To reduce the margin of error, the sample size needs to be increased. From a sample of 10, the margin of error at 95% confidence was ~0.32. To half the margin of error, the sample size should be four times greater. Margin of error: M 2 0.5 0.5 40 0.16 14

Margin of error The multiplier of the standard deviation needs to be found from the inverse normal distribution. For a 90% confidence: Find the value of z that has 95% of values below it. 90% confidence interval: Pr(Z>z)=95% z = invnorm(0.95,0,1)=1.65 90% 80% : z=1.28 90% : z=1.65 95% : z=1.96 98% : z=2.33 5% 5% z=1.65 15

Margin of error A survey is to be taken of voters to find the proportion that have not yet decided on who they are voting for. How many people need to be surveyed for a 2% or 5% margin of error in the results with 95% confidence? Firstly, the sample proportion quick survey. must be estimated from prior data or a Assume that ˆp is around 0.35 from preliminary data ˆp 0.02=1.96 0.35 0.65 n 0.02 1.96 2 = 0.35 0.65 n n == 0.35 0.65 0.02 1.96 2 n =2185 (for 2% margin of error) n = 350 (for 5% margin of error) 16