Central Limit Theorem (cont d) 7/28/2006

Similar documents
MA : Introductory Probability

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Chapter 5. Sampling Distributions

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Probability. An intro for calculus students P= Figure 1: A normal integral

Probability Distributions

Central Limit Theorem 11/08/2005

2011 Pearson Education, Inc

Chapter 3 Discrete Random Variables and Probability Distributions

Probability Theory. Mohamed I. Riffi. Islamic University of Gaza

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

The binomial distribution p314

Statistical Methods in Practice STAT/MATH 3379

MATH 264 Problem Homework I

Midterm Exam III Review

Continuous Probability Distributions & Normal Distribution

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

PROBABILITY DISTRIBUTIONS

Chapter 7. Sampling Distributions and the Central Limit Theorem

ECON 214 Elements of Statistics for Economists 2016/2017

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Bernoulli and Binomial Distributions

The normal distribution is a theoretical model derived mathematically and not empirically.

Engineering Statistics ECIV 2305

AMS7: WEEK 4. CLASS 3

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

Central Limit Theorem, Joint Distributions Spring 2018

MATH 3200 Exam 3 Dr. Syring

Example. Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables

Estimation. Focus Points 10/11/2011. Estimating p in the Binomial Distribution. Section 7.3

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

Random Variables and Probability Functions

4.2 Bernoulli Trials and Binomial Distributions

Sampling Distribution

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

Probability Distributions for Discrete RV

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

1/2 2. Mean & variance. Mean & standard deviation

Chapter 7. Sampling Distributions and the Central Limit Theorem

Math489/889 Stochastic Processes and Advanced Mathematical Finance Homework 5

4 Random Variables and Distributions

Lecture 9. Probability Distributions. Outline. Outline

Homework Assignments

Section Distributions of Random Variables

Lecture 9. Probability Distributions

Part 10: The Binomial Distribution

8.1 Estimation of the Mean and Proportion

Tutorial 11: Limit Theorems. Baoxiang Wang & Yihan Zhang bxwang, April 10, 2017

MA 1125 Lecture 12 - Mean and Standard Deviation for the Binomial Distribution. Objectives: Mean and standard deviation for the binomial distribution.

Confidence Intervals Introduction

Chapter 3 Discrete Random Variables and Probability Distributions

Random Variables Handout. Xavier Vilà

Data Analysis and Statistical Methods Statistics 651

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41

Statistics 6 th Edition

Chapter 5. Statistical inference for Parametric Models

CH 5 Normal Probability Distributions Properties of the Normal Distribution

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

CHAPTER 5 SOME DISCRETE PROBABILITY DISTRIBUTIONS. 5.2 Binomial Distributions. 5.1 Uniform Discrete Distribution

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

University of California, Los Angeles Department of Statistics. Normal distribution

Chapter 7: Point Estimation and Sampling Distributions

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics

IEOR 3106: Introduction to OR: Stochastic Models. Fall 2013, Professor Whitt. Class Lecture Notes: Tuesday, September 10.

Math 361. Day 8 Binomial Random Variables pages 27 and 28 Inv Do you have ESP? Inv. 1.3 Tim or Bob?

MidTerm 1) Find the following (round off to one decimal place):

Binomial Distribution. Normal Approximation to the Binomial

Math 14 Lecture Notes Ch The Normal Approximation to the Binomial Distribution. P (X ) = nc X p X q n X =

Chapter 8. Introduction to Statistical Inference

6. THE BINOMIAL DISTRIBUTION

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

Math 227 Elementary Statistics. Bluman 5 th edition

Statistics 13 Elementary Statistics

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

8.1 Binomial Distributions

Chapter 4 Continuous Random Variables and Probability Distributions

The Binomial distribution

Statistical Tables Compiled by Alan J. Terry

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

Chapter 5. Discrete Probability Distributions. McGraw-Hill, Bluman, 7 th ed, Chapter 5 1

Business Statistics. Chapter 5 Discrete Probability Distributions QMIS 120. Dr. Mohammad Zainal

Commonly Used Distributions

S = 1,2,3, 4,5,6 occurs

E509A: Principle of Biostatistics. GY Zou

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

CH 6 Review Normal Probability Distributions College Statistics

STA 220H1F LEC0201. Week 7: More Probability: Discrete Random Variables

Binomial Random Variables. Binomial Random Variables

Chapter 9 & 10. Multiple Choice.

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

Chapter 7. Sampling Distributions

Part V - Chance Variability

Math Week in Review #10. Experiments with two outcomes ( success and failure ) are called Bernoulli or binomial trials.

Transcription:

Central Limit Theorem (cont d) 7/28/2006

Central Limit Theorem for Binomial Distributions Theorem. For the binomial distribution b(n, p, j) we have lim npq b(n, p, np + x npq ) = φ(x), n where φ(x) is the standard normal density. 1

Recall: The standardized sum S n = S n np npq. Then P (a S n b) = P ( a np S npq n b np ). npq 2

Central Limit Theorem for Bernoulli Trials Theorem. Let S n be the number of successes in n Bernoulli trials with probability p for success, and let a and b be two fixed real numbers. Define a = a np npq and Then b = b np npq. lim P (a S n b) = n b a φ(x) dx. 3

How to use this theorem? The integral on the right side of this equation is equal to the area under the graph of the standard normal density φ(x) between a and b. We denote this area by NA(a, b ). Unfortunately, there is no simple way to integrate the function e x2 /2. 4

NA (0,z) = area of shaded region 0 z z NA(z) z NA(z) z NA(z) z NA(z).0.0000 1.0.3413 2.0.4772 3.0.4987.1.0398 1.1.3643 2.1.4821 3.1.4990.2.0793 1.2.3849 2.2.4861 3.2.4993.3.1179 1.3.4032 2.3.4893 3.3.4995.4.1554 1.4.4192 2.4.4918 3.4.4997.5.1915 1.5.4332 2.5.4938 3.5.4998.6.2257 1.6.4452 2.6.4953 3.6.4998.7.2580 1.7.4554 2.7.4965 3.7.4999.8.2881 1.8.4641 2.8.4974 3.8.4999.9.3159 1.9.4713 2.9.4981 3.9.5000 5

Approximation of Binomial Probabilities Suppose that S n is binomially distributed with parameters n and p. ( i 1 2 P (i S n j) NA np, j + 1 2 np ) npq npq. 6

Example A coin is tossed 100 times. Estimate the probability that the number of heads lies between 40 and 60. 7

Example A coin is tossed 100 times. Estimate the probability that the number of heads lies between 40 and 60. The expected number of heads is 100 1/2 = 50, and the standard deviation for the number of heads is 100 1/2 1/2 = 5. n = 100 is reasonably large. 7

P (40 S n 60) ( 39.5 50 P Sn 5 = P ( 2.1 Sn 2.1) N A( 2.1, 2.1) = 2N A(0, 2.1).9642. ) 60.5 50 5 8

Dartmouth College would like to have 1050 freshmen. This college cannot accommodate more than 1060. Assume that each applicant accepts with probability.6 and that the acceptances can be modeled by Bernoulli trials. If the college accepts 1700, what is the probability that it will have too many acceptances? 9

Dartmouth College would like to have 1050 freshmen. This college cannot accommodate more than 1060. Assume that each applicant accepts with probability.6 and that the acceptances can be modeled by Bernoulli trials. If the college accepts 1700, what is the probability that it will have too many acceptances? If it accepts 1700 students, the expected number of students who matriculate is.6 1700 = 1020. The standard deviation for the number that accept is 1700.6.4 20. Thus we want to estimate the probability P (S 1700 > 1060) = P (S 1700 1061) 9

P (S 1700 > 1060) = P (S 1700 1061) ( ) = P S1700 1060.5 1020 20 = P (S 1700 2.025). 10

Exercise Let S 100 be the number of heads that turn up in 100 tosses of a fair coin. Use the Central Limit Theorem to estimate 1. P (S 100 45). 2. P (45 < S 100 < 55). 3. P (S 100 > 63). 4. P (S 100 < 57). 11

Exercise A true-false examination has 48 questions. June has probability 3/4 of answering a question correctly. April just guesses on each question. A passing score is 30 or more correct answers. Compare the probability that June passes the exam with the probability that April passes it. 12

Applications to Statistics Suppose that a poll has been taken to estimate the proportion of people in a certain population who favor one candidate over another in a race with two candidates. We pick a subset of the population, called a sample, and ask everyone in the sample for their preference. Let p be the actual proportion of people in the population who are in favor of candidate A and let q = 1 p. If we choose a sample of size n from the population, the preferences of the people in the sample can be represented by random variables X 1, X 2,..., X n, where X i = 1 if person i is in favor of candidate A, and X i = 0 if person i is in favor of candidate B. 13

Let S n = X 1 + X 2 + + X n. If each subset of size n is chosen with the same probability, then S n is hypergeometrically distributed. If n is small relative to the size of the population, then S n is approximately binomially distributed, with parameters n and p. The pollster wants to estimate the value p. An estimate for p is provided by the value p = S n /n. 14

The mean of p is just p, and the standard deviation is pq n. The standardized version of p is p = p p pq/n. 15

The distribution of the standardized version of p is approximated by the standard normal density. 95% of its values will lie within two standard deviations of its mean, and the same is true of p. ( ) pq pq P p 2 n < p < p + 2.954. n The pollster does not know p or q, but he can use p and q = 1 p in their place ( ) p q p q P p 2 n < p < p + 2.954. n 16

The resulting interval ( p 2 p q, p + 2 p q ) n n is called the 95 percent confidence interval for the unknown value of p. The pollster has control over the value of n. Thus, if he wants to create a 95% confidence interval with length 6%, then he should choose a value of n so that 2 p q n.03. 17

Exercise A restaurant feeds 400 customers per day. On the average 20 percent of the customers order apple pie. 1. Give a range (called a 95 percent confidence interval) for the number of pieces of apple pie ordered on a given day such that you can be 95 percent sure that the actual number will fall in this range. 2. How many customers must the restaurant have to be at least 95 percent sure that the number of customers ordering pie on that day falls in the 19 to 21 percent range? 18

Central Limit Theorem for Discrete Independent Trials Let S n = X 1 + X 2 + + X n be the sum of n independent discrete random variables of an independent trials process with common distribution function m(x) defined on the integers, with mean µ and variance σ 2. Standardized Sums S n = S n nµ nσ 2. This standarizes S n to have expected value 0 and variance 1. 19

If S n = j, then S n has the value x j with x j = j nµ nσ 2. 20

Approximation Theorem Let X 1, X 2,..., X n be an independent trials process and let S n = X 1 + X 2 + + X n. Assume that the greatest common divisor of the differences of all the values that the X j can take on is 1. Let E(X j ) = µ and V (X j ) = σ 2. Then for n large, P (S n = j) φ(x j) nσ 2, where x j = (j nµ)/ nσ 2, and φ(x) is the standard normal density. 21

Central Limit Theorem for a Discrete Independent Trials Process Let S n = X 1 + X 2 + + X n be the sum of n discrete independent random variables with common distribution having expected value µ and variance σ 2. Then, for a < b, lim P n ( a < S n nµ nσ 2 ) < b = 1 2π b a e x2 /2 dx. 22

Example A die is rolled 420 times. What is the probability that the sum of the rolls lies between 1400 and 1550? 23

Example A die is rolled 420 times. What is the probability that the sum of the rolls lies between 1400 and 1550? The sum is a random variable S 420 = X 1 + X 2 + + X 420. We have seen that µ = E(X) = 7/2 and σ 2 = V (X) = 35/12. Thus, E(S 420 ) = 420 7/2 = 1470, σ 2 (S 420 ) = 420 35/12 = 1225, and σ(s 420 ) = 35. 23

P (1400 S 420 1550) ( 1399.5 1470 P S420 35 = P ( 2.01 S420 2.30) NA( 2.01, 2.30) =.9670. ) 1550.5 1470 35 24