The Central Limit Theorem

Similar documents
Sampling Distributions and the Central Limit Theorem

The Binomial Distribution

The normal distribution is a theoretical model derived mathematically and not empirically.

Random variables The binomial distribution The normal distribution Other distributions. Distributions. Patrick Breheny.

The Binomial Distribution

Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny.

Part V - Chance Variability

The Binomial Distribution

We use probability distributions to represent the distribution of a discrete random variable.

Math 140 Introductory Statistics

Probability. An intro for calculus students P= Figure 1: A normal integral

Chapter 7 Sampling Distributions and Point Estimation of Parameters

2011 Pearson Education, Inc

Normal Approximation to Binomial Distributions

Statistics 6 th Edition

Theoretical Foundations

5.7 Probability Distributions and Variance

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

Comparing Estimators

Section The Sampling Distribution of a Sample Mean

Sampling Distributions Chapter 18

Business Statistics 41000: Probability 4

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

4: Probability. What is probability? Random variables (RVs)

Sampling Distributions For Counts and Proportions

Chapter 7: Point Estimation and Sampling Distributions

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

Data Analysis and Statistical Methods Statistics 651

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

STAT 241/251 - Chapter 7: Central Limit Theorem

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

1 Sampling Distributions

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution?

Chapter 14. From Randomness to Probability. Copyright 2010 Pearson Education, Inc.

STA 320 Fall Thursday, Dec 5. Sampling Distribution. STA Fall

CHAPTER 5 Sampling Distributions

BIO5312 Biostatistics Lecture 5: Estimations

MA 1125 Lecture 12 - Mean and Standard Deviation for the Binomial Distribution. Objectives: Mean and standard deviation for the binomial distribution.

Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7

Chapter 5: Statistical Inference (in General)

4.3 Normal distribution

Math 227 Elementary Statistics. Bluman 5 th edition

1. Variability in estimates and CLT

Value (x) probability Example A-2: Construct a histogram for population Ψ.

Chapter 17. The. Value Example. The Standard Error. Example The Short Cut. Classifying and Counting. Chapter 17. The.

MidTerm 1) Find the following (round off to one decimal place):

The Binomial and Geometric Distributions. Chapter 8

STAT 157 HW1 Solutions

MATH 264 Problem Homework I

QQ PLOT Yunsi Wang, Tyler Steele, Eva Zhang Spring 2016

Unit 5: Sampling Distributions of Statistics

Data Analysis and Statistical Methods Statistics 651

Unit 5: Sampling Distributions of Statistics

E509A: Principle of Biostatistics. GY Zou

Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

Chapter 5 Probability Distributions. Section 5-2 Random Variables. Random Variable Probability Distribution. Discrete and Continuous Random Variables

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

Chapter Four: Introduction To Inference 1/50

Chapter 8 Additional Probability Topics

Section 6.5. The Central Limit Theorem

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

CH 5 Normal Probability Distributions Properties of the Normal Distribution

***SECTION 8.1*** The Binomial Distributions

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Lecture 6: Chapter 6

Probability Models. Grab a copy of the notes on the table by the door

4 Random Variables and Distributions

4.2 Probability Distributions

Chapter 6: Random Variables

Chapter 6: Random Variables

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

ECON 214 Elements of Statistics for Economists 2016/2017

STA 6166 Fall 2007 Web-based Course. Notes 10: Probability Models

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1

Elementary Statistics Lecture 5

Contents. The Binomial Distribution. The Binomial Distribution The Normal Approximation to the Binomial Left hander example

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution?

Model Paper Statistics Objective. Paper Code Time Allowed: 20 minutes

23.1 Probability Distributions

Introduction to Statistical Data Analysis II

CSSS/SOC/STAT 321 Case-Based Statistics I. Random Variables & Probability Distributions I: Discrete Distributions

STAT 201 Chapter 6. Distribution

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Probability Basics. Part 1: What is Probability? INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder. March 1, 2017 Prof.

Chapter 4 and 5 Note Guide: Probability Distributions

MLLunsford 1. Activity: Mathematical Expectation

Random Variables CHAPTER 6.3 BINOMIAL AND GEOMETRIC RANDOM VARIABLES

The Binomial Distribution

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Lean Six Sigma: Training/Certification Books and Resources

Shifting our focus. We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why?

Lecture 8 - Sampling Distributions and the CLT

Lecture 2. Probability Distributions Theophanis Tsandilas

Discrete Probability Distributions

Learning Objectives for Ch. 7

Binomial Random Variables. Binomial Random Variables

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean.

Transcription:

The Central Limit Theorem Patrick Breheny March 1 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 1 / 29

Kerrich s experiment Introduction The law of averages Mean and SD of the binomial distribution A South African mathematician named John Kerrich was visiting Copenhagen in 1940 when Germany invaded Denmark Kerrich spent the next five years in an internment camp To pass the time, he carried out a series of experiments in probability theory One of them involved flipping a coin 10,000 times Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 2 / 29

The law of averages Introduction The law of averages Mean and SD of the binomial distribution We know that a coin lands heads with probability 50% Thus, after many tosses, the law of averages says that the number of heads should be about the same as the number of tails...... or does it? Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 3 / 29

Kerrich s results Introduction The law of averages Mean and SD of the binomial distribution Number of Number of Heads - tosses (n) heads 0.5 Tosses 10 4-1 100 44-6 500 255 5 1,000 502 2 2,000 1,013 13 3,000 1,510 10 4,000 2,029 29 5,000 2,533 33 6,000 3,009 9 7,000 3,516 16 8,000 4,034 34 9,000 4,538 38 10,000 5,067 67 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 4 / 29

Kerrich s results plotted The law of averages Mean and SD of the binomial distribution Number of heads minus half the number of tosses 100 50 0 50 100 0 400 1600 3600 6400 10000 Number of tosses Instead of getting closer, the numbers of heads and tails are getting farther apart Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 5 / 29

Repeating the experiment 50 times The law of averages Mean and SD of the binomial distribution Number of heads minus half the number of tosses 100 50 0 50 100 0 400 1600 3600 6400 10000 Number of tosses This is not a fluke instead, it occurs systematically and consistently in repeated simulated experiments Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 6 / 29

Where s the law of averages? The law of averages Mean and SD of the binomial distribution So where s the law of averages? Well, the law of averages does not say that as n increases the number of heads will be close to the number of tails What it says instead is that, as n increases, the average number of heads will get closer and closer to the long-run average (in this case, 0.5) The technical term for this is that the sample average, which is an estimate, converges to the population mean, which is a parameter Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 7 / 29

The law of averages Mean and SD of the binomial distribution Repeating the experiment 50 times, Part II 80 70 Percentage of heads 60 50 40 30 20 0 400 1600 3600 6400 10000 Number of tosses Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 8 / 29

Trends in Kerrich s experiment The law of averages Mean and SD of the binomial distribution There are three very important trends going on in this experiment We ll get to those three trends in a few minutes, but first, I want to introduce two additional, important facts about the binomial distribution: its mean (expected value) and standard deviation Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 9 / 29

The law of averages Mean and SD of the binomial distribution The expected value of the binomial distribution Recall that the probability of an event is the long-run percent of time it occurs An analogous idea exists for random variables: if we were to measure a random variable over and over again an infinite number of times, the average of those measurements would be the expected value of the random variable For example, the expected value of a random variable X following a binomial distribution with n trials and probability π is nπ: E(X) = nπ This makes sense; if you flip a coin 10 times, you can expect 5 heads Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 10 / 29

The law of averages Mean and SD of the binomial distribution The standard deviation of the binomial distribution Of course, you won t always get 5 heads Because of variability, we are also interested in the standard deviation of random variables For the binomial distribution, the standard deviation is SD(X) = nπ(1 π) To continue our example of flipping a coin 10 times, here the SD is 10(0.5)(0.5) = 1.58, so we can expect the number of heads to be 5 ± 3 about 95% of the time (by the 95% rule of thumb) Note that the SD is highest when π = 0.5 and gets smaller as π is close to 0 or 1 this makes sense, as if π is close to 0 or 1, the event is more predictable and less variable Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 11 / 29

Trends in Kerrich s experiment Trend #1: The expected value of the average Trend #2: The standard error Trend #3: The distribution of the average As I said a few minutes ago, there are three very important trends going on in this experiment These trends can be observed visually from the computer simulations or proven via the binomial distribution We ll work with both approaches so that you can get a sense of how they both work and how they reinforce each other Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 12 / 29

The expected value of the mean Trend #1: The expected value of the average Trend #2: The standard error Trend #3: The distribution of the average The expected value of the binomial distribution is nπ; what about the expected value of its mean? The mean (i.e., the sample proportion) is ˆπ = X n, so its expected value is E(ˆπ) = E(X) n = nπ n = π In other words, for any sample size, the expected value of the sample proportion is equal to the true proportion (i.e., it is not biased) Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 13 / 29

The standard error of the mean Trend #1: The expected value of the average Trend #2: The standard error Trend #3: The distribution of the average Likewise, but the standard deviation of the binomial distribution is nπ(1 π), but what about the SD of the mean? As before, SD(ˆπ) = SD(X) n nπ(1 π) = n π(1 π) = n Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 14 / 29

Standard errors Introduction Trend #1: The expected value of the average Trend #2: The standard error Trend #3: The distribution of the average Note that, as n goes up, the variability of the # of heads goes up, but the variability of the average goes down just as we saw in our simulation Indeed, the variability goes to 0 as n gets larger and larger this is the law of averages The standard deviation of the average is given a special name in statistics to distinguish it from the sample standard deviation of the data The standard deviation of the average is called the standard error The term standard error refers to the variability of any estimate, to distinguish it from the variability of individual tosses or people Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 15 / 29

The square root law Introduction Trend #1: The expected value of the average Trend #2: The standard error Trend #3: The distribution of the average The relationship between the variability of an individual (toss) and the variability of the average (of a large number of tosses) is a very important relationship, sometimes called the square root law: SE = SD n, where SE is the standard error of the mean and SD is the standard deviation of an individual (toss) We saw that this is true for tosses of a coin, but it is in fact true for all averages Once again, we see this phenomenon visually in our simulation results Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 16 / 29

The distribution of the mean Trend #1: The expected value of the average Trend #2: The standard error Trend #3: The distribution of the average Finally, let s look at the distribution of the mean by creating histograms of the mean in our simulation 2 flips 9 flips 25 flips 0.0 0.2 0.4 0.6 0.8 1.0 Mean 0.0 0.2 0.4 0.6 0.8 1.0 Mean 0.0 0.2 0.4 0.6 0.8 1.0 Mean Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 17 / 29

The theorem How large does n have to be? In summary, there are three very important phenomena going on here concerning the sampling distribution of the sample average: #1 The expected value is always equal to the population average #2 The standard error is always equal to the population standard deviation divided by the square root of n #3 As n gets larger, the sampling distribution looks more and more like the normal distribution Furthermore, these three properties of the sampling distribution of the sample average hold for any distribution not just the binomial Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 18 / 29

(cont d) The theorem How large does n have to be? This result is called the central limit theorem, and it is one of the most important, remarkable, and powerful results in all of statistics In the real world, we rarely know the distribution of our data But the central limit theorem says: we don t have to Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 19 / 29

(cont d) The theorem How large does n have to be? Furthermore, as we have seen, knowing the mean and standard deviation of a distribution that is approximately normal allows us to calculate anything we wish to know with tremendous accuracy and the sampling distribution of the mean is always approximately normal The only caveats: Observations must be independently drawn from and representative of the population applies to the sampling distribution of the mean not necessarily to the sampling distribution of other statistics How large does n have to be before the distribution becomes close enough in shape to the normal distribution? Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 20 / 29

How large does n have to be? The theorem How large does n have to be? Rules of thumb are frequently recommended that n = 20 or n = 30 is large enough to be sure that the central limit theorem is working There is some truth to such rules, but in reality, whether n is large enough for the central limit theorem to provide an accurate approximation to the true sampling distribution depends on how close to normal the population distribution is If the original distribution is close to normal, n = 2 might be enough If the underlying distribution is highly skewed or strange in some other way, n = 50 might not be enough Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 21 / 29

Example #1 Introduction The theorem How large does n have to be? Population n=10 0.20 0.5 0.15 0.4 Density 0.10 Density 0.3 0.2 0.05 0.1 0.00 0.0 6 4 2 0 2 4 6 x 3 2 1 0 1 2 3 Sample means Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 22 / 29

Example #2 Introduction The theorem How large does n have to be? Now imagine an urn containing the numbers 1, 2, and 9: n=20 0.6 Density 0.4 0.2 0.0 1 2 3 4 5 6 7 Sample mean Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 23 / 29

Example #2 (cont d) Introduction The theorem How large does n have to be? 1.2 n=50 1.0 0.8 Density 0.6 0.4 0.2 0.0 2 3 4 5 6 Sample mean Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 24 / 29

Example #2 (cont d) Introduction The theorem How large does n have to be? n=100 1.5 Density 1.0 0.5 0.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 Sample mean Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 25 / 29

Example #3 Introduction The theorem How large does n have to be? Weight tends to be skewed to the right (far more people are overweight than underweight) Let s perform an experiment in which the NHANES sample of adult men is the population I am going to randomly draw twenty-person samples from this population (i.e. I am re-sampling the original sample) Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 26 / 29

Example #3 (cont d) Introduction The theorem How large does n have to be? n=20 0.04 0.03 Density 0.02 0.01 0.00 160 180 200 220 240 260 Sample mean Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 27 / 29

Why do so many things follow normal distributions? We can see now why the normal distribution comes up so often in the real world: any time a phenomenon has many contributing factors, and what we see is the average effect of all those factors, the quantity will follow a normal distribution For example, there is no one cause of height thousands of genetic and environmental factors make small contributions to a person s adult height, and as a result, height is normally distributed On the other hand, things like eye color, cystic fibrosis, broken bones, and polio have a small number of (or a single) contributing factors, and do not follow a normal distribution Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 28 / 29

Introduction Central limit theorem: The expected value of the average is always equal to the population average SE = SD/ n As n gets larger, the sampling distribution looks more and more like the normal distribution Generally speaking, the sampling distribution looks pretty normal by about n = 20, but this could happen faster or slower depending on the population and how skewed it is Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 29 / 29