Stat 213: Intro to Statistics 9 Central Limit Theorem

Similar documents
Sampling and sampling distribution

Chapter 9: Sampling Distributions

Chapter 5. Sampling Distributions

Statistics 13 Elementary Statistics

The Central Limit Theorem

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Making Sense of Cents

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics

Chapter 7: Point Estimation and Sampling Distributions

Chapter 7 Study Guide: The Central Limit Theorem

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Elementary Statistics Lecture 5

1 Sampling Distributions

BIO5312 Biostatistics Lecture 5: Estimations

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Central Limit Theorem (cont d) 7/28/2006

ECO220Y Sampling Distributions of Sample Statistics: Sample Proportion Readings: Chapter 10, section

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter Four: Introduction To Inference 1/50

Chapter 7. Sampling Distributions

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

Sampling Distributions

Estimation Y 3. Confidence intervals I, Feb 11,

Sampling Distribution Models. Copyright 2009 Pearson Education, Inc.

Tutorial 6. Sampling Distribution. ENGG2450A Tutors. 27 February The Chinese University of Hong Kong 1/6

MATH 3200 Exam 3 Dr. Syring

Distribution of the Sample Mean

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

Standard Normal, Inverse Normal and Sampling Distributions

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

STAT Lab#5 Binomial Distribution & Midterm Review

Business Statistics 41000: Probability 4

Math 227 Elementary Statistics. Bluman 5 th edition

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Sampling Distributions and the Central Limit Theorem

Binomial Random Variables. Binomial Random Variables

Statistics 251: Statistical Methods Sampling Distributions Module

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean.

Chapter 5: Statistical Inference (in General)

Review: Population, sample, and sampling distributions

Chapter 6: Point Estimation

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

The Binomial Probability Distribution

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

Sampling Distributions

Chapter 8 Estimation

University of California, Los Angeles Department of Statistics

STA215 Confidence Intervals for Proportions

CHAPTER 5 Sampling Distributions

Statistics and Probability

Binomial and Normal Distributions

Statistical Methods in Practice STAT/MATH 3379

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Sampling. Marc H. Mehlman University of New Haven. Marc Mehlman (University of New Haven) Sampling 1 / 20.

Data Analysis and Statistical Methods Statistics 651

. (i) What is the probability that X is at most 8.75? =.875

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Sampling & populations

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA

Statistics for Managers Using Microsoft Excel 7 th Edition

Stat 139 Homework 2 Solutions, Fall 2016

5.3 Statistics and Their Distributions

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

2011 Pearson Education, Inc

ECON 214 Elements of Statistics for Economists 2016/2017

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Central Limit Theorem

Chapter 8 Statistical Intervals for a Single Sample

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

E509A: Principle of Biostatistics. GY Zou

Section Introduction to Normal Distributions

Chapter 7 - Lecture 1 General concepts and criteria

Counting Basics. Venn diagrams

Statistics, Their Distributions, and the Central Limit Theorem

Discrete Random Variables

What was in the last lecture?

MVE051/MSG Lecture 7

6 Central Limit Theorem. (Chs 6.4, 6.5)

Confidence Intervals Introduction

Confidence Intervals and Sample Size

For more information about how to cite these materials visit

* Point estimate for P is: x n

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

The normal distribution is a theoretical model derived mathematically and not empirically.

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.

Back to estimators...

Probability Theory. Mohamed I. Riffi. Islamic University of Gaza

Mean of a Discrete Random variable. Suppose that X is a discrete random variable whose distribution is : :

GETTING STARTED. To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop

Normal Probability Distributions

Transcription:

1 Stat 213: Intro to Statistics 9 Central Limit Theorem H. Kim Fall 2007

2 unknown parameters Example: A pollster is sure that the responses to his agree/disagree questions will follow a binomial distribution, but p, the proportion of those who agree in the population, is unknown. In practice, the parameters of the distribution are unknown. Most rely on the sample to learn about the parameter. Want to the sample to provide reliable information about the population.

3 statistic A statistic is the numerical descriptive measures calculated from a sample: ˆp and X. A statistic is a random variable, their values vary from sample to sample = a statistic has a probability distribution. My sample represents the population? the sampling distribution of a statistic is the probability distribution for all possible values of the statistic that results when random samples of size n are repeatedly drawn from the population the expected value (mean) of sampling distribution is the true parameter, i.e. E(X) = µ or E(ˆp) = p

4 simulation 1 If we draw 100 repeated random samples of the same size 30 from uniform population with mean µ = 0.5 and standard deviation σ = 1 12, Histogram of sample9, sample24, sample48, sample84 8 sample9 8 sample24 6 6 4 4 2 2 Frequency 0 4.8 0.2 0.4 0.6 sample48 0.8 1.0 0 4.8 0.0 0.2 0.4 sample84 0.6 0.8 3.6 3.6 2.4 2.4 1.2 1.2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

5 simulation 1 measure the means (X) for each sample, and draw histogram: 20 Histogram of mean 15 Frequency 10 5 0 0.35 0.40 0.45 0.50 mean 0.55 0.60 0.65

6 simulation 2 If we draw 100 repeated random samples of the same size 30 from normal population with mean µ = 1 and standard deviation σ = 0.1, and measure the means (X) for each sample, and draw histogram: Histogram of mean 12 Mean 1.001 StDev 0.01645 N 100 10 Frequency 8 6 4 2 0 0.97 0.98 0.99 1.00 1.01 mean 1.02 1.03 1.04

7 simulation 3 If we draw 100 repeated random samples of the same size 30 from Bernolli population with p = 0.4 and measure the means (X) for each sample, and draw histogram: 18 16 14 12 Histogram of mean Mean 0.3893 StDev 0.08696 N 100 Frequency 10 8 6 4 2 0 0.20 0.25 0.30 0.35 0.40 mean 0.45 0.50 0.55

8 mean and variance for sample mean, X Random variables X 1, X 2,, X n are independent with mean E(X i ) = µ and variance V (X i ) = σ 2, i = 1, 2,, n: n X = 1 n i=1 X i E(X) and V (X) Sampling distribution of the random variable X?

9 mean and variance for sample proportion, ˆp If X 1,, X n are independent Bernoulli random variables with mean E(X i ) = p and variance V (X i ) = p(1 p), i = 1, 2,, n: 1 if success X i = 0 if failure Y = n i=1 X i Binomial(n, p) the sample mean, X = 1 n Xi = Y n E(X) and V (X) = ˆp: proportion Sampling distribution of the random variable ˆp?

10 sampling distributions of X and ˆp = Normal? Collection of the mean values will pile up around the underlying (µ) in such way that a histogram of the sample means (X) can be modeled well by a Normal model: sampling distribution of the mean X N ˆp N ) (µ, σ2 ( p, n ) p(1 p), np > 5, n(1 p) > 5 n

Central Limit Theorem 11

12 Central Limit Theorem When a random sample is drawn from any population with mean µ and standard deviation σ, its sample mean, X, has a sampling distribution with the mean µ and standard deviation σ n and the shape of the sampling distribution is approximately Normal as long as the sample size is large enough (at least 30). sampling distribution models tame the variation in statistics (X) enough to know us to measure how close our computed statistic values are likely to be to the unknown underlying parameters (µ) standard error (se): estimated standard deviation the sampling distribution ( ) ˆσ n of

13 the real world and the model world we never actually get to see the sampling distribution; we imagine repeated samples to develop the theory and own intuition about sampling distribution models sampling distributions act as a bridge from real world to imaginary model of the statistic and enable to say something about the population when all we have is data from the real world

14 example 1 The length of stay of patients in a chronic health facility is normally distributed with a mean of 40 days and a standard deviation of 12 days. Suppose that a sample of n = 16 patients is randomly selected. Of interest is the mean length of the sample of n = 16 patients. a. Specify the distribution for the mean length of stay of the sample of 16 patients is less than 34 days? b. What is the probability that the mean length of stay for the 16 patients is less than 34 days?

15 example 1 c. What is the probability that the mean length of stay for the 16 patients is between 34 and 46 days? d. What is the probability that the length of stay of one of the 16 patients is less than 34 days?

16 example 2 The population of healthy females in Canada has a mean potassium concentration of 4.36 meq/l and a standard deviation of 0.12mEq/l. Suppose that a sample of 50 females is selected. a. Specify the distribution for the mean potassium concentration of the sample of 50 females. What is the standard error of this sample mean? b. What is the probability that the mean potassium concentration for 50 females is below 4.4mEq/l?

17 example 3 The duration of Alzheimer s disease from the onset of symptoms until death ranges from 3 to 20 years: the average is 8 years with a standard deviation of 4 years. The administrator of a large medical center randomly selects the medical records of 36 deceased Alzheimer s patients from the medical center s database and records the average duration. Find the approximate probability for these events: a. the average duration is less than 7 years

18 example 3 b. the average duration lies within 1 year of the population mean, µ = 8.

19 example 4 Statistics Canada reported that 33.1% of all 1997 family incomes in New Brunswick were below 30, 000. Suppose a random sample of 80, 1997 family incomes from New Brunswick is selected. What is the probability that the percentage of incomes in the sample that are under 30, 000 is over 30 percent?