University of California, Los Angeles Department of Statistics. The central limit theorem The distribution of the sample mean

Similar documents
University of California, Los Angeles Department of Statistics

University of California, Los Angeles Department of Statistics. Normal distribution

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

Review of commonly missed questions on the online quiz. Lecture 7: Random variables] Expected value and standard deviation. Let s bet...

Density curves. (James Madison University) February 4, / 20

Chapter 7 Sampling Distributions and Point Estimation of Parameters

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

I. Standard Error II. Standard Error III. Standard Error 2.54

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

χ 2 distributions and confidence intervals for population variance

Sampling Distribution

Math 251, Test 2 Wednesday, May 19, 2004

b) What is a good approximation of S and why can you use it? d) What is the approximate probability exactly 38 plants sprout?

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

Chapter 7 Study Guide: The Central Limit Theorem

Chapter 5. Sampling Distributions

What was in the last lecture?

7 THE CENTRAL LIMIT THEOREM

Name PID Section # (enrolled)

Chapter 7. Sampling Distributions and the Central Limit Theorem

X P(X=x) E(X)= V(X)= S.D(X)= X P(X=x) E(X)= V(X)= S.D(X)=

STAT 3090 Test 2 - Version B Fall Student s Printed Name: PLEASE READ DIRECTIONS!!!!

BIOL The Normal Distribution and the Central Limit Theorem

Chapter Seven: Confidence Intervals and Sample Size

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Statistics, Their Distributions, and the Central Limit Theorem

Section 6.5. The Central Limit Theorem

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Math 251: Practice Questions Hints and Answers. Review II. Questions from Chapters 4 6

The Binomial Probability Distribution

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Central Limit Theorem 11/08/2005

Section 7-2 Estimating a Population Proportion

Uniform Probability Distribution. Continuous Random Variables &

Chapter 7: Sampling Distributions Chapter 7: Sampling Distributions

The Central Limit Theorem

Part 1 In which we meet the law of averages. The Law of Averages. The Expected Value & The Standard Error. Where Are We Going?

c. What is the probability that the next car tuned has at least six cylinders? More than six cylinders?

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

6.1 Discrete & Continuous Random Variables. Nov 4 6:53 PM. Objectives

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 4 Random Variables & Probability. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

MA131 Lecture 9.1. = µ = 25 and σ X P ( 90 < X < 100 ) = = /// σ X

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Confidence Intervals: Review

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

*****CENTRAL LIMIT THEOREM (CLT)*****

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Problem Set 07 Discrete Random Variables

Key Concept. 155S6.6_3 Normal as Approximation to Binomial. March 02, 2011

Math 243 Lecture Notes

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS MIDTERM EXAM - STATISTICS FALL 2014, SECTION 005

Module 4: Probability

Lecture 6: Normal distribution

Chapter 4 Continuous Random Variables and Probability Distributions

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

FINAL REVIEW W/ANSWERS

Part V - Chance Variability

SAMPLING DISTRIBUTIONS. Chapter 7

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

ECON 214 Elements of Statistics for Economists 2016/2017

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

EXERCISES FOR PRACTICE SESSION 2 OF STAT CAMP

Binomial formulas: The binomial coefficient is the number of ways of arranging k successes among n observations.

QM353: Business Statistics. Chapter 2

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

University of California, Los Angeles Department of Statistics. Final exam 07 June 2013

Section Introduction to Normal Distributions

Central Limit Theorem (cont d) 7/28/2006

Chapter Six Probability Distributions

Math 160 Professor Busken Chapter 5 Worksheets

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

PROBABILITY DISTRIBUTIONS

STAT 201 Chapter 6. Distribution

Binomial and Normal Distributions. Example: Determine whether the following experiments are binomial experiments. Explain.

NORMAL RANDOM VARIABLES (Normal or gaussian distribution)

Cover Page Homework #8

6.1 Discrete and Continuous Random Variables. 6.1A Discrete random Variables, Mean (Expected Value) of a Discrete Random Variable

Problem Set 08 Sampling Distribution of Sample Mean

( ) P = = =

Chapter 7 - Lecture 1 General concepts and criteria

Confidence Intervals Introduction

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

STA 103: Final Exam. Print clearly on this exam. Only correct solutions that can be read will be given credit.

CH 5 Normal Probability Distributions Properties of the Normal Distribution

Activity #17b: Central Limit Theorem #2. 1) Explain the Central Limit Theorem in your own words.

Review. What is the probability of throwing two 6s in a row with a fair die? a) b) c) d) 0.333

CHAPTER 5 SAMPLING DISTRIBUTIONS

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

Chapter 6: Random Variables and Probability Distributions

5.3 Interval Estimation

Answer Key: Quiz2-Chapter5: Discrete Probability Distribution

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

WebAssign Math 3680 Homework 5 Devore Fall 2013 (Homework)

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

AMS7: WEEK 4. CLASS 3

Transcription:

University of California, Los Angeles Department of Statistics Statistics 12 Instructor: Nicolas Christou First: Population mean, µ: The central limit theorem The distribution of the sample mean Sample mean, X: Very important: The sample mean, X, follows the normal distribution with mean µ, and standard deviation σ n, where µ and σ are the mean and standard deviation of the population from where the sample of size n is selected. Requirements: large n, and independent sample values. What does the above statement mean? 1

We can write: ( ) σ X N µ, n Therefore Z = X µ σ n As a result of the previous statement the following is also true (and very useful): T N ( nµ, σ n ) Therefore Z = T nµ σ n How can we use the above statement? Here is an example: A large freight elevator can transport a maximum of 9800 pounds. Suppose a load of cargo containing 49 boxes must be transported via the elevator. Experience has shown that the weight of boxes of this type of cargo follows a distribution with mean µ = 205 pounds and standard deviation σ = 15 pounds. Based on this information, what is the probability that all 49 boxes can be safely loaded onto the freight elevator and transported? Important: The population mean µ is a parameter, it is a fixed number. It is not random! The sample mean X is not fixed. It is random! 2

Here are two simulation experiments from the SOCR sample mean experiment: A. Population N(4, 1.5). Sample size n = 16. We observe that because the population from where the samples are selected is normal even a small sample (here n = 16) will produce a bell-shaped distribution for the sample mean X. B. Population exp(λ = 1). Sample size n = 100: 3

Distribution of the sample mean - Sampling from normal distribution If we sample from normal distribution N(µ, σ) then X follows exactly the normal distribution with mean µ and standard deviation σ n regardless of the sample size n. In the next figure we see the effect of the sample size on the shape of the distribution of X. The first figure is the N(5, 2) distribution. The second figure represents the distribution of X when n = 4. The third figure represents the distribution of X when n = 16. N(5, 2) f(x) 0.0 0.2 0.4 0.6 0.8 3 1 1 3 5 7 9 11 13 x N(5, 2 4 ) f(x) 0.0 0.2 0.4 0.6 0.8 1 2 3 4 5 6 7 8 9 x N(5, 2 16 ) f(x) 0.0 0.2 0.4 0.6 0.8 3 4 5 6 7 x 4

Sampling distributions - Examples Example 1 A large freight elevator can transport a maximum of 9800 pounds. Suppose a load of cargo containing 49 boxes must be transported via the elevator. Experience has shown that the weight of boxes of this type of cargo follows a distribution with mean µ = 205 pounds and standard deviation σ = 15 pounds. Based on this information, what is the probability that all 49 boxes can be safely loaded onto the freight elevator and transported? Example 2 From past experience, it is known that the number of tickets purchased by a student standing in line at the ticket window for the football match of UCLA against USC follows a distribution that has mean µ = 2.4 and standard deviation σ = 2.0. Suppose that few hours before the start of one of these matches there are 100 eager students standing in line to purchase tickets. If only 250 tickets remain, what is the probability that all 100 students will be able to purchase the tickets they desire? Example 3 Suppose that you have a sample of 100 values from a population with mean µ = 500 and with standard deviation σ = 80. a. What is the probability that the sample mean will be in the interval (490, 510)? b. Give an interval that covers the middle 95% of the distribution of the sample mean. Example 4 The amount of mineral water consumed by a person per day on the job is normally distributed with mean 19 ounces and standard deviation 5 ounces. A company supplies its employees with 2000 ounces of mineral water daily. The company has 100 employees. a. Find the probability that the mineral water supplied by the company will not satisfy the water demanded by its employees. b. Find the probability that in the next 4 days the company will not satisfy the water demanded by its employees on at least 1 of these 4 days. Assume that the amount of mineral water consumed by the employees of the company is independent from day to day. c. Find the probability that during the next year (365 days) the company will not satisfy the water demanded by its employees on more than 15 days. Example 5 Supply responses true or false with an explanation to each of the following: a. The probability that the average of 20 values will be within 0.4 standard deviations of the population mean exceeds the probability that the average of 40 values will be within 0.4 standard deviations of the population mean. b. P ( X > 4) is larger than P (X > 4) if X N(8, σ). c. If X is the average of n values sampled from a normal distribution with mean µ and if c is any positive number, then P (µ c X µ + c) decreases as n gets large. Example 6 An insurance company wants to audit health insurance claims in its very large database of transactions. In a quick attempt to assess the level of overstatement of this database, the insurance company selects at random 400 items from the database (each item represents a dollar amount). Suppose that the population mean overstatement of the entire database is $8, with population standard deviation $20. a. Find the probability that the sample mean of the 400 would be less than $6.50. b. The population from where the sample of 400 was selected does not follow the normal distribution. Why? c. Why can we use the normal distribution in obtaining an answer to part (a)? d. For what value of ω can we say that P (µ ω < X < µ + ω) is equal to 80%? e. Let T be the total overstatement for the 400 randomly selected items. Find the number b so that P (T > b) = 0.975. Example 7 A telephone company has determined that during nonholidays the number of phone calls that pass through the main branch office each hour follows the normal distribution with mean µ = 80000 and standard deviation σ = 35000. Suppose that a random sample of 60 nonholiday hours is selected and the sample mean x of the incoming phone calls is computed. a. Describe the distribution of x. b. Find the probability that the sample mean x of the incoming phone calls for these 60 hours is larger than 91970. c. Is it more likely that the sample average x will be greater than 75000 hours, or that one hour s incoming calls will be? Example 8 Assume that the daily S&P return follows the normal distribution with mean µ = 0.00032 and standard deviation σ = 0.00859. a. Find the 75 th percentile of this distribution. b. What is the probability that in 2 of the following 5 days, the daily S&P return will be larger than 0.01? c. Consider the sample average S&P of a random sample of 20 days. i. What is the distribution of the sample mean? ii. What is the probability that the sample mean will be larger than 0.005? iii. Is it more likely that the sample average S&P will be greater than 0.007, or that one day s S&P return will be? 5

Example 9 You are going to play roulette at a casino. As a reminder a roulette has 38 numbers (1-36 plus 0 and 00). Let s say that you want to bet on number 13. If you win it pays 35 : 1 which means that you get your $1 back plus $35. a. In 10000 plays, what is the probability that the casino will make more than $400? b. What would the probability be if you play 90000 games? Example 10 Suppose you select a sample of size n = 16 from a population that follows the normal distribution with µ = 5 and σ = 2. a. Describe the distribution of the sample mean X. b. Find the probability that the sample mean will exceed 6.5. c. Describe the distribution of the total T. d. What is the probability that the total will fall between 80 to 100. 6

Sampling distributions - Examples Answers EXERCISE 1 0.0099 EXERCISE 2 0.6915 EXERCISE 3 a. 0.7888 b. (484.32, 515.68) EXERCISE 4 a. 0.0228 b. 0.0881 c. 0.0059 EXERCISE 5 a. False b. True c. False EXERCISE 6 a. 0.0668 b. X > 0, σ > µ c. large n d. 1.28 e. 2416 EXERCISE 7 a. X N(80000, 35000 60 ) b. 0.0040 c. P ( X > 75000) is larger.. EXERCISE 8 a. 0.0061 b. 0.1102 c. i. x N(0.00032, 0.00859 20 ) ii. 0.0073 iii. One day s S&P will be larger EXERCISE 9 a. 0.5871 b. 0.9940 EXERCISE 10 a. X N(5, 2 16 b. 0.0013 c. T N(16(5), 2 16) d. 0.4938 7

Review problems Example 1: The length of time required for the periodic maintenance of an automobile is a random variable that follows the normal distribution with mean µ = 1.4 hours and standard deviation σ = 0.7 hour. a. Find the 95 th percentile of the above distribution. b. Suppose that the service department plans to service 50 automobiles per day. The department has 10 employees and each one works for 8 hours per day. What is the probability that the service department will have to work overtime? Example 2: An airline, believing that 5% of passengers fail to show up for flights, overbooks (sells more tickets than there are seats). Suppose a plane will hold 265 passengers, and the airline sells 275 seats. Let X be the number of passengers that show up. a. What is the distribution of X? b. Write an expression for the exact probability that the airline will not have enough seats for the passengers that show up. c. Approximate the above probability using the normal distribution. Example 3: We have discussed in class that if we select two samples of size n 1 and n 2 from two normal populations with known variances σ1 2 and σ2 2 then the difference between the two sample means ( X 1 X 2 ) follows the normal distribution as follows. X 1 X 2 N µ 1 µ 2, σ1 2 + σ2 2 n 1 The corresponding z score is: Z = X 1 X 2 (µ 1 µ 2 ) σ 2 1 n 1 + σ2 2 n 2 n 2 a. Let X and Y denote the wing lengths (in mm) of a male and female Hawaiian gallinule. Assume that the respective distributions of X and Y are N(184, 40) and N(172, 51). Note: σ X = 40, σ Y = 51. If X and Ȳ are the sample means of random samples of sizes n 1 = n 2 = 16 birds of each sex, find the probability that the sample mean of the wing length of the 16 male birds exceeds that one of the female birds by 9 mm. In other words, find P ( X Ȳ > 9). b. Find the 5 th percentile of the distribution of X Ȳ. c. What is the probability that among 5 female Hawaiian gallinules 2 will have wing length more than 200 mm? 8