University of California, Los Angeles Department of Statistics

Similar documents
University of California, Los Angeles Department of Statistics. The central limit theorem The distribution of the sample mean

University of California, Los Angeles Department of Statistics. Normal distribution

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

χ 2 distributions and confidence intervals for population variance

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

Math 251, Test 2 Wednesday, May 19, 2004

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

Review of commonly missed questions on the online quiz. Lecture 7: Random variables] Expected value and standard deviation. Let s bet...

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

Chapter 5. Sampling Distributions

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

Density curves. (James Madison University) February 4, / 20

I. Standard Error II. Standard Error III. Standard Error 2.54

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Sampling Distribution

Section 7-2 Estimating a Population Proportion

Central Limit Theorem (cont d) 7/28/2006

b) What is a good approximation of S and why can you use it? d) What is the approximate probability exactly 38 plants sprout?

Chapter 7 Study Guide: The Central Limit Theorem

STAT 3090 Test 2 - Version B Fall Student s Printed Name: PLEASE READ DIRECTIONS!!!!

What was in the last lecture?

6.1 Discrete & Continuous Random Variables. Nov 4 6:53 PM. Objectives

7 THE CENTRAL LIMIT THEOREM

Stat 213: Intro to Statistics 9 Central Limit Theorem

BIOL The Normal Distribution and the Central Limit Theorem

STAT 1220 FALL 2010 Common Final Exam December 10, 2010

Name PID Section # (enrolled)

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Chapter 7. Sampling Distributions and the Central Limit Theorem

QM353: Business Statistics. Chapter 2

Math 251: Practice Questions Hints and Answers. Review II. Questions from Chapters 4 6

X P(X=x) E(X)= V(X)= S.D(X)= X P(X=x) E(X)= V(X)= S.D(X)=

Chapter Seven: Confidence Intervals and Sample Size

5.3 Interval Estimation

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Statistics, Their Distributions, and the Central Limit Theorem

Section 6.5. The Central Limit Theorem

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS MIDTERM EXAM - STATISTICS FALL 2014, SECTION 005

SAMPLING DISTRIBUTIONS. Chapter 7

Elementary Statistics Lecture 5

Chapter 6: Random Variables and Probability Distributions

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

The Central Limit Theorem

Uniform Probability Distribution. Continuous Random Variables &

The Binomial Probability Distribution

Introduction to Statistics I

Central Limit Theorem 11/08/2005

Chapter 7: Sampling Distributions Chapter 7: Sampling Distributions

Part 1 In which we meet the law of averages. The Law of Averages. The Expected Value & The Standard Error. Where Are We Going?

AMS7: WEEK 4. CLASS 3

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

Math 160 Professor Busken Chapter 5 Worksheets

Answer Key: Quiz2-Chapter5: Discrete Probability Distribution

Activity #17b: Central Limit Theorem #2. 1) Explain the Central Limit Theorem in your own words.

6.1 Discrete and Continuous Random Variables. 6.1A Discrete random Variables, Mean (Expected Value) of a Discrete Random Variable

Chapter 7. Sampling Distributions and the Central Limit Theorem

c. What is the probability that the next car tuned has at least six cylinders? More than six cylinders?

Problem Set 08 Sampling Distribution of Sample Mean

Chapter 9 & 10. Multiple Choice.

MA131 Lecture 9.1. = µ = 25 and σ X P ( 90 < X < 100 ) = = /// σ X

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Chapter 4 Random Variables & Probability. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

*****CENTRAL LIMIT THEOREM (CLT)*****

Confidence Intervals: Review

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

STAT 201 Chapter 6. Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Problem Set 07 Discrete Random Variables

Key Concept. 155S6.6_3 Normal as Approximation to Binomial. March 02, 2011

Test 2 Version A STAT 3090 Fall 2016

Lecture 6: Normal distribution

Module 4: Probability

Math 243 Lecture Notes

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Chapter 4 Continuous Random Variables and Probability Distributions

MATH 10 INTRODUCTORY STATISTICS

STA 103: Final Exam. Print clearly on this exam. Only correct solutions that can be read will be given credit.

Part V - Chance Variability

WebAssign Math 3680 Homework 5 Devore Fall 2013 (Homework)

FINAL REVIEW W/ANSWERS

Exercise Questions. Q7. The random variable X is known to be uniformly distributed between 10 and

ECON 214 Elements of Statistics for Economists 2016/2017

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

15.063: Communicating with Data Summer Recitation 4 Probability III

Discrete Probability Distributions

Binomial formulas: The binomial coefficient is the number of ways of arranging k successes among n observations.

EXERCISES FOR PRACTICE SESSION 2 OF STAT CAMP

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Chapter Six Probability Distributions

Section Introduction to Normal Distributions

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

MATH 3200 Exam 3 Dr. Syring

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

PROBABILITY DISTRIBUTIONS

University of California, Los Angeles Department of Statistics. Final exam 07 June 2013

Transcription:

University of California, Los Angeles Department of Statistics Statistics 13 Instructor: Nicolas Christou The central limit theorem The distribution of the sample proportion The distribution of the sample mean The distribution of the sample proportion: First: Population proportion, p: Sample proportion, ˆp. Very important: The sample proportion, ˆp, follows the normal distribution with mean p, and standard deviation p(1 p), where p is the population proportion, and n is the n sample size. Requirements: large n, and independent sample values. What does the above statement mean? 1

We can write: Therefore p(1 p) ˆp N p, n Z = ˆp p p(1 p) n ) How can we use the above statement? Here is an example: Suppose the population proportion of Democtrats among UCLA students is 55%. Find the probability that the sample proportion of 200 students randomly selected will exceed 60%. Important: The population proportion p is a parameter, it is a fixed number between 0 and 1. It is not random! The sample proportion ˆp is not fixed. It is random! 2

The distribution of the sample mean: First: Population mean, µ: Sample mean, X: Very important: The sample mean, X, follows the normal distribution with mean µ, and standard deviation σ n, where µ and σ are the mean and standard deviation of the population from where the sample of size n is selected. Requirements: large n, and independent sample values. What does the above statement mean? 3

We can write: ( ) σ X N µ, n Therefore Z = X µ σ n As a result of the previous statement the following is also true (and very useful): T N ( nµ, σ n ) Therefore Z = T nµ σ n How can we use the above statement? Here is an example: A large freight elevator can transport a maximum of 9800 pounds. Suppose a load of cargo containing 49 boxes must be transported via the elevator. Experience has shown that the weight of boxes of this type of cargo follows a distribution with mean µ = 205 pounds and standard deviation σ = 15 pounds. Based on this information, what is the probability that all 49 boxes can be safely loaded onto the freight elevator and transported? Important: The population mean µ is a parameter, it is a fixed number. It is not random! The sample mean X is not fixed. It is random! 4

Here are two simulation experiments from the SOCR sample mean experiment: A. Population N(4, 1.5). Sample size n = 16. We observe that because the population from where the samples are selected is normal even a small sample (here n = 16) will produce a bell-shaped distribution for the sample mean X. B. Population exp(λ = 1). Sample size n = 100: 5

Distribution of the sample mean - Sampling from normal distribution If we sample from normal distribution N(µ, σ) then X follows exactly the normal distribution with mean µ and standard deviation σ n regardless of the sample size n. In the next figure we see the effect of the sample size on the shape of the distribution of X. The first figure is the N(5, 2) distribution. The second figure represents the distribution of X when n = 4. The third figure represents the distribution of X when n = 16. N(5, 2) f(x) 0.0 0.2 0.4 0.6 0.8 3 1 1 3 5 7 9 11 13 x N(5, 2 4 ) f(x) 0.0 0.2 0.4 0.6 0.8 1 2 3 4 5 6 7 8 9 x N(5, 2 16 ) f(x) 0.0 0.2 0.4 0.6 0.8 3 4 5 6 7 x 6

Sampling distributions - Examples Example 1 A large freight elevator can transport a maximum of 9800 pounds. Suppose a load of cargo containing 49 boxes must be transported via the elevator. Experience has shown that the weight of boxes of this type of cargo follows a distribution with mean µ = 205 pounds and standard deviation σ = 15 pounds. Based on this information, what is the probability that all 49 boxes can be safely loaded onto the freight elevator and transported? Example 2 From past experience, it is known that the number of tickets purchased by a student standing in line at the ticket window for the football match of UCLA against USC follows a distribution that has mean µ = 2.4 and standard deviation σ = 2.0. Suppose that few hours before the start of one of these matches there are 100 eager students standing in line to purchase tickets. If only 250 tickets remain, what is the probability that all 100 students will be able to purchase the tickets they desire? Example 3 Suppose that you have a sample of 100 values from a population with mean µ = 500 and with standard deviation σ = 80. a. What is the probability that the sample mean will be in the interval (490, 510)? b. Give an interval that covers the middle 95% of the distribution of the sample mean. Example 4 The amount of mineral water consumed by a person per day on the job is normally distributed with mean 19 ounces and standard deviation 5 ounces. A company supplies its employees with 2000 ounces of mineral water daily. The company has 100 employees. a. Find the probability that the mineral water supplied by the company will not satisfy the water demanded by its employees. b. Find the probability that in the next 4 days the company will not satisfy the water demanded by its employees on at least 1 of these 4 days. Assume that the amount of mineral water consumed by the employees of the company is independent from day to day. c. Find the probability that during the next year (365 days) the company will not satisfy the water demanded by its employees on more than 15 days. Example 5 Supply responses true or false with an explanation to each of the following: a. The probability that the average of 20 values will be within 0.4 standard deviations of the population mean exceeds the probability that the average of 40 values will be within 0.4 standard deviations of the population mean. b. P ( X > 4) is larger than P (X > 4) if X N(8, σ). c. If X is the average of n values sampled from a normal distribution with mean µ and if c is any positive number, then P (µ c X µ + c) decreases as n gets large. Example 6 An insurance company wants to audit health insurance claims in its very large database of transactions. In a quick attempt to assess the level of overstatement of this database, the insurance company selects at random 400 items from the database (each item represents a dollar amount). Suppose that the population mean overstatement of the entire database is $8, with population standard deviation $20. a. Find the probability that the sample mean of the 400 would be less than $6.50. b. The population from where the sample of 400 was selected does not follow the normal distribution. Why? c. Why can we use the normal distribution in obtaining an answer to part (a)? d. For what value of ω can we say that P (µ ω < X < µ + ω) is equal to 80%? e. Let T be the total overstatement for the 400 randomly selected items. Find the number b so that P (T > b) = 0.975. Example 7 A telephone company has determined that during nonholidays the number of phone calls that pass through the main branch office each hour follows the normal distribution with mean µ = 80000 and standard deviation σ = 35000. Suppose that a random sample of 60 nonholiday hours is selected and the sample mean x of the incoming phone calls is computed. a. Describe the distribution of x. b. Find the probability that the sample mean x of the incoming phone calls for these 60 hours is larger than 91970. c. Is it more likely that the sample average x will be greater than 75000 hours, or that one hour s incoming calls will be? Example 8 Assume that the daily S&P return follows the normal distribution with mean µ = 0.00032 and standard deviation σ = 0.00859. a. Find the 75 th percentile of this distribution. b. What is the probability that in 2 of the following 5 days, the daily S&P return will be larger than 0.01? c. Consider the sample average S&P of a random sample of 20 days. i. What is the distribution of the sample mean? ii. What is the probability that the sample mean will be larger than 0.005? iii. Is it more likely that the sample average S&P will be greater than 0.007, or that one day s S&P return will be? 7

Example 9 Assume that 30% of students at a university wear contact lenses. a. We randomly pick 100 students. Describe the distribution of ˆp (the sample proportion of the 100 students who wear contact lenses). Do the requirements hold? b. What is the probability that more than 1 of this sample wear contact lenses? 3 Example 10 A restaurant owner anticipates serving about 180 people on Friday evening, and believing that about 20% of the customers will order the special of the day. How many of those special meals should he plan on serving in order to be very sure of having enough specials on hand to meet customer demand? First, what very sure means to you? Example 11 You are going to play roulette at a casino. As a reminder a roulette has 38 numbers (1-36 plus 0 and 00). Let s say that you want to bet on number 13. If you win it pays 35 : 1 which means that you get your $1 back plus $35. a. In 10000 plays, what is the probability that the casino will make more than $400? b. What would the probability be if you play 90000 games? Example 12 Suppose you select a sample of size n = 16 from a population that follows the normal distribution with µ = 5 and σ = 2. a. Describe the distribution of the sample mean X. b. Find the probability that the sample mean will exceed 6.5. c. Describe the distribution of the total T. d. What is the probability that the total will fall between 80 to 100. Example 13 Write a summary of the sampling distribution of ˆp, the requirements? X, and T. What distribution do they follow? Draw these distributions. What are 8

Sampling distributions - Examples Answers EXERCISE 1 0.0099 EXERCISE 2 0.6915 EXERCISE 3 a. 0.7888 b. (484.32, 515.68) EXERCISE 4 a. 0.0228 b. 0.0881 c. 0.0059 EXERCISE 5 a. False b. True c. False EXERCISE 6 a. 0.0668 b. X > 0, σ > µ c. large n d. 1.28 e. 2416 EXERCISE 7 a. X N(80000, 35000 60 ) b. 0.0040 c. P ( X > 75000) is larger.. EXERCISE 8 a. 0.0061 b. 0.1102 c. i. x N(0.00032, 0.00859 20 ) ii. 0.0073 iii. EXERCISE 9 One day s S&P will be larger 0.30(1 0.30) a. ˆp N(0.30, ) 100 b. 0.2327 EXERCISE 10 45 EXERCISE 11 a. 0.5871 b. 0.9940 EXERCISE 12 a. X N(5, 2 16 b. 0.0013 c. T N(16(5), 2 16) d. 0.4938 EXERCISE 13 Please see your classnotes! 9

Review problems Example 1: The length of time required for the periodic maintenance of an automobile is a random variable that follows the normal distribution with mean µ = 1.4 hours and standard deviation σ = 0.7 hour. a. Find the 95 th percentile of the above distribution. b. Suppose that the service department plans to service 50 automobiles per day. The department has 10 employees and each one works for 8 hours per day. What is the probability that the service department will have to work overtime? Example 2: An airline, believing that 5% of passengers fail to show up for flights, overbooks (sells more tickets than there are seats). Suppose a plane will hold 265 passengers, and the airline sells 275 seats. Let X be the number of passengers that show up. a. What is the distribution of X? b. Write an expression for the exact probability that the airline will not have enough seats for the passengers that show up. c. Approximate the above probability using the normal distribution. Example 3: We have discussed in class that if we select two samples of size n 1 and n 2 from two normal populations with known variances σ1 2 and σ2 2 then the difference between the two sample means ( X 1 X 2 ) follows the normal distribution as follows. X 1 X 2 N µ 1 µ 2, σ1 2 + σ2 2 n 1 The corresponding z score is: Z = X 1 X 2 (µ 1 µ 2 ) σ 2 1 n 1 + σ2 2 n 2 n 2 a. Let X and Y denote the wing lengths (in mm) of a male and female Hawaiian gallinule. Assume that the respective distributions of X and Y are N(184, 40) and N(172, 51). Note: σ X = 40, σ Y = 51. If X and Ȳ are the sample means of random samples of sizes n 1 = n 2 = 16 birds of each sex, find the probability that the sample mean of the wing length of the 16 male birds exceeds that one of the female birds by 9 mm. In other words, find P ( X Ȳ > 9). b. Find the 5 th percentile of the distribution of X Ȳ. c. What is the probability that among 5 female Hawaiian gallinules 2 will have wing length more than 200 mm? 10