Chapter 9: Sampling Distributions

Similar documents
Chapter 9. Sampling Distributions. A sampling distribution is created by, as the name suggests, sampling.

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

The Binomial Probability Distribution

The Normal Approximation to the Binomial

chapter 13: Binomial Distribution Exercises (binomial)13.6, 13.12, 13.22, 13.43

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

Business Statistics 41000: Probability 4

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

ECON 214 Elements of Statistics for Economists 2016/2017

8.1 Estimation of the Mean and Proportion

The normal distribution is a theoretical model derived mathematically and not empirically.

Statistics 6 th Edition

Stat 213: Intro to Statistics 9 Central Limit Theorem

Elementary Statistics Lecture 5

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

15.063: Communicating with Data Summer Recitation 4 Probability III

Distribution of the Sample Mean

AMS7: WEEK 4. CLASS 3

The binomial distribution p314

Sampling and sampling distribution

Binomial Random Variables. Binomial Random Variables

Stat511 Additional Materials

Review: Population, sample, and sampling distributions

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

Confidence Intervals and Sample Size

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Statistics 431 Spring 2007 P. Shaman. Preliminaries

FEEG6017 lecture: The normal distribution, estimation, confidence intervals. Markus Brede,

MA 1125 Lecture 12 - Mean and Standard Deviation for the Binomial Distribution. Objectives: Mean and standard deviation for the binomial distribution.

Statistics for Managers Using Microsoft Excel 7 th Edition

Chapter 7: Point Estimation and Sampling Distributions

Class 13. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

Part V - Chance Variability

Statistical Methods in Practice STAT/MATH 3379

Statistics 13 Elementary Statistics

The Normal Probability Distribution

Data Analysis and Statistical Methods Statistics 651

Value (x) probability Example A-2: Construct a histogram for population Ψ.

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

Making Sense of Cents

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Lecture 5: Sampling Distributions

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean.

Confidence Intervals Introduction

Lecture 8 - Sampling Distributions and the CLT

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

ECON 214 Elements of Statistics for Economists

Chapter 6. The Normal Probability Distributions

Sampling. Marc H. Mehlman University of New Haven. Marc Mehlman (University of New Haven) Sampling 1 / 20.

Chapter 5. Sampling Distributions

MAKING SENSE OF DATA Essentials series

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

AP Statistics Test 5

Chapter 7. Sampling Distributions

2011 Pearson Education, Inc

Module 4: Probability

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

AP Statistics Ch 8 The Binomial and Geometric Distributions

CHAPTER 5 SAMPLING DISTRIBUTIONS

Chapter 8: Binomial and Geometric Distributions

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 8. Variables. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc.

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 7 Sampling Distributions and Point Estimation of Parameters

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION

Section Sampling Distributions for Counts and Proportions

Midterm Exam III Review

Statistics, Their Distributions, and the Central Limit Theorem

Statistics for Business and Economics

6.5: THE NORMAL APPROXIMATION TO THE BINOMIAL AND

STAT Chapter 7: Confidence Intervals

Chapter 5: Statistical Inference (in General)

Chapter 6 Continuous Probability Distributions. Learning objectives

MATH 264 Problem Homework I

Lecture 9. Probability Distributions. Outline. Outline

Discrete Probability Distribution

Theoretical Foundations

Lecture 9. Probability Distributions

Name PID Section # (enrolled)

PROBABILITY DISTRIBUTIONS

BIOL The Normal Distribution and the Central Limit Theorem

10/1/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

ECO220Y Sampling Distributions of Sample Statistics: Sample Proportion Readings: Chapter 10, section

Tutorial 6. Sampling Distribution. ENGG2450A Tutors. 27 February The Chinese University of Hong Kong 1/6

Review of the Topics for Midterm I

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Sampling Distributions For Counts and Proportions

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

Engineering Statistics ECIV 2305

HOMEWORK: Due Mon 11/8, Chapter 9: #15, 25, 37, 44

***SECTION 8.1*** The Binomial Distributions

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

1. Variability in estimates and CLT

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

ECE 295: Lecture 03 Estimation and Confidence Interval

Transcription:

Chapter 9: Sampling Distributions 9. Introduction This chapter connects the material in Chapters 4 through 8 (numerical descriptive statistics, sampling, and probability distributions, in particular) with statistical inference, which is introduced in Chapter 0. At the completion of this chapter, you are expected to know the following:. How the sampling distribution of the mean is created and the shape and parameters of the distribution.. How to calculate probabilities using the sampling distribution of the mean. 3. Understand how the normal distribution can be used to approximate the binomial distribution. 4. How to calculate probabilities associated with a sample proportion. 5. How to calculate probabilities associated with the difference between two sample means. 9. Sampling Distribution of the Mean The most important thing to learn from this section is that if we repeatedly draw samples from any population, the values of x sampling will have three important characteristics:. x is approximately normally. calculated in each sample will vary. This new random variable created by. The mean of x will equal the mean of the original random variable. That is, µ x = µ x. 3. The variance of x will equal the variance of the original random variable divided by n. That is, σ x = σ x / n. The sampling distribution of x allows us to make probability statements about x based on knowing the values of the sample size n and the population parameters µ and σ. Example 9. A random variable possesses the following probability distribution: x p(x)..5 3.3 06

a) Find all possible samples of size that can be drawn from this population. b) Using the results in part a), find the sampling distribution of x. c) Confirm that µ x = µ x and σ x = σ x / n. a) There are nine possible samples of size. They are (,), (,), (,3), (,), (,), (,3), (3,), (3,), and (3,3). b) The samples, the values of x, and the probability of each sample outcome are shown below: Sample x Probability (,).0 (.)(.) =.04 (,).5 (.)(.5) =.0 (,3).0 (.)(.3) =.06 (,).5 (.5)(.) =.0 (,).0 (.5)(.5) =.5 (,3).5 (.5)(.3) =.5 (3,).0 (.3)(.) =.06 (3,).5 (.3)(.5) =.5 (3,3) 3.0 (.3)(.3) =.09 The sampling distribution of x follows: x p( x ).0.04.5.0.0.37.5.30 3.0.09 c) Using our definitions of expected value and variance, we find the mean and variance of the random variable x: µ x = E ( x ) = x p ( x ) = (. ) + (. 5) + 3(. 3 ) =. σ x = ( x µ) p ( x ) = (.) (. ) + (.) (. 5 ) + (3.) (. 3) = 0.49 07

The mean and variance of the random variable x are computed as follows: µ x = E ( x ) = x p( x ) =.0 (. 04 ) +.5 (. 0 ) +.0 (. 37 ) +.5(. 30 ) + 3.0 (. 09 ) =. σ x ( ) p( x ) = x µ x = (.0. ) (. 04 ) + (.5.) (. 0 ) + (.0.) (. 37 ) = 0.45 As you can see, and µ x = µ x =. + (.5. ) (.30 ) + ( 3.0.) (. 09 ) σ x = σ x / n = 0.49 / = 0.45 Example 9. Suppose a random sample of 00 observations is drawn from a normal population whose mean is 600 and whose variance is,500. Find the following probabilities: a) P(590 x 60) b) P(590 x 60) c) P(x > 650) d) P( x > 650) a) X is normally distributed with mean µ x = 600 and variance σ x =,500. We standardize x by subtracting µ x = 600 and dividing by σ x = 50. Therefore, P(590 x 60) = P 590 600 50 x µ x σ x 60 600 50 = P(. z.) =.586 b) We know that x is normally distributed with µ x = µ x = 600 and σ x = σ x / n =, 500 /00 = 5. Thus, σ x = 5. Hence, P(590 x 60) = P 590 600 5 x µ x 60 600 σ x 5 = P( z ) =.9544 08

c) P(x > 650) = P x µ x > σ x 650 600 50 = P(z > ) =.587 d) P( x > 650) = P x µ x > σ x = P(z > 0) = 0 650 600 5 Example 9.3 Refer to Example 9.. Suppose a random sample of 00 observations produced a mean of x = 650. What does this imply about the statement that µ = 600 and σ =,500? From Example 9. part d), we found that P( x > 650) = 0 Therefore, it is quite unlikely that we could observe a sample mean of 650 in a sample of 00 observations drawn from a population whose mean is 600 and whose variance is,500. Question: Answer: What purpose does the sampling distribution serve? In particular, why do we need to calculate probabilities associated with the sample mean? (in reverse order) We are not terribly interested in making probability statements about x. Since knowledge of µ and σ is required in order to compute the probability that x falls into some specific interval, we acknowledge that this procedure is quite unrealistic. However, the sampling distribution will eventually allow us to infer something about an unknown population mean from a sample mean. This process, called statistical inference, will be the main topic throughout the rest of the textbook. EXERCISES 9. If 64 observations are taken from a population with µ = 00 and σ = 40, find P(0 x ). 09

9. A normally distributed random variable has a mean of 0 and a standard deviation of 0. If a random sample of 5 is drawn from this population, find P( x > 3). 9.3 Given the probability distribution of x below, find all samples of size 3, the sampling distribution of x, the mean, and the variance of x. x p(x) 0.7.3 0

9.3 Creating the Sampling Distribution by Computer Simulation In Section 9., we described how the sampling distribution was created theoretically. We also pointed out that sampling distributions can be created empirically, but that the effort can be extremely timeconsuming. In this section, we used the computer and our two software packages to create several sampling distributions empirically. The concept of the sampling distribution is critical to the development of statistical inference. It is important that you understand that the sampling distribution of any statistic is created theoretically or empirically by repeated sampling from a population. In each sample we calculate the statistic and thus create the distribution of that statistic. Throughout the book we will introduce about 0 different sampling distributions. 9.4 Sampling Distribution of a Proportion The sampling distribution of a sample proportion is actually based on the binomial distribution. However, the primary purpose of creating the sampling distribution is for inference and the binomial distribution, which is discrete, makes inference somewhat difficult. Consequently, we use the normal approximation to the binomial distribution. The details are not particularly important to the applied statistician (that's you). What is important is that you understand how the sampling distribution is used. The sampling distribution of pˆ is approximately normal with mean p and variance np(-p). Thus z = pˆ p p( p ) / n is approximately standard normally distributed. Example 9.4 A fair coin is flipped 400 times. Find the probability that the proportion of heads falls between.48 and.5. We wish to find P(.48 pˆ.5). We employ the approximate normal sampling distribution. Because the coin is fair, p =.5.

P(.48 pˆ.5) = P. 48. 5 (. 5 )(. 5) / 400 pˆ p p( p ) / n. 5. 5 (. 5 )(. 5) / 400 = P(-.8 z.8) = (.5 -.88) =.438 Example 9.5 Repeat Exercise 9.4 changing the number of flips to 000. P(.48 pˆ.5) = P. 48. 5 (. 5 )(. 5) / 000 pˆ p p( p ) / n. 5. 5 (. 5)(. 5 ) / 000 = P(-.6 z.6) = (.5 -.396) =.076 Question: Answer: Why don't we use the /-correction factor? When the sample size is large the effect of the correction factor is negligible. Omitting it only slightly affects the approximation but simplifies our calculation. EXERCISES 9.4 The proportion of defective units coming off a production line is 5%. Find the probability that in a random sample of 00 units more than 0% are defective? 9.5 Repeat Exercise 9.4 with a sample of 400 units.

9.6 In the last election a local counselor received 5% of the vote. If her popularity level is unchanged what is the probability that in a random sample of 00 voters less than 50% would vote for her? 9.5 Sampling Distribution of the Difference between Two Means The sampling distribution of the difference between two means is developed by extending the Central Limit Theorem. That is, the sampling distribution of is approximately normally distributed (If the two variables are normal then x x is normally distributed as well.) with mean µ µ and standard deviation Thus, σ σ + n n ( x x ) ( µ µ z = σ σ + n n ) x x is either normally distributed or approximately normally distributed. We can use this sampling distribution in the same way we employed the sampling distribution of the sample mean, to make probability statements about the difference between two sample means. Example 9.6 Suppose that we draw random samples of size 5 from two normal populations. The mean and standard deviation of population are 00 and 5. The mean and standard deviation of population are 90 and 40. Find the probability that the mean of sample exceeds the mean of sample. 3

We want to determine P[( x x > 0]. The mean and standard deviation of the sampling distribution are Thus, µ µ = 00-90 = 0 σ σ 5 40 + = + =. n n 5 5 P[( x x ) > 0] = EXERCISES ( x x ) ( µ µ P σ σ + n n ) 0 0 > = (P(z > -.47) =.5+.808 =.6808.. 9.7 The assistant dean of a business school claims that the number of job offers received by MBA's whose major is finance is normally distributed with a mean of and a standard deviation of.5. Furthermore he states that job offers to marketing majors is normally distributed with a mean of 0 and a standard deviation of 3. Find the probability that in a random sample of 0 finance and 0 marketing majors the average finance major receives more job offers than the average marketing major. 9.8 Repeat Exercise 9.7 using sample sizes of 5. 4