CHAPTER 5 Sampling Distributions

Similar documents
Central Limit Theorem

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Midterm Exam III Review

ECON 214 Elements of Statistics for Economists 2016/2017

Math 227 Elementary Statistics. Bluman 5 th edition

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

Expected Value of a Random Variable

The Normal Distribution

CHAPTER 5 SAMPLING DISTRIBUTIONS

Binomial and Normal Distributions

Probability Distribution Unit Review

Making Sense of Cents

Examples of continuous probability distributions: The normal and standard normal

Section 5 3 The Mean and Standard Deviation of a Binomial Distribution!

The normal distribution is a theoretical model derived mathematically and not empirically.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 8 Estimation

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

The Normal Probability Distribution

NORMAL RANDOM VARIABLES (Normal or gaussian distribution)

Sampling Distributions For Counts and Proportions

Chapter 7 Study Guide: The Central Limit Theorem

Part V - Chance Variability

Sampling Distributions and the Central Limit Theorem

The Binomial Probability Distribution

Business Statistics 41000: Probability 4

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

CH 5 Normal Probability Distributions Properties of the Normal Distribution

Chapter 6. The Normal Probability Distributions

ECON 214 Elements of Statistics for Economists

2011 Pearson Education, Inc

11.5: Normal Distributions

Contents. The Binomial Distribution. The Binomial Distribution The Normal Approximation to the Binomial Left hander example

5.3 Statistics and Their Distributions

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

Section Introduction to Normal Distributions

Chapter 5. Sampling Distributions

1 Sampling Distributions

Module 4: Probability

Sampling & Confidence Intervals

Statistics and Probability

Normal Probability Distributions

(# of die rolls that satisfy the criteria) (# of possible die rolls)

Section The Sampling Distribution of a Sample Mean

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean.

4: Probability. Notes: Range of possible probabilities: Probabilities can be no less than 0% and no more than 100% (of course).

3.3-Measures of Variation

The probability of having a very tall person in our sample. We look to see how this random variable is distributed.

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

Lecture 6: Chapter 6

Introduction to Statistical Data Analysis II

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 7 1. Random Variables

5.1 Personal Probability

Midterm Test 1 (Sample) Student Name (PRINT):... Student Signature:... Use pencil, so that you can erase and rewrite if necessary.

Random Variables CHAPTER 6.3 BINOMIAL AND GEOMETRIC RANDOM VARIABLES

Chapter Seven. The Normal Distribution

Theoretical Foundations

The Binomial Distribution

MA131 Lecture 8.2. The normal distribution curve can be considered as a probability distribution curve for normally distributed variables.

Unit 2 Statistics of One Variable

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x)

The Central Limit Theorem

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

Sampling Distributions Chapter 18

Fundamentals of Statistics

SAMPLING DISTRIBUTIONS. Chapter 7

the number of correct answers on question i. (Note that the only possible values of X i

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

A Derivation of the Normal Distribution. Robert S. Wilson PhD.

Sampling Distributions

4.2 Bernoulli Trials and Binomial Distributions

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

Statistical Methods in Practice STAT/MATH 3379

Chapter 3: Probability Distributions and Statistics

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution?

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Unit 2: Statistics Probability

Consider the following examples: ex: let X = tossing a coin three times and counting the number of heads

The Binomial Distribution

Example. Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

Chapter 5 Basic Probability

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions

Math 14 Lecture Notes Ch The Normal Approximation to the Binomial Distribution. P (X ) = nc X p X q n X =

MAKING SENSE OF DATA Essentials series

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION

Chapter 6: Random Variables

Chapter 8: Binomial and Geometric Distributions

Math 140 Introductory Statistics

Chapter 7 Sampling Distributions and Point Estimation of Parameters

STAB22 section 1.3 and Chapter 1 exercises

CHAPTER 6 Random Variables

5: Several Useful Discrete Distributions

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

DO NOT POST THESE ANSWERS ONLINE BFW Publishers 2014

= 0.35 (or ˆp = We have 20 independent trials, each with probability of success (heads) equal to 0.5, so X has a B(20, 0.5) distribution.

Transcription:

CHAPTER 5 Sampling Distributions 5.1 The possible values of p^ are 0, 1/3, 2/3, and 1. These correspond to getting 0 persons with lung cancer, 1 with lung cancer, 2 with lung cancer, and all 3 with lung cancer. 5.2 (a) Pr{p^ 0} Pr{no mutants} Pr{all are non-mutants} (1 -.39) 3.227. (b) Pr{p^ 1/3} Pr{1 mutant} 3 C 1 p 1 (1 - p) 2, where p.39. This is (3)(.39 1 )(.61 2 ).435. 5.3 (a) (i).08; (ii).27; (iii).35; (iv).22; (v).07; (vi).01 (b).40 Probability.30.20.10 0.2.4.6.8 1.0 p^ 5.4 We are concerned with the sampling distribution of p^, which is governed by a binomial distribution. Letting "success" "responder," we have p.2 and 1 - p.8. The number of trials is n 15. (a) The event p^.2 occurs if there are 3 successes in the 15 trials (because 3/15.2). Thus, to find the probability that p^.2, we can use the binomial formula n C j p j (1 - p) n - j with j 3, so n - j 12: Pr{p^.2} 15 C 3 p 3 (1 - p) 12 (455)(.2 3 )(.8 12 ).2501. (b) The event p^ 0 occurs if there are 0 successes in the 15 trials (because 0/15 0). Thus, to find the probability that p^ 0, we can use the binomial formula with j 0, so n - j 15: Pr{p^ 0} 15 C 0 p 0 (1 - p) 15 (1)(1)(.8 15 ).0352. 5.5 (a) Letting "success" "infected," we have p.25 and 1 - p.75. The number of trials is n 4. We then use the binomial formula n C j p j (1 - p) n - j with n 4 and p.25. The values of p^ correspond to numbers of successes and failures as follows: p^ Number of successes (j) Number of failures (n - j) 0 0/4 0 4.25 1/4 1 3.50 2/4 2 2.75 3/4 3 1 1 4/4 4 0

65 Thus, we find (i) Pr{p^ 0} 4C 0 p 0 (1 - p) 4 (1)(1)(.75 4 ).3164 (ii) Pr{p^.25} 4C 1 p 1 (1 - p) 3 (4)(.25)(.75 3 ).4219 (iii) Pr{p^.50} 4C 2 p 2 (1 - p) 2 (6)(.25 2 )(.75 2 ).2109 (iv) Pr{p^.75} 4C 3 p 3 (1 - p) 1 (4)(.25 3 )(.75).0469 (v) Pr{p^ 1} 4C 4 p 4 (1 - p) 0 (1)(.25 4 )(1).0039 (b) The distribution is displayed in the following histogram:.50.40 Probability.30.20.10 0.25.5.75 1.0 p^ 5.6 (a) p^ Probability.000.75 8.1001.125 (8)(.25)(.75 7 ).2670.250 (28)(.25 2 )(.75 6 ).3115.375 (56)(.25 3 )(.75 5 ).2076.500 (70)(.25 4 )(.75 4 ).0865.625 (56)(.25 5 )(.75 3 ).0231.750 (28)(.25 6 )(.75 2 ).0039.875 (8)(.25 7 )(.75).0004 1.000.25 8.0000

66 (b) Probability.50.40.30.20.10.40 0.25.5.75 1.0 p^.30 Probability.20.10 0.25.5.75 1.0 p^ The distribution for n 8 is narrower than the distribution for n 4. 5.7 (a) (252)(.6 5 )(.4 5 ).2007 (b) (210)(.6 6 )(.4 4 ).2508 (c) (120)(.6 7 )(.4 3 ).2150 (d).2007 +.2508 +.2150.6665 (e).6665 (from part (d)) 5.8 (a) p^ Probability.0.7 5.1681.2 (5)(.3)(.7 4 ).3602.4 (10)(.3 2 )(.7 3 ).3087.6 (10)(.3 3 )(.7 2 ).1323.8 (5)(.3 4 )(.7).0284 1.0.3 5.0024

(b).40 67 Probability.30.20.10 0.2.4.6.8 1.0 p^ Compared with Figure 5.4, this distribution is more spread out (more dispersed) and is more skewed. 5.9 Because p.40, the event E occurs if p^ is within ±.05 of.40; this happens if there are 7, 8, or 9 successes, as follows: Number of successes (j) p^ 7.35 8.40 9.45 We can calculate the probabilities of these outcomes using the binomial formula with n 20 and p.4: Pr{p^.35} 20 C 7 p 7 (1 - p) 13 (77,520)(.4 7 )(.6 13 ).1659 Pr{p^.40} 20 C 8 p 8 (1 - p) 12 (125,970)(.4 8 )(.6 12 ).1797 Pr{p^.45} 20 C 9 p 9 (1 - p) 11 (167,960)(.4 9 )(.6 11 ).1597 Finally, we calculate Pr{E} by adding these results: Pr{E}.1659 +.1797 +.1597.5053. 5.10 The sample percentage, p^, of students who smoke varies from one sample to the next. The sampling distribution of the sample percentage is the distribution of p^ -- the proportion of smokers in a sample -- across repeated samples. That is, the sampling distribution of the sample percentage is the distribution of sample percentages of smokers in samples of size 10. 5.11-5.13 See Section III of this Manual. 5.14 Under the proposed sampling scheme, the chance that an ellipse will be selected is proportional to its area. The scheme is biased toward larger ellipses, and will thus tend to produce a y that is too large.

68 5.15 (a) In the population, µ 176 and σ 30. For y 186, y - µ 186-176 σ 30.33. From Table 3, the area below.33 is.6293. For y 166, y - µ 166-176 σ 30 -.33. From Table 3, the area below -.33 is.3707. Thus, the percentage with 166 y 186 is.6293 -.3707.2586, or 25.86%. (b) We are concerned with the sampling distribution of Y for n 9. From Theorem 5.1, the mean of the sampling distribution of Y is µ Ȳ µ 176, the standard deviation is σ Ȳ σ n 30 9 10, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1). We need to find the shaded area in the figure. For y 186, y - µ Ȳ 186-176 σ 10 Ȳ 1.00. From Table 3, the area below 1.00 is.8413. For y 166, y - µ Ȳ 166-176 σ 10 Ȳ -1.00. From Table 3, the area below -1.00 is.1587. Thus, the percentage with 166 y 186 is.8413 -.1587.6826, or 68.26%. (c) The probability of an event can be interpreted as the long-run relative frequency of occurrence of the event (Section 3.3). Thus, the question in part (c) is just a rephrasing of the question in part (b). It follows from part (b) that Pr{166 Y 186}.6826. 5.16 (a) µ 3000; σ 400. The event E occurs if Y is between 2900 and 3100. We are concerned with the sampling distribution of Y for n 15. From Theorem 5.1, the mean of the sampling distribution of Y is µ Ȳ µ 3000, the standard deviation is

σ Ȳ σ n 400 15 103.3, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1). For y 3100, 69 y - µ Ȳ 3100-3000 σ 103.3 Ȳ.97. From Table 3, the area below.97 is.8340. For y 2900, y - µ Ȳ 2900-3000 σ 103.3 Ȳ -.97. From Table 3, the area below -.97 is.1660. Thus, Pr{2900 Y 3100} Pr{E}.8340 -.1660.6680. (b) n 60; σ Ȳ 400/ 60 51.64 ±100 51.64 ±1.94; Table 3 gives.9738 and.0262, so Pr{E}.9738 -.0262.9476. (c) As n increases, Pr{E} increases. 5.17 σ Ȳ 400/ 15 103.3 (a) (b) 2900-2800 103.3 2700-2800 103.3.97. From Table 3, the area below.97 is.8340. -.97. From Table 3, the area below -.97 is.1660. Thus, Pr{E}.8340 -.1660.6680 2700-2600 103.3 2500-2600 103.3.97. From Table 3, the area below.97 is.8340. -.97. From Table 3, the area below -.97 is.1660. Thus, Pr{E}.8340 -.1660.6680 (c) For fixed n and σ, Pr{E} does not depend on µ. 5.18 µ 145; σ 22. 155-145 (a) 22.45; Table 3 gives.6736. 135-145 22 -.45; Table 3 gives.3264. Thus,.6736 -.3264.3472 or 34.72% of the plants.

70 (b) n 16; σ Ȳ 22/ 16 5.5. 155-145 5.5 135-145 5.5 1.82; Table 3 gives.9656. -1.82; Table 3 gives.0344. Thus,.9656 -.0344.9312 or 93.12% of the groups. (c) Pr{135 Ȳ 155}.9312 (from part (b)). (d) n 36; σ Ȳ 22/ 36 3.67. 155-145 3.67 135-145 3.67 2.72; Table 3 gives.9967. -2.72; Table 3 gives.0033. Thus,.9967 -.0033.9934 or 99.34% of the groups. 5.19 (a) σ Ȳ 1.4/ 25.28; 5-4.2.28 4-4.2.28 2.86; Table 3 gives.9979. -.71; Table 3 gives.2389. Pr{4 Ȳ 5}.9979 -.2389.7590. (b) The answer is approximately correct because the Central Limit Theorem says that the sampling distribution of Ȳ is approximately normal if n is large. The same approach is not valid for n 2, because the Central Limit Theorem does not apply when the sample size is small. 5.20 (a) In the population, 65.68% of the fish are between 51 and 60 mm long. To find the probability that four randomly chosen fish are all between 51 and 60 mm long, we let "success" be "between 51 and 60 mm long" and use the binomial distribution with n 4 and p.6568, as follows: Pr{all 4 are between 51 and 60} 4 C 4 p 4 (1 - p) 0 (1).6568 4 (1).1861. (b) The mean length of four randomly chosen fish is Ȳ. Thus, we are concerned with the sampling distribution of Ȳ for a sample of size n 4 from a population with µ 54 and σ 4.5. From Theorem 5.1, the mean of the sampling distribution of Ȳ is µ Ȳ µ 54, the standard deviation is σ Ȳ σ n 4.5 4 2.25, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1).

For y 60, 71 y - µ Ȳ 60-54 σ 2.25 Ȳ 2.67. From Table 3, the area below 2.67 is.9962. For y 51, y - µ Ȳ 51-54 σ 2.25 Ȳ -1.33. From Table 3, the area below -1.33 is.0918. Thus, Pr{51 Y 60}.9962 -.0918.9044. 5.21 Let E 1 be the event that all four fish are between 51 and 60 mm long and let E 2 be the event that Ȳ is between 51 and 60 mm long. If E 1 occurs, then E 2 must also occur -- the mean of four numbers, each of which is between 51 and 60, must be between 51 and 60 -- but E 2 can occur without E 1 occurring. Thus, in the long run, E 2 will happen more often than E 1, which shows that Pr{E 2 } > Pr{E 1 }. 5.22 µ Ȳ 50 and σ Ȳ σ/ n 9/ n An area of.68 corresponds to ±1 on the z scale; therefore 51.1-50 1.0 9/ n which yields n 36. 5.23 The distribution of repeated assays of the patient's specimen is a normal distribution with mean µ 35 (the true concentration) and standard deviation σ 4. (a) The result of a single assay is like a random observation Y from the population of assays. A value Y 40 will be flagged as "unusually high." For y 40, y - µ 40-35 σ 4 1.25. From Table 3, the area below 1.25 is.8944, so the area beyond 1.25 is 1 -.8944.1056. Thus, Pr{specimen will be flagged as "unusually high"}.1056. (b) The reported value is the mean of three independent assays, which is like the mean Ȳ of a sample of size n 3 from the population of assays. A value Ȳ 40 will be flagged as "unusually high." We are concerned with the sampling distribution of Ȳ for a sample of size n 3 from a population with mean µ 35 and standard deviation σ 4. From Theorem 5.1, the mean of the sampling distribution of Ȳ is µ Ȳ µ 35, the standard deviation is

72 σ Ȳ σ n 4 3 2.309, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1). For y 40, y - µ Ȳ 40-35 σ 2.309 2.17. Ȳ From Table 3, the area below 2.17 is.9850, so the area beyond 2.17 is 1 -.9850.0150. Thus, Pr{mean of three assays will be flagged as "unusually high"} 1 -.9850.0150. 5.24 (a) µ Ȳ µ 41.5. (b) σ Ȳ 4.7/ 4 2.35 5.25 (a) Because the sample size of 2 is small, we would expect the histogram of the sample means to be skewed to the right, as is the histrogram of the data. However, the histogram of the sample means will be somewhat symmetric (more so than the histogram of the data). (b) Because the sample size of 25 is fairly large, we would expect the histogram to have a bell shape. 5.26 The sample mean is just an individual observation when n1. Thus, the histogram of the sample means will be the same as the histogram of the data (and therefore be skewed to the right). 5.27 No. The histogram shows the distribution of observations in the sample. Such a distribution would look more like the population distribution for n 400 than for n 100, and the population distribution is apparently rather skewed. The Central Limit Theorem applies to the sampling distribution of Ȳ, which is not what is shown in the histogram. 5.28 µ Ȳ 38 and σ Ȳ 9/ 25 1.8. (a) (b) 36-38 1.8 41-38 1.8-1.11. Table 3 gives.1335, so Pr{Ȳ > 36} 1 -.1335.8665. 1.67. Table 3 gives.9525, so Pr{Ȳ > 41} 1 -.9525.0475. 5.29 For each thrust, the probability is.9 that the thrust is good and the probability is.1 that the thrust is fumbled. Letting "success" "good thrust," and assuming that the thrusts are independent, we apply the binomial formula with n 4 and p.9. (a) The area under the first peak is approximately equal to the probability that all four thrusts are good. To find this probability, we set j 4; thus, the area is approximately 4C 4 p 4 (1 - p) 0 (1)(.9 4 )(1).66. (b) The area under the second peak is approximately equal to the probability that three thrusts are good and one is fumbled. To find this probability, we set j 3; thus, the area is approximately

4C 3 p 3 (1 - p) 1 (4)(.9 3 )(.1).29. 73 5.30 (a) The first peak is at 115, the second peak is at 1 2 (115 + 450) 282.5, and the third peak is at 450. (b) First peak:.9 2.81 Second peak: (2)(.9)(.1).18 Third peak:.1 2.01 5.31 When n1 the sample mean is just an individual observation. Thus, the sampling distribution of the sample mean is the same as the distribution of the individual time scores, as shown in Figure 5.15. There are two peaks, one at 115 ms and one at 450 ms.

74 5.32 Letting "success" "heads," the probability of ten heads and ten tails is determined by the binomial distribution with n 20 and p.5. (a) We apply the binomial formula with j 10: Pr{10 heads, 10 tails} 20 C 10 p 10 (1 - p) 10 (184,756)(.5 10 )(.5 10 ).1762. (b) According to part (a) of Theorem 5.2, the binomial distribution can be approximated by a normal distribution with mean np (20)(.5) 10 and standard deviation np(1 - p) (20)(.5)(.5) 2.236. Applying continuity correction, we wish to find the area under the normal curve between 10 -.5 9.5 and 10 +.5 10.5. The desired area is shaded in the figure. The boundary 10.5 corresponds to 10.5-10 2.236.22. From Table 3, the area below.22 is.5871. The boundary 9.5 corresponds to 9.5-10 2.236 -.22. From Table 3, the area below -.22 is.4129.

Thus, the normal approximation to the binomial probability is Pr{10 heads, 10 tails}.5871 -.4129.1742. 75 5.33 Letting "success" "type O blood," the probability that 6 of the persons will have type O blood is determined by the binomial distribution with n 12 and p.44. (a) We apply the binomial formula with j 6: Pr{6 type O blood} 12 C 6 p 6 (1 - p) 6 (924)(.44 6 )(.56 6 ).2068. (b) According to part (a) of Theorem 5.2, the binomial distribution can be approximated by a normal distribution with mean np (12)(.44) 5.28 and standard deviation np(1 - p) (12)(.44)(.56) 1.72. Applying continuity correction, we wish to find the area under the normal curve between 6 -.5 5.5 and 6 +.5 6.5. Thus, Pr{6 type O blood} Pr{ 5.5 5.28 1.72 < Z < 6.5 5.28 1.72 5.34 (a) Because p.12, the event that p^ will be within ±.03 of p is the event.09 p^.15, which, if n 100, is equivalent to the event 9 number of success 15. } Pr{.13 < Z <.71}.7580 -.5517 2063. Letting "success" "oral contraceptive user," the probability of this event is determined by the binomial distribution with mean np (100)(.12) 12 and standard deviation np(1 - p) (100)(.12)(.88) 3.250. Applying continuity correction, we wish to find the area under the normal curve between 9 -.5 8.5 and 15 +.5 15.5. The desired area is shaded in the figure. The boundary 15.5 corresponds to 15.5-12 3.250 1.08. From Table 3, the area below 1.08 is.8599. The boundary 8.5 corresponds to 8.5-12 3.250-1.08. From Table 3, the area below -1.08 is.1401. Thus, the normal approximation to the binomial probability is Pr{p^ will be within ±.03 of p}.8599 -.1401.7198. (Note: An alternative method of solution is to use part (b) of Theorem 5.2 rather than part (a). Such a method is illustrated in the solutions to Exercises 5.30 and 5.41.) (b) With n 200, p^ is within ±.03 of p if and only if the number of successes is between (200)(.09) 18 and (200)(.15) 30. The mean is (200)(.12) 24 and the standard deviation is (200)(.12)(.88) 4.60. Applying continuity correction, we wish to find the area under the normal curve between 18 -.5 17.5 and 30 +.5 30.5.

76 30.5-24 4.60 1.41; Table 3 gives.9207. 17.5-24 4.60-1.41; Table 3 gives.0793..9207 -.0783.8414. 5.35 (b) p.5 For n 45, 60% boys means 27 boys. For the normal approximation to the binomial, the mean is np (45)(.5) 33.5 and the SD is np(1 - p) (45)(.5)(.5) 3.354. 27-22.5 3.354 1 -.9099.0901. 1.34; Table 3 gives.9099. For n 15, 60% boys means 9 boys. For the normal approximation to the binomial, the mean is np (15)(.5) 7.5 and the SD is np(1 - p) (15)(.5)(.5) 1.936. 9-7.5 1.936 1 -.7794.2206..77; Table 3 gives.7794. In the larger hospital, 9% of days have 60% or more boys. In the smaller hospital, 22% of days have 60% or more boys. The smaller hospital recorded more such days. 5.36 p ±.05 is.25 to.35. The normal approximation to the sampling distribution of p^ has mean p.3 and standard deviation p(1 - p) (.3)(.7) n 400.02291..35 -.3.02291.25 -.3.02291.9854 -.0146.9708. 2.18; Table 3 gives.9854. -2.18; Table 3 gives.0146. 5.37 (a) Because p.3, the event E, that p^ will be within ±.05 of p, is equivalent to.25 p^.35. The sample size is n 40. According to part (b) of Theorem 5.2, the sampling distribution of p^ can be approximated by a normal distribution with mean p.3 and p(1 - p) (.3)(.7) standard deviation n 40.07246. To apply continuity correction, we first calculate the half-width of a histogram bar (on the p^ scale) as ( 1 2 )( 1 40 ).1025.

Thus, we wish to find the area under the normal curve between.25 -.0125.2375 and.35 +.0125.3625. The desired area is shaded in the figure. 77 The boundary.3625 corresponds to.3625 -.25.07246.86. From Table 3, the area below.86 is.8051. The boundary.2375 corresponds to.2375 -.25.07246 -.86. From Table 3, the area below -.86 is.1949. Thus, the normal approximation to the probability is Pr{E}.8051 -.1949.6102. (Note: An alternative method of solution is to use part (a) of Theorem 5.2 rather than part (b). Such a method is illustrated in the solution to Exercise 5.27.).35 -.25 (b).07246.69; Table 3 gives.7549..25 -.25.07246 -.69; Table 3 gives.2451..7549 -.2451.5098. 5.38 Let E be the event that p^ is closer to 1 2 than to 9 16. (a) n 1. E occurs if the number of purple plants is 0. Pr{E} 7/16.4375. (b) n 64. E occurs if the number of purple plants is less than or equal to 33. The normal approximation to the binomial has mean np (64)(9/16) 36 and standard deviation np(1 - p) (64(9/16)(7/16)) 3.969. 33-36 3.969 -.76; Table 3 gives.2236 Pr{E}. (c) n 320. E occurs if the number of purple plants is less than or equal to 169. The normal approximation to the binomial has mean np (320)(9/16) 180 and standard deviation np(1 - p) (320(9/16)(7/16)) 8.874. 169-180 8.874-1.24; Table 3 gives.1075 Pr{E}. 5.39 (a) Pr{3 heads} (120)(.5 3 )(.5 7 ).1172. Pr{4 heads} (210)(.5 4 )(.5 6 ).2051..1172 +.2051.3223. (b) For the normal approximation to the binomial, the mean is np (10)(.5) 5 and the SD is np(1 - p) (10(.5)(.5)) 1.581. 4.5-5 1.581 2.5-5 1.581 -.32; Table 3 gives.3745. -1.58; Table 3 gives.0571.

78.3745 -.0571.3174. 5.40 For the normal approximation to the binomial, the mean is np (100)(.8) 80 and the SD is np(1 - p) (100(.8)(.2)) 4. 85-80 (a) 4 1.25; Table 3 gives.8944. 1 -.8944.1056. 84.5-80 (b) 4 1.13; Table 3 gives.8708. 1 -.8708.1292. 5.41 For the normal approximation to the binomial, the mean is np (50)(.8) 40 and the SD is np(1 - p) (50(.8)(.2)) 2.83. 35-40 (a) 2.83-1.77; Table 3 gives.0384. (b) 35.5-40 2.83 5.42 µ 88; σ 7. -1.59; Table 3 gives.0559. We are concerned with the sampling distribution of Y for n 5. From Theorem 5.1, the mean of the sampling distribution of Y is µ Ȳ µ 88, the standard deviation is σ Ȳ σ n 7 5 3.13, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1). For y 90, y - µ Ȳ 90-88 σ 3.13 Ȳ.64. From Table 3, the area below.64 is.7389. Thus, Pr{Y > 90} 1 -.7389.2611. 5.43 (a) (45)(.83 8 )(.17 2 ).2929 (b) (10)(.83 9 )(.17).3178 72-69.7 5.44 (a) 2.8.82; Table 3 gives.8939. 1 -.8939.2061. (b) (i) Using the binomial distribution, Pr{both are > 72}.2061 2.0425. (ii) n 2; σ σ/ n 2.8/ 2 1.980. Ȳ

72-69.7 1.980 1 -.8770.1230 1.16; Table 3 gives.8770. 79 5.45 µ 800; σ 90. 850-900 (a) 90.56; Table 3 gives.7123. 750-900 90 -.56; Table 3 gives.2877..7123 -.2877.4246 or 42.46% of the plants. (b) n 4; σ σ/ n 90/ 4 45. Ȳ 850-900 45 1.11; Table 3 gives.8665. 750-900 45-1.11; Table 3 gives.1335..8665 -.1335.7330 or 73.30% of the groups will have means in this range. 5.46 Two possible factors are: (a) environmental variation from one pot (or location) to another; (b) competition between plants in a pot (for instance, overlapping leaves). 5.47 We are concerned with the sampling distribution of p^, which is governed by a binomial distribution. Letting "success" "adult," we have p.2 and 1 - p.8. The number of trials is n 20. (a) The event p^ p occurs if there are 4 successes in the 20 trials (because 4/20.2). Thus, to find the probability that p^ p, we can use the binomial formula with j 4, so n - j 16: Pr{p^ p} 20 C 4 p 4 (1 - p) 16 (4,845)(.2 4 )(.8 16 ).2182. (b) The event p -.05 p^ p +.05 is equivalent to the event.15 p^.25. This event occurs if there are 3, 4, or 5 successes in the 20 trials, as follows: Number of successes (j) p^ 3.25 4.30 5.35 We can calculate the probabilities of these outcomes using the binomial formula with n 20 and p.2: Pr{p^.25} 20 C 3 p 3 (1 - p) 17 (1,140)(.2 3 )(.8 17 ).20536 Pr{p^.30} 20 C 4 p 4 (1 - p) 16 (4,845)(.2 4 )(.8 16 ).21820 Pr{p^.35} 20 C 5 p 5 (1 - p) 15 (15,504)(.2 5 )(.8 15 ).17456 Thus, Pr{p -.05 p^ p +.05} Pr{.15 p^.35}.20536 +.21820 +.17456.59812.5981. 5.48 We are concerned with the sampling distribution of p^ for n 20 and p.2. According to part (b) of Theorem 5.2, this sampling distribution can be approximated by a normal distribution with

80 mean p.2 and standard deviation p(1 - p) (.2)(.8) n 20.08944. To apply continuity correction, we first calculate the half-width of a histogram bar (on the p^ scale) as ( 1 2 )( 1 20 ).025. (a) We wish to find Pr{p^.2}. Thus, we wish to find the area under the normal curve between.2 -.025.175 and.2 +.025.225. The desired area is shaded in the figure. The boundary.225 corresponds to.225 -.200.08944.28. From Table 3, the area below.28 is.6103. The boundary.175 corresponds to.175 -.200.08944 -.28. From Table 3, the area below -.28 is.3897. Thus, the normal approximation to the probability is Pr{p^ p}.6103 -.3897.2206. Note that this agrees well with the exact value (.2182) found in Exercise 5.40(a). (b) The event p -.05 p^ p +.05 is equivalent to the event.15 p^.25. Thus, we wish to find the area under the normal curve between.15 -.025.125 and.25 +.025.275. The desired area is shaded in the figure. The boundary.275 corresponds to.275 -.200.08944.84. From Table 3, the area below.84 is.7995. The boundary.175 corresponds to.125 -.200.08944 -.84. From Table 3, the area below -.84 is.2005. Thus, the normal approximation to the probability is Pr{.15 p^.25}.7995 -.2005.5990. Note that this agrees quite well with the exact value (.5981) found in Exercise 5.40(b). 5.49 For the normal approximation to the sampling distribution of p^, the mean is p.42 and the SD is p(1 - p) (.42)(.58) n 25.0987.

Continuity correction: ( 1 2 )( 1 25 ).02..46.42.405; Table 3 gives.6590..0987 1 -.6590.3410. 81 5.50 µ 1,200; σ 35. For Pr{1175 Y 1225} 1225-1200 35.71; Table 3 gives.7611. 1175-1200 35 -.71; Table 3 gives.2389..7611 -.2389.5222. For Pr{1175 Ȳ 1225}, σ Ȳ σ/ n 35/ 6 14.29. 1225-1200 14.29 1.75; Table 3 gives.9599. 1175-1200 14.29-1.75; Table 3 gives.0401..9599 -.0401.9198. Comparison:.9189 >.5222; this shows that the mean of 6 counts is more precise, in that it is more likely to be near the correct value (1200) than is a single count. 5.51 µ 8.3; σ 1.7. If the total weight of 10 mice is 90 gm, then their mean weight is 90 10 9.0 gm. Thus, we wish to find the percentage of litters for which y 9.0 gm. We are concerned with the sampling distribution of Y for n 10. From Theorem 5.1, the mean of the sampling distribution of Y is µ µ 8.3, Ȳ the standard deviation is σ Ȳ σ n 1.7 10.538, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1). We need to find the shaded area in the figure. For y 9.0, y - µ Ȳ 9.0-8.3 σ.538 Ȳ 1.30. From Table 3, the area below 1.30 is.9032. Thus, the percentage with y 9.0 is 1 -.9032.0968, or 9.68%.

82 5.52 Two possible factors are: (a) environmental and genetic differences between litters; (b) competition between mice in a litter. 5.53 (a) p^ Probability.0.8 5.3277.2 (5)(.2)(.8 4 ).4096.4 (10)(.2 2 )(.8 3 ).2048.6 (10)(.2 3 )(.8 2 ).0512.8 (5)(.2 4 )(.8).0064 1.0.2 5.0003

(b).50 83 Probability.40.30.20.10 0.2.4.6.8 1.0 p^ 5.54 The sample average, Y, of the heights of the plants varies from one sample to the next. The sampling distribution of the sample average is the distribution of Y -- the average height of plants in a sample -- across repeated samples. That is, the sampling distribution of the sample average is the distribution of sample average plant heights in samples of size 28. 5.55 σ Ȳ σ n 4 28 0.76. 5.56 σ Ȳ σ n 10 64 1.25. 52-50 (a) 1.25 1.6; Table 3 gives.9452. 48-50 1.25-1.6; Table 3 gives.0548..9452 -.0548.8904. 102-100 (b) 1.25 1.6; Table 3 gives.9452. 98-100 1.25-1.6; Table 3 gives.0548..9452 -.0548.8904. (c) µ + 2 - µ 1.25 1.6; Table 3 gives.9452. µ - 2 - µ 1.25-1.6; Table 3 gives.0548..9452 -.0548.8904.