Section 15.0: The Normal Distribution

Size: px
Start display at page:

Download "Section 15.0: The Normal Distribution"

Transcription

1 Section 15.0: The Normal Distribution The Normal distribution is the most widely recognized of all probability distributions. It is a continuous distribution, which means its graph has no gaps. The shape of its graph is classically described as Bellshaped. It is symmetric around the mean with long tails. It was originally discovered in the early part of the 18 th century by the French mathematician, De Moivre and independently rediscovered in the 19 th century by the German mathematician, Gauss. The distribution is described by a complex formula that cannot be used analytically to answer probability questions. Instead, all probability questions are answered by inserting values into a calculator or spreadsheet. The Normal Distribution is the most important probability distribution that is used in statistics. This is due to a number of desirable mathematical properties that Normal Distribution has. Statisticians have been able to solve many problems in statistics only by assuming a Normal Distribution and using its mathematical properties. This is because there are many variables, such as intelligence or height, that when plotted appear similar to it. There might be an infinite number of variables of interest that have this characteristic. However, the Normal Distribution is only a model. There are, in fact, no variables that completely match it. Furthermore, there are many other models, such as exponential growth (e.g., bacteria) or exponential decay (e.g., radioactivity), each also having possibly an infinite number of important variables that when plotted appear similar to them as well. The existence of other models has a major impact on the consequences of assuming the Normal Distribution if it is not reasonable to do so. The Normal Distribution arises naturally when the random variable is the sum or mean of several of other independent random variables. The distribution has two independent parameters: the mean μ (mu) and the standard deviation σ (sigma). Sometimes the variance is used instead of the standard deviation. In this chapter, the graphing calculator is the primary tool used to calculate probabilities North Carolina State University Chapter 15 Page 1

2 Section 15.1: Cutting Fabric for Parachutes Skydiving equipment has advanced considerably over the last several years. Round parachutes are seldom seen these days and have been replaced by modern, rectangular ram-air canopies that have better directional control and offer softer landings. Ram-air canopies are made of a series of inflatable tubes or cells, connected side-by-side along their length. Each cell is designed to form the cross section of an airfoil, so when the parachute inflates, it forms a wing-shaped canopy, ready for flight. The front of each cell is open to the air, and the back is sewn closed. Once inflated, the ram-air canopy is a semi-rigid, rectangular plane, similar to an airplane wing. Figure shows a ram-air canopy parachute with seven cells. Figure : A ram-air parachute The fabric most often used to make parachutes is called rip stop nylon. It is commercially available in 20-meter rolls that are 150 centimeters wide. Depending on the final dimensions of the parachute, the size of each cell will vary as will the length of fabric that must be cut. Regardless of the size, the fabric must be cut with great precision. If the pieces are cut too small, the material cannot be used. If they are cut too large, the material can be used, but some of the fabric will be wasted and may need to be re-cut down to size Sampling the Cut Pieces Sky s the Limit is a small company that makes parachutes. Twin brothers Tom and Ken Speedy work in the cutting department at Sky s the Limit. They cut the large panels of fabric used in making the parachutes. Ray Monitor, the production manager at Sky s the Limit, is studying Tom s work and Ken s work to determine if one or both of them should receive a pay raise. Tom and Ken are cutting pieces of fabric that are supposed to be 50 cm wide and 2 m long. Administrators at Sky s the Limit established that the margin of error for cutting the fabric is 5 millimeters. This means that the fabric can still be used if it is within 5 mm of 50 cm wide and within 5 mm of 2 m long. Q1. What is the acceptable range of widths of fabric (in centimeters)? Q2. What is the acceptable range of lengths of fabric (in meters)? Ray wants to decide if one, neither, or both of the Speedy brothers should get a pay raise. He is basing his decision on the quality of the work they do. He wants to know how often Tom s and Ken s cut pieces of fabric are within specifications: 50 cm wide ± 5 mm and 2 m long ± 5 mm. However, Ray cannot measure 2013 North Carolina State University Chapter 15 Page 2

3 every single piece of fabric they cut because they cut many every day. Instead, Ray takes a random sample of 10 pieces of Tom s work and 10 pieces of Ken s work, to see if they are within specifications. The measurements for all cut pieces of fabric would make up the population of cut pieces of fabric. Since it is usually impossible to collect data for an entire population, a random sample is often used instead. A sample refers to a portion of the population. By choosing the sample randomly, Ray expects that the sample pieces of data are representative of the pieces of data for the entire population. That is, Ray can use the sample data to make predictions and gain a better understanding of the population data. When a sample is representative of the population, the sample data closely match the characteristics of the population data. This helps Ray to generalize his results from the small sample to the large population. Ray takes random samples of 10 pieces of Tom s work and 10 pieces of Ken s work, to see if they are within the given specifications (2 m long ± 5 mm and 50 cm wide ± 5 mm). The length and width of each piece of work that was sampled appear in Table Worker Tom Ken Sample Length (m) Width (cm) Length (m) Width (cm) Table : Lengths and widths of Tom s and Ken s samples measured to the nearest mm Q3. Based on the data presented in Table , would you recommend that Tom or Ken should receive a raise? Explain your reasoning. Ray Monitor decides he needs to compute some statistics to help him make sense of the data. A statistic is a fact or a piece of data, usually calculated from a sample. The first statistics Ray decides to calculate are the means of the lengths of the cloth pieces in each of the two samples. The Mean of a Sample The mean is one way to describe the center of a set of data. The mean of a sample is just the arithmetic average: Add each of the sampled items and divide by the number of items. Note: If the elements of a data set are represented by x i and there are n elements in the set, the usual notation for the mean of the data set is: n i 1 where the term is used to represent the mean of a sample. n 2013 North Carolina State University Chapter 15 Page 3

4 Q4. Compute the mean of the lengths of each worker s sample. What do you observe? Ray looks at the two means and does not find this information very helpful. He decides to look at the data a more closely. Q5. Do the two sets of lengths look different to you? If so, how are they different? If not, what similarities did you see? Ray realizes that Tom s lengths varied between and 2.004; while Ken s varied between and Ray uses a statistic called the range to express this realization mathematically. The Range of a Sample The range is one way to measure how spread-out the individual data points are. To find the range, subtract the smallest number from the largest number in the data set. Q6. Compute the range of the lengths for each worker s sample. What do you observe? At this point, Ray is thinking that maybe Ken is more deserving of a pay raise than Tom. He wonders what he will observe about the width data. Q7. Compute the mean and the range for each worker s sample of widths. What do you observe? Ray is unsure what to do with this information. He sees similarities and differences in the two data sets. Q8. Do you see any differences in the two width data sets? If yes, how are they different? If not, what similarities do they have? Ray decides that the range was not helpful for the width data. He needs another way to measure the spread of the data. He decides to use a statistic called the standard deviation. Note: For the remainder of his analysis, Ray only considers the sample widths to make his decision. The Standard Deviation of a Sample The standard deviation is another way to measure how spread-out the individual data points are. The symbol S x is used to represent standard deviation of a sample. The formula for S x is: 1 This formula is explained in more detail below. The standard deviation uses the difference, or deviation, of each data point from the mean to measure the spread of the data. To find the standard deviation, complete the following steps North Carolina State University Chapter 15 Page 4

5 Step 1: Calculate the difference between each data point and the mean For each data point, x i, compute the difference. For example, using Tom s data for width of pieces, the data points refer to each individual width collected by Ray. Thus, the first step would be as follows: x x 1 = = -0.3 x x 2 = = -0.3 x x 3 = = -0.2 x x 4 = = -0.3 x x 5 = = 0.3 x x 6 = = 0.2 x x 7 = = 0.3 x x 8 = = -0.2 x x 9 = = 0.2 x x 10 = = 0.3 Step 2: Square each difference Next, finding an average difference would make sense. However, some of the differences are positive, and some are negative, so if they were just added together, the positives and negatives would cancel each other. To avoid that, each of the differences is squared. This is the typical way for mathematicians to address an issue like this. For example, using Tom s width data, the second step would be as follows: (x x 1 ) 2 = ( ) 2 = (-0.3) 2 = 0.09 (x x 2 ) 2 = ( ) 2 = (-0.3) 2 = 0.09 (x x 3 ) 2 = ( ) 2 = (-0.2) 2 = 0.04 (x x 4 ) 2 = ( ) 2 = (-0.3) 2 = 0.09 (x x 5 ) 2 = ( ) 2 = (0.3) 2 = 0.09 (x x 6 ) 2 = ( ) 2 = (0.2) 2 = 0.04 (x x 7 ) 2 = ( ) 2 = (0.3) 2 = 0.09 (x x 8 ) 2 = ( ) 2 = (-0.2) 2 = 0.04 (x x 9 ) 2 = ( ) 2 = (0.2) 2 = 0.04 (x x 10 ) 2 = ( ) 2 = (0.3) 2 = 0.09 Step 3: Sum these squares Next, the squared differences are added. For Tom s width data, this sum would be: = = 0.7 Q9. Why does adding the squared differences avoid the problem of positives and negatives canceling each other? Step 4: Divide this sum by n 1, where n is the number of points in the data set Next, divide the sum from Step 3 by n 1. In this example, n is 10 because there were 10 pieces of fabric in the sample. Therefore, continuing with Tom s width data set, this step yields: 2013 North Carolina State University Chapter 15 Page 5

6 Step 5: Take the square root of the results Lastly, take the square root of this quotient. In this example, this would be: This step is important because it allows the units of measurement to be consistent. In this example, the width of the fabric is measured in cm. Ray would like the standard deviation also to be measured in cm. Taking the square root allows the units of the standard deviation to be the same as the units of the points in the data set. These steps together give the formula for standard deviation: 1 The standard deviation tells Ray how spread-out the data points are. It measures the typical amount each data point deviates from the mean. Therefore, the more spread-out the data points are, the greater the standard deviation. Q10. Using this process, find the standard deviation for Ken s width data. What do you observe about the two standard deviations? Why do you think that happened? Q11. Based on Tom s and Ken s standard deviations, should Tom, Ken, both, or neither get a pay raise? Explain your reasoning. Sometimes it is useful to work with a statistic called the variance. The variance is simply the square of the standard deviation. Therefore, it is denoted by S x 2. Notice that the square root of the variance is the standard deviation, which is as it should be: S 2 x S x n i x x i n 1 Q12. Calculate the variances of Tom s and Ken s width samples. Recall that Ray wanted to know the mean, standard deviation, and variance for the widths of all of the panels cut. That information would not be available unless every panel had been measured after it was cut. That would have been very costly. Instead, Ray decides to use the means and standard deviations from the samples that were measured. He reasons that these statistics are probably good estimates of the actual mean and standard deviation. In other words, Ray cannot know the mean or the standard deviation of the widths of all of the panels Tom and Ken have ever cut (i.e., the population of cut panels). However, he does know the mean and the standard deviation of the sample that they cut North Carolina State University Chapter 15 Page 6

7 Sample vs. Population Different representations are used to represent population and sample means, variances, and standard deviations. These are detailed in Table Ray can use the sample statistics, x and S x, as estimates of the actual parameters, and x, of Tom s and Ken s widths. Sample Population Mean x μ Variance S 2 x n i x x i n 1 Standard Deviation Sx n i 1 x x i 2 n 1 σ 2 x n i μ x i n σ x n i 1 μ x i 2 n Table : Sample statistics and population parameters Ray notices also the variance is always computed as part of the computation of the standard deviation. He wonders, Why, then, are there two different statistics? One reason is that the units of measure for the variance might not be meaningful, and even when they are meaningful, they are not appropriate to the problem context. For example, in the width example, the units in which the widths were measured are centimeters. However, the variance squares differences that are measured in centimeters. The units of the variance are square centimeters. Centimeters are used to measure distances, as in the width example, but square centimeters are used to measure area. It is not appropriate to use a unit of area to measure variation in a distance. There is another reason why there are two different statistics, the variance and the standard deviation. The standard deviation, which is the more meaningful statistic, lacks an important property that the less meaningful statistic, the variance, has. This property is explored in Appendix A. This Appendix also explains how to find several statistics using the graphing calculator Hiring a New Worker Sky s the Limit has been quite successful lately. Sales have increased, and they need to hire a new worker for the cutting department. They want this new worker to cut with precision and consistency. Ray Monitor has been tasked with hiring this new worker. Five potential workers have been interviewed for the job. These potential workers were asked to cut several pieces of fabric. Ray will use these samples to decide which worker to hire. Ray decides to compare the samples from the potential workers to the samples from the current workers (including Tom and Ken Speedy), who he knows are precise and consistent. However, he is unsure how to proceed. He asks his brother, a mathematician named Jacob, to help him with the next steps. There are currently 20 workers in the cutting department. Jacob asks Ray to collect five samples from each worker. Then, rather than consider all 100 collected samples, Jacob finds the mean for each worker, based on the five collected samples North Carolina State University Chapter 15 Page 7

8 Jacob explains to Ray that by using the means, they can assume the data are Normally Distributed. He says this is due to something called The Central Limit Theorem. When data are Normally Distributed, this allows them to make assumptions, predictions, and conclusions that they would not typically be able to make. Therefore, this is a very useful property. Ray is convinced by his brother s argument. So he sets out to collect data from the 20 workers and finds their means. For example, Worker A provided the sample widths shown in Table Sample Width (cm) Table : Sample widths provided by worker A Then, the mean of Worker A s sample would be: Ray repeats this process for all 20 workers to obtain a total of 20 means. These means are shown in Table Mean Sample Widths (cm) Worker A Worker K Worker B Worker L Worker C Worker M Worker D Worker N Worker E Worker O 50 Worker F Worker P Worker G Worker Q Worker H Worker R Worker I Worker S Worker J Worker T Table : Mean sample widths for all 20 workers Jacob wants to provide Ray a picture of the data. He starts by taking the means shown in Table and creating a histogram. He creates bins based on the data. For example, he knows his bins need to range from approximately 49.6 to approximately 50.3 (based on the range of means in Table ). He also knows that in order to gain a good picture of the data, the bins should be neither too small nor too large. His bins are listed in Table Jacob also needs to show how tall the bars of the histogram are as well. Rather than simply counting the number of means that fit into each bin, Jacob uses relative frequencies. To find a relative frequency, Jacob divides the count by the total number of means (in this case, there are 20 means). Table begins this process. For example, there is only one mean that fits in the 49.5 T 49.7 bin (Worker S). Therefore, the relative frequency of a mean fitting into this bin is 1 20 = North Carolina State University Chapter 15 Page 8

9 Bin Total Count Relative Frequency 49.5 T = < T = < T < T < T 50.5 Total 20 1 Table : Calculating the relative frequency for each bin Q13. Complete Table by counting the number of means (from Table ) and calculating the relative frequency for each bin. Q14. Create a histogram by drawing bars on Figure The heights of each bar should be the relative frequencies found in Q13. Figure : Creating a histogram for the relative frequencies of the workers mean sample widths 2013 North Carolina State University Chapter 15 Page 9

10 Q15. Describe the general shape of this histogram. Then, Jacob points out the area of the bars to Ray. He notes that the width of each bar is one unit, and the height of each bar is the relative frequency of a panel width being within the respective bin. For example, the height of the first bar (the 49.5 T 49.7 bin) is 0.05 because the relative frequency of a panel width being within 49.5 and 49.7 cm is Therefore, the area of this bar is: Area = (width of bar)(height of bar) = (1)(0.05) = 0.05 Q16. Using this same logic, calculate the areas of the remaining bars of the histogram. Q17. Calculate the sum of all the areas of the bars (don t forget to include the first area). Why does this make sense? Now that Ray has a basic idea of what these means look like on a histogram, Jacob introduces him to the Normal Distribution curve. The shape of the curve is similar to the shape of the histogram, but it is continuous and smooth. To draw the curve, Jacob uses his graphing calculator. He instructs Ray to complete the following steps. Step 1: Identify the mean and standard deviation In this case, the mean of the data listed in Table is 50. The standard deviation is Step 2: Input the normalpdf function in the Y= menu Jacob presses the Y= button on his calculator. Then, he accesses the distribution menu by pressing 2 nd and then VARS. In this menu, Ray chooses 1:normalpdf(, as shown in Figure Figure : Choosing the normalpdf function on the graphing calculator After pressing ENTER, Jacob is brought back to the Y= screen. Now, he types in the syntax for the normalpdf function: normalpdf(x, x, S x ) The graph for the mean width data has the syntax shown in Figure Figure : Graphing the Normal Distribution curve for the mean width data 2013 North Carolina State University Chapter 15 Page 10

11 Step 3: Set up the appropriate window settings Before pressing Graph, Jacob changes his window settings. To do so, he presses the Window button. Then, based on what he knows about the Normal Distribution, he chooses the settings shown in Figure Figure : Window settings for the Normal Distribution curve for the mean width data Step 4: Press Graph and analyze as appropriate Finally, Jacob presses Graph and sees the graph shown in Figure a. He uses the Pen function in the Draw menu ( 2 nd then PRGM ) to draw the vertical line along the mean, as shown in Figure b. (Note: to clear this vertical line, go back to the Draw menu and choose 1:ClrDraw ). Figure a: Graph of the Normal Distribution curve for the mean width data Figure b: Graph of the Normal Distribution curve with a vertical line along the mean Q18. What proportion of the area under the bell-shaped curve and above the x-axis appears to be to the left of the vertical line? What proportion appears to be to the right? Q19. What do you think is the probability that a value of x is greater than 50 cm? What do you think is the probability that a value of x is less than 50 cm? Q20. Based on your response to Q17, what do you think the area under this curve is? How do you know? Now that Ray has spent some time thinking about the Normal Distribution curve, he comes back to his original problem. He is hiring a new worker, and he needs to see how the potential workers compare to his current workers. Recall that Ray collected data from five potential workers. He asked each potential worker to cut five pieces of fabric. Then, he found the mean for each potential worker. These means are listed in Table North Carolina State University Chapter 15 Page 11

12 Mean Sample Widths (cm) Potential Worker Potential Worker Potential Worker Potential Worker Potential Worker Table : Mean sample widths for the five potential workers Now, Jacob instructs Ray to locate each of these means on the Normal Distribution curve shown in Figure a. This will allow Ray to see how these potential workers compare to the current workers because the curve was created based on the current workers data. Jacob again uses his graphing calculator. While on the main screen, he chooses the Vertical option from the Draw menu. Then, he types in the mean sample width for Potential Worker 1. This is shown in Figure Figure : Graphing at vertical line at x = (the sample mean for Potential Worker 1) When Jacob hits Enter, a vertical line is drawn on the Normal Distribution curve (created previously). This is shown in Figure Figure : Graph of the Normal Distribution curve with a vertical line at x = Q21. Consider the location of this vertical line. a. How do you think Potential Worker 1 compares to the current workers in the cutting department? b. Would you recommend that Ray hire this worker? Why or why not? Q22. Graph vertical lines on the curve for the remaining four potential workers. (Remember: to clear previous drawings, choose ClrDraw from the Draw menu.) Q23. Based on these graphs, which potential worker should Ray hire? Explain your reasoning North Carolina State University Chapter 15 Page 12

13 Ray feels confident that he knows who the best worker will be. However, he remembers from his previous analysis with Tom and Ken that he also needs to consider the spread of the data. Therefore, he calculates the standard deviation for each potential worker, shown in Table Mean Sample Widths (cm) Standard Deviations Potential Worker Potential Worker Potential Worker Potential Worker Potential Worker Table : Mean sample widths for the five potential workers Q24. Graph the Normal Distribution curve for each of the potential workers using the normalpdf function on the graphing calculator. Q25. Describe how these five graphs differ from one another. Q26. Based on these graphs, which potential worker should Ray hire? Explain your reasoning North Carolina State University Chapter 15 Page 13

14 Section 15.2: Automobile Battery Warranties Alex Murphy is a senior at Chesterfield High School in Chesterfield Township, Michigan. In a case of what he likes to call Murphy s Law, the battery in his recently purchased used car just died and he needs to replace it as soon as possible. He intends to keep this car through his senior year of high school as well as through his four years of college, so a reliable car battery with a high-quality warranty is essential. After doing some research, Alex considers purchasing a car battery that costs $110 and includes a 36- month full replacement warranty with an additional 6-month pro-rated warranty. This means that if the battery fails after only 36 months or less, he will get all his money back. If the battery fails between 36 and 42 months, he will only get part of his money back. If the battery fails after 42 months or more, he will not get any money back. He lets T represent the time, in months, from the purchase of the car battery until it fails. Then, he creates Table , which shows the replacement values for the warranty period. Time, in months, from the purchase of the car battery that it fails Replacement Value (% of original purchase price) T < T < T < T 42 0 Table : Replacement values for battery failure within warranty period Replacement Value (amount of money he will get back) Q1. The car battery costs $110. Complete Table to determine how much money Alex will get back if his car battery fails within each time frame Using a Normal Distribution to Find the Probability of Battery Failure After doing some additional research, Alex found an article in an automotive magazine with some data about battery life for the same battery that he is considering purchasing. The magazine collected data on a sample of 50 such batteries. They analyzed their data and found that the battery life is Normally Distributed with a mean of 48 months and a standard deviation of 6 months. By assuming the battery life is Normally Distributed, Alex can make many predictions about when the battery may fail. Q2. In your own words, interpret the statement, the battery life is Normally Distributed with a mean of 48 months and a standard deviation of 6 months. Alex decides to look at the graph of this Normal Distribution curve. He does this on his graphing calculator. Figure shows the calculator screens for the steps he takes. Note: these steps are explained in greater detail in Section North Carolina State University Chapter 15 Page 14

15 (a) (b) (c) Figure : Steps to graphing a Normal Distribution on a graphing calculator The most challenging step in graphing the Normal Distribution on the graphing calculator is setting up an appropriate viewing window. In order to set the Xmin and Xmax, Alex reasons that the mean should be in the middle, half-way between the Xmin and Xmax. He decides to set the x-axis from 18 months below the mean (30 months) to 18 months above the mean (66 months). He also thinks that it makes sense to scale the x-axis using the standard deviation, which is 6 months. His real challenge comes in setting-up the y-axis. Fortunately, he remembers learning that the area between the graph of a Normal Distribution curve and the x-axis is always 1. Since his x-values were fairly large, he decides to use fairly small values for y. The viewing window Alex uses and his graph of this Normal Distribution curve are given in Figure (a) (b) Figure : Alex s viewing window and graph of the Normal Distribution curve with μ = 48 and σ = 6 Q3. What are some features that you notice about the graph of this Normal Distribution curve? The Battery Failing Between 36 and 39 Months of Purchase Alex now wants to find the probability that this battery will fail within the ranges given in the terms of the battery warranty (see Table ). He is not too concerned about the probability of the battery failing within the first 36 months. In that case, he would get 100% of his money back. Instead, he considers the first range in the pro-rated warranty period. If the car battery fails between 36 and 39 months of his purchase, Alex would be refunded 75% of the original purchase price. Alex wants to know the probability that the battery will fail between 36 and 39 months of purchase. That is, he wants to find the total probabilities of the battery failing at 36 months, 39 months, or any time in between. Figure shows a picture of this North Carolina State University Chapter 15 Page 15

16 Figure : A graph with the shaded region showing the probabilities of interest This pictures helps Alex visualize the probability he is looking for. He knows that the area (i.e., the total probability) under the curve is 1. He is interested in the proportion of the area that is between 36 and 39 months. Q4. Based on the picture shown in Figure , do you think it is likely that the battery will fail between 36 and 39 months? Why or why not? Now, Alex actually finds the probability of the battery failing between 36 and 39 months. He does so using his graphing calculator. There are two methods for finding this probability on the graphing calculator. The first method is to use the normalcdf function on the calculator. This function, known as the Normal Cumulative Density Function, calculates the cumulative probability between two values. Since Alex is interested in finding the probability between 36 and 39 months, this function makes sense. The syntax for this function is: normalcdf(lower limit, upper limit, mean, standard deviation) Recall that for this particular car battery, Alex assumes that the mean is 48 months and the standard deviation is 6 months. Figure shows the calculator screenshot for this example. The lower limit is 36 months, and the upper limit is 39 months. Figure : Calculating a probability using the Normal Cumulative Density Function Q5. Use the normalcdf function to calculate the probability that the battery will fail between 36 and 39 months of purchase North Carolina State University Chapter 15 Page 16

17 Q6. On average, out of every 100 of these batteries that are made, how many fail between 36 and 39 months after purchase? How many out of every 1,000? The second way to calculate this probability on the graphing calculator is done using the graph of the Normal Distribution curve. This is done by first choosing the ShadeNorm function. This function is found by pressing DISTR (2 nd VARS) and scrolling over to the DRAW menu, as shown in Figure Figure : Choosing the ShadeNorm function Then, the following syntax is used: ShadeNorm(lower limit, upper limit, mean, standard deviation) This is shown in Figure Figure : Calculating a probability using the ShadeNorm function Q7. Use the ShadeNorm function to calculate the probability that the battery will fail between 36 and 39 months of purchase. Describe what happens when you use this function. Q8. Based on the probability that the battery will fail between 36 and 39 months of purchase, would you recommend Alex purchase this battery? Explain your reasoning. The Battery Failing After 42 Months of Purchase Next, Alex decides to calculate the probability that the battery fails beyond the warranty period. Since he would not get any money back if this happens, he is very interested in the likelihood that the battery will fail after 42 months. He wants to use the normalcdf command, but he needs to know both the lower and upper limits of that range. The lower limit is 42 months, but he does not initially know what the upper limit should be. Figure shows the area under the curve that shows the probability of interest North Carolina State University Chapter 15 Page 17

18 Figure : The area showing the probability that the battery will fail after 42 months of purchase Alex realizes that, in theory, the upper limit for the range of months when he would not get any money back is infinite. Since there is no infinity button on the calculator, he wonders how to tell the calculator that the upper limit is without bound. So, Alex uses the largest number he can possibly tell the calculator. In scientific notation, the largest number the calculator can interpret is , which is usually written as 1E99. Alex accesses the EE command by pressing 2 nd comma. The command is shown in Figure Figure : Alex s use of scientific notation to represent infinity Q9. What is the probability that the car battery will fail beyond the warranty period? Note: Alex does not need to use an absurdly large number. With any Normal Distribution, there is less than a one in ten thousand chance of observing a value that is more than four standard deviations above the mean (this idea will be discussed in more detail later in this section). Q10. What value do you obtain when you multiply the standard deviation by 4 and add it to the mean? Input this value as the upper limit and compare your answer to that of the previous question. Q11. Use the ShadeNorm function on the calculator to find the probability that the battery will fail beyond the warranty period. (Recall: To clear previous drawings, choose ClrDraw from the Draw menu.) a. What was your lower limit? b. What was your upper limit? c. Did you get the same answer as when you used the normalcdf function? 2013 North Carolina State University Chapter 15 Page 18

19 d. Which method do you prefer (ShadeNorm or normalcdf)? Explain your reasoning. Q12. Use either technique (ShadeNorm or normalcdf) to find the probabilities for all the ranges in the warranty period. Record your answers in Table Time, in months, from the purchase of the car battery that it fails Probability of battery failure at time T T 36 P(T 36) 36 < T 39 P(36 < T 39) 39 < T < 42 P(39 < T < 42) T 42 P(T 42) Table : Recording the probabilities of battery failure at time T Q13. Do your values make sense? Why or why not? Q14. What happens when you add all the probabilities together? Why? Q15. What is the probability that the battery will last to the average? That is, find P(T > 48). Q16. Since Alex intends to keep his car for 5 years, what is the probability that the battery will fail before 60 months has passed? Q17. Do you think he will need to buy another car battery before 5 years have passed? Explain your reasoning. Since the average life of the car battery is 48 months and Alex intends to keep this car for 5 years, he is concerned about what may happen outside of the warranty period. Q18. What is the probability that the battery will fail after the warranty period ends? Q19. What is the probability that the battery will fail before the warranty period ends? Q20. Based on these probabilities, do you think this battery is the right choice for Alex? Why or why not? Expected Value of the Warranty EverLast, Inc. is the company that produces the battery Alex is considering. The president of EverLast, Inc., Caroline Myers, is interested in determining how much money the company is paying, on average, for the car battery warranty. She can use the probabilities (from the previous section) to calculate the expected value of the warranty. Caroline has experience with the Normal Distribution. She has already calculated the probabilities found in Table Since she knows the probabilities for each range, she can calculate the expected value of the warranty. Recall that the expected value of a random variable is the long-term average value. So, the expected value of a battery warranty is the long-term average amount of money that the company pays out when a battery fails. It is calculated by multiplying each probability by the amount of money the company would pay out and then adding them all together North Carolina State University Chapter 15 Page 19

20 Q21. What is the expected value of the car battery warranty? (Hint: use the replacement values from Q1 and the probabilities from Q12.) Q22. From the company s perspective, what does that value mean? Note: the expected value can also be thought of in terms of the customer. However, a customer will only purchase one car battery, in which case the notion of expected value referring to the long run average does not really make sense. For example, in the long run, Alex still only has the one car battery that either fails during the warranty period or fails outside the warranty period. On the other hand, the company that sells the battery would be interested in the expected value simply because of the large number of car batteries they have sold. From the company s perspective, they would want to have the expected value of the warranty as small as possible. The warranty requires them to pay in the event of their battery s failure. Q23. Suggest changes to the warranty policy to reduce the expected value of the warranty cost to less than $10? Q24. What happens to the expected value of the warranty cost if the company can improve the life of the battery so it lasts, on average: a. 49 months? b. 50 months? The Oldest Batteries Suppose the federal government is trying to decrease the number of older, more hazardous car batteries in the public. They have developed a new way to recycle and reuse toxic batteries. To remove some of these older batteries from the public, they now require car battery companies to offer exchange vouchers to the customers with the oldest batteries. Specifically, the federal government will offer exchange vouchers to those with batteries in the top 10% age range. They choose to look at the top 10% age range for all batteries, rather than a certain age (e.g., 10 years old), because the life of a battery depends on the quality of the battery, which varies by company and type. Caroline Myers, the president of EverLast, Inc., needs to determine at what age of the battery s life customers would qualify for this exchange program. That is, she needs to know at what age there is only a 10% chance that the battery is still alive. In other words, at what point in the battery s life will only 10% of them still be alive? Q25. How could the company contact those people with older batteries? Caroline needs to know the age at which there is only a 10% chance of surpassing. Another way of phrasing this is: Caroline needs to know the age at which 90% of the batteries will have failed before reaching this age. That is, if she wants to find the age at which only 10% of the batteries are still alive, she can instead find the age at which 90% of the batteries have failed. Mathematically, Caroline is looking for the value of t such that P(T > t ) = 0.1 and P(T < t ) = 0.9. This is shown graphically in Figure North Carolina State University Chapter 15 Page 20

21 Figure : A graphical representation of P(T < t ) = 0.9 and P(T > t ) = 0.1 Caroline knows the probability of interest, but does not know the value of t that will give this probability. This is the reverse of what was done earlier. Caroline now needs to invert the Normal Distribution. When given the probability, she can use her calculator to find the corresponding value on the normal curve. To do this, she uses the distribution menu from her graphing calculator. Since she is inverting the Normal Distribution, she uses the inverse normal function. The command is option 3, invnorm(, and the parameters are the probability, the mean, and the standard deviation. The syntax is as follows: invnorm(probability, mean, standard deviation) Note: when using the inverse normal function, the calculator only reads probabilities (or areas) from left to right. Therefore, in this example, Caroline Myers needs to use 0.90 as the probability rather than Figure shows the calculator screens. Figure : Using the invnorm command Q26. There a 90% chance that the battery will fail before how many months? Q27. Suppose the federal government instead offers vouchers to the 5% oldest batteries. There a 95% chance that the battery will fail before how many months? Q28. Suppose the federal government instead offers vouchers to the 1% oldest batteries. There a 99% chance that the battery will fail before how many months? 2013 North Carolina State University Chapter 15 Page 21

22 The Empirical Rule Returning to the original problem, Alex is trying to determine whether he should purchase this particular car battery made by EverLast, Inc. He makes a list of everything he knows at this point: The battery s life follows the Normal Distribution, with an average battery life of 48 months and a standard deviation of 6 months. When given two months, he can calculate the probability the battery will fail between the two months using the Normal Distribution. When given a probability P(T < t), he can calculate the month, t, at which the battery fails using the inverse normal function on his calculator. The last thing Alex wants to know is between which two months the battery is most likely to fail. One of the important features of any Normal Distribution is that the probabilities of being ±1, ±2, ±3 or ±4 standard deviations from the mean are well-known. Those probabilities are given in Table , where (mu) is the mean and (sigma) is the standard deviation. Domain of x Probability X % 2 X % 3 X % 4 X % Table : Probabilities based on standard deviations in a Normal Distribution Thus, 0.68 is a good estimate of the probability of being within 1 standard deviation from the mean, 0.95 is a good estimate of the probability of being within 2 standard deviations from the mean, and 0.99 is a good estimate of the probability being within 3 standard deviations. The table also shows that there is very little chance of being 4 standard deviations or more from the mean. Therefore, nearly all values lie within 3 standard deviations from the mean. This is an important fact known as the empirical rule. Figure shows a graph of the Normal Distribution including these probabilities for the car battery problem North Carolina State University Chapter 15 Page 22

23 Figure : Probabilities based on standard deviations in a Normal Distribution Q29. Interpret each of the probabilities in Table or Figure in terms of the problem context. Earlier in this section, Alex used the value 4 standard deviations above the mean as an upper limit rather than choosing an extremely large number (like ). He stated that there was less than a one in ten thousand chance of observing a value that is more than four standard deviations above the mean. The empirical rule shows this in more detail because there is a 99.99% chance of a value being within 4 standard deviations from the mean. Alex recalls that the Normal Distribution curve is symmetric about the mean. This fact, together with the probabilities in Table , can be used to determine that the probability of a value, x, being between the mean and 1 standard deviation above the mean. This range has a probability of about In terms of the problem context, there is approximately a 34% chance that the battery will fail between 48 and 54 months. This is shown in Figure North Carolina State University Chapter 15 Page 23

24 Figure : The probability that a value is between and + is approximately 34.13% Q30. What is the approximate probability of a value, x, being between the mean and 2 standard deviations above the mean? Interpret this question in terms of the car battery problem context. Q31. What is the approximate probability of x being between the mean and 2 standard deviations below the mean? Interpret this question in terms of the car battery problem context. Q32. What is the approximate probability of x being between 1 and 2 standard deviations above the mean? Interpret this question in terms of the car battery problem context. Q33. What is the approximate probability of x being between 1 and 2 standard deviations below the mean? Interpret this question in terms of the car battery problem context. Q34. What is the approximate probability of x being between 2 and 3 standard deviations below the mean? Interpret this question in terms of the car battery problem context. Alex can also use the empirical rule to calculate probabilities at the ends of the curve. For example, he can find the probability that the battery will fail before 36 months have passed. The probability that the battery fails between 42 and 54 months is 68.26%. Thus, the probability that the battery fails outside this range is = However, this value includes the probability that the battery fails before 42 months have passed as well as the probability that the battery fails after 54 months. Since the normal curve is symmetric about the mean, Alex can simply divide in half to determine these two pieces. This is shown in Figure North Carolina State University Chapter 15 Page 24

25 Figure : The probability that the battery fails before 42 months is approximately 15.87% Q35. Use the empirical rule and the fact that the area under the curve is 1 to complete the following. a. What is the probability that the battery fails before 36 months have passed? b. What is the probability that the battery fails after 66 months have passed? c. What is the probability that the battery fails between 36 and 54 months? Q36. Fill in the blank probabilities of using the empirical rule Figure : Calculating probabilities using the empirical rule 2013 North Carolina State University Chapter 15 Page 25

26 Note: A common error in this situation involves subtracting in the incorrect order. But there is an easy way to tell if the order is wrong. If the values are subtracted in the wrong order, the result is negative. Q37. How can you tell that a negative number cannot be the correct answer to a question regarding probability (or area)? 2013 North Carolina State University Chapter 15 Page 26

27 Section 15.3: Rappin Skoop Dogg Seasonal Demand The Pineapple Technology Corp. has developed a new item just in time for the holiday season. Rappin Skoop Dogg is an upright stuffed hound dog, dressed in a leather-like jacket and pants costume. It sings and dances while standing on a replica of a stage and holding a microphone. There are ten different rap songs that Rappin Skoop Dogg can sing as well as dance to. Additional songs can be downloaded from the Pineapple Technology Corp. web site. There are three settings for activating the dog. First, Rappin Skoop Dogg has a motion detector and can be set to activate when anyone approaches. Second, it can be activated by clapping your hands. Third, it can be activated by pushing a button. Chillioz Toys, a toy store chain, is planning on placing a large order in anticipation of the holiday season. The Pineapple Technology Corp. charges $45 per item for purchases of more than 10,000 units. Chillioz Toys plans to price Rappin Skoop Dogg at $99. However, items left over after the holiday season will be steeply discounted to $30. There is an inventory carrying cost of $2 per item left over after the holiday season ends. These data are summarized in Table Variable Description Amount P Purchase Price $45 S Selling Price $99 D After-Christmas Selling Price $30 H Inventory Carry-Over Cost $2 Table : Chillioz Toys data regarding Rappin Skoop Dogg Therefore, the net profit on a regular pre-holiday sale is: Net Profit = $99 $45 = $54 The net cost at the end of the season for leftover products is: Net Cost = $ = -$17 Based on past experience with similar products, executives at Chillioz Toy know there is high potential, but also a great deal of uncertainty. Due to their extensive experience, they can assume that the demand for this item is Normally Distributed with a mean of 21,000 and a standard deviation of 4,200 (one-fifth of the mean). Q1. Sketch a Normal Distribution curve for the demand of the Rappin Skoop Dogg toy. Roy Charles, the Vice President for marketing at Chillioz Toys, is interested in analyzing the impact of different ordering policies. He starts by wondering how much to order so that there is only a 40% chance that the demand will be less than this number. He therefore wants to determine the value of x such that: P(X < x) = 0.40 Q2. Use the invnorm function on the graphing calculator to find the value of x such that P(X < x) = Interpret this value in terms of the problem context. Q3. Use the normalcdf function on the graphing calculator to verify your answer to Q2 by doing the problem in reverse. (Hint: Choose 4200 as your lower limit because it is 4 standard deviations below the mean.) 2013 North Carolina State University Chapter 15 Page 27

28 Roy Charles is considering ordering more than the average demand. This time, he wants to find out how many to order so that there is only a 40% chance that the demand will be more than this number. That is, he is looking for the value of x such that: P(X > x) = 0.40 This is shown in Figure % Figure : Determining the value of x such that P(X > x) = 0.40 However, the calculator only provides the left hand tail of the Normal Distribution. He must first apply the principle of the complement: P(X > x) = 1 P(X < x) Q4. If the area to right of the x is going to be 0.4, what should the area to the left have to equal? Q5. Use the inverse normal function to determine how many toys Roy Charles should order. Roy Charles wants to look at the decision from a different point of view. He is hoping to sell as many units as possible and does not want to run out of stock. Q6. Use the normalcdf or the ShadeNorm function on the calculator to determine the following probabilities. Interpret the probability in terms of the problem context for each case. a. P(X > 22,000) b. P(X < 22,000) c. P(20,000 < X < 22,000) d. P(16,800 < X < 25,200) Q7. Consider the empirical rule. a. There is approximately a 68% chance that the demand will be between what two values? (Assume the interval is symmetric about the mean.) b. There is approximately a 95% chance that the demand will be between what two values? (Assume the interval is symmetric about the mean.) c. There is approximately a 99.7% chance that the demand will be between what two values? (Assume the interval is symmetric about the mean.) x 2013 North Carolina State University Chapter 15 Page 28

Continuous Random Variables and the Normal Distribution

Continuous Random Variables and the Normal Distribution Chapter 6 Continuous Random Variables and the Normal Distribution Continuous random variables are used to approximate probabilities where there are many possible outcomes or an infinite number of possible

More information

Math 227 Elementary Statistics. Bluman 5 th edition

Math 227 Elementary Statistics. Bluman 5 th edition Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find

More information

Normal Probability Distributions

Normal Probability Distributions C H A P T E R Normal Probability Distributions 5 Section 5.2 Example 3 (pg. 248) Normal Probabilities Assume triglyceride levels of the population of the United States are normally distributed with a mean

More information

The Normal Probability Distribution

The Normal Probability Distribution 102 The Normal Probability Distribution C H A P T E R 7 Section 7.2 4Example 1 (pg. 71) Finding Area Under a Normal Curve In this exercise, we will calculate the area to the left of 5 inches using a normal

More information

3. Continuous Probability Distributions

3. Continuous Probability Distributions 3.1 Continuous probability distributions 3. Continuous Probability Distributions K The normal probability distribution A continuous random variable X is said to have a normal distribution if it has a probability

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education

More information

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Statistics (This summary is for chapters 18, 29 and section H of chapter 19)

Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x n =

More information

Confidence Intervals and Sample Size

Confidence Intervals and Sample Size Confidence Intervals and Sample Size Chapter 6 shows us how we can use the Central Limit Theorem (CLT) to 1. estimate a population parameter (such as the mean or proportion) using a sample, and. determine

More information

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions SGSB Workshop: Using Statistical Data to Make Decisions Module 2: The Logic of Statistical Inference Dr. Tom Ilvento January 2006 Dr. Mugdim Pašić Key Objectives Understand the logic of statistical inference

More information

Unit 2: Statistics Probability

Unit 2: Statistics Probability Applied Math 30 3-1: Distributions Probability Distribution: - a table or a graph that displays the theoretical probability for each outcome of an experiment. - P (any particular outcome) is between 0

More information

Normal Sampling and Modelling

Normal Sampling and Modelling 8.3 Normal Sampling and Modelling Many statistical studies take sample data from an underlying normal population. As you saw in the investigation on page 422, the distribution of the sample data reflects

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes. Standardizing normal distributions The Standard Normal Curve

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes. Standardizing normal distributions The Standard Normal Curve 6.1 6.2 The Standard Normal Curve Standardizing normal distributions The "bell-shaped" curve, or normal curve, is a probability distribution that describes many reallife situations. Basic Properties 1.

More information

AP Statistics Chapter 6 - Random Variables

AP Statistics Chapter 6 - Random Variables AP Statistics Chapter 6 - Random 6.1 Discrete and Continuous Random Objective: Recognize and define discrete random variables, and construct a probability distribution table and a probability histogram

More information

Introduction to Business Statistics QM 120 Chapter 6

Introduction to Business Statistics QM 120 Chapter 6 DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics QM 120 Chapter 6 Spring 2008 Chapter 6: Continuous Probability Distribution 2 When a RV x is discrete, we can

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial

More information

The "bell-shaped" curve, or normal curve, is a probability distribution that describes many real-life situations.

The bell-shaped curve, or normal curve, is a probability distribution that describes many real-life situations. 6.1 6.2 The Standard Normal Curve The "bell-shaped" curve, or normal curve, is a probability distribution that describes many real-life situations. Basic Properties 1. The total area under the curve is.

More information

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution Section 7.6 Application of the Normal Distribution A random variable that may take on infinitely many values is called a continuous random variable. A continuous probability distribution is defined by

More information

The topics in this section are related and necessary topics for both course objectives.

The topics in this section are related and necessary topics for both course objectives. 2.5 Probability Distributions The topics in this section are related and necessary topics for both course objectives. A probability distribution indicates how the probabilities are distributed for outcomes

More information

Numerical Descriptive Measures. Measures of Center: Mean and Median

Numerical Descriptive Measures. Measures of Center: Mean and Median Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where

More information

Probability. An intro for calculus students P= Figure 1: A normal integral

Probability. An intro for calculus students P= Figure 1: A normal integral Probability An intro for calculus students.8.6.4.2 P=.87 2 3 4 Figure : A normal integral Suppose we flip a coin 2 times; what is the probability that we get more than 2 heads? Suppose we roll a six-sided

More information

Statistical Methods in Practice STAT/MATH 3379

Statistical Methods in Practice STAT/MATH 3379 Statistical Methods in Practice STAT/MATH 3379 Dr. A. B. W. Manage Associate Professor of Mathematics & Statistics Department of Mathematics & Statistics Sam Houston State University Overview 6.1 Discrete

More information

The normal distribution is a theoretical model derived mathematically and not empirically.

The normal distribution is a theoretical model derived mathematically and not empirically. Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.

More information

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed. The Central Limit Theorem The central limit theorem (clt for short) is one of the most powerful and useful ideas in all of statistics. The clt says that if we collect samples of size n with a "large enough

More information

7 THE CENTRAL LIMIT THEOREM

7 THE CENTRAL LIMIT THEOREM CHAPTER 7 THE CENTRAL LIMIT THEOREM 373 7 THE CENTRAL LIMIT THEOREM Figure 7.1 If you want to figure out the distribution of the change people carry in their pockets, using the central limit theorem and

More information

Data Analysis. BCF106 Fundamentals of Cost Analysis

Data Analysis. BCF106 Fundamentals of Cost Analysis Data Analysis BCF106 Fundamentals of Cost Analysis June 009 Chapter 5 Data Analysis 5.0 Introduction... 3 5.1 Terminology... 3 5. Measures of Central Tendency... 5 5.3 Measures of Dispersion... 7 5.4 Frequency

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

Statistics for Business and Economics: Random Variables:Continuous

Statistics for Business and Economics: Random Variables:Continuous Statistics for Business and Economics: Random Variables:Continuous STT 315: Section 107 Acknowledgement: I d like to thank Dr. Ashoke Sinha for allowing me to use and edit the slides. Murray Bourne (interactive

More information

Measure of Variation

Measure of Variation Measure of Variation Variation is the spread of a data set. The simplest measure is the range. Range the difference between the maximum and minimum data entries in the set. To find the range, the data

More information

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4 Week 7 Oğuz Gezmiş Texas A& M University Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4 Oğuz Gezmiş (TAMU) Topics in Contemporary Mathematics II Week7 1 / 19

More information

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes. Introduction In the previous chapter we discussed the basic concepts of probability and described how the rules of addition and multiplication were used to compute probabilities. In this chapter we expand

More information

Section Introduction to Normal Distributions

Section Introduction to Normal Distributions Section 6.1-6.2 Introduction to Normal Distributions 2012 Pearson Education, Inc. All rights reserved. 1 of 105 Section 6.1-6.2 Objectives Interpret graphs of normal probability distributions Find areas

More information

Chapter 7 1. Random Variables

Chapter 7 1. Random Variables Chapter 7 1 Random Variables random variable numerical variable whose value depends on the outcome of a chance experiment - discrete if its possible values are isolated points on a number line - continuous

More information

Part V - Chance Variability

Part V - Chance Variability Part V - Chance Variability Dr. Joseph Brennan Math 148, BU Dr. Joseph Brennan (Math 148, BU) Part V - Chance Variability 1 / 78 Law of Averages In Chapter 13 we discussed the Kerrich coin-tossing experiment.

More information

5-1 pg ,4,5, EOO,39,47,50,53, pg ,5,9,13,17,19,21,22,25,30,31,32, pg.269 1,29,13,16,17,19,20,25,26,28,31,33,38

5-1 pg ,4,5, EOO,39,47,50,53, pg ,5,9,13,17,19,21,22,25,30,31,32, pg.269 1,29,13,16,17,19,20,25,26,28,31,33,38 5-1 pg. 242 3,4,5, 17-37 EOO,39,47,50,53,56 5-2 pg. 249 9,10,13,14,17,18 5-3 pg. 257 1,5,9,13,17,19,21,22,25,30,31,32,34 5-4 pg.269 1,29,13,16,17,19,20,25,26,28,31,33,38 5-5 pg. 281 5-14,16,19,21,22,25,26,30

More information

Lecture 9. Probability Distributions. Outline. Outline

Lecture 9. Probability Distributions. Outline. Outline Outline Lecture 9 Probability Distributions 6-1 Introduction 6- Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7- Properties of the Normal Distribution

More information

Expected Value of a Random Variable

Expected Value of a Random Variable Knowledge Article: Probability and Statistics Expected Value of a Random Variable Expected Value of a Discrete Random Variable You're familiar with a simple mean, or average, of a set. The mean value of

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution

More information

Lecture 9. Probability Distributions

Lecture 9. Probability Distributions Lecture 9 Probability Distributions Outline 6-1 Introduction 6-2 Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7-2 Properties of the Normal Distribution

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 6 Point Estimation Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Point Estimation Statistical inference: directed toward conclusions about one or more parameters. We will use the generic

More information

6.1 Graphs of Normal Probability Distributions:

6.1 Graphs of Normal Probability Distributions: 6.1 Graphs of Normal Probability Distributions: Normal Distribution one of the most important examples of a continuous probability distribution, studied by Abraham de Moivre (1667 1754) and Carl Friedrich

More information

Example: Histogram for US household incomes from 2015 Table:

Example: Histogram for US household incomes from 2015 Table: 1 Example: Histogram for US household incomes from 2015 Table: Income level Relative frequency $0 - $14,999 11.6% $15,000 - $24,999 10.5% $25,000 - $34,999 10% $35,000 - $49,999 12.7% $50,000 - $74,999

More information

MAKING SENSE OF DATA Essentials series

MAKING SENSE OF DATA Essentials series MAKING SENSE OF DATA Essentials series THE NORMAL DISTRIBUTION Copyright by City of Bradford MDC Prerequisites Descriptive statistics Charts and graphs The normal distribution Surveys and sampling Correlation

More information

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management BA 386T Tom Shively PROBABILITY CONCEPTS AND NORMAL DISTRIBUTIONS The fundamental idea underlying any statistical

More information

We use probability distributions to represent the distribution of a discrete random variable.

We use probability distributions to represent the distribution of a discrete random variable. Now we focus on discrete random variables. We will look at these in general, including calculating the mean and standard deviation. Then we will look more in depth at binomial random variables which are

More information

Chapter ! Bell Shaped

Chapter ! Bell Shaped Chapter 6 6-1 Business Statistics: A First Course 5 th Edition Chapter 7 Continuous Probability Distributions Learning Objectives In this chapter, you learn:! To compute probabilities from the normal distribution!

More information

The Central Limit Theorem for Sample Means (Averages)

The Central Limit Theorem for Sample Means (Averages) The Central Limit Theorem for Sample Means (Averages) By: OpenStaxCollege Suppose X is a random variable with a distribution that may be known or unknown (it can be any distribution). Using a subscript

More information

Lecture 6: Chapter 6

Lecture 6: Chapter 6 Lecture 6: Chapter 6 C C Moxley UAB Mathematics 3 October 16 6.1 Continuous Probability Distributions Last week, we discussed the binomial probability distribution, which was discrete. 6.1 Continuous Probability

More information

Section 3.4 The Normal Distribution

Section 3.4 The Normal Distribution Section 3.4 The Normal Distribution Properties of the Normal Distribution Curve 1. We denote the normal random variable with X = x. 2. The curve has a peak at x = µ. 3. The curve is symmetric about the

More information

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either

More information

Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at

Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at mailto:msfrisbie@pfrisbie.com. 1. Let X represent the savings of a resident; X ~ N(3000,

More information

Discrete Random Variables

Discrete Random Variables Discrete Random Variables In this chapter, we introduce a new concept that of a random variable or RV. A random variable is a model to help us describe the state of the world around us. Roughly, a RV can

More information

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x)

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x) Section 6-2 I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x) to represent a probability density

More information

The Binomial and Geometric Distributions. Chapter 8

The Binomial and Geometric Distributions. Chapter 8 The Binomial and Geometric Distributions Chapter 8 8.1 The Binomial Distribution A binomial experiment is statistical experiment that has the following properties: The experiment consists of n repeated

More information

Chapter 4 Random Variables & Probability. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Chapter 4 Random Variables & Probability. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Chapter 4.5, 6, 8 Probability for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random variable =

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Descriptive Statistics (Devore Chapter One)

Descriptive Statistics (Devore Chapter One) Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf

More information

Chapter 8 Statistical Intervals for a Single Sample

Chapter 8 Statistical Intervals for a Single Sample Chapter 8 Statistical Intervals for a Single Sample Part 1: Confidence intervals (CI) for population mean µ Section 8-1: CI for µ when σ 2 known & drawing from normal distribution Section 8-1.2: Sample

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions

More information

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Announcements: There are some office hour changes for Nov 5, 8, 9 on website Week 5 quiz begins after class today and ends at

More information

A Derivation of the Normal Distribution. Robert S. Wilson PhD.

A Derivation of the Normal Distribution. Robert S. Wilson PhD. A Derivation of the Normal Distribution Robert S. Wilson PhD. Data are said to be normally distributed if their frequency histogram is apporximated by a bell shaped curve. In practice, one can tell by

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

2 General Notions 2.1 DATA Types of Data. Source: Frerichs, R.R. Rapid Surveys (unpublished), NOT FOR COMMERCIAL DISTRIBUTION

2 General Notions 2.1 DATA Types of Data. Source: Frerichs, R.R. Rapid Surveys (unpublished), NOT FOR COMMERCIAL DISTRIBUTION Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 2 General Notions 2.1 DATA What do you want to know? The answer when doing surveys begins first with the question,

More information

STA Module 3B Discrete Random Variables

STA Module 3B Discrete Random Variables STA 2023 Module 3B Discrete Random Variables Learning Objectives Upon completing this module, you should be able to 1. Determine the probability distribution of a discrete random variable. 2. Construct

More information

Chapter 4. The Normal Distribution

Chapter 4. The Normal Distribution Chapter 4 The Normal Distribution 1 Chapter 4 Overview Introduction 4-1 Normal Distributions 4-2 Applications of the Normal Distribution 4-3 The Central Limit Theorem 4-4 The Normal Approximation to the

More information

5.1 Mean, Median, & Mode

5.1 Mean, Median, & Mode 5.1 Mean, Median, & Mode definitions Mean: Median: Mode: Example 1 The Blue Jays score these amounts of runs in their last 9 games: 4, 7, 2, 4, 10, 5, 6, 7, 7 Find the mean, median, and mode: Example 2

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

Chapter 4 and 5 Note Guide: Probability Distributions

Chapter 4 and 5 Note Guide: Probability Distributions Chapter 4 and 5 Note Guide: Probability Distributions Probability Distributions for a Discrete Random Variable A discrete probability distribution function has two characteristics: Each probability is

More information

4: Probability. Notes: Range of possible probabilities: Probabilities can be no less than 0% and no more than 100% (of course).

4: Probability. Notes: Range of possible probabilities: Probabilities can be no less than 0% and no more than 100% (of course). 4: Probability What is probability? The probability of an event is its relative frequency (proportion) in the population. An event that happens half the time (such as a head showing up on the flip of a

More information

Statistics, Measures of Central Tendency I

Statistics, Measures of Central Tendency I Statistics, Measures of Central Tendency I We are considering a random variable X with a probability distribution which has some parameters. We want to get an idea what these parameters are. We perfom

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

22.2 Shape, Center, and Spread

22.2 Shape, Center, and Spread Name Class Date 22.2 Shape, Center, and Spread Essential Question: Which measures of center and spread are appropriate for a normal distribution, and which are appropriate for a skewed distribution? Eplore

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

starting on 5/1/1953 up until 2/1/2017.

starting on 5/1/1953 up until 2/1/2017. An Actuary s Guide to Financial Applications: Examples with EViews By William Bourgeois An actuary is a business professional who uses statistics to determine and analyze risks for companies. In this guide,

More information

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment

More information

STAT 201 Chapter 6. Distribution

STAT 201 Chapter 6. Distribution STAT 201 Chapter 6 Distribution 1 Random Variable We know variable Random Variable: a numerical measurement of the outcome of a random phenomena Capital letter refer to the random variable Lower case letters

More information

CHAPTERS 5 & 6: CONTINUOUS RANDOM VARIABLES

CHAPTERS 5 & 6: CONTINUOUS RANDOM VARIABLES CHAPTERS 5 & 6: CONTINUOUS RANDOM VARIABLES DISCRETE RANDOM VARIABLE: Variable can take on only certain specified values. There are gaps between possible data values. Values may be counting numbers or

More information

Chapter 6 Confidence Intervals

Chapter 6 Confidence Intervals Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) VOCABULARY: Point Estimate A value for a parameter. The most point estimate of the population parameter is the

More information

Consumer Guide Dealership Word of Mouth Internet

Consumer Guide Dealership Word of Mouth Internet 8.1 Graphing Data In this chapter, we will study techniques for graphing data. We will see the importance of visually displaying large sets of data so that meaningful interpretations of the data can be

More information

The Normal Distribution

The Normal Distribution 5.1 Introduction to Normal Distributions and the Standard Normal Distribution Section Learning objectives: 1. How to interpret graphs of normal probability distributions 2. How to find areas under the

More information

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS Part 1: Introduction Sampling Distributions & the Central Limit Theorem Point Estimation & Estimators Sections 7-1 to 7-2 Sample data

More information

2 DESCRIPTIVE STATISTICS

2 DESCRIPTIVE STATISTICS Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

EXERCISES FOR PRACTICE SESSION 2 OF STAT CAMP

EXERCISES FOR PRACTICE SESSION 2 OF STAT CAMP EXERCISES FOR PRACTICE SESSION 2 OF STAT CAMP Note 1: The exercises below that are referenced by chapter number are taken or modified from the following open-source online textbook that was adapted by

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

5.1 Personal Probability

5.1 Personal Probability 5. Probability Value Page 1 5.1 Personal Probability Although we think probability is something that is confined to math class, in the form of personal probability it is something we use to make decisions

More information

Chapter 8. Binomial and Geometric Distributions

Chapter 8. Binomial and Geometric Distributions Chapter 8 Binomial and Geometric Distributions Lesson 8-1, Part 1 Binomial Distribution What is a Binomial Distribution? Specific type of discrete probability distribution The outcomes belong to two categories

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION In Inferential Statistic, ESTIMATION (i) (ii) is called the True Population Mean and is called the True Population Proportion. You must also remember that are not the only population parameters. There

More information

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables STA 2023 Module 5 Discrete Random Variables Learning Objectives Upon completing this module, you should be able to: 1. Determine the probability distribution of a discrete random variable. 2. Construct

More information

What s Normal? Chapter 8. Hitting the Curve. In This Chapter

What s Normal? Chapter 8. Hitting the Curve. In This Chapter Chapter 8 What s Normal? In This Chapter Meet the normal distribution Standard deviations and the normal distribution Excel s normal distribution-related functions A main job of statisticians is to estimate

More information

Lecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean)

Lecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean) Statistics 16_est_parameters.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean) Some Common Sense Assumptions for Interval Estimates

More information

3. Probability Distributions and Sampling

3. Probability Distributions and Sampling 3. Probability Distributions and Sampling 3.1 Introduction: the US Presidential Race Appendix 2 shows a page from the Gallup WWW site. As you probably know, Gallup is an opinion poll company. The page

More information

The Central Limit Theorem for Sums

The Central Limit Theorem for Sums OpenStax-CNX module: m46997 1 The Central Limit Theorem for Sums OpenStax College This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 Suppose X is a random

More information

Chapter 7 Study Guide: The Central Limit Theorem

Chapter 7 Study Guide: The Central Limit Theorem Chapter 7 Study Guide: The Central Limit Theorem Introduction Why are we so concerned with means? Two reasons are that they give us a middle ground for comparison and they are easy to calculate. In this

More information

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative STAT:10 Statistical Methods and Computing Normal Distributions Lecture 4 Feb. 6, 17 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowa.edu 1 2 Using density curves to describe the distribution of values of

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information