Since his score is positive, he s above average. Since his score is not close to zero, his score is unusual.

Similar documents
7.1 Graphs of Normal Probability Distributions

The Normal Probability Distribution

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution?

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution?

The Normal Distribution

The probability of having a very tall person in our sample. We look to see how this random variable is distributed.

6.1 Graphs of Normal Probability Distributions:

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Chapter 6. The Normal Probability Distributions

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

ECON 214 Elements of Statistics for Economists

Chapter 3. Lecture 3 Sections

STAB22 section 1.3 and Chapter 1 exercises

CH 5 Normal Probability Distributions Properties of the Normal Distribution

ECON 214 Elements of Statistics for Economists 2016/2017

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

The normal distribution is a theoretical model derived mathematically and not empirically.

Expected Value of a Random Variable

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1

1. Confidence Intervals (cont.)

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Statistics 511 Supplemental Materials

MA 1125 Lecture 05 - Measures of Spread. Wednesday, September 6, Objectives: Introduce variance, standard deviation, range.

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

The Normal Distribution

STAT 201 Chapter 6. Distribution

5.1 Mean, Median, & Mode

Normal Probability Distributions

MAKING SENSE OF DATA Essentials series

The Binomial Distribution

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

Section Introduction to Normal Distributions

Applications of Data Dispersions

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr.

Every data set has an average and a standard deviation, given by the following formulas,

NORMAL RANDOM VARIABLES (Normal or gaussian distribution)

CS 237: Probability in Computing

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

The graph of a normal curve is symmetric with respect to the line x = µ, and has points of

What s Normal? Chapter 8. Hitting the Curve. In This Chapter

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

Measures of Variation. Section 2-5. Dotplots of Waiting Times. Waiting Times of Bank Customers at Different Banks in minutes. Bank of Providence

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

The Standard Deviation as a Ruler and the Normal Model. Copyright 2009 Pearson Education, Inc.

Introduction to Statistics I

Standard Normal Calculations

The Binomial Distribution

Numerical Descriptive Measures. Measures of Center: Mean and Median

Collaborative Statistics Using Spreadsheets. Chapter 6

The Assumption(s) of Normality

Chapter 4 Continuous Random Variables and Probability Distributions

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Section 5 3 The Mean and Standard Deviation of a Binomial Distribution!

Unit 2: Statistics Probability

Set up a normal distribution curve, to help estimate the percent of the band that, on average, practices a greater number of hours than Alexis.

11.5: Normal Distributions

5-1 pg ,4,5, EOO,39,47,50,53, pg ,5,9,13,17,19,21,22,25,30,31,32, pg.269 1,29,13,16,17,19,20,25,26,28,31,33,38

BIOL The Normal Distribution and the Central Limit Theorem

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

Linear Modeling Business 5 Supply and Demand

Review. What is the probability of throwing two 6s in a row with a fair die? a) b) c) d) 0.333

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

Chapter 5 Normal Probability Distributions

Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Chapter 7 Study Guide: The Central Limit Theorem

2 DESCRIPTIVE STATISTICS

In a moment, we will look at a simple example involving the function f(x) = 100 x

Probability. An intro for calculus students P= Figure 1: A normal integral

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Data Analysis and Statistical Methods Statistics 651

3.1 Measures of Central Tendency

Elementary Statistics

Density curves. (James Madison University) February 4, / 20

Prob and Stats, Nov 7

Chapter Seven: Confidence Intervals and Sample Size

A Derivation of the Normal Distribution. Robert S. Wilson PhD.

3 3 Measures of Central Tendency and Dispersion from grouped data.notebook October 23, 2017

Descriptive Statistics (Devore Chapter One)

1 Describing Distributions with numbers

Normal Model (Part 1)

Math 227 Elementary Statistics. Bluman 5 th edition

Lecture 9. Probability Distributions. Outline. Outline

Confidence Intervals. σ unknown, small samples The t-statistic /22

We use probability distributions to represent the distribution of a discrete random variable.

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Lecture 9. Probability Distributions

The Normal Model The famous bell curve

Solutions for practice questions: Chapter 9, Statistics

UNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES

Continuous Probability Distributions

But suppose we want to find a particular value for y, at which the probability is, say, 0.90? In other words, we want to figure out the following:

Statistical Intervals (One sample) (Chs )

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

Transcription:

Chapter 06: The Standard Deviation as a Ruler and the Normal Model This is the worst chapter title ever! This chapter is about the most important random variable distribution of them all the normal distribution. Measuring Position (again) Earlier (several chapters ago) we talked about quartiles as a method of measuring position. It turns out that there is another one: standardized scores. Let s define standardized scores in a very generic way for the moment, and then we ll see how they really get used a little later. Equation 1 - Standardized Scores datum mean standardized score= standard deviation Standardized scores measure the position of a datum relative to the mean, but in units of the standard deviation. Thus, these scores tell how many standard deviations above (or below) a datum is located. There are two things to know about standardized scores for now: the sign (positive or negative) tells whether a datum is above or below the mean, and the value (close to zero, or far from zero) tells how unusual the datum is (closer to zero means more typical). You can always use a standardized score to measure position but there are other things we can do with them that only work under certain conditions! Specifically, we can only attach probabilities to them when they are used with a normally distributed variable more on that after a few examples. Examples [1.] Zeke s parents are told that their son s IQ has a standardized score of 2.3. Explain what this means about Zeke s IQ relative to the others in his class. Since his score is positive, he s above average. Since his score is not close to zero, his score is unusual. [2.] In his PreCalculus class, Abe earned a final grade of 92. For his class as a whole, the mean final grade was 80 with a standard deviation of 7. Carol also just finished a PreCalculus class, where the class mean was 80 with a standard deviation of 3. Carol s final grade was 89. Which student performed better relative to their peers? Abe s standardized score is 92 80 = 1.714 and Carol s standardized score is 89 80 = 3. 7 3 Both are above the mean, but Carol s score is farther from zero, so her score is more unusual she did better than Abe when compared to her peers. HOLLOMAN S AP STATISTICS BVD CHAPTER 06, PAGE 1 OF 6

The Normal Model The Idea There is one continuous random variable that is more important than the others that is the Normal Distribution. This is also referred to as a Gaussian Distribution, or sometimes as a bell curve. Unlike the discrete cases we studied where you can define the model in terms of a process the normal model really can really only be defined in terms of its graph. Figure 1 - A Normal Distribution Properties You need two numbers to describe a normal distribution the mean and the standard deviation. In the picture above, µ= 0 and σ = 1. The shape is perfectly symmetric about the mean. The standard deviation in a normal curve is actually not too hard to see. There is a special point on the curve (most people can pick out its general location with little difficulty) that helps to find the standard deviation. Figure 2 - The Inflection Point of a Normal Curve This special point is where the curve stops dropping more than sliding, and starts sliding more than dropping. It s called an inflection point but you don t need to remember that. What you do need to know is that the horizontal distance between this point and the mean is the value of the standard deviation. The domain of a normal distribution is all real numbers it continues infinitely in both directions. For this reason, most applications use models that are only approximately normal. In the discrete cases (binomial and geometric) I defined a probability mass function a function that produced probability values. The continuous variables require a slightly different approach. In the discrete case, the function value (analogous to the y-value on a graph like the HOLLOMAN S AP STATISTICS BVD CHAPTER 06, PAGE 2 OF 6

one above) is the probability. In the continuous case, the function value is not the probability rather, probability is found through area. You ve probably never found the area of a shape like the normal curve! That s OK we ve got ways to do that without getting too mathy. Here s the first The Empirical Rule First of all, know that the total area under a normal curve is 1 (since it represents all possible outcomes, the total probability is 1 and probability is now area). The area under a normal curve between µ σ and µ + σ is about 0.68; between µ 2σ and µ + 2σ is about 0.95; and between µ 3σ and µ + 3σ is about 0.997. This is called the Empirical Rule, and can be used to approximate probabilities for normal distributions. Figure 3 - The Empirical Rule you may also want to use the facts that the distribution is symmetric, and that the total area is 1. It s also a really good idea to draw a picture of what you re trying to find! Examples [3.] The size of the frontal lobe of members of the Leptograpsus variegatus species (Australian crab) varies normally with mean 15.6mm and standard deviation 3.5mm. (a) What proportion of these crabs have frontal lobe measurements between 12.1mm and 19.1mm? (b) What proportion of these crabs have frontal lobe measurements greater than 22.6mm? (a) We are given µ= 15.6 and σ = 3.5. Notice that 12.1 = µ σ, and 19.1 = µ + σ. Thus, the proportion of crabs with frontal lobes between these measurements is about 0.68, or 68%. (b) 22.6= µ + 2σ 95% of these measurements are between 8.6 and 22.6; so 5% must be lower than 8.6 OR higher than 22.6. Since the normal curve is symmetric about µ, the proportion above 22.6 must be about 2.5%. HOLLOMAN S AP STATISTICS BVD CHAPTER 06, PAGE 3 OF 6

[4.] PSAT Math scores for Sophomores who took the test in 2005 were approximately normally distributed with mean 44 and standard deviation 11. (a) What proportion of these scores were between 55 and 66? (b) What proportion of these scores were lower than 33? (c) Approximately what score would put a student in the top 2.5% of scores? (a) Note that 55 is at µ + σ and 66 is at µ + 2σ. I know that the proportion between µ σ and µ + σ is about 0.68 and that the proportion between µ 2σ and µ + 2σ is about 0.95. If we subtract these, we get the proportion that are either between µ 2σ and µ σ or between µ + σ and µ + 2σ. Since we want only one of those, and since the distribution is symmetric, we can divide the difference by two to obtain the answer. Thus, the proportion is 0.95 0.68 = 0.135. 2 (b) First note that 33 is at µ σ. I know that the proportion between µ σ and µ + σ is about 0.68, so the proportion that are either lower than µ σ or greater than µ + σ must be 0.32. Since I only want one of those, and since we have symmetry, divide by two! The proportion is 0.16. (c) This is a little different we know the area and we need to work that back to a value of the variable. A top area of 0.025 gets doubled to make a top and bottom area of 0.05 and that makes a middle area of 0.95. Great! I know that the proportion of scores between µ 2σ and µ + 2σ is about 0.95 so the upper score (the one marking the top 2.5%) must be at µ + 2σ. Thus, the score that puts you at the top 2.5% must be 44+ 2( 11) = 66. More Accurate Methods It is easy to see that this method has limited application we need something a bit more powerful. Did you notice that when we found that measurements were at µ + σ or µ 2σ that we were really finding standardized scores? No? Well, we were! If you are dealing with a normal distribution, then standardized scores are the gateway to more accurate probability. Let s get one bit of notation out of the way now that we are distinctly dealing with a particular distribution (the normal), let s adjust that formula for standardized scores: Equation 2 - Z-scores x µ z= σ Really, every standardized score is a z-score just remember that you can only turn a z-score into a probability if you know that the distribution is (at least approximately) normal. Normal Probability To find the area under any normal curve, first convert your measurements to z-scores. Once you ve converted any values of your original variable (x) into standard normal values (z), then it s time to use the Z Chart. The chart lists its z values (from about -3.4 to 3.4) to two decimal places. The units and tenths are found on the left margin of the page; the hundredths digit is HOLLOMAN S AP STATISTICS BVD CHAPTER 06, PAGE 4 OF 6

found along the top row. At the intersection of the row and column for your z, you will find the area under the normal curve to the left of that z. If you want the area to the right, or the area between two z values, you ll have to use two important properties of normal curves symmetry, and the total area under the curve is 1. or you could use the handy function in your calculator! Inverse Normal Calculations Of course, you don t always start with a variable and then find out how often it occurs. Sometimes, you know how often something occurs (or how often you want it to occur), and you need to find the value of the variable that ll make it happen. This is called an Inverse Normal Calculation. It works exactly backwards from a regular problem. In a regular problem, we take an x and make it a z; then we look that z up in the chart; then we read off the left-hand area. For an Inverse problem, we start by finding the left-hand area in the chart; then we read z, then we change z back into x. You should be able to (easily) do the algebra required to change z back to x. Here s a little graphic that I use to remember how to move between the various parts of a normal problem: Figure 4 - Normal Calculation Summary Examples [5.] Back to our crab example what proportion of crabs have frontal lobes larger than 18mm? First, the z-score: 18 15.6 = 0.6857. Now use the chart or your calculator to find the right 3.5 hand area of 0.2464. [6.] The heights of 15 year old American males are approximately normally distributed with mean 170 cm and standard deviation 7.6 cm. What proportion of these young men have heights greater than 185 cm? Start with the z-score: 185 170 = 1.9737. Now use your preferred tool to find the right hand 7.6 area: 0.0242. [7.] The maximum oxygen uptake of middle-aged American men is approximately normally distributed with mean 42.2 ml/kg/min and standard deviation 3.7 ml/kg/min. (a) What proportion of middle-aged men have maximum oxygen uptake levels lower than 35 ml/kg/min? HOLLOMAN S AP STATISTICS BVD CHAPTER 06, PAGE 5 OF 6

(b) Higher levels of maximum oxygen uptake are indicative of better health. What level would put a middle-aged man in the top 5% of healthy individuals? (a) z-score: 35 42.2 = 1.9459 ; the proportion is 0.0258. 3.7 (b) The z-score for the top 5% is 1.6449; that makes the variable value 1.6449 3.7 + 42.2= 48.286 ml/kg/min. ( )( ) [8.] Most measures of IQ are normally distributed, and scaled to have mean 100 and standard deviation 15. (a) What percentage of people will have IQs that are lower than 88? (b) The organization MENSA only accepts members who score in the top 2% of IQ tests. What range of scores would enable one to join MENSA? (c) People who score in the lowest 10% of IQ scores are often eligible for additional services in school. What range of IQ scores would enable a student to receive special services? (a) The z-score is 88 100 = 0.8 and the proportion is 0.2119. 15 (b) The z-score for the top 2% is 2.0537; that makes the IQ score 2.0537 15 + 100= 130.8062. ( )( ) (c) The z-score for the lowest 10% is -1.2816; that makes the IQ score 1.2816 15 + 100= 80.7767. ( )( ) The Normal Probability Plot There are times when you have some data and you want to check to see if the data are approximately normally distributed. There are ways to do that and the textbooks describes one of them at this point. I m going to wait and talk about this graph much later in the year (it isn t that important, really). HOLLOMAN S AP STATISTICS BVD CHAPTER 06, PAGE 6 OF 6