STAB22 section 1.3 and Chapter 1 exercises

Size: px
Start display at page:

Download "STAB22 section 1.3 and Chapter 1 exercises"

Transcription

1 STAB22 section 1.3 and Chapter 1 exercises Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and (2)(51) = Same idea as the previous exercise, but go up and down 3 times the SD: from 572 (3)(51) = 419 to (3)(51) = 725. In a normal distribution, going up and down 3 times the SD includes almost all the data; this range has length 3+3 = 6 times the SD (which is the origin of the name six sigma in statistical process control) z = ( )/51 = A z-score says how your given value, here 510, compares to the mean. Here, the z-score is negative because the value is below the mean (it doesn t matter that the mean is positive). If you want another example, consider a city where the mean temperature in January is 10, with an SD of 6. A temperature of 5 has a z-score of z = (( 5) ( 10))/6 = 0.83 (positive because it is above the mean), and a temperature of 19 has a z-score of z = (( 19) ( 10))/6 = 1.5, negative because 19 is below the mean (and not just because 19 itself is negative) Figure out z and look it up in Table A. z = ( )/51 = 0.94; a proportion of the normal curve is less than this. To find how much is greater, do the same calculation but subtract the answer from 1: = (The table always gives you less than.) Turn your two values (620 and 660) into z- scores, look them both up in the table, and subtract. As we found in the previous question, 620 has a z-score of 0.94, which corresponds to in the table. 660 has z = ( )/51 = 1.73, which goes with proportion The proportion between 620 and 660 is = Another way to see this is: the proportion of students scoring less than 620 is ; the proportion scoring more than 660 is = ; everyone else scores between, so the proportion is = This way is perhaps easier to understand, but the first way is easier to do If 25% of students are going to score bigger than this score (whatever it is), 75% will score less than this score. Look up in the body of Table A; it s between and , so z is between 0.67 and 0.68, slightly closer to Then unstandardize this z value using the formula on page 64 (just above the question for 1.107) and the given mean and SD to get a score 1

2 of (0.67)(51) = 606. (If you are going to be in the top 25%, you ll need a score a bit bigger than the mean.) It s a density curve, so the area under it has to be 1. The area of the shape shown is its width times its height; for the area to be 1, the height must be 1 as well. For (b), draw the (vertical) line x = 0.35 on the picture; the piece on the left is what you want. The width is and the height is 1, so the area is = (c) is the same idea: draw the area that has x between 0.35 and 0.65; the width is = 0.3 and the height is 1, so the area is 0.3. You might guess that the proportion of the uniform distribution between a and b is b a, and you would be correct. Use a = 0 or b = 1 if you don t have a lower or upper limit, as in (b) This density curve is also a rectangle, so the area, width times height, has to be 1. Since the width is 4, the height must be 1. In (b), the width 4 is 1 0 = 1, so the proportion is 1 1 = 1. (If you 4 4 draw a picture, the area you want is obviously a quarter of the rectangle). For (c), the width is = 2 so the proportion is 2 1 = For the median and quartiles, you want the values that cut off area 0.5, 0.25 and These are 0.5 (median), 0.25 (Q1) and 0.75 (Q3). For the mean, note that the density curve has a symmetric shape, so the mean and median must be equal, For a skewed density curve, like a skewed histogram, the mean is pulled farthest into the tail, and the median lies between the peak and the median. Thus for (a), C is the mean and B the median. For (c), A is the mean and B the median. For a symmetric density curve with one peak, the mean and median are at that peak, so in (b), mean and median are both A It s easiest to draw the bell-curve first and then put the numbers below it. The mean is at the peak, and the shoulders of the curve are one standard deviation above and below the mean (that is, where the curve stops curving downwards and starts curving outwards). My rough sketch is in Figure For (a), go up and down 2 standard deviations from the mean, that is from 266 (2)(16) = 234 to (2)(16) = 298. Because the normal distribution is symmetric, these values cut off half of 5%, that is, 2.5%, on each end. So the shortest 2.5% of pregnancies last 234 days or less and the longest 2.5% last 298 days or more For (a), go up and down 3 times the SD: between 336 (3)(3) = 327 and (3)(3) = 345 2

3 days. (The SD for horses is less than for humans, so we can make more precise statements about pregnancy lengths for horses as compared to humans.) (b) is tricky: the middle 68% are between 333 and 339 days (1 SD), so of the other 32%, half (16%) are below 333 days and half (16%) are above 339 days. Figure 1: Normal density curve for ex It s easiest to get the data into software first (on the disk, look for the data acidrain). Calling for the Descriptive Statistics gets you the mean, , and the SD, The normal probability plot was fairly straight, indicating that a normal distribution is a good fit to these data. So the rule should be fairly accurate. The 68% limits are = and = Then go to the data in the Minitab worksheet and count how many of the values are within these two limits. This is made easier by the fact that the data values are sorted in order. The values in rows 18 to 88 inclusive are between these values; there are = 71 of them, which is 71/ % = 67.6%. This is very close to 68%. For the others, go up and down 2 (and 3) times the SD from the mean, and count how many data values fall between those limits. The 95% limits are (2)(0.5379) =

4 and (2)(0.5379) = The values in rows 2 to 101 are between these limits; these are 100/105 = 95.2% of the total, again very close to 95%. The 99.7% limits are (3)(0.5379) = and (3)(0.5379) = All 105 values, 100%, fall between these limits. This is again very close to 99.7%. A reminder that the rules don t work exactly for actual data, but if the data are close to normal, the rules will be close to correct goes with z = ( )/15 = 2. Thus those more than 2 SDs above the mean would qualify, which should be the same proportion as those further than 2 SDs below the mean. So, 2.5%, or more accurately, Calculate z values to compare the results fairly, using the mean and SD for the test that was taken. Tonya s z is ( )/321 = 0.97, while Jermaine s is ( )/5.4 = Both scored above average, but Jermaine s score is higher in standardized units. Sometimes you will see these scores expressed as percentile ranks, which is the percentage of all people who would score less than the given score. You can get these by looking up the z s in Table A: Tonya is at the 83rd percentile (table gives ) while Jermaine is at the 92nd percentile (table gives ). It is perhaps clearer this way that Jermaine s performance is better. See and Jacob scored z = ( )/5.4 = 1.02, and Emily scored z = ( )/321 = Both scored below average, but Jacob did better relative to the mean on his test. (Jacob is at the 15th percentile and Emily is at the 6th, using the same kind of calculation as in ) Jose s score standardizes to z = ( )/321 = To find out the equivalent ACT score, x, figure out how you would standardize x and put that equal to 1.78: (x 21.5)/5.4 = Then solve for x to get 31.1, or 31 rounded off (since ACT scores are given as whole numbers). Or, if you prefer, do some trial and error to see what ACT score standardizes to 1.78: 31 is a little too low, 32 is too high by more, so 31 is best Same idea: Maria s z is z = ( )/5.4 = 1.57, and the standardized SAT score, say x, has to come to the same thing: (x 1509)/321 = 1.57, so x = , which would presumably be rounded off to Or (in the previous two exercises) you can use the rule for unstandardizing z-values given at 4

5 the top of page 68: x = µ + σz, where µ is the mean and σ the SD. Thus you d get x = (1.78) = 31.1 and x = (1.57) = This is the same Tonya as in 1.132, with z = Look in Table A to find that the proportion less than this is , so the percentile is 83 (rounded off) This is the Jacob of 1.133, with z = The proportion less than this is , so Jacob is at the 15th percentile (rounded off) First, ask yourself what z-value cuts off the top 10% of the standard normal distribution. This same value cuts off the bottom 90%, and thus (Table A) is about z = Then unstandardize according to the mean and SD of SAT scores: those SAT scores above (1.28) = 1920 make up the top 10% First find the quartiles of the standard normal distribution. These are z = 0.67 and z = 0.67 (look up 0.25 and 0.75 in the body of Table A). Then unstandardize them according to the mean and SD of ACT scores: Q1 = ( 0.67) = 17.8 and Q3 = (0.67) = (These don t necessarily need to be rounded off since a quartile of whole numbers doesn t itself have to be a whole number.) Same idea as the previous exercise, though it looks a bit more scary. Find the quintiles of the standard normal distribution by looking up 0.2, 0.4, 0.6 and 0.8 in the body of Table A: this gives z = 0.84, 0.25, 0.25, (Note the symmetry: 0.2 below is the mirror image of 0.8 below, which is 0.2 above.) Then unstandardize onto the SAT scale: ( 0.84) = 1239, ( 0.25) = 1429, (0.25) = 1589, (0.84) = (You can keep one decimal place here, by the same argument as ) Note that the quintiles are equally spaced on the proportion scale but not the score scale: 1239 and 1429 are farther apart than 1429 and 1589, for instance (a) Standardize 40 (using the correct mean and SD), and look it up in Table A. z = (40 55)/15.5 = 0.97, so a proportion of young women have cholesterol level lower than 40. (b) This implies 60 or higher : since HDL can take any value, it has no chance of being exactly 60. Turn 60 into a z-score first: z = (60 55)/15.5 = Table A gives proportion for this, which is the proportion less. So the proportion of women with HDL 60 or higher is = 5

6 or 37%. (c) Everyone between is neither low (part (a)) or high (part (b)), so the answer is = If you didn t recognize that you could use parts (a) and (b), you can churn through the whole thing: turn 60 into a z-score (0.32), turn 40 into a z-score (-0.97), look them both up in Table A ( and ) and subtract ( = ) Same kind of calculations as the previous exercise, but now using the different means and SDs. For 40, z = (40 46)/13.6 = 0.44, proportion less is For 60, z = (60 46)/13.6 = 1.03, proportion less is , proportion more is = Proportion between 40 and 60 is = Compared to the women of 1.142, the men have a lower HDL on average than the women (with a similar spread), so more of them have low HDL and fewer of them have high HDL (a) z = ( )/16 = 1.63, so , or about 5%. (b) For 270 days, z = ( )/16 = 0.25, so proportion are shorter than this. Proportion between is = (c) The z for 0.2 longer is that for 0.8 shorter, which is z = 0.84 (look for in the body of Table A). Unstandardize to get (0.84) = days As found in 1.140: z = 0.67 and z = For (b), unstandardize these to get Q1 = µ 0.67σ and Q3 = µ σ. (Never mind that these are formulas; the unstandardizing works the same way.) For (c), put in the values 266 for µ and 16 for σ to get Q1 = and Q3 = (a) is the difference of the two numbers found in 1.146: 0.67 ( 0.67) = The easiest way to do (b) is to look back at 1.146(b) and take the difference between Q3 and Q1 (using the formulas), which is 1.34σ. The µs cancel out, which makes sense: the IQR, as a measure of spread, depends only on the standard deviation, which is another measure of spread. This way, the answer has to be c = 1.34, no matter what µ and σ are such is the power of mathematics This one is clearly not normal, because the plot is not close to a straight line. If you think about how it fails to be a straight line: there are too many values with emissions close to 0 (the plot is too flat on the left compared to the middle of the plot), and too many values with high emissions (because the plot is too steep on the right compared to the middle). Thus the distribution is skewed to the right. (Or you can memorize which 6

7 shape of curve goes with which kind of skew, but you would do better to be able to work it out from scratch.) The three countries at the top right look like outliers: if you were to make a straight line through the middle of the data, the line would definitely go below these three points Bar graphs for population and for open space would show only that the cities vary in both population and in how much open space they have. To investigate further, type the data into a Minitab worksheet and create a fourth column (using Calculator) with rate of open space per thousand residents. To make a bar chart, select Graph and Bar Chart. Select Bars represent values from a table. Select the Simple option. Select OK. In the next dialog, select your rates into the Graph Variables box, and select City as your categorical variable. Select Bar Chart Options, and order the bars by Increasing Y (for (e)). My bar chart is shown in Figure 2. I prefer a chart with the heights of the bars ordered, because it makes comparison easier. Here, you see that Miami and Chicago have a noticeably small amount of open space per inhabitant, and Washington DC and Minneapolis have a noticeably large amount, with the other cities being very similar. New York, however, has a lot of its open space concentrated in Central Park, so you might imagine that the rest of the city doesn t have much. Figure 2: Bar chart of open space per resident by city The statement given is true on average, but a statement that women score higher than men hides the fact that there is a lot of variability; the majority of both men and women score between 500 and 650, but it s hard to be more precise. The men s scores have a lower mean and a larger SD than the women s, which means that the lowest scores are overwhemingly likely to be men. Surprisingly (because the men s scores have higher variability), the extremely high scores might be either men or women. (Notice that the density curves for scores 700 and higher are about the same height.) If you want to do some calculations: of 7

8 the women will score less than 450, compared to of the men; of the women will score more than 700, compared to of the men. (This is the usual thing: figure out z s and look them up in the table.) This justifies the statement that the lowest scores are mostly men and the highest could be either men or women. The scores in the middle are all mixed up Two things you can do here: compare the mean and the median, and see whether the median is closer to Q1 or Q3 (or about the same distance from both). Here, the mean is quite a bit bigger than the median, and the median is closer to Q1 and further away from Q3. These both suggest that the distribution is right-skewed; there are a few women with large weights compared to the others. (b) How many hours is quantitative, so a histogram or stemplot (or boxplot). For the second part of (b), you have to figure out how you re going to measure this. One way is to have set times during the semester, such as week 1, before midterm, after midterm, week 10, before final and get the number of study hours in those weeks for your students. Then you could do side-by-side boxplots to compare the study times (since time during the semester is categorical). (c) Favourite radio station is a categorical variable, so bar chart (or maybe pie chart if you want to be able to say that 33% of students prefer to listen to radio station XXXX, which is the highest percentage of any radio station ). (d) To assess whether a normal distribution applies to a collection of measurements, you need a normal probability plot, which you then assess for straightness (a) Make is categorical, so a bar chart (or maybe a pie chart, if you are thinking of out of all the students, what fraction drive cars of a certain make? ). For how old, a quantitative variable, a histogram is the thing (or a stemplot, or even a boxplot). 8

The Normal Distribution

The Normal Distribution Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we

More information

Putting Things Together Part 2

Putting Things Together Part 2 Frequency Putting Things Together Part These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for, and are in

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows

More information

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.) Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop

More information

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Categorical. A general name for non-numerical data; the data is separated into categories of some kind. Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,

More information

Lecture 2 Describing Data

Lecture 2 Describing Data Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number

More information

Frequency Distribution and Summary Statistics

Frequency Distribution and Summary Statistics Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary

More information

2 Exploring Univariate Data

2 Exploring Univariate Data 2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting

More information

Expected Value of a Random Variable

Expected Value of a Random Variable Knowledge Article: Probability and Statistics Expected Value of a Random Variable Expected Value of a Discrete Random Variable You're familiar with a simple mean, or average, of a set. The mean value of

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

We will also use this topic to help you see how the standard deviation might be useful for distributions which are normally distributed.

We will also use this topic to help you see how the standard deviation might be useful for distributions which are normally distributed. We will discuss the normal distribution in greater detail in our unit on probability. However, as it is often of use to use exploratory data analysis to determine if the sample seems reasonably normally

More information

Chapter 2. Section 2.1

Chapter 2. Section 2.1 Chapter 2 Section 2.1 Check Your Understanding, page 89: 1. c 2. Her daughter weighs more than 87% of girls her age and she is taller than 67% of girls her age. 3. About 65% of calls lasted less than 30

More information

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). We will look the three common and useful measures of spread. The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). 1 Ameasure of the center

More information

STAB22 section 2.2. Figure 1: Plot of deforestation vs. price

STAB22 section 2.2. Figure 1: Plot of deforestation vs. price STAB22 section 2.2 2.29 A change in price leads to a change in amount of deforestation, so price is explanatory and deforestation the response. There are no difficulties in producing a plot; mine is in

More information

Descriptive Statistics (Devore Chapter One)

Descriptive Statistics (Devore Chapter One) Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf

More information

Putting Things Together Part 1

Putting Things Together Part 1 Putting Things Together Part 1 These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for 1, 5, and 6 are in

More information

FINALS REVIEW BELL RINGER. Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/ /2 4

FINALS REVIEW BELL RINGER. Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/ /2 4 FINALS REVIEW BELL RINGER Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/3 + 7 + 1/2 4 4) 3 + 4 ( 7) + 3 + 4 ( 2) 1) 36/6 4/6 + 3/6 32/6 + 3/6 35/6

More information

Numerical Descriptive Measures. Measures of Center: Mean and Median

Numerical Descriptive Measures. Measures of Center: Mean and Median Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where

More information

Since his score is positive, he s above average. Since his score is not close to zero, his score is unusual.

Since his score is positive, he s above average. Since his score is not close to zero, his score is unusual. Chapter 06: The Standard Deviation as a Ruler and the Normal Model This is the worst chapter title ever! This chapter is about the most important random variable distribution of them all the normal distribution.

More information

STAT 113 Variability

STAT 113 Variability STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2

More information

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2] 1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter7 Probability Distributions and Statistics Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number of boys in

More information

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Announcements: There are some office hour changes for Nov 5, 8, 9 on website Week 5 quiz begins after class today and ends at

More information

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good

More information

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either

More information

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good

More information

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. STAT 515 -- Chapter 5: Continuous Distributions Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. Continuous distributions typically are represented by

More information

Applications of Data Dispersions

Applications of Data Dispersions 1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given

More information

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82 Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections

More information

Statistics (This summary is for chapters 18, 29 and section H of chapter 19)

Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x n =

More information

NOTES: Chapter 4 Describing Data

NOTES: Chapter 4 Describing Data NOTES: Chapter 4 Describing Data Intro to Statistics COLYER Spring 2017 Student Name: Page 2 Section 4.1 ~ What is Average? Objective: In this section you will understand the difference between the three

More information

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1 8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions For Example: On August 8, 2011, the Dow dropped 634.8 points, sending shock waves through the financial community.

More information

NORMAL RANDOM VARIABLES (Normal or gaussian distribution)

NORMAL RANDOM VARIABLES (Normal or gaussian distribution) NORMAL RANDOM VARIABLES (Normal or gaussian distribution) Many variables, as pregnancy lengths, foot sizes etc.. exhibit a normal distribution. The shape of the distribution is a symmetric bell shape.

More information

Example: Histogram for US household incomes from 2015 Table:

Example: Histogram for US household incomes from 2015 Table: 1 Example: Histogram for US household incomes from 2015 Table: Income level Relative frequency $0 - $14,999 11.6% $15,000 - $24,999 10.5% $25,000 - $34,999 10% $35,000 - $49,999 12.7% $50,000 - $74,999

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

Chapter 6. The Normal Probability Distributions

Chapter 6. The Normal Probability Distributions Chapter 6 The Normal Probability Distributions 1 Chapter 6 Overview Introduction 6-1 Normal Probability Distributions 6-2 The Standard Normal Distribution 6-3 Applications of the Normal Distribution 6-5

More information

Probability. An intro for calculus students P= Figure 1: A normal integral

Probability. An intro for calculus students P= Figure 1: A normal integral Probability An intro for calculus students.8.6.4.2 P=.87 2 3 4 Figure : A normal integral Suppose we flip a coin 2 times; what is the probability that we get more than 2 heads? Suppose we roll a six-sided

More information

Chapter 3. Lecture 3 Sections

Chapter 3. Lecture 3 Sections Chapter 3 Lecture 3 Sections 3.4 3.5 Measure of Position We would like to compare values from different data sets. We will introduce a z score or standard score. This measures how many standard deviation

More information

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x

More information

Description of Data I

Description of Data I Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret

More information

IOP 201-Q (Industrial Psychological Research) Tutorial 5

IOP 201-Q (Industrial Psychological Research) Tutorial 5 IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,

More information

The Standard Deviation as a Ruler and the Normal Model. Copyright 2009 Pearson Education, Inc.

The Standard Deviation as a Ruler and the Normal Model. Copyright 2009 Pearson Education, Inc. The Standard Deviation as a Ruler and the Normal Mol Copyright 2009 Pearson Education, Inc. The trick in comparing very different-looking values is to use standard viations as our rulers. The standard

More information

Example. Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables

Example. Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables You are dealt a hand of 5 cards. Find the probability distribution table for the number of hearts. Graph

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values

More information

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences. STAB22H3 Statistics I Duration: 1 hour and 45 minutes

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences. STAB22H3 Statistics I Duration: 1 hour and 45 minutes UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences STAB22H3 Statistics I Duration: 1 hour and 45 minutes Last Name: First Name: Student number: Aids allowed: - One handwritten

More information

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25 Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example

More information

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. STAT 515 -- Chapter 5: Continuous Distributions Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. Continuous distributions typically are represented by

More information

3.5 Applying the Normal Distribution (Z-Scores)

3.5 Applying the Normal Distribution (Z-Scores) 3.5 Applying the Normal Distribution (Z-Scores) The Graph: Review of the Normal Distribution Properties: - it is symmetrical; the mean, median and mode are equal and fall at the line of symmetry - it is

More information

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12)

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Descriptive statistics: - Measures of centrality (Mean, median, mode, trimmed mean) - Measures of spread (MAD, Standard deviation, variance) -

More information

Examples of continuous probability distributions: The normal and standard normal

Examples of continuous probability distributions: The normal and standard normal Examples of continuous probability distributions: The normal and standard normal The Normal Distribution f(x) Changing μ shifts the distribution left or right. Changing σ increases or decreases the spread.

More information

What s Normal? Chapter 8. Hitting the Curve. In This Chapter

What s Normal? Chapter 8. Hitting the Curve. In This Chapter Chapter 8 What s Normal? In This Chapter Meet the normal distribution Standard deviations and the normal distribution Excel s normal distribution-related functions A main job of statisticians is to estimate

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Chapter 6 Exam A Name The given values are discrete. Use the continuity correction and describe the region of the normal distribution that corresponds to the indicated probability. 1) The probability of

More information

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management BA 386T Tom Shively PROBABILITY CONCEPTS AND NORMAL DISTRIBUTIONS The fundamental idea underlying any statistical

More information

Math 227 Elementary Statistics. Bluman 5 th edition

Math 227 Elementary Statistics. Bluman 5 th edition Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

appstats5.notebook September 07, 2016 Chapter 5

appstats5.notebook September 07, 2016 Chapter 5 Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.

More information

The Normal Distribution

The Normal Distribution 5.1 Introduction to Normal Distributions and the Standard Normal Distribution Section Learning objectives: 1. How to interpret graphs of normal probability distributions 2. How to find areas under the

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao The binomial: mean and variance Recall that the number of successes out of n, denoted

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions

More information

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations

More information

MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION

MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION 1 Day 3 Summer 2017.07.31 DISTRIBUTION Symmetry Modality 单峰, 双峰 Skewness 正偏或负偏 Kurtosis 2 3 CHAPTER 4 Measures of Central Tendency 集中趋势

More information

Unit 2 Statistics of One Variable

Unit 2 Statistics of One Variable Unit 2 Statistics of One Variable Day 6 Summarizing Quantitative Data Summarizing Quantitative Data We have discussed how to display quantitative data in a histogram It is useful to be able to describe

More information

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION We have examined discrete random variables, those random variables for which we can list the possible values. We will now look at continuous random variables.

More information

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet.

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. 1 Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. Warning to the Reader! If you are a student for whom this document is a historical artifact, be aware that the

More information

The Normal Probability Distribution

The Normal Probability Distribution 1 The Normal Probability Distribution Key Definitions Probability Density Function: An equation used to compute probabilities for continuous random variables where the output value is greater than zero

More information

CHAPTER 2 Describing Data: Numerical

CHAPTER 2 Describing Data: Numerical CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of

More information

Math Take Home Quiz on Chapter 2

Math Take Home Quiz on Chapter 2 Math 116 - Take Home Quiz on Chapter 2 Show the calculations that lead to the answer. Due date: Tuesday June 6th Name Time your class meets Provide an appropriate response. 1) A newspaper surveyed its

More information

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative STAT:10 Statistical Methods and Computing Normal Distributions Lecture 4 Feb. 6, 17 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowa.edu 1 2 Using density curves to describe the distribution of values of

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education

More information

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data. -3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data

More information

Announcements. Unit 2: Probability and distributions Lecture 3: Normal distribution. Normal distribution. Heights of males

Announcements. Unit 2: Probability and distributions Lecture 3: Normal distribution. Normal distribution. Heights of males Announcements Announcements Unit 2: Probability and distributions Lecture 3: Statistics 101 Mine Çetinkaya-Rundel First peer eval due Tues. PS3 posted - will be adding one more question that you need to

More information

Chapter ! Bell Shaped

Chapter ! Bell Shaped Chapter 6 6-1 Business Statistics: A First Course 5 th Edition Chapter 7 Continuous Probability Distributions Learning Objectives In this chapter, you learn:! To compute probabilities from the normal distribution!

More information

Continuous Probability Distributions

Continuous Probability Distributions 8.1 Continuous Probability Distributions Distributions like the binomial probability distribution and the hypergeometric distribution deal with discrete data. The possible values of the random variable

More information

Chapter 5 The Standard Deviation as a Ruler and the Normal Model

Chapter 5 The Standard Deviation as a Ruler and the Normal Model Chapter 5 The Standard Deviation as a Ruler and the Normal Model 55 Chapter 5 The Standard Deviation as a Ruler and the Normal Model 1. Stats test. Nicole scored 65 points on the test. That is one standard

More information

Some estimates of the height of the podium

Some estimates of the height of the podium Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40

More information

Solutions for practice questions: Chapter 9, Statistics

Solutions for practice questions: Chapter 9, Statistics Solutions for practice questions: Chapter 9, Statistics If you find any errors, please let me know at mailto:msfrisbie@pfrisbie.com. 1. We know that µ is the mean of 30 values of y, 30 30 i= 1 2 ( y i

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,

More information

Honors Statistics. 3. Discuss homework C2# Discuss standard scores and percentiles. Chapter 2 Section Review day 2016s Notes.

Honors Statistics. 3. Discuss homework C2# Discuss standard scores and percentiles. Chapter 2 Section Review day 2016s Notes. Honors Statistics Aug 23-8:26 PM 3. Discuss homework C2#11 4. Discuss standard scores and percentiles Aug 23-8:31 PM 1 Feb 8-7:44 AM Sep 6-2:27 PM 2 Sep 18-12:51 PM Chapter 2 Modeling Distributions of

More information

3.1 Measures of Central Tendency

3.1 Measures of Central Tendency 3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent

More information

Properties of Probability Models: Part Two. What they forgot to tell you about the Gammas

Properties of Probability Models: Part Two. What they forgot to tell you about the Gammas Quality Digest Daily, September 1, 2015 Manuscript 285 What they forgot to tell you about the Gammas Donald J. Wheeler Clear thinking and simplicity of analysis require concise, clear, and correct notions

More information

Lecture 6: Chapter 6

Lecture 6: Chapter 6 Lecture 6: Chapter 6 C C Moxley UAB Mathematics 3 October 16 6.1 Continuous Probability Distributions Last week, we discussed the binomial probability distribution, which was discrete. 6.1 Continuous Probability

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem

More information

Graphical and Tabular Methods in Descriptive Statistics. Descriptive Statistics

Graphical and Tabular Methods in Descriptive Statistics. Descriptive Statistics Graphical and Tabular Methods in Descriptive Statistics MATH 3342 Section 1.2 Descriptive Statistics n Graphs and Tables n Numerical Summaries Sections 1.3 and 1.4 1 Why graph data? n The amount of data

More information

The Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012

The Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012 The Normal Distribution & Descriptive Statistics Kin 304W Week 2: Jan 15, 2012 1 Questionnaire Results I received 71 completed questionnaires. Thank you! Are you nervous about scientific writing? You re

More information

Chapter 8 Estimation

Chapter 8 Estimation Chapter 8 Estimation There are two important forms of statistical inference: estimation (Confidence Intervals) Hypothesis Testing Statistical Inference drawing conclusions about populations based on samples

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow

More information

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment

More information

MA 1125 Lecture 05 - Measures of Spread. Wednesday, September 6, Objectives: Introduce variance, standard deviation, range.

MA 1125 Lecture 05 - Measures of Spread. Wednesday, September 6, Objectives: Introduce variance, standard deviation, range. MA 115 Lecture 05 - Measures of Spread Wednesday, September 6, 017 Objectives: Introduce variance, standard deviation, range. 1. Measures of Spread In Lecture 04, we looked at several measures of central

More information

1. Confidence Intervals (cont.)

1. Confidence Intervals (cont.) Math 1125-Introductory Statistics Lecture 23 11/1/06 1. Confidence Intervals (cont.) Let s review. We re in a situation, where we don t know µ, but we have a number from a normal population, either an

More information