Putting Things Together Part 2

Size: px
Start display at page:

Download "Putting Things Together Part 2"

Transcription

1 Frequency Putting Things Together Part These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for, and are in the instructor s shared folder on LakerApps: Putting Things Together Part.. Data on the number of laps completed by drivers in a -lap car race. (The race was called off due to rain with laps to go.) n =. a) Obtain a histogram and identify the shape of this distribution. b) Determine the number summary. { } c) Draw a simple boxplot below the histogram. d) Determine the interquartile range and range: IQR = Range = e) Determine the mean and standard deviation: Mean = SD = f) Compute Range Standard Deviation to determine how many times bigger the standard deviation is than the range. The range is times bigger than the standard deviation. g) Mode =. Technically the mode (most common value) is every value in this data set (because all of them appear once; no ties). One way to more meaningfully identify a mode is to use a representative value from the interval that occurs most often in the histogram. For this histogram, values in the interval - occur most often, and it s correct to say is (approximately) the mode. h) For left skewed data (like this), how do the mean, mode and median compare? Which is largest? Which is smallest? Boxplot

2 Frequency Frequency Frequency Frequency. Here are histograms for data sets A, B, C and D. Notice that all are drawn to the same scales in both the data scale (horizontal) and the frequency scale (vertical). A B C D a) Determine the number summary for A. Identify the shape of each distribution. Complete the table below. Data Set Shape Min Q Median Q Max A B..... C..... D..... c) Use these number summaries to construct a simple boxplot for each data set. (Space is provided below. Stack the boxplots one atop the other.

3 A B C D Data d) Look at the boxplots ( # summaries) and histograms. While shape is generally defined in terms of a histogram, you should see that the orientation of a boxplot (the pattern of the # summary) provides a good indication of its shape. For a right skewed distribution (such as A) the # summary has the min, Q and median rather close, then Q and the max are relatively distant. For a symmetric distribution (B), the distance between the min and Q is about equal to that for Q and the max; Q to the median is about the same as the median to Q. The left skewed distribution is a mirror image of the right skewed distribution. A good quantitative key is to compare the distances from the quartiles to the median. If the first quartile is a lot closer to the median than the third, then you re probably looking at right skew. If the third quartile is a lot closer to the median than is the first, then you re probably looking at left skew. Q to median Distances median to Q Shape A:. < <. Right skewed B:.. Symmetric C:. > >. Left skewed D:. > >. Left skewed Of course, you want to see these comparisons through the boxplot you don t want to be computing all this. The < < means a good deal less than ; similarly > > means a good deal greater than. Finally, means approximately equal.

4 e) Here are means and standard deviations for the four data sets: Data Set Mean StDev A.. B.. Data Set Mean StDev C.. D.. For which data set are the mean and median closest? Notice the shape of the distribution for this data set. For sets C and D the mean is a decent amount less than the median. What shape are the distributions for C and D? For set A the mean is a decent amount greater than the median. What shape is the distribution for A? Fill in the blanks below with one of these phrases: less than greater than about equal to. For a right skewed distribution the mean is the median. For a left skewed distribution the mean is the median. For a symmetric distribution the mean is the median. For continuous data, it is rare for the mean and median to be exactly the same. Distributions of real data are virtually never exactly symmetric. Slight discrepancies between mean and median exist for a distribution that is best described as symmetric. (By slight discrepancy is meant: The mean and median plot very close to each other on the horizontal (x) axis of the histogram or boxplot.) f) Identify a reasonable value for the mode for each of the sets A D. A: B: C: D: You probably answered. for set A. Fine. Most statisticians would say. Why? Because the histogram shows a pattern of increasing frequency for values closer to. In the same way, a statistician would say that the mode for set C is. Now, examine the relationship between mean, median and mode for the data sets, keeping in mind the distribution shapes. A: Mode = (or.) Median =. Mean = Right skewed B: Mode =. Median =. Mean =. Symmetric C: Mode = (or.) Median =. Mean =. Left skewed D: Mode =. Median =. Mean =. Left skewed Suppose the mean and mode are quite different which generally happens when there is skew. Where does the median generally fall, relative to the mean and mode?

5 Percent Percent Percent Percent. a) For each histogram, identify the shape of the distribution. Also give reasonable values for the mode of each distribution A B C D b) Match the histograms to the boxplots. c) For each of the distributions identified by histogram letters A, B, C and D, determine how the mean and median compare to each other, as well as to the mode.... Data...

6 . Here s a boxplot of the amounts people paid for an identical model of car. (Different people pay different amounts because automobile prices are usually negotiated.) Answer from the boxplot alone. (Do the best you can. No one can be exact.) a) Determine the -# summary. b) Determine values for the range and IQR. c) About what % of people paid over,? d) About what % of people paid between, and,? e) Should the mean price be less than, more than, or about equal to the median price?

7 Frequency Frequency Frequency Frequency. Consider the four data sets Set A Set B Set C Set D a) Obtain histograms for all four sets. (The preferred method is to use a computer to do this. If you do that, the scales of the histograms might be somewhat different from those shown below which have been forced to be identical. That s OK.) A N B C L D U b) Examine the histograms. Without any computing: i) What approximately are the means of these four sets? ii) Which set do you think has the most variability? The least? Rank the sets A, B, C and D, from least to most variable.

8 c) Obtain the five number summary for data set A. Similar five number summaries are shown for the other three data sets. Also determine the range and interquartile range (IQR). A: {,,,, } IQR = Range = B: {.,.,.,.,. } IQR =. Range =. C: {.,.,.,.,. } IQR =. Range =. D: {.,.,.,.,. } IQR =. Range =. d) Use the Range Rule of Thumb to guess the standard deviations for these four sets of data. e) Obtain mean and standard deviation for set A. f) Make some comparisons: How do the means compare? Set A B C D Mean... SD... Standard deviation, range and IQR are all measures of variability. The aim of this exercise is a finer point demonstrating that Range alone is somewhat flawed. How do the ranges compare? Rank the data sets A, B, C and D from smallest range to largest. How do standard deviations compare? Rank the data sets A, B, C and D from smallest standard deviation to largest. (This is how you want to answer (ii) of part (b) above.) Do these rankings agree with those for the range? Could you use the order of ranges to predict the order of standard deviations? How do IQRs compare? Rank the data sets A, B, C and D from smallest IQR to largest. Do these rankings agree with those for the standard deviation? Comment: The Range Rule of Thumb tends to work better when the shape is near Normal (bell). Data set A is closest to Normal shaped. The Range Rule of Thumb predicts. =. for the standard deviation, and, in fact, for set A the actual standard deviation is quite close to that:..

9 . The histograms below are all drawn to the same scale on the horizontal (x) axis. Min A Max Min B Max Min Max C a) How do the ranges compare for these four distributions? Min L D Max b) The standard deviations for the distributions are:,, and. Which standard deviation goes with each of the four histograms? U N c) Match the boxplots to the histograms. P d) Which of these distributions (A, B, C, D) has largest interquartile range? Second largest? Second smallest? Which has the smallest? How does this ranking comparing to that for the standard deviations? e) For which distribution is the Range Rule of Thumb (Range SD) going to work best? f) Suppose you know the means of these distributions are all. Use your answer to e to guess the values of Min and Max.

10 Solutions. a) The distribution is left skewed. b) The five number summary is {.,.,.,.,. } d) IQR =., Range =.. e) The mean is., the standard deviation is.. f).. h) Mean =.; median =.; mode =. So for left skewed data: mean < median < mode.. a) A is right skewed; B is symmetric; C and D are both left skewed. b) {.,.,.,.,. } is the # summary. The mean is. with standard deviation.. c) A B C D Data e) The mean and median are the closest for the symmetric distribution for data set B. The mean is. and the median is. these are very close when marked on the scale of the

11 histogram or boxplot. When the mean is below the median (as for C and D) we see left skew. When the mean is above the median (as for A) we see right skew. For a right skewed distribution the mean is greater than the median. For a left skewed distribution the mean is less than the median. For a symmetric distribution the mean is about equal to the median. f) The median is generally between the mean and mode. We can extend the results of part e: If Mean < Median < Mode then you are probably looking at a left skewed distribution. If Mean > Median > Mode then you are probably looking at a right skewed distribution. If the three are fairly close to each other, you are probably looking at a fairly symmetric distribution.. a) A and D are right skewed (D is more skewed than is A); B is symmetric; C is left skewed. For the modes, see the table below. b) A-; B-; C-; D-. c) A: Mode =. < Median < Mean B: Mode =. Median Median (so the mean and median are about.) C: Mean < Median < Mode =. D: Mode = < Median < Mean Your modes may be a little different, but should basically be in the same place when you compare to mine with marks under the horizontal (x) axis of the histograms.. a) {,,,, } (if you are fairly close, that s good). b) The range is about and the IQR about. Again: Your values should be close. c) About % from the boxplot. d) From the boxplot: About %. e) This is the sort of boxplot you d see for a right skewed distribution. So, the mean should be larger than the median.

12 . c) and e) Here are some corresponding statistics. Variable Mean StDev Minimum Q Median Q Maximum Range IQR A B C D d) The range rule of thumb does not discriminate the differences in variability among these data sets. The range rule of thumb anticipates a standard deviation of (..) / =. for each of the data sets. f). The means are identical. (The medians are nearly the same, and fairly close to the means which goes hand in hand with the symmetry shown in all these distributions.) The ranges are identical. The standard deviations are somewhat different. C has lowest, then A, then D, with B highest. Standard deviation measures a standard (typical) deviation from the mean. Take C for instance: Almost all the data is very near the mean. So the standard deviation is small relative to the others. For B however, much of the data is at the extremes far from the mean. The standard deviation is large relative to the others. The standard deviation is a more subtle measure of variability than is the range.. is a decent guess for all four but this exercise is designed to reinforce this idea: While standard deviation tends to be about ¼ the range, there is more to it than that. Standard deviation takes into account not only what the largest deviations from the center (mean) are, but also how often these occur relative to smaller deviations. The IQRs also measure variability, and they also discriminate the differences in variability among the data sets better than do the ranges. You can see that the order of IQRs (from small to high: C, A, D, B) is the same as for standard deviations. a) They are about the same. b) C-, A-, B-, D-. c) U-A, N-B, P-C, L-D. d) P has largest; then U; then N; L has smallest. The IQR is the width of the box, so a comparison is simple. Then using the result to part c we can order the histograms by IQR: C has largest; then A; then B; D has smallest. IQRs rank the same as standard deviations. (Which is good. They are just different ways of measuring the variability and generally they discriminate the same way. You can see that the range can, in cases, be unable to make this discrimination. Look at C and D. Clearly there s more variability in C extremes (values far from the mean/center) are very likely for C and very uncommon in D. e) B the one that has a Normal (bell) shape. f) Range rule of thumb works best with Normal shapes. For B we have a standard deviation of, anticipating a range of. With a mean at, we d have Min around and Max around.

Putting Things Together Part 1

Putting Things Together Part 1 Putting Things Together Part 1 These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for 1, 5, and 6 are in

More information

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values

More information

appstats5.notebook September 07, 2016 Chapter 5

appstats5.notebook September 07, 2016 Chapter 5 Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2] 1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders

More information

Some estimates of the height of the podium

Some estimates of the height of the podium Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

Skewness and the Mean, Median, and Mode *

Skewness and the Mean, Median, and Mode * OpenStax-CNX module: m46931 1 Skewness and the Mean, Median, and Mode * OpenStax This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 Consider the following

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information

Lecture 2 Describing Data

Lecture 2 Describing Data Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms

More information

2 Exploring Univariate Data

2 Exploring Univariate Data 2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting

More information

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25 Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example

More information

Descriptive Statistics

Descriptive Statistics Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs

More information

STAT 113 Variability

STAT 113 Variability STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.

More information

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations

More information

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

IOP 201-Q (Industrial Psychological Research) Tutorial 5

IOP 201-Q (Industrial Psychological Research) Tutorial 5 IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,

More information

Section3-2: Measures of Center

Section3-2: Measures of Center Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow

More information

Math 227 Elementary Statistics. Bluman 5 th edition

Math 227 Elementary Statistics. Bluman 5 th edition Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find

More information

Numerical Descriptive Measures. Measures of Center: Mean and Median

Numerical Descriptive Measures. Measures of Center: Mean and Median Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

CHAPTER 2 Describing Data: Numerical

CHAPTER 2 Describing Data: Numerical CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of

More information

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.) Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop

More information

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12)

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Descriptive statistics: - Measures of centrality (Mean, median, mode, trimmed mean) - Measures of spread (MAD, Standard deviation, variance) -

More information

Unit 2 Statistics of One Variable

Unit 2 Statistics of One Variable Unit 2 Statistics of One Variable Day 6 Summarizing Quantitative Data Summarizing Quantitative Data We have discussed how to display quantitative data in a histogram It is useful to be able to describe

More information

Mini-Lecture 3.1 Measures of Central Tendency

Mini-Lecture 3.1 Measures of Central Tendency Mini-Lecture 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data 3. Explain what it means for a

More information

We will also use this topic to help you see how the standard deviation might be useful for distributions which are normally distributed.

We will also use this topic to help you see how the standard deviation might be useful for distributions which are normally distributed. We will discuss the normal distribution in greater detail in our unit on probability. However, as it is often of use to use exploratory data analysis to determine if the sample seems reasonably normally

More information

Empirical Rule (P148)

Empirical Rule (P148) Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall

More information

Chapter 3: Displaying and Describing Quantitative Data Quiz A Name

Chapter 3: Displaying and Describing Quantitative Data Quiz A Name Chapter 3: Displaying and Describing Quantitative Data Quiz A Name 3.1.1 Find summary statistics; create displays; describe distributions; determine 1. Following is a histogram of salaries (in $) for a

More information

The Standard Deviation as a Ruler and the Normal Model. Copyright 2009 Pearson Education, Inc.

The Standard Deviation as a Ruler and the Normal Model. Copyright 2009 Pearson Education, Inc. The Standard Deviation as a Ruler and the Normal Mol Copyright 2009 Pearson Education, Inc. The trick in comparing very different-looking values is to use standard viations as our rulers. The standard

More information

Description of Data I

Description of Data I Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret

More information

MATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1)

MATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1) LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1) Descriptive statistics are ways of summarizing large sets of quantitative (numerical) information. The best way to reduce a set of

More information

Section 6-1 : Numerical Summaries

Section 6-1 : Numerical Summaries MAT 2377 (Winter 2012) Section 6-1 : Numerical Summaries With a random experiment comes data. In these notes, we learn techniques to describe the data. Data : We will denote the n observations of the random

More information

GOALS. Describing Data: Displaying and Exploring Data. Dot Plots - Examples. Dot Plots. Dot Plot Minitab Example. Stem-and-Leaf.

GOALS. Describing Data: Displaying and Exploring Data. Dot Plots - Examples. Dot Plots. Dot Plot Minitab Example. Stem-and-Leaf. Describing Data: Displaying and Exploring Data Chapter 4 GOALS 1. Develop and interpret a dot plot.. Develop and interpret a stem-and-leaf display. 3. Compute and understand quartiles, deciles, and percentiles.

More information

Example: Histogram for US household incomes from 2015 Table:

Example: Histogram for US household incomes from 2015 Table: 1 Example: Histogram for US household incomes from 2015 Table: Income level Relative frequency $0 - $14,999 11.6% $15,000 - $24,999 10.5% $25,000 - $34,999 10% $35,000 - $49,999 12.7% $50,000 - $74,999

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 10 (MWF) Checking for normality of the data using the QQplot Suhasini Subba Rao Review of previous

More information

Review: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59.

Review: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59. Review: Chebyshev s Rule Measures of Dispersion II Tom Ilvento STAT 200 Is based on a mathematical theorem for any data At least ¾ of the measurements will fall within ± 2 standard deviations from the

More information

Numerical Descriptions of Data

Numerical Descriptions of Data Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =

More information

Copyright 2005 Pearson Education, Inc. Slide 6-1

Copyright 2005 Pearson Education, Inc. Slide 6-1 Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is

More information

Math 243 Lecture Notes

Math 243 Lecture Notes Assume the average annual rainfall for in Portland is 36 inches per year with a standard deviation of 9 inches. Also assume that the average wind speed in Chicago is 10 mph with a standard deviation of

More information

Graphical and Tabular Methods in Descriptive Statistics. Descriptive Statistics

Graphical and Tabular Methods in Descriptive Statistics. Descriptive Statistics Graphical and Tabular Methods in Descriptive Statistics MATH 3342 Section 1.2 Descriptive Statistics n Graphs and Tables n Numerical Summaries Sections 1.3 and 1.4 1 Why graph data? n The amount of data

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

Lesson 12: Describing Distributions: Shape, Center, and Spread

Lesson 12: Describing Distributions: Shape, Center, and Spread : Shape, Center, and Spread Opening Exercise Distributions - Data are often summarized by graphs. We often refer to the group of data presented in the graph as a distribution. Below are examples of the

More information

Chapter 4-Describing Data: Displaying and Exploring Data

Chapter 4-Describing Data: Displaying and Exploring Data Chapter 4-Describing Data: Displaying and Exploring Data Jie Zhang, Ph.D. Student Account and Information Systems Department College of Business Administration The University of Texas at El Paso jzhang6@utep.edu

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

4. DESCRIPTIVE STATISTICS

4. DESCRIPTIVE STATISTICS 4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in

More information

Frequency Distribution and Summary Statistics

Frequency Distribution and Summary Statistics Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary

More information

Lecture Week 4 Inspecting Data: Distributions

Lecture Week 4 Inspecting Data: Distributions Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your

More information

3.1 Measures of Central Tendency

3.1 Measures of Central Tendency 3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent

More information

STA 248 H1S Winter 2008 Assignment 1 Solutions

STA 248 H1S Winter 2008 Assignment 1 Solutions 1. (a) Measures of location: STA 248 H1S Winter 2008 Assignment 1 Solutions i. The mean, 100 1=1 x i/100, can be made arbitrarily large if one of the x i are made arbitrarily large since the sample size

More information

Chapter 4. The Normal Distribution

Chapter 4. The Normal Distribution Chapter 4 The Normal Distribution 1 Chapter 4 Overview Introduction 4-1 Normal Distributions 4-2 Applications of the Normal Distribution 4-3 The Central Limit Theorem 4-4 The Normal Approximation to the

More information

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line. Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,

More information

1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form:

1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form: 1 Exercise One Note that the data is not grouped! 1.1 Calculate the mean ROI Below you find the raw data in tabular form: Obs Data 1 18.5 2 18.6 3 17.4 4 12.2 5 19.7 6 5.6 7 7.7 8 9.8 9 19.9 10 9.9 11

More information

MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION

MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION 1 Day 3 Summer 2017.07.31 DISTRIBUTION Symmetry Modality 单峰, 双峰 Skewness 正偏或负偏 Kurtosis 2 3 CHAPTER 4 Measures of Central Tendency 集中趋势

More information

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical

More information

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017

More information

STAT 157 HW1 Solutions

STAT 157 HW1 Solutions STAT 157 HW1 Solutions http://www.stat.ucla.edu/~dinov/courses_students.dir/10/spring/stats157.dir/ Problem 1. 1.a: (6 points) Determine the Relative Frequency and the Cumulative Relative Frequency (fill

More information

Chapter 15: Sampling distributions

Chapter 15: Sampling distributions =true true Chapter 15: Sampling distributions Objective (1) Get "big picture" view on drawing inferences from statistical studies. (2) Understand the concept of sampling distributions & sampling variability.

More information

Descriptive Analysis

Descriptive Analysis Descriptive Analysis HERTANTO WAHYU SUBAGIO Univariate Analysis Univariate analysis involves the examination across cases of one variable at a time. There are three major characteristics of a single variable

More information

Chapter 4-Describing Data: Displaying and Exploring Data

Chapter 4-Describing Data: Displaying and Exploring Data Chapter 4-Describing Data: Displaying and Exploring Data Jie Zhang, Ph.D. Student Account and Information Systems Department College of Business Administration The University of Texas at El Paso jzhang6@utep.edu

More information

Today s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation.

Today s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation. 1 Today s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation. 2 Once we know the central location of a data set, we want to know how close things are to the center. 2 Once we know

More information

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative STAT:10 Statistical Methods and Computing Normal Distributions Lecture 4 Feb. 6, 17 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowa.edu 1 2 Using density curves to describe the distribution of values of

More information

AP Statistics Unit 1 (Chapters 1-6) Extra Practice: Part 1

AP Statistics Unit 1 (Chapters 1-6) Extra Practice: Part 1 AP Statistics Unit 1 (Chapters 1-6) Extra Practice: Part 1 1. As part of survey of college students a researcher is interested in the variable class standing. She records a 1 if the student is a freshman,

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution

More information

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences. STAB22H3 Statistics I Duration: 1 hour and 45 minutes

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences. STAB22H3 Statistics I Duration: 1 hour and 45 minutes UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences STAB22H3 Statistics I Duration: 1 hour and 45 minutes Last Name: First Name: Student number: Aids allowed: - One handwritten

More information

Lectures delivered by Prof.K.K.Achary, YRC

Lectures delivered by Prof.K.K.Achary, YRC Lectures delivered by Prof.K.K.Achary, YRC Given a data set, we say that it is symmetric about a central value if the observations are distributed symmetrically about the central value. In symmetrically

More information

Simple Descriptive Statistics

Simple Descriptive Statistics Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency

More information

2CORE. Summarising numerical data: the median, range, IQR and box plots

2CORE. Summarising numerical data: the median, range, IQR and box plots C H A P T E R 2CORE Summarising numerical data: the median, range, IQR and box plots How can we describe a distribution with just one or two statistics? What is the median, how is it calculated and what

More information

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Categorical. A general name for non-numerical data; the data is separated into categories of some kind. Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,

More information

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers. Chapter 3 Section3-: Measures of Center Section 3-3: Measurers of Variation Section 3-4: Measures of Relative Standing Section 3-5: Exploratory Data Analysis Describing Distributions with Numbers The overall

More information

Shifting and rescaling data distributions

Shifting and rescaling data distributions Shifting and rescaling data distributions It is useful to consider the effect of systematic alterations of all the values in a data set. The simplest such systematic effect is a shift by a fixed constant.

More information

LINEAR COMBINATIONS AND COMPOSITE GROUPS

LINEAR COMBINATIONS AND COMPOSITE GROUPS CHAPTER 4 LINEAR COMBINATIONS AND COMPOSITE GROUPS So far, we have applied measures of central tendency and variability to a single set of data or when comparing several sets of data. However, in some

More information

Describing Data: Displaying and Exploring Data

Describing Data: Displaying and Exploring Data Describing Data: Displaying and Exploring Data Chapter 4 McGraw-Hill/Irwin Copyright 2011 by the McGraw-Hill Companies, Inc. All rights reserved. LEARNING OBJECTIVES LO1. Develop and interpret a dot plot.

More information

SOLUTIONS TO THE LAB 1 ASSIGNMENT

SOLUTIONS TO THE LAB 1 ASSIGNMENT SOLUTIONS TO THE LAB 1 ASSIGNMENT Question 1 Excel produces the following histogram of pull strengths for the 100 resistors: 2 20 Histogram of Pull Strengths (lb) Frequency 1 10 0 9 61 63 6 67 69 71 73

More information

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar Measures of Central Tendency 11.220 Lecture 5 22 February 2006 R. Ryznar Today s Content Wrap-up from yesterday Frequency Distributions The Mean, Median and Mode Levels of Measurement and Measures of Central

More information

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) = Solutions to End-of-Section and Chapter Review Problems 225 CHAPTER 6 6.1 (a) P(Z < 1.20) = 0.88493 P(Z > 1.25) = 1 0.89435 = 0.10565 P(1.25 < Z < 1.70) = 0.95543 0.89435 = 0.06108 (d) P(Z < 1.25) or Z

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number

More information

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION Subject Paper No and Title Module No and Title Paper No.2: QUANTITATIVE METHODS Module No.7: NORMAL DISTRIBUTION Module Tag PSY_P2_M 7 TABLE OF CONTENTS 1. Learning Outcomes 2. Introduction 3. Properties

More information

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda, MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, CONTENTS What is measures of dispersion? Why measures of dispersion? How measures of dispersions are calculated? Range Quartile

More information

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 11: BUSINESS STATISTICS I Semester 04 Major Exam #1 Sunday March 7, 005 Please circle your instructor

More information

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data. -3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data

More information

8. From FRED, search for Canada unemployment and download the unemployment rate for all persons 15 and over, monthly,

8. From FRED,   search for Canada unemployment and download the unemployment rate for all persons 15 and over, monthly, Economics 250 Introductory Statistics Exercise 1 Due Tuesday 29 January 2019 in class and on paper Instructions: There is no drop box and this exercise can be submitted only in class. No late submissions

More information

2 DESCRIPTIVE STATISTICS

2 DESCRIPTIVE STATISTICS Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled

More information

Monte Carlo Simulation (Random Number Generation)

Monte Carlo Simulation (Random Number Generation) Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

starting on 5/1/1953 up until 2/1/2017.

starting on 5/1/1953 up until 2/1/2017. An Actuary s Guide to Financial Applications: Examples with EViews By William Bourgeois An actuary is a business professional who uses statistics to determine and analyze risks for companies. In this guide,

More information

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet.

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. 1 Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. Warning to the Reader! If you are a student for whom this document is a historical artifact, be aware that the

More information

DATA HANDLING Five-Number Summary

DATA HANDLING Five-Number Summary DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest

More information

Data Analysis. BCF106 Fundamentals of Cost Analysis

Data Analysis. BCF106 Fundamentals of Cost Analysis Data Analysis BCF106 Fundamentals of Cost Analysis June 009 Chapter 5 Data Analysis 5.0 Introduction... 3 5.1 Terminology... 3 5. Measures of Central Tendency... 5 5.3 Measures of Dispersion... 7 5.4 Frequency

More information

FINALS REVIEW BELL RINGER. Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/ /2 4

FINALS REVIEW BELL RINGER. Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/ /2 4 FINALS REVIEW BELL RINGER Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/3 + 7 + 1/2 4 4) 3 + 4 ( 7) + 3 + 4 ( 2) 1) 36/6 4/6 + 3/6 32/6 + 3/6 35/6

More information

The Normal Distribution

The Normal Distribution 5.1 Introduction to Normal Distributions and the Standard Normal Distribution Section Learning objectives: 1. How to interpret graphs of normal probability distributions 2. How to find areas under the

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

1.2 Describing Distributions with Numbers, Continued

1.2 Describing Distributions with Numbers, Continued 1.2 Describing Distributions with Numbers, Continued Ulrich Hoensch Thursday, September 6, 2012 Interquartile Range and 1.5 IQR Rule for Outliers The interquartile range IQR is the distance between the

More information

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Lecture 3 Reading: Sections 5.7 54 Remember, when you finish a chapter make sure not to miss the last couple of boxes: What Can Go

More information