NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS
|
|
- Bertina Cross
- 5 years ago
- Views:
Transcription
1 NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows how the data is set out. To draw a box plot you need the values of: the median, the two quartiles (lower and upper) and the two extremes (highest and lowest). This is called the five figure summary or five number summary. Whisker Interquartile range Whisker Lowest value (minimum) Lower quartile Median Upper quartile Highest value (maximum) Range The number line is drawn so that it covers all the values in the set of data The rectangular 'box' represents the interquartile range (the middle half of the data) The 'box' is drawn from the lower quartile (Q L or Q 1 ) to the upper quartile (Q U or Q 3 ) The vertical line drawn in the 'box' represents the median (middle value) A whisker extends from each end of the 'box' to the lowest and highest values Example 1 Number line (with a suitable scale) This data shows Tom's cricket scores over the last 13 games. Show the data below as a box plot. (We are finding the box plot without outliers. Outliers will be treated later.) Step 1: Find the median Sort the data from lowest to highest Middle value is the 7th value There are 13 scores The middle score = the median = 46 Step 2: Find the upper and lower quartiles. Divide the data in half. If the data set have even number of values, it can be divided equally. However, in this question, there are 13 values in the data set 13 (odd number) scores don't divide the data set evenly into two groups, so we leave the middle score, 46 out.
2 There are 6 scores in the lower half. The median of these scores =!"!!" = 39! We call this the lower quartile Q L or Q 1 There are 6 scores in the upper half. The median of these scores =!"!!" = 55! We call this the lower quartile Q U or Q 3 Five number summary Lowest score (minimum) = 20 runs Lower quartile = 39 runs Median = 46 runs Upper quartile = 55 runs Highest score = 90 runs Use the results to draw a box plot The box plot shows how the scores are spread out. A boxplot divides the data into four groups, each with about the same number of data scores The range of the box is called the interquartile range (IQR) o IQR = Q 3 -Q 1 = = 16 runs o The middle half of the scores fall in this range. o Eliminates the extreme values in the top and bottom of the data o Helps to measure the consistency of the data 25% of scores are less than Q 1 or 75% of scores are above Q 1 50% of scores are less than the median or 50% of scores are above the median 75% of scores are less than Q 3 or 25% of scores are above Q 3 Don't mix up between the range and the interquartile range (IQR). The range is the difference between the Highest and Lowest values. The interquartile range (IQR) is the difference between the Upper and Lower quartiles. Finding the Measures of Centre for Grouped Data
3 Interpreting Information from a Box Plot Example 2 This boxplot represents test results fro Year 8 Science. The test was marked out of 40. a. Read off the approximate values and interpret them. b. What can you say about the test results? c. Comment on the skewness. a. & b. Statistic Approximate Interpretation value Median 9 It is the middle value of the date 50% of students received a result above 9 and 50% of students received a result below 9 Upper Quartile 17 one quarter or 25% of the students received a result greater than 17 Lower Quartile 6 one quarter or 25% of the students received a result less than 6 Range 35 highest score lowest score = 35-0 = 35 the results ranged from 0 to 35 Interquartile 11 IQR = upper quartile lower quartile = 17 6 = 11 Range 50% of the students received a result between 6 and 17 the middle half of the data has a spread of 11 Comment: We can see that 75% of the class got a score below 17. Not a vey pleasing result. Perhaps the test was too hard or too long or the students did not prepare properly. d. The data is positively skewed.
4 Outliers / Limits / Fences A box plot can help us find outliers. Outliers are data values which are found outside the main group of the data. The lines outside the box are called whiskers. If the whiskers are more than 1.5 times the width of the box, then there are outliers in the sample. The first step is to identify the Lower and Upper Limits / Fences by using the following formula: Lower limit / fence = Q 1 (1.5 IQR) Upper limit / fence = Q 3 + (1.5 IQR) Outliers are therefore data values that are: less than the lower limit or upper limit / fence The whiskers are extended to the last actual data values before the outliers. Lower limit IQR 1.5 IQR 1.5 IQR Upper limit Outlier Outliers TESTING FOR OUTLIERS Step 1: Establish the values fo Q 1, Q 3 and IQR. Step 2: Calculate Q IQR. This value is called the lower limit or fence. Step 3: Check to see whether there are any data values less than the lower limit / fence. If so, then each value is an outlier. Outlier < Lower limit value Step 4: Calculate Q IQR. This value is called the upper limit or fence. Step 5: Check to see whether there are any data values greater than the upper limit / fence. If so, then each value is an outlier. Outlier > Upper limit value Note: To receive full marks for a box plot question, you must clearly show all relevant information including IQR, Lower and Upper limit calculations.
5 Example 3 Using the data from Example 1 (Tom s cricket scores over the last 13 games) Lowest score, minimum = 20 runs Lower quartile, Q 1 = 39 runs Median, Q 2 = 46 runs Upper quartile, Q 3 = 55 runs Highest score, maximum = 90 runs Remember: To get full marks, you must show all relevant values e.g. IQR, limit calculations, etc. Step 1: Q 1 =39 and Q 3 = 55 IQR = = 16 Step 2: Lower limit / fence: Q x IQR = x 16 = = 15 Step 3: Check the data. There are no data values less than 15, and so there are no outliers on the left side of the box plot.the whisker will extend to the last data value on this side. Thus it extends to the value 20. Step 4: Upper limit fence: Q x IQR = x 16 = = 79 Step 5: Check the data. There are two data values more than 79. These values are 82 and 90 and so these values are outliers on the right side of the box plot. They will be represented by dots or asterisks. The whisker will extend to the last data value before 82. Thus the whisker extends to 56 (third highest value). Min Q 1 Med Q 3 Max Each outlier is represented by an asterisk *
6 Box Plots for grouped data in frequency tables (GROUPED data) Example 6 Draw a box plot for the goals scored in 100 games as shown in the table. Find the range, interquartile range, and comment on the spread of the data. Goals in a game Total Frequency Cumulative To find the median and the upper and lower quartiles, set up a Goals Frequency frequency cumulative frequency column We have 100 values which divides in half evenly, 50 in each = 20 half. There is no middle value so the median is the average of the 50 th and the 51 st values The 50 th value = 2 goals, the 51 st value = 3 goals Thus the median = = 2!3 5 = 2. 5 goals The lower quartile is the median of the lower 50 values which is the average of the 25 th and 26 th values. The 25 th and 26 th values both equal 2 goals, so Q 1 = 2 goals The upper quartile is the median of the upper 50 values which Total 100 is the average of the 75 th and 76 th values. The 75 th and 76 th values both equal 4 goals, so Q 3 = 4 goals. The box plot shows the range (Max - Min) = 6-0 = 6 goals The interquartile range is: Q 3 Q 1 = 4-2 = 2 goals The upper half of the data is slightly more spread out than the lower half Exercise 2C
7 NOTES TO CONSIDER BEFORE ATTEMPTING EX 2D & 2E Relating A Box Plot To Distribution Shape A box plot gives an idea of the skewness of a distribution. Always look at the box first ie. between Q 1 and Q 3 to see shape of distribution. Whiskers are used to confirm shape. NB: If whiskers contradict shape, use box to decide on shape. Approximately symmetric distribution The box plot is symmetric The median is generally in the middle of the box The whiskers are approximately equal in length If the mean, median and the mode are equal then it is said to be symmetric about the mean. Mode = Median = Mean Positively skewed distribution The box plot has the median off-centre and generally to the left. The left-hand whisker will be short and the right-hand whisker will be long. When the long tail is to the right, the mean is pulled away from the main body of he data set to the right; and it is located to the right of the median. In this case, the data set is positively skewed because of the predominance of the data values towards the positive end of the distribution. Mode < Median < Mean Negatively skewed distribution The box plot has the median off-centre and generally to the right. The right-hand whisker will be short and the left-hand whisker will be long. When the long tail is to the left, the mean is pulled away from the main body of the data set to the left, and it is located to the left of the median. In this case, the data set is negatively skewed because of the predominance of the data values towards the negative end of the distribution. Mean < Median < Mode
8 Comparing boxplots Comparing Box Plots - It is very important to be able to COMPARE all the key features when comparing box plots. Here are the math results for four year 7 classes. To be able to compare they must be PARALLEL box plots, so that they all use the same scale. Centre - Median Class 7B has the highest median mark of approximately 78. 7A has the lowest median mark of approximately 62. Shape - Skew You can describe skew by looking at just the middle 50% or the entire range of data. To be safe make sure you are clear what you have compared: Class 7D is approx. symmetrical and 7B is positively skewed with the median score being a little closer to the lower quartile and also the minimum score. Class 7C could be seen as negatively skewed if we look at just the box (middle 50%) but the upper whisker is longer than the lower. Class 7A has its median equally spaced between the upper and lower quartile scores but the upper whisker is much longer than the lower, meaning there is more spread of data above the median rather than below. So class 7A it is positively skewed. Maximum and Minimum 7B has the highest mark of around 98. All other classes have equal highest marks of 90 7D and 7A share lowest marks of 40. Overall Range Class 7B has and 7C have approximately the same lowest range. Class 7A and 7D have the highest range. Upper and Lower Quartiles The 7B upper quartile is equal to the maximum of the other classes. Interquartile Range Class 7D has the highest IQR. By comparing the middle 50% all other IQR's are similar. In summary Class 7B is the highest performing class because it has 25% of its students achieving higher than any grade of the other classes. Half of its students achieve higher than three quarters of Class 7A and 7C. The lower quartile of 7B is above the median for all other classes, with the bottom 25% of marks being higher than the lower quartile for all other classes. Exercise 2D & 2E
2 Exploring Univariate Data
2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting
More information1 Describing Distributions with numbers
1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write
More informationappstats5.notebook September 07, 2016 Chapter 5
Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.
More informationHandout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25
Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example
More informationHow Wealthy Are Europeans?
How Wealthy Are Europeans? Grades: 7, 8, 11, 12 (course specific) Description: Organization of data of to examine measures of spread and measures of central tendency in examination of Gross Domestic Product
More informationSTAT 113 Variability
STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow
More informationEmpirical Rule (P148)
Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall
More information2CORE. Summarising numerical data: the median, range, IQR and box plots
C H A P T E R 2CORE Summarising numerical data: the median, range, IQR and box plots How can we describe a distribution with just one or two statistics? What is the median, how is it calculated and what
More informationChapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.
-3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data
More informationSome estimates of the height of the podium
Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40
More informationDescribing Data: One Quantitative Variable
STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive
More informationLecture 2 Describing Data
Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms
More informationDATA HANDLING Five-Number Summary
DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest
More informationChapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1
Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.
More informationPercentiles, STATA, Box Plots, Standardizing, and Other Transformations
Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Lecture 3 Reading: Sections 5.7 54 Remember, when you finish a chapter make sure not to miss the last couple of boxes: What Can Go
More informationSTAB22 section 1.3 and Chapter 1 exercises
STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea
More informationPutting Things Together Part 2
Frequency Putting Things Together Part These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for, and are in
More informationLecture Week 4 Inspecting Data: Distributions
Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your
More informationMeasures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean
Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values
More informationChapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1
Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and
More informationDescription of Data I
Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret
More informationMath 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment
Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class
More informationNumerical Descriptions of Data
Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =
More informationCategorical. A general name for non-numerical data; the data is separated into categories of some kind.
Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,
More informationSOLUTIONS TO THE LAB 1 ASSIGNMENT
SOLUTIONS TO THE LAB 1 ASSIGNMENT Question 1 Excel produces the following histogram of pull strengths for the 100 resistors: 2 20 Histogram of Pull Strengths (lb) Frequency 1 10 0 9 61 63 6 67 69 71 73
More informationA LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]
1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders
More informationSection3-2: Measures of Center
Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number
More informationMonte Carlo Simulation (Random Number Generation)
Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...
More informationIOP 201-Q (Industrial Psychological Research) Tutorial 5
IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,
More informationDescriptive Statistics
Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs
More information4. DESCRIPTIVE STATISTICS
4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in
More informationCopyright 2005 Pearson Education, Inc. Slide 6-1
Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is
More informationWk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12)
Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Descriptive statistics: - Measures of centrality (Mean, median, mode, trimmed mean) - Measures of spread (MAD, Standard deviation, variance) -
More informationMath 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet.
1 Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. Warning to the Reader! If you are a student for whom this document is a historical artifact, be aware that the
More informationData that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.
Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer
More information3.1 Measures of Central Tendency
3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent
More informationMini-Lecture 3.1 Measures of Central Tendency
Mini-Lecture 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data 3. Explain what it means for a
More informationUnit 2 Statistics of One Variable
Unit 2 Statistics of One Variable Day 6 Summarizing Quantitative Data Summarizing Quantitative Data We have discussed how to display quantitative data in a histogram It is useful to be able to describe
More informationDATA SUMMARIZATION AND VISUALIZATION
APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296
More informationFrequency Distribution and Summary Statistics
Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary
More informationFINALS REVIEW BELL RINGER. Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/ /2 4
FINALS REVIEW BELL RINGER Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/3 + 7 + 1/2 4 4) 3 + 4 ( 7) + 3 + 4 ( 2) 1) 36/6 4/6 + 3/6 32/6 + 3/6 35/6
More informationSTA 248 H1S Winter 2008 Assignment 1 Solutions
1. (a) Measures of location: STA 248 H1S Winter 2008 Assignment 1 Solutions i. The mean, 100 1=1 x i/100, can be made arbitrarily large if one of the x i are made arbitrarily large since the sample size
More informationTi 83/84. Descriptive Statistics for a List of Numbers
Ti 83/84 Descriptive Statistics for a List of Numbers Quiz scores in a (fictitious) class were 10.5, 13.5, 8, 12, 11.3, 9, 9.5, 5, 15, 2.5, 10.5, 7, 11.5, 10, and 10.5. It s hard to get much of a sense
More informationUnit 2 Measures of Variation
1. (a) Weight in grams (w) 6 < w 8 4 8 < w 32 < w 1 6 1 < w 1 92 1 < w 16 8 6 Median 111, Inter-quartile range 3 Distance in km (d) < d 1 1 < d 2 17 2 < d 3 22 3 < d 4 28 4 < d 33 < d 6 36 Median 2.2,
More informationStandardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis
Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem
More informationStat 101 Exam 1 - Embers Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.
More informationSUMMARY STATISTICS EXAMPLES AND ACTIVITIES
Session 6 SUMMARY STATISTICS EXAMPLES AD ACTIVITIES Example 1.1 Expand the following: 1. X 2. 2 6 5 X 3. X 2 4 3 4 4. X 4 2 Solution 1. 2 3 2 X X X... X 2. 6 4 X X X X 4 5 6 5 3. X 2 X 3 2 X 4 2 X 5 2
More informationCenter and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.
Chapter 3 Section3-: Measures of Center Section 3-3: Measurers of Variation Section 3-4: Measures of Relative Standing Section 3-5: Exploratory Data Analysis Describing Distributions with Numbers The overall
More informationExploratory Data Analysis
Exploratory Data Analysis Stemplots (or Stem-and-leaf plots) Stemplot and Boxplot T -- leading digits are called stems T -- final digits are called leaves STAT 74 Descriptive Statistics 2 Example: (number
More informationMath146 - Chapter 3 Handouts. The Greek Alphabet. Source: Page 1 of 39
Source: www.mathwords.com The Greek Alphabet Page 1 of 39 Some Miscellaneous Tips on Calculations Examples: Round to the nearest thousandth 0.92431 0.75693 CAUTION! Do not truncate numbers! Example: 1
More informationBoth the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.
Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of
More informationDescriptive Statistics (Devore Chapter One)
Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf
More informationApplications of Data Dispersions
1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given
More information22.2 Shape, Center, and Spread
Name Class Date 22.2 Shape, Center, and Spread Essential Question: Which measures of center and spread are appropriate for a normal distribution, and which are appropriate for a skewed distribution? Eplore
More informationMeasures of Central Tendency Lecture 5 22 February 2006 R. Ryznar
Measures of Central Tendency 11.220 Lecture 5 22 February 2006 R. Ryznar Today s Content Wrap-up from yesterday Frequency Distributions The Mean, Median and Mode Levels of Measurement and Measures of Central
More informationOverview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution
PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations
More informationToday s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation.
1 Today s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation. 2 Once we know the central location of a data set, we want to know how close things are to the center. 2 Once we know
More informationDot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.
Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,
More informationSTAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model
STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good
More informationReview: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59.
Review: Chebyshev s Rule Measures of Dispersion II Tom Ilvento STAT 200 Is based on a mathematical theorem for any data At least ¾ of the measurements will fall within ± 2 standard deviations from the
More informationWeek 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.
Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.
More informationSTAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model
STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good
More informationChapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)
Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop
More information1) What is the range of the data shown in the box and whisker plot? 2) True or False: 75% of the data falls between 6 and 12.
DO NOW 1) What is the range of the data shown in the box and whisker plot? 2) True or False: 75% of the data falls between 6 and 12. May 21 7:19 AM 1. 3 2. 2 3. 1 4. 4 5. 2 6. Min = 17, Q 1 = 18.5, Med
More informationDescriptive Statistics
Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations
More informationSkewness and the Mean, Median, and Mode *
OpenStax-CNX module: m46931 1 Skewness and the Mean, Median, and Mode * OpenStax This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 Consider the following
More informationThe Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).
We will look the three common and useful measures of spread. The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). 1 Ameasure of the center
More informationMath 140 Introductory Statistics. First midterm September
Math 140 Introductory Statistics First midterm September 23 2010 Box Plots Graphical display of 5 number summary Q1, Q2 (median), Q3, max, min Outliers If a value is more than 1.5 times the IQR from the
More informationExample - Let X be the number of boys in a 4 child family. Find the probability distribution table:
Chapter8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number
More informationBasic Procedure for Histograms
Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that
More informationDays Traveling Frequency Relative Frequency Percent Frequency % % 35 and above 1 Total %
Math 1351 Activity 1(Chapter 10)(Due by end of class Feb. 15) Group # 1. A business magazine was conducting a study into the amount of travel required for managers across the U.S. Seventy-five managers
More informationDistributions and their Characteristics
Distributions and their Characteristics 1. A distribution of a variable is merely a list of all values the variable can take, and the corresponding frequencies. Distributions may be represented or displayed
More informationMath Take Home Quiz on Chapter 2
Math 116 - Take Home Quiz on Chapter 2 Show the calculations that lead to the answer. Due date: Tuesday June 6th Name Time your class meets Provide an appropriate response. 1) A newspaper surveyed its
More information1.2 Describing Distributions with Numbers, Continued
1.2 Describing Distributions with Numbers, Continued Ulrich Hoensch Thursday, September 6, 2012 Interquartile Range and 1.5 IQR Rule for Outliers The interquartile range IQR is the distance between the
More informationMeasures of Dispersion (Range, standard deviation, standard error) Introduction
Measures of Dispersion (Range, standard deviation, standard error) Introduction We have already learnt that frequency distribution table gives a rough idea of the distribution of the variables in a sample
More informationSTATISTICAL DISTRIBUTIONS AND THE CALCULATOR
STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either
More informationChapter 4. The Normal Distribution
Chapter 4 The Normal Distribution 1 Chapter 4 Overview Introduction 4-1 Normal Distributions 4-2 Applications of the Normal Distribution 4-3 The Central Limit Theorem 4-4 The Normal Approximation to the
More informationNumerical Descriptive Measures. Measures of Center: Mean and Median
Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where
More informationSome Characteristics of Data
Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key
More informationKING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section
KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 11: BUSINESS STATISTICS I Semester 04 Major Exam #1 Sunday March 7, 005 Please circle your instructor
More informationLecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution
More informationExample - Let X be the number of boys in a 4 child family. Find the probability distribution table:
Chapter7 Probability Distributions and Statistics Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number of boys in
More informationMath 227 Elementary Statistics. Bluman 5 th edition
Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find
More information9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives
Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical
More informationNOTES: Chapter 4 Describing Data
NOTES: Chapter 4 Describing Data Intro to Statistics COLYER Spring 2017 Student Name: Page 2 Section 4.1 ~ What is Average? Objective: In this section you will understand the difference between the three
More informationIntroduction to Computational Finance and Financial Econometrics Descriptive Statistics
You can t see this text! Introduction to Computational Finance and Financial Econometrics Descriptive Statistics Eric Zivot Summer 2015 Eric Zivot (Copyright 2015) Descriptive Statistics 1 / 28 Outline
More informationEdexcel past paper questions
Edexcel past paper questions Statistics 1 Chapters 2-4 (Discrete) Statistics 1 Chapters 2-4 (Discrete) Page 1 Stem and leaf diagram Stem-and-leaf diagrams are used to represent data in its original form.
More informationPutting Things Together Part 1
Putting Things Together Part 1 These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for 1, 5, and 6 are in
More informationCHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =
Solutions to End-of-Section and Chapter Review Problems 225 CHAPTER 6 6.1 (a) P(Z < 1.20) = 0.88493 P(Z > 1.25) = 1 0.89435 = 0.10565 P(1.25 < Z < 1.70) = 0.95543 0.89435 = 0.06108 (d) P(Z < 1.25) or Z
More informationKey: 18 5 = 1.85 cm. 5 a Stem Leaf. Key: 2 0 = 20 points. b Stem Leaf. Key: 2 0 = 20 cm. 6 a Stem Leaf. Key: 4 3 = 43 cm.
Answers EXERCISE. D D C B Numerical: a, b, c Categorical: c, d, e, f, g Discrete: c Continuous: a, b C C Categorical B A Categorical and ordinal Discrete Ordinal D EXERCISE. Stem Key: = Stem Key: = $ The
More informationSimple Descriptive Statistics
Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency
More informationSTAT 157 HW1 Solutions
STAT 157 HW1 Solutions http://www.stat.ucla.edu/~dinov/courses_students.dir/10/spring/stats157.dir/ Problem 1. 1.a: (6 points) Determine the Relative Frequency and the Cumulative Relative Frequency (fill
More information2 2 In general, to find the median value of distribution, if there are n terms in the distribution the
THE MEDIAN TEMPERATURES MEDIAN AND CUMULATIVE FREQUENCY The median is the third type of statistical average you will use in his course. You met the other two, the mean and the mode in pack MS4. THE MEDIAN
More informationThe Normal Distribution
Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we
More informationStatistics (This summary is for chapters 18, 29 and section H of chapter 19)
Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x n =
More informationGraphical and Tabular Methods in Descriptive Statistics. Descriptive Statistics
Graphical and Tabular Methods in Descriptive Statistics MATH 3342 Section 1.2 Descriptive Statistics n Graphs and Tables n Numerical Summaries Sections 1.3 and 1.4 1 Why graph data? n The amount of data
More information6683/01 Edexcel GCE Statistics S1 Gold Level G2
Paper Reference(s) 6683/01 Edexcel GCE Statistics S1 Gold Level G Time: 1 hour 30 minutes Materials required for examination papers Mathematical Formulae (Green) Items included with question Nil Candidates
More informationChapter 3. Lecture 3 Sections
Chapter 3 Lecture 3 Sections 3.4 3.5 Measure of Position We would like to compare values from different data sets. We will introduce a z score or standard score. This measures how many standard deviation
More informationThe Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012
The Normal Distribution & Descriptive Statistics Kin 304W Week 2: Jan 15, 2012 1 Questionnaire Results I received 71 completed questionnaires. Thank you! Are you nervous about scientific writing? You re
More information