Today s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation.
|
|
- Nathaniel Lucas
- 5 years ago
- Views:
Transcription
1 1 Today s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation.
2 2 Once we know the central location of a data set, we want to know how close things are to the center.
3 2 Once we know the central location of a data set, we want to know how close things are to the center. We ll see two ways to measure dispersion of a data set.
4 3 five-number summary (goes with the median)
5 3 five-number summary (goes with the median) standard deviation (goes with the mean)
6 4 Five-Number Summary
7 5 Five-number Summary: 1. Min 2. Lower Quartile 3. Median 4. Upper Quartile 5. Max
8 6 Definition The Min is the smallest value in the whole data set.
9 6 Definition The Min is the smallest value in the whole data set. The Max is the largest value in the whole data set.
10 6 Definition The Min is the smallest value in the whole data set. The Max is the largest value in the whole data set. The Lower Quartile is the median of the lower half.
11 6 Definition The Min is the smallest value in the whole data set. The Max is the largest value in the whole data set. The Lower Quartile is the median of the lower half. The Upper Quartile is the median of the upper half.
12 7 Example The appraisals of the 10 houses are: [$75K, $96K, $107K, $110K, $110K, $118K, $130K, $135K, $150K, $520K ]
13 7 Example The appraisals of the 10 houses are: [$75K, $96K, $107K, $110K, $110K, $118K, $130K, $135K, $150K, $520K ] Find the five-number summary.
14 8 Solution We already found: the median, Med = $114K
15 8 Solution We already found: the median, Med = $114K the lower half, [$75K, $96K, $107K, $110K, $110K ]
16 8 Solution We already found: the median, Med = $114K the lower half, [$75K, $96K, $107K, $110K, $110K ] the upper half [$118K, $130K, $135K, $150K, $520K ]
17 8 Solution We already found: the median, Med = $114K the lower half, [$75K, $96K, $107K, $110K, $110K ] the upper half [$118K, $130K, $135K, $150K, $520K ] Since each half has size 5, their respective medians will be in the 3rd location.
18 9 Solution Thus the lower quartile is Q1 = $107K
19 9 Solution Thus the lower quartile is Q1 = $107K the upper quartile is Q3 = $135K
20 9 Solution Thus the lower quartile is Q1 = $107K the upper quartile is Q3 = $135K the lowest value is Min = $75K
21 9 Solution Thus the lower quartile is Q1 = $107K the upper quartile is Q3 = $135K the lowest value is Min = $75K the highest value is Max = $520K
22 9 Solution Thus the lower quartile is Q1 = $107K the upper quartile is Q3 = $135K the lowest value is Min = $75K the highest value is Max = $520K So the five-number summary is: [Min = $75K, Q1 = $107K, Med = $114K, Q3 = $135K, Max = $520K ].
23 10 The five-number summary can be visualized with a boxplot diagram, or box-and-whiskers diagram.
24 Min Q1Med Q3 Max \\
25 12 The box goes from the lower quartile to the upper quartile, with a mark at the median.
26 12 The box goes from the lower quartile to the upper quartile, with a mark at the median. Two whiskers extend from the box to the Min and Max.
27 13 Remarks: the left whisker spans the bottom 25%
28 13 Remarks: the left whisker spans the bottom 25% the box spans the middle 50%
29 13 Remarks: the left whisker spans the bottom 25% the box spans the middle 50% the right whisker spans the top 25%
30 13 Remarks: the left whisker spans the bottom 25% the box spans the middle 50% the right whisker spans the top 25% each half of the box spans 25%
31 14 Example The ages of the police officers in the Clearview Police Department are Age Freq
32 14 Example The ages of the police officers in the Clearview Police Department are Age Freq Find the five-number summary and draw the boxplot.
33 15 Age Freq Cum. Freq
34 16 The size is n = 41, so the median is in location
35 16 The size is n = 41, so the median is in location = 21. 2
36 16 The size is n = 41, so the median is in location = The lower half has size 20, so the lower quartile is the average of the values at locations 10 and 11: Q1 = =
37 17 The upper half also has size 20, so the upper quartile is the average of the values at locations 10 and 11 of the upper half.
38 17 The upper half also has size 20, so the upper quartile is the average of the values at locations 10 and 11 of the upper half. Since the median is at location 21, the third quartile is the average of the values at locations 31 and 32 of the whole data set: Q3 = = 32
39 18 Five-number summary: [Min = 22, Q1 = 26.5, Med = 29, Q3 = 32, Max = 39] Min Q1 Med Q3 Max
40 19 Remark: Outliers can be drawn separated from the rest of the data set.
41 20 Example The appraisals of the 10 houses are: [$75K, $96K, $107K, $110K, $110K, $118K, $130K, $135K, $150K, $520K ]
42 20 Example The appraisals of the 10 houses are: [$75K, $96K, $107K, $110K, $110K, $118K, $130K, $135K, $150K, $520K ] Find the five-number summary with outliers separated.
43 Min Q1Med Q3 Max \\
44 22 Boxplots and five-number summaries are useful when comparing two data sets.
45 23 Example Waiting times at two car washes: Acme Car Wash: [Min = 1, Q1 = 5, Med = 8, Q3 = 9, Max = 12] Kleen Car Wash: [Min = 3, Q1 = 4, Med = 5, Q3 = 8, Max = 20] (Times are in minutes.)
46 24 Example Draw the boxplots together, and compare them.
47 25 Solution Here are the boxplots: Acme Kleen
48 26 Solution The Min and Max tell us:
49 26 Solution The Min and Max tell us: everyone at Kleen has to wait at least 3 minutes, and some people have a very long wait.
50 26 Solution The Min and Max tell us: everyone at Kleen has to wait at least 3 minutes, and some people have a very long wait. at Acme, some have a tiny wait and everyone gets started in 12 minutes.
51 26 Solution The Min and Max tell us: everyone at Kleen has to wait at least 3 minutes, and some people have a very long wait. at Acme, some have a tiny wait and everyone gets started in 12 minutes. Acme seems better.
52 27 Solution But, the Median tells us: half of the customers of Acme wait 8 minutes for service
53 27 Solution But, the Median tells us: half of the customers of Acme wait 8 minutes for service at Kleen half of them start in 5 minutes
54 27 Solution But, the Median tells us: half of the customers of Acme wait 8 minutes for service at Kleen half of them start in 5 minutes Now Kleen seems better.
55 28 Which is better? There s no simple answer
56 28 Which is better? There s no simple answer If you don t mind waiting a little, Acme is better, since there are no long waits.
57 28 Which is better? There s no simple answer If you don t mind waiting a little, Acme is better, since there are no long waits. If you re willing to risk a long wait, in hope of a really short wait, Kleen is better.
58 29 Standard Deviation
59 30 When using the mean to measure the center, we use the standard deviation to measure dispersion.
60 30 When using the mean to measure the center, we use the standard deviation to measure dispersion. Think of standard deviation as measuring how far from the average the data points tend to be.
61 31 (Wrong way:)
62 31 (Wrong way:) 1. take the deviation of each data point from the average
63 31 (Wrong way:) 1. take the deviation of each data point from the average 2. average those deviations
64 31 (Wrong way:) 1. take the deviation of each data point from the average 2. average those deviations The deviation of a point x i from the average x is just x i x
65 32 (Wrong way:)
66 32 (Wrong way:) Example Weekly Sales of Home Town Pharmacy: S M T W R F S $2,548, $1,225, $1,732, $1,871, $975, $2,218, $1,339. Find the average of x i x.
67 32 (Wrong way:) Example Weekly Sales of Home Town Pharmacy: S M T W R F S $2,548, $1,225, $1,732, $1,871, $975, $2,218, $1,339. Find the average of x i x. We have already found the average: x =
68 33 (Wrong way:) Here are deviations x i x: Day x i (sales) x i x (deviation) Sunday 2, Monday 1, Tuesday 1, Wednesday 1, Thursday Friday 2, Saturday 1, Total 11, Average 1,
69 34 (Wrong way:) Deviations are like distances, but with a sign
70 34 (Wrong way:) Deviations are like distances, but with a sign Positive deviation x i is to the right of x
71 34 (Wrong way:) Deviations are like distances, but with a sign Positive deviation x i is to the right of x Negative deviation x i is to the left of x
72 35 (Wrong way:) The average of those deviations: = 0.00
73 35 (Wrong way:) The average of those deviations: This is going to happen with any data set! Average deviation from the mean is a useless measure of dispersion. = 0.00
74 36 (Right way:) However, if we square all deviations, they will turn all positive
75 36 (Right way:) However, if we square all deviations, they will turn all positive We can then average those squared deviations
76 36 (Right way:) However, if we square all deviations, they will turn all positive We can then average those squared deviations that is called the variance
77 37 Definition The variance var(x) of a data set x is the average of the squared deviations from the mean x: var(x) = 1 (xi x) 2 n
78 38 To compensate for the squaring, we take the square root of the variance.
79 38 To compensate for the squaring, we take the square root of the variance. Definition The standard deviation is σ(x) = var(x)
80 39 Example Find the variance and standard deviation for the Home Town Pharmacy daily sales data set.
81 40 Day x (sales) x x (x x) 2 Sunday 2, Monday 1, Tuesday 1, Wednesday 1, Thursday Friday 2, Saturday 1, Total 11, Average 1,
82 41 the variance is var(x) =
83 41 the variance is var(x) = the standard deviation is σ(x) = =
84 42 What if we start with a frequency table or a histogram?
85 43 Example Find the standard deviation for the Math 109 quizzes score freq cum fr
86 44 Solution We computed the average µ = 14.64
87 44 Solution We computed the average µ = For convenience turn the frequency table into a vertical table
88 45 x f x f (x µ) (x µ) 2 (x µ) 2 f Tot Ave
89 46 So the standard deviation is σ = = 3.35.
90 47 To find the Standard Deviation σ 1. Compute the deviations x i µ. 2. Square the deviations (x i µ) Average the squared deviations to the variance (xi µ) 2 var =. n 4. Take the square root of the variance σ = var.
91 48 Question What does standard deviation mean in practice?
92 49 In the previous example: The average is µ = the standard deviation is σ = 3.35
93 50 How many data points are within one standard deviation of the average?
94 50 How many data points are within one standard deviation of the average? µ σ = and µ + σ = 17.99
95 50 How many data points are within one standard deviation of the average? µ σ = and µ + σ = Between these two values there are a total of = 62 data points (out of 95), i.e., about two thirds.
96 51 For nice data sets, about 2 of the 3 data set is located within one standard deviation of the average.
97 51 For nice data sets, about 2 of the 3 data set is located within one standard deviation of the average. if σ is small, the data points are crowded close to µ
98 51 For nice data sets, about 2 of the 3 data set is located within one standard deviation of the average. if σ is small, the data points are crowded close to µ if σ is large, the data points are scattered.
Lecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow
More information1 Describing Distributions with numbers
1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write
More informationHandout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25
Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example
More informationappstats5.notebook September 07, 2016 Chapter 5
Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.
More informationTi 83/84. Descriptive Statistics for a List of Numbers
Ti 83/84 Descriptive Statistics for a List of Numbers Quiz scores in a (fictitious) class were 10.5, 13.5, 8, 12, 11.3, 9, 9.5, 5, 15, 2.5, 10.5, 7, 11.5, 10, and 10.5. It s hard to get much of a sense
More information2 Exploring Univariate Data
2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting
More informationMeasures of Variation. Section 2-5. Dotplots of Waiting Times. Waiting Times of Bank Customers at Different Banks in minutes. Bank of Providence
Measures of Variation Section -5 1 Waiting Times of Bank Customers at Different Banks in minutes Jefferson Valley Bank 6.5 6.6 6.7 6.8 7.1 7.3 7.4 Bank of Providence 4. 5.4 5.8 6. 6.7 8.5 9.3 10.0 Mean
More informationChapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.
-3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data
More informationMath 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment
Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class
More informationSome estimates of the height of the podium
Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40
More informationDescribing Data: One Quantitative Variable
STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive
More informationChapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1
Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and
More informationNumerical Descriptions of Data
Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =
More informationNOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS
NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows
More informationDescriptive Statistics (Devore Chapter One)
Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf
More informationSTA 248 H1S Winter 2008 Assignment 1 Solutions
1. (a) Measures of location: STA 248 H1S Winter 2008 Assignment 1 Solutions i. The mean, 100 1=1 x i/100, can be made arbitrarily large if one of the x i are made arbitrarily large since the sample size
More informationA LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]
1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders
More informationDATA HANDLING Five-Number Summary
DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest
More informationSimple Descriptive Statistics
Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency
More informationLecture 18 Section Mon, Feb 16, 2009
The s the Lecture 18 Section 5.3.4 Hampden-Sydney College Mon, Feb 16, 2009 Outline The s the 1 2 3 The 4 s 5 the 6 The s the Exercise 5.12, page 333. The five-number summary for the distribution of income
More informationLecture 18 Section Mon, Sep 29, 2008
The s the Lecture 18 Section 5.3.4 Hampden-Sydney College Mon, Sep 29, 2008 Outline The s the 1 2 3 The 4 s 5 the 6 The s the Exercise 5.12, page 333. The five-number summary for the distribution of income
More informationData that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.
Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer
More information3.1 Measures of Central Tendency
3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent
More informationChapter 3 Descriptive Statistics: Numerical Measures Part A
Slides Prepared by JOHN S. LOUCKS St. Edward s University Slide 1 Chapter 3 Descriptive Statistics: Numerical Measures Part A Measures of Location Measures of Variability Slide Measures of Location Mean
More informationOverview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution
PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations
More informationPercentiles, STATA, Box Plots, Standardizing, and Other Transformations
Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Lecture 3 Reading: Sections 5.7 54 Remember, when you finish a chapter make sure not to miss the last couple of boxes: What Can Go
More informationChapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1
Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.
More informationSTAT 113 Variability
STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2
More informationAverages and Variability. Aplia (week 3 Measures of Central Tendency) Measures of central tendency (averages)
Chapter 4 Averages and Variability Aplia (week 3 Measures of Central Tendency) Chapter 5 (omit 5.2, 5.6, 5.8, 5.9) Aplia (week 4 Measures of Variability) Measures of central tendency (averages) Measures
More informationBasic Procedure for Histograms
Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that
More informationApplications of Data Dispersions
1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given
More informationStatistics vs. statistics
Statistics vs. statistics Question: What is Statistics (with a capital S)? Definition: Statistics is the science of collecting, organizing, summarizing and interpreting data. Note: There are 2 main ways
More informationHow Wealthy Are Europeans?
How Wealthy Are Europeans? Grades: 7, 8, 11, 12 (course specific) Description: Organization of data of to examine measures of spread and measures of central tendency in examination of Gross Domestic Product
More informationStandardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis
Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem
More informationPutting Things Together Part 2
Frequency Putting Things Together Part These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for, and are in
More informationSection3-2: Measures of Center
Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number
More informationBoth the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.
Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of
More informationDescription of Data I
Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret
More informationIOP 201-Q (Industrial Psychological Research) Tutorial 5
IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,
More informationMeasures of Central Tendency Lecture 5 22 February 2006 R. Ryznar
Measures of Central Tendency 11.220 Lecture 5 22 February 2006 R. Ryznar Today s Content Wrap-up from yesterday Frequency Distributions The Mean, Median and Mode Levels of Measurement and Measures of Central
More informationDescriptive Statistics
Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations
More informationSummarising Data. Summarising Data. Examples of Types of Data. Types of Data
Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017
More informationThe Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).
We will look the three common and useful measures of spread. The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). 1 Ameasure of the center
More informationCenter and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.
Chapter 3 Section3-: Measures of Center Section 3-3: Measurers of Variation Section 3-4: Measures of Relative Standing Section 3-5: Exploratory Data Analysis Describing Distributions with Numbers The overall
More informationStandard Deviation. Lecture 18 Section Robb T. Koether. Hampden-Sydney College. Mon, Sep 26, 2011
Standard Deviation Lecture 18 Section 5.3.4 Robb T. Koether Hampden-Sydney College Mon, Sep 26, 2011 Robb T. Koether (Hampden-Sydney College) Standard Deviation Mon, Sep 26, 2011 1 / 42 Outline 1 Variability
More informationMeasures of Variability
Sample I: 30, 35, 40, 45, 50, 55, 60, 65, 70 Sample II: 30, 41, 48, 49, 50, 51, 52, 59, 70 Sample III: 41, 45, 48, 49, 50, 51, 52, 55, 59 Sample I: 30, 35, 40, 45, 50, 55, 60, 65, 70 Sample II: 30, 41,
More informationMath 140 Introductory Statistics. First midterm September
Math 140 Introductory Statistics First midterm September 23 2010 Box Plots Graphical display of 5 number summary Q1, Q2 (median), Q3, max, min Outliers If a value is more than 1.5 times the IQR from the
More informationDATA SUMMARIZATION AND VISUALIZATION
APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296
More informationWk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12)
Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Descriptive statistics: - Measures of centrality (Mean, median, mode, trimmed mean) - Measures of spread (MAD, Standard deviation, variance) -
More informationChapter 5 Normal Probability Distributions
Chapter 5 Normal Probability Distributions Section 5-1 Introduction to Normal Distributions and the Standard Normal Distribution A The normal distribution is the most important of the continuous probability
More informationMidterm Test 1 (Sample) Student Name (PRINT):... Student Signature:... Use pencil, so that you can erase and rewrite if necessary.
MA 180/418 Midterm Test 1 (Sample) Student Name (PRINT):............................................. Student Signature:................................................... Use pencil, so that you can erase
More information3.3-Measures of Variation
3.3-Measures of Variation Variation: Variation is a measure of the spread or dispersion of a set of data from its center. Common methods of measuring variation include: 1. Range. Standard Deviation 3.
More informationQuantitative Analysis and Empirical Methods
3) Descriptive Statistics Sciences Po, Paris, CEE / LIEPP Introduction Data and statistics Introduction to distributions Measures of central tendency Measures of dispersion Skewness Data and Statistics
More informationCategorical. A general name for non-numerical data; the data is separated into categories of some kind.
Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,
More informationMEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION
MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION 1 Day 3 Summer 2017.07.31 DISTRIBUTION Symmetry Modality 单峰, 双峰 Skewness 正偏或负偏 Kurtosis 2 3 CHAPTER 4 Measures of Central Tendency 集中趋势
More informationBasic Sta)s)cs. Describing Data Measures of Spread
Basic Sta)s)cs Describing Data Measures of Spread Describing Data Learning Inten7ons Today we will understand: } Measures of Spread * Calculate the range of a sample * Determine quar7les and interquar7le
More informationDescriptive Statistics
Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs
More informationStat 101 Exam 1 - Embers Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.
More informationLecture Week 4 Inspecting Data: Distributions
Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your
More informationDescriptive Analysis
Descriptive Analysis HERTANTO WAHYU SUBAGIO Univariate Analysis Univariate analysis involves the examination across cases of one variable at a time. There are three major characteristics of a single variable
More informationCHAPTER 2 Describing Data: Numerical
CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of
More information4. DESCRIPTIVE STATISTICS
4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in
More informationSTA Module 3B Discrete Random Variables
STA 2023 Module 3B Discrete Random Variables Learning Objectives Upon completing this module, you should be able to 1. Determine the probability distribution of a discrete random variable. 2. Construct
More informationUsing the TI-83 Statistical Features
Entering data (working with lists) Consider the following small data sets: Using the TI-83 Statistical Features Data Set 1: {1, 2, 3, 4, 5} Data Set 2: {2, 3, 4, 4, 6} Press STAT to access the statistics
More informationGraphical and Tabular Methods in Descriptive Statistics. Descriptive Statistics
Graphical and Tabular Methods in Descriptive Statistics MATH 3342 Section 1.2 Descriptive Statistics n Graphs and Tables n Numerical Summaries Sections 1.3 and 1.4 1 Why graph data? n The amount of data
More informationA.REPRESENTATION OF DATA
A.REPRESENTATION OF DATA (a) GRAPHS : PART I Q: Why do we need a graph paper? Ans: You need graph paper to draw: (i) Histogram (ii) Cumulative Frequency Curve (iii) Frequency Polygon (iv) Box-and-Whisker
More informationMATH SPEAK - TO BE UNDERSTOOD AND MEMORIZED
FOM 11 T9 STANDARD DEVIATION 1 MATH SPEAK - TO BE UNDERSTOOD AND MEMORIZED 1) STATISTICS = the branch of mathematics used to analyze and interpret data. 2) DATA = information, often in the form of numbers,
More informationVariance, Standard Deviation Counting Techniques
Variance, Standard Deviation Counting Techniques Section 1.3 & 2.1 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston 1 / 52 Outline 1 Quartiles 2 The 1.5IQR Rule 3 Understanding
More informationMAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw
MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment
More informationKey: 18 5 = 1.85 cm. 5 a Stem Leaf. Key: 2 0 = 20 points. b Stem Leaf. Key: 2 0 = 20 cm. 6 a Stem Leaf. Key: 4 3 = 43 cm.
Answers EXERCISE. D D C B Numerical: a, b, c Categorical: c, d, e, f, g Discrete: c Continuous: a, b C C Categorical B A Categorical and ordinal Discrete Ordinal D EXERCISE. Stem Key: = Stem Key: = $ The
More informationNumerical Descriptive Measures. Measures of Center: Mean and Median
Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where
More informationKING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section
KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 11: BUSINESS STATISTICS I Semester 04 Major Exam #1 Sunday March 7, 005 Please circle your instructor
More informationReview: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59.
Review: Chebyshev s Rule Measures of Dispersion II Tom Ilvento STAT 200 Is based on a mathematical theorem for any data At least ¾ of the measurements will fall within ± 2 standard deviations from the
More informationStatistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)
Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x
More informationChapter 4 Variability
Chapter 4 Variability PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter and Larry B. Wallnau Chapter 4 Learning Outcomes 1 2 3 4 5
More informationSTAB22 section 1.3 and Chapter 1 exercises
STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea
More informationFrequency Distribution and Summary Statistics
Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary
More informationVARIABILITY: Range Variance Standard Deviation
VARIABILITY: Range Variance Standard Deviation Measures of Variability Describe the extent to which scores in a distribution differ from each other. Distance Between the Locations of Scores in Three Distributions
More informationMeasures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean
Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values
More informationFound under MATH NUM
While you wait Edit the last line of your z-score program : Disp round(z, 2) Found under MATH NUM Bluman, Chapter 6 1 Sec 6.2 Bluman, Chapter 6 2 Bluman, Chapter 6 3 6.2 Applications of the Normal Distributions
More informationNOTES: Chapter 4 Describing Data
NOTES: Chapter 4 Describing Data Intro to Statistics COLYER Spring 2017 Student Name: Page 2 Section 4.1 ~ What is Average? Objective: In this section you will understand the difference between the three
More informationCSC Advanced Scientific Programming, Spring Descriptive Statistics
CSC 223 - Advanced Scientific Programming, Spring 2018 Descriptive Statistics Overview Statistics is the science of collecting, organizing, analyzing, and interpreting data in order to make decisions.
More informationEmpirical Rule (P148)
Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall
More informationAP Statistics Chapter 6 - Random Variables
AP Statistics Chapter 6 - Random 6.1 Discrete and Continuous Random Objective: Recognize and define discrete random variables, and construct a probability distribution table and a probability histogram
More informationEdexcel past paper questions
Edexcel past paper questions Statistics 1 Chapters 2-4 (Discrete) Statistics 1 Chapters 2-4 (Discrete) Page 1 Stem and leaf diagram Stem-and-leaf diagrams are used to represent data in its original form.
More informationStatistics S1 Advanced/Advanced Subsidiary
Paper Reference(s) 6683/01 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Tuesday 10 June 2014 Morning Time: 1 hour 30 minutes Materials required for examination Mathematical Formulae (Pink) Items
More information(And getting familiar with R) Jan. 8th, School of Information, University of Michigan. SI 544 Descriptive Statistics
(And getting familiar with R) School of Information, University of Michigan Jan. 8th, 2007 last lecture In this course: descriptive simply quantitatively and visually analyze data inferential try to reach
More information2CORE. Summarising numerical data: the median, range, IQR and box plots
C H A P T E R 2CORE Summarising numerical data: the median, range, IQR and box plots How can we describe a distribution with just one or two statistics? What is the median, how is it calculated and what
More informationDavid Tenenbaum GEOG 090 UNC-CH Spring 2005
Simple Descriptive Statistics Review and Examples You will likely make use of all three measures of central tendency (mode, median, and mean), as well as some key measures of dispersion (standard deviation,
More informationSolutions for practice questions: Chapter 9, Statistics
Solutions for practice questions: Chapter 9, Statistics If you find any errors, please let me know at mailto:msfrisbie@pfrisbie.com. 1. We know that µ is the mean of 30 values of y, 30 30 i= 1 2 ( y i
More informationMVE051/MSG Lecture 7
MVE051/MSG810 2017 Lecture 7 Petter Mostad Chalmers November 20, 2017 The purpose of collecting and analyzing data Purpose: To build and select models for parts of the real world (which can be used for
More information5.3 Standard Deviation
Math 2201 Date: 5.3 Standard Deviation Standard Deviation We looked at range as a measure of dispersion, or spread of a data set. The problem with using range is that it is only a measure of how spread
More informationSection Distributions of Random Variables
Section 8.1 - Distributions of Random Variables Definition: A random variable is a rule that assigns a number to each outcome of an experiment. Example 1: Suppose we toss a coin three times. Then we could
More information2 2 In general, to find the median value of distribution, if there are n terms in the distribution the
THE MEDIAN TEMPERATURES MEDIAN AND CUMULATIVE FREQUENCY The median is the third type of statistical average you will use in his course. You met the other two, the mean and the mode in pack MS4. THE MEDIAN
More informationNumerical Measurements
El-Shorouk Academy Acad. Year : 2013 / 2014 Higher Institute for Computer & Information Technology Term : Second Year : Second Department of Computer Science Statistics & Probabilities Section # 3 umerical
More informationDiploma in Financial Management with Public Finance
Diploma in Financial Management with Public Finance Cohort: DFM/09/FT Jan Intake Examinations for 2009 Semester II MODULE: STATISTICS FOR FINANCE MODULE CODE: QUAN 1103 Duration: 2 Hours Reading time:
More information9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives
Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical
More informationFINALS REVIEW BELL RINGER. Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/ /2 4
FINALS REVIEW BELL RINGER Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/3 + 7 + 1/2 4 4) 3 + 4 ( 7) + 3 + 4 ( 2) 1) 36/6 4/6 + 3/6 32/6 + 3/6 35/6
More informationMath 14, Homework 6.2 p. 337 # 3, 4, 9, 10, 15, 18, 19, 21, 22 Name
Name 3. Population in U.S. Jails The average daily jail population in the United States is 706,242. If the distribution is normal and the standard deviation is 52,145, find the probability that on a randomly
More informationCopyright 2005 Pearson Education, Inc. Slide 6-1
Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is
More information