Exploratory Data Analysis

Size: px
Start display at page:

Download "Exploratory Data Analysis"

Transcription

1 Exploratory Data Analysis Stemplots (or Stem-and-leaf plots) Stemplot and Boxplot T -- leading digits are called stems T -- final digits are called leaves STAT 74 Descriptive Statistics 2 Example: (number of hysterectomies performed by 5 male doctors) 27, 5,, 25, 6, 25, 5,, 7, 44, 2, 6, 59, 4, STAT 74 Descriptive Statistics Example: Number of hysterectomies performed by 5 male doctors: 27, 5,, 25, 6, 25, 5,, 7, 44, 2, 6, 59, 4, 2 by female doctors, the numbers are: 5, 7,, 4,, 9, 25, 29,, () () STAT 74 Descriptive Statistics 4 Back-to-back stem-plot () () Example: (Height data with gender) :, 6, 64, 65, 65, 65, 66, 67 : 6, 64, 69, 7, 7, 7, 72, 72, 72, 72, 7, 74, 75 (See data sheet) Back-to-back Split-back-to-back 4 6* 4 * => # 9 # => 5-9 7* # 5 STAT 74 Descriptive Statistics 5 STAT 74 Descriptive Statistics 6

2 Order Statistics (Hogg & Tanis) Examine Distribution Let x, x 2,, x n be a random sample from a distribution of continuous type. Let y be the smallest of x i s, y 2 the next smallest x i s in order of magnitude, and so y n be the largest of x i s, i.e., y < y 2 < <y n. Then y i, for i =, 2,, n, is called the order statistic of x i. STAT 74 Descriptive Statistics 7 STAT 74 Descriptive Statistics Percentile (measure of location) If < p <, the p th sample percentile, denote as has approximately np sample observations less than it and also n( p) sample observations great than it, and it is undefined if p < n or p >. n + n + To find ~ π p :. Compute (n + ) p [position index] 2. If (n + ) p is an integer, then the p th sample percentile ~ π p= y (n+)p a not an integer, and is equal to r + then b ~ a π p = y r + ( y r + yr ) r is an integer b a a a is a fraction = ( ) y r + y r+ b b b STAT 74 Descriptive Statistics 9 STAT 74 Descriptive Statistics Example: [odd number of data],6,6,64,64,65,65,65,66,67,69,7,7,7,72,72,72,72,7,74,75 Q = 64.5 Median = 69 Q = 72 To find ~π.25 = ~q (the first quartile), a (n + ) p = (2 + ).25 = 5.5 => r = 5 & =.5 b = (.5) = 64.5 ~π.25 Quartiles: The first quartile, Q, or 25 th percentile, is the median of the lower half of the list of ordered observations. The third quartile, Q, or 75 th percentile, is the median of the upper half of the list of ordered observations. ( m ~ = 69, ~π. 75 = 72 ) STAT 74 Descriptive Statistics STAT 74 Descriptive Statistics 2 2

3 Example: (data sheet) [odd number of data],6,6,64,64,65,65,65,66,67,69,7,7,7,72,72,72,72,7,74,75 Q = 64.5 Median = 69 Q = 72 [even number of data] 6,,6,6,64,64,65,65,65,66,67,69,7,7,7,72,72,72,72,7,74,75 Q = 64 Median = 6 Q = 72 IQR = = Interquartile range (IQR) = Q Q IQR = = 7.5 STAT 74 Descriptive Statistics STAT 74 Descriptive Statistics 4 Example: (data sheet without outlier 6 ) The five-number summary.minimum value.q.median.q.maximum value Min =, Q = 64.5, Median = 69, Q = 72, Max = STAT 74 Descriptive Statistics 5 STAT 74 Descriptive Statistics 6 Inner and outer fences Mild and Extreme outliers The inner fences are located at a distance of.5 IQR below Q and at a distance of.5 IQR above Q. The outer fences are located at a distance of IQR below Q and at a distance of IQR above Q. T Data values falling between the inner and outer fences are considered mild outliers. T Data values falling outside the outer fences are considered extreme outliers. When outliers exist, the whisker extended to the smallest and largest data values within the inner fence. STAT 74 Descriptive Statistics 7 STAT 74 Descriptive Statistics

4 Outer fence Inner fence IQR Inner fence 4 4 Outer fence STAT 74 Descriptive Statistics 9 STAT 74 Descriptive Statistics 2 Side-by-side Box Plot 4 Quantile For data values y, y 2,,y n such that y y 2 y n in a sample data set, y r is the quantile of order r/(n+) Quantile-quantile plot or q-q plot is a plot that plotting quantiles of one sample against quantiles of another sample or a distribution. sex STAT 74 Descriptive Statistics 2 STAT 74 Descriptive Statistics 22 Quantile-Quantile Plot Examine the distributions of the two samples: Quantile-Quantile Plot Sample 2 quantile Sample :.5,, 7.4,, 9.2,.9 Sample 2: -.92, -., -.2,.2,.62,.4.2 (,.2) Sample quantile STAT 74 Descriptive Statistics 2 STAT 74 Descriptive Statistics 24 4

5 Quantile-Quantile Plot Sample 2 quantile Examine Distribution with Quantile-Quantile Plot.2 (,.2) Sample quantile Two distributions are about the same. Two distributions are not the same. STAT 74 Descriptive Statistics 25 STAT 74 Descriptive Statistics 26 z-percentile Example:,, 7,, 9, n = 6, and the value is the [4/(6+)]th quantile, i.e.,.57 quantile or 57.th percentile of the distribution of the sample. The 57.th percentile of a standard normal distribution is a z-score that has 57.% of the distribution below it. Therefore, use.57 for area (.574 is closest to it) and look for z score corresponding to it, and it is around.. So, z-percentile for this.57 quantile is.. (sample quantile, z-percentile) => (,.) Normal Quantile Plot Z-percentile (,.) 5 Sample quantile STAT 74 Descriptive Statistics 27 STAT 74 Descriptive Statistics 2 Examine Distribution with Normal Quantile Plot Examine Other Distributions Close to Normal Not Normal T Find the quantiles of the data and the corresponding percentiles of desired distribution to be examined. T Plot the ordered pairs of sample quantiles and their corresponding percentiles of desired distribution. T Examine if it has the straight line pattern. STAT 74 Descriptive Statistics 29 STAT 74 Descriptive Statistics 5

6 Sample Quantile Let x () denote the smallest sample observation, x (2) the 2 nd smallest sample observation, and the x (n) the largest sample observation. For, i =, 2,, n, x (i) is the [(i-.5)/n]th sample quantile. (Approximation) Example:,, 7,, 9, Grouping and Displaying Categorical Data n = 6, and the value, x (4), is the [(4-.5)/6]th quantile of the distribution of the sample, i.e..5 sample quantile or 5. percentile. STAT 74 Descriptive Statistics STAT 74 Descriptive Statistics 2 Frequency Table and Charts 7 Class Frequency Relative Frequency 9 9/22 =.49 = 4.9% /22 =.59 = 59.% Total 22 % Bar Chart Pie Chart Examine Bivariate Data 5 4.9% 4 59.% 2 Pe rc ent STAT 74 sex Descriptive Statistics STAT 74 Descriptive Statistics 4 Two Categorical Variables Contingency Table Cancer No cancer Row Total Smoker 2 5 Non- Smoker Column Total Percent 4 2 Cluster bar chart Cancer Odds of smoker to have cancer: 2/ = 6/9 Odds of nonsmoker to have cancer: 5/45 = /9 Odds Ratio = (6/9)/(/9) = 6 STAT 74 Descriptive Statistics 5 Have Cancer Do not have Cancer Smoke Do not Smoke Smoking Status STAT 74 Descriptive Statistics 6 6

7 Two Quantitative variables Data: Temperature Mortality Index STAT 74 Descriptive Statistics 7 Two Quantitative variables Average annual temperature and the mortality index for a type of breast cancer in women in certain region of Europe. Mortality Index STAT 74 Average Temperature Descriptive Statistics 5 A Categorical & A Quantitative Variables Side-by-side Boxplot 4 9 Time Plot Youngstown Homicide Rate by Year Value HRATE YEAR sex STAT 74 Descriptive Statistics 9 STAT 74 Descriptive Statistics 4 Time Plot Time Plot National Homicide Rate By Year National Homicide Rate By Year 2 9 National Homicide Rate National Homicide Rate YEAR YEAR STAT 74 Descriptive Statistics 4 STAT 74 Descriptive Statistics 42 7

2 Exploring Univariate Data

2 Exploring Univariate Data 2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information

Empirical Rule (P148)

Empirical Rule (P148) Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

Lecture 2 Describing Data

Lecture 2 Describing Data Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms

More information

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25 Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

appstats5.notebook September 07, 2016 Chapter 5

appstats5.notebook September 07, 2016 Chapter 5 Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.

More information

Introduction to Computational Finance and Financial Econometrics Descriptive Statistics

Introduction to Computational Finance and Financial Econometrics Descriptive Statistics You can t see this text! Introduction to Computational Finance and Financial Econometrics Descriptive Statistics Eric Zivot Summer 2015 Eric Zivot (Copyright 2015) Descriptive Statistics 1 / 28 Outline

More information

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Categorical. A general name for non-numerical data; the data is separated into categories of some kind. Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,

More information

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). We will look the three common and useful measures of spread. The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). 1 Ameasure of the center

More information

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows

More information

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.

More information

Descriptive Statistics (Devore Chapter One)

Descriptive Statistics (Devore Chapter One) Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf

More information

Numerical Descriptions of Data

Numerical Descriptions of Data Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =

More information

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

Session 5: Associations

Session 5: Associations Session 5: Associations Li (Sherlly) Xie http://www.nemoursresearch.org/open/statclass/february2013/ Session 5 Flow 1. Bivariate data visualization Cross-Tab Stacked bar plots Box plot Scatterplot 2. Correlation

More information

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either

More information

Description of Data I

Description of Data I Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret

More information

Frequency Distribution and Summary Statistics

Frequency Distribution and Summary Statistics Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,

More information

Some estimates of the height of the podium

Some estimates of the height of the podium Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40

More information

STAT 113 Variability

STAT 113 Variability STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2

More information

Descriptive Statistics Bios 662

Descriptive Statistics Bios 662 Descriptive Statistics Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-08-19 08:51 BIOS 662 1 Descriptive Statistics Descriptive Statistics Types of variables

More information

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line. Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,

More information

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar Measures of Central Tendency 11.220 Lecture 5 22 February 2006 R. Ryznar Today s Content Wrap-up from yesterday Frequency Distributions The Mean, Median and Mode Levels of Measurement and Measures of Central

More information

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017

More information

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet.

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. 1 Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. Warning to the Reader! If you are a student for whom this document is a historical artifact, be aware that the

More information

Exploring Data and Graphics

Exploring Data and Graphics Exploring Data and Graphics Rick White Department of Statistics, UBC Graduate Pathways to Success Graduate & Postdoctoral Studies November 13, 2013 Outline Summarizing Data Types of Data Visualizing Data

More information

2CORE. Summarising numerical data: the median, range, IQR and box plots

2CORE. Summarising numerical data: the median, range, IQR and box plots C H A P T E R 2CORE Summarising numerical data: the median, range, IQR and box plots How can we describe a distribution with just one or two statistics? What is the median, how is it calculated and what

More information

How Wealthy Are Europeans?

How Wealthy Are Europeans? How Wealthy Are Europeans? Grades: 7, 8, 11, 12 (course specific) Description: Organization of data of to examine measures of spread and measures of central tendency in examination of Gross Domestic Product

More information

Measures of Variability

Measures of Variability Sample I: 30, 35, 40, 45, 50, 55, 60, 65, 70 Sample II: 30, 41, 48, 49, 50, 51, 52, 59, 70 Sample III: 41, 45, 48, 49, 50, 51, 52, 55, 59 Sample I: 30, 35, 40, 45, 50, 55, 60, 65, 70 Sample II: 30, 41,

More information

Descriptive Statistics

Descriptive Statistics Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Math Take Home Quiz on Chapter 2

Math Take Home Quiz on Chapter 2 Math 116 - Take Home Quiz on Chapter 2 Show the calculations that lead to the answer. Due date: Tuesday June 6th Name Time your class meets Provide an appropriate response. 1) A newspaper surveyed its

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences. STAB22H3 Statistics I Duration: 1 hour and 45 minutes

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences. STAB22H3 Statistics I Duration: 1 hour and 45 minutes UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences STAB22H3 Statistics I Duration: 1 hour and 45 minutes Last Name: First Name: Student number: Aids allowed: - One handwritten

More information

MAT 1371 Midterm. This is a closed book examination. However one sheet is permitted. Only non-programmable and non-graphic calculators are permitted.

MAT 1371 Midterm. This is a closed book examination. However one sheet is permitted. Only non-programmable and non-graphic calculators are permitted. MAT 1371 Midterm Duration: 80 minutes Professor G. Lamothe Student Number: Last Name: First Name: This is a closed book examination. However one sheet is permitted. Only non-programmable and non-graphic

More information

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 11: BUSINESS STATISTICS I Semester 04 Major Exam #1 Sunday March 7, 005 Please circle your instructor

More information

Math 140 Introductory Statistics. First midterm September

Math 140 Introductory Statistics. First midterm September Math 140 Introductory Statistics First midterm September 23 2010 Box Plots Graphical display of 5 number summary Q1, Q2 (median), Q3, max, min Outliers If a value is more than 1.5 times the IQR from the

More information

Edexcel past paper questions

Edexcel past paper questions Edexcel past paper questions Statistics 1 Chapters 2-4 (Discrete) Statistics 1 Chapters 2-4 (Discrete) Page 1 Stem and leaf diagram Stem-and-leaf diagrams are used to represent data in its original form.

More information

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers. Chapter 3 Section3-: Measures of Center Section 3-3: Measurers of Variation Section 3-4: Measures of Relative Standing Section 3-5: Exploratory Data Analysis Describing Distributions with Numbers The overall

More information

1) What is the range of the data shown in the box and whisker plot? 2) True or False: 75% of the data falls between 6 and 12.

1) What is the range of the data shown in the box and whisker plot? 2) True or False: 75% of the data falls between 6 and 12. DO NOW 1) What is the range of the data shown in the box and whisker plot? 2) True or False: 75% of the data falls between 6 and 12. May 21 7:19 AM 1. 3 2. 2 3. 1 4. 4 5. 2 6. Min = 17, Q 1 = 18.5, Med

More information

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data. -3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data

More information

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem

More information

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values

More information

Source: Fall 2015 Biostats 540 Exam I. BIOSTATS 540 Fall 2016 Practice Test for Unit 1 Summarizing Data Page 1 of 6

Source: Fall 2015 Biostats 540 Exam I. BIOSTATS 540 Fall 2016 Practice Test for Unit 1 Summarizing Data Page 1 of 6 BIOSTATS 540 Fall 2016 Practice Test for Unit 1 Summarizing Data Page 1 of 6 Source: Fall 2015 Biostats 540 Exam I. 1. 1a. The U.S. Census Bureau reports the median family income in its summary of census

More information

NCSS Statistical Software. Reference Intervals

NCSS Statistical Software. Reference Intervals Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and

More information

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical

More information

Section3-2: Measures of Center

Section3-2: Measures of Center Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number

More information

Simple Descriptive Statistics

Simple Descriptive Statistics Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency

More information

CHAPTER 2 Describing Data: Numerical

CHAPTER 2 Describing Data: Numerical CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of

More information

The Normal Distribution

The Normal Distribution Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

(a) salary of a bank executive (measured in dollars) quantitative. (c) SAT scores of students at Millersville University quantitative

(a) salary of a bank executive (measured in dollars) quantitative. (c) SAT scores of students at Millersville University quantitative Millersville University Name Answer Key Department of Mathematics MATH 130, Elements of Statistics I, Test 1 February 8, 2010, 10:00AM-10:50AM Please answer the following questions. Your answers will be

More information

Lecture Week 4 Inspecting Data: Distributions

Lecture Week 4 Inspecting Data: Distributions Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your

More information

3.1 Measures of Central Tendency

3.1 Measures of Central Tendency 3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent

More information

Romero Catholic Academy Gender Pay Reporting Findings

Romero Catholic Academy Gender Pay Reporting Findings Romero Catholic Academy Gender Pay Reporting Findings March 2018 Introduction In light of the recent Government Regulations regarding Mandatory Gender Pay Gap Reporting, Total Reward Group have been tasked

More information

Variance, Standard Deviation Counting Techniques

Variance, Standard Deviation Counting Techniques Variance, Standard Deviation Counting Techniques Section 1.3 & 2.1 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston 1 / 52 Outline 1 Quartiles 2 The 1.5IQR Rule 3 Understanding

More information

Chapter 3. Lecture 3 Sections

Chapter 3. Lecture 3 Sections Chapter 3 Lecture 3 Sections 3.4 3.5 Measure of Position We would like to compare values from different data sets. We will introduce a z score or standard score. This measures how many standard deviation

More information

Section 6-1 : Numerical Summaries

Section 6-1 : Numerical Summaries MAT 2377 (Winter 2012) Section 6-1 : Numerical Summaries With a random experiment comes data. In these notes, we learn techniques to describe the data. Data : We will denote the n observations of the random

More information

MATH 217 Test 2 Version A

MATH 217 Test 2 Version A MATH 217 Test 2 Version A Name: KEY Sec Number: Answer all questions to the best of your ability. Note you should show as much work as is possible. For questions answered using Excel be sure to include

More information

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Lecture 3 Reading: Sections 5.7 54 Remember, when you finish a chapter make sure not to miss the last couple of boxes: What Can Go

More information

Days Traveling Frequency Relative Frequency Percent Frequency % % 35 and above 1 Total %

Days Traveling Frequency Relative Frequency Percent Frequency % % 35 and above 1 Total % Math 1351 Activity 1(Chapter 10)(Due by end of class Feb. 15) Group # 1. A business magazine was conducting a study into the amount of travel required for managers across the U.S. Seventy-five managers

More information

Diploma in Financial Management with Public Finance

Diploma in Financial Management with Public Finance Diploma in Financial Management with Public Finance Cohort: DFM/09/FT Jan Intake Examinations for 2009 Semester II MODULE: STATISTICS FOR FINANCE MODULE CODE: QUAN 1103 Duration: 2 Hours Reading time:

More information

MSM Course 1 Flashcards. Associative Property. base (in numeration) Commutative Property. Distributive Property. Chapter 1 (p.

MSM Course 1 Flashcards. Associative Property. base (in numeration) Commutative Property. Distributive Property. Chapter 1 (p. 1 Chapter 1 (p. 26, 1-5) Associative Property Associative Property: The property that states that for three or more numbers, their sum or product is always the same, regardless of their grouping. 2 3 8

More information

Chapter 3: Displaying and Describing Quantitative Data Quiz A Name

Chapter 3: Displaying and Describing Quantitative Data Quiz A Name Chapter 3: Displaying and Describing Quantitative Data Quiz A Name 3.1.1 Find summary statistics; create displays; describe distributions; determine 1. Following is a histogram of salaries (in $) for a

More information

Getting to know a data-set (how to approach data) Overview: Descriptives & Graphing

Getting to know a data-set (how to approach data) Overview: Descriptives & Graphing Overview: Descriptives & Graphing 1. Getting to know a data set 2. LOM & types of statistics 3. Descriptive statistics 4. Normal distribution 5. Non-normal distributions 6. Effect of skew on central tendency

More information

Private Motor Insurance Statistics

Private Motor Insurance Statistics 2015 Private Motor Insurance Statistics 1 Contents Contents... 2 Executive Summary... 3 1 Introduction... 4 1.1 Data used in the Report... 4 1.2 Key Factors... 4 2 General Market Overview... 6 2.1 Exposure...

More information

Table of Contents. New to the Second Edition... Chapter 1: Introduction : Social Research...

Table of Contents. New to the Second Edition... Chapter 1: Introduction : Social Research... iii Table of Contents Preface... xiii Purpose... xiii Outline of Chapters... xiv New to the Second Edition... xvii Acknowledgements... xviii Chapter 1: Introduction... 1 1.1: Social Research... 1 Introduction...

More information

Math146 - Chapter 3 Handouts. The Greek Alphabet. Source: Page 1 of 39

Math146 - Chapter 3 Handouts. The Greek Alphabet. Source:   Page 1 of 39 Source: www.mathwords.com The Greek Alphabet Page 1 of 39 Some Miscellaneous Tips on Calculations Examples: Round to the nearest thousandth 0.92431 0.75693 CAUTION! Do not truncate numbers! Example: 1

More information

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations

More information

FINALS REVIEW BELL RINGER. Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/ /2 4

FINALS REVIEW BELL RINGER. Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/ /2 4 FINALS REVIEW BELL RINGER Simplify the following expressions without using your calculator. 1) 6 2/3 + 1/2 2) 2 * 3(1/2 3/5) 3) 5/3 + 7 + 1/2 4 4) 3 + 4 ( 7) + 3 + 4 ( 2) 1) 36/6 4/6 + 3/6 32/6 + 3/6 35/6

More information

Full file at Chapter 2 Descriptive Statistics: Tabular and Graphical Presentations

Full file at   Chapter 2 Descriptive Statistics: Tabular and Graphical Presentations Descriptive Statistics: Tabular and Graphical Presentations Learning Objectives 1. Learn how to construct and interpret summarization procedures for qualitative data such as : frequency and relative frequency

More information

Sampling and Descriptive Statistics

Sampling and Descriptive Statistics Sampling and Descriptive Statistics Berlin Chen Department of Computer Science & Information Engineering National Taiwan Normal University Reference: 1. W. Navidi. Statistics for Engineering and Scientists.

More information

Putting Things Together Part 2

Putting Things Together Part 2 Frequency Putting Things Together Part These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for, and are in

More information

4. DESCRIPTIVE STATISTICS

4. DESCRIPTIVE STATISTICS 4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in

More information

Monte Carlo Simulation (Random Number Generation)

Monte Carlo Simulation (Random Number Generation) Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...

More information

= P25 = Q1 = = P50 = Q2 = = = P75 = Q3

= P25 = Q1 = = P50 = Q2 = = = P75 = Q3 NOMINAL- No ordering of the items ORDINAL- are categorical data where there is a logical ordering to the categories. (The simplest ordinal scale is a ranking) INTERVAL- 0 is only an arbitrary reference

More information

Mini-Lecture 3.1 Measures of Central Tendency

Mini-Lecture 3.1 Measures of Central Tendency Mini-Lecture 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data 3. Explain what it means for a

More information

Name PID Section # (enrolled)

Name PID Section # (enrolled) STT 200 -Lecture 2 Instructor: Aylin ALIN 02/19/2014 Midterm # 1 A Name PID Section # (enrolled) * The exam is closed book and 80 minutes. * You may use a calculator and the formula sheet that you brought

More information

Review: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59.

Review: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59. Review: Chebyshev s Rule Measures of Dispersion II Tom Ilvento STAT 200 Is based on a mathematical theorem for any data At least ¾ of the measurements will fall within ± 2 standard deviations from the

More information

1. In a statistics class with 136 students, the professor records how much money each

1. In a statistics class with 136 students, the professor records how much money each so shows the data collected. student has in his or her possession during the first class of the semester. The histogram 1. In a statistics class with 136 students, the professor records how much money

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

SAMPLE. HSC formula sheet. Sphere V = 4 πr. Volume. A area of base

SAMPLE. HSC formula sheet. Sphere V = 4 πr. Volume. A area of base Area of an annulus A = π(r 2 r 2 ) R radius of the outer circle r radius of the inner circle HSC formula sheet Area of an ellipse A = πab a length of the semi-major axis b length of the semi-minor axis

More information

Manual for the TI-83, TI-84, and TI-89 Calculators

Manual for the TI-83, TI-84, and TI-89 Calculators Manual for the TI-83, TI-84, and TI-89 Calculators to accompany Mendenhall/Beaver/Beaver s Introduction to Probability and Statistics, 13 th edition James B. Davis Contents Chapter 1 Introduction...4 Chapter

More information

3) Marital status of each member of a randomly selected group of adults is an example of what type of variable?

3) Marital status of each member of a randomly selected group of adults is an example of what type of variable? MATH112 STATISTICS; REVIEW1 CH1,2,&3 Name CH1 Vocabulary 1) A statistics student wants to find some information about all college students who ride a bike. She collected data from other students in her

More information

22.2 Shape, Center, and Spread

22.2 Shape, Center, and Spread Name Class Date 22.2 Shape, Center, and Spread Essential Question: Which measures of center and spread are appropriate for a normal distribution, and which are appropriate for a skewed distribution? Eplore

More information

Ti 83/84. Descriptive Statistics for a List of Numbers

Ti 83/84. Descriptive Statistics for a List of Numbers Ti 83/84 Descriptive Statistics for a List of Numbers Quiz scores in a (fictitious) class were 10.5, 13.5, 8, 12, 11.3, 9, 9.5, 5, 15, 2.5, 10.5, 7, 11.5, 10, and 10.5. It s hard to get much of a sense

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution

More information

Test Bank Elementary Statistics 2nd Edition William Navidi

Test Bank Elementary Statistics 2nd Edition William Navidi Test Bank Elementary Statistics 2nd Edition William Navidi Completed downloadable package TEST BANK for Elementary Statistics 2nd Edition by William Navidi, Barry Monk: https://testbankreal.com/download/elementary-statistics-2nd-edition-test-banknavidi-monk/

More information

CHAPTER TWO Descriptive Statistics

CHAPTER TWO Descriptive Statistics 5 CHAPTER TWO Descriptive Statistics 2. Introduction The description of a data set includes, among, other things: Presentation of the data by tables and graphs. Examination of the overall shape of the

More information

Multiple Choice: Identify the choice that best completes the statement or answers the question.

Multiple Choice: Identify the choice that best completes the statement or answers the question. U8: Statistics Review Name: Date: Multiple Choice: Identify the choice that best completes the statement or answers the question. 1. A floral delivery company conducts a study to measure the effect of

More information

Applications of Data Dispersions

Applications of Data Dispersions 1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

Steps with data (how to approach data)

Steps with data (how to approach data) Descriptives & Graphing Lecture 3 Survey Research & Design in Psychology James Neill, 216 Creative Commons Attribution 4. Overview: Descriptives & Graphing 1. Steps with data 2. Level of measurement &

More information