Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1

Similar documents
Copyright 2005 Pearson Education, Inc. Slide 6-1

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

1 Describing Distributions with numbers

Test Bank Elementary Statistics 2nd Edition William Navidi

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

appstats5.notebook September 07, 2016 Chapter 5

Section3-2: Measures of Center

Numerical Descriptions of Data

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25

3.1 Measures of Central Tendency

DATA SUMMARIZATION AND VISUALIZATION

Empirical Rule (P148)

Some estimates of the height of the podium

Math Take Home Quiz on Chapter 2

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.

Lecture 1: Review and Exploratory Data Analysis (EDA)

2 Exploring Univariate Data

Review: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59.

Standard Deviation. Lecture 18 Section Robb T. Koether. Hampden-Sydney College. Mon, Sep 26, 2011

Mini-Lecture 3.1 Measures of Central Tendency

Chapter 3 Descriptive Statistics: Numerical Measures Part A

Lecture 2 Describing Data

Description of Data I

Numerical Measurements

CHAPTER 2 Describing Data: Numerical

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.

Simple Descriptive Statistics

Lecture Week 4 Inspecting Data: Distributions

SOLUTIONS TO THE LAB 1 ASSIGNMENT

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Descriptive Statistics

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section

Describing Data: One Quantitative Variable

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Putting Things Together Part 2

Frequency Distribution and Summary Statistics

NOTES: Chapter 4 Describing Data

STATS DOESN T SUCK! ~ CHAPTER 4

Measures of Variation. Section 2-5. Dotplots of Waiting Times. Waiting Times of Bank Customers at Different Banks in minutes. Bank of Providence

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Applications of Data Dispersions

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]

Unit 2 Statistics of One Variable

Exploratory Data Analysis

Refer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet.

4. DESCRIPTIVE STATISTICS

Lecture 18 Section Mon, Feb 16, 2009

Lecture 18 Section Mon, Sep 29, 2008

DATA ANALYSIS EXAM QUESTIONS

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Chapter 3: Displaying and Describing Quantitative Data Quiz A Name

Math146 - Chapter 3 Handouts. The Greek Alphabet. Source: Page 1 of 39

( ) P = = =

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar

Key: 18 5 = 1.85 cm. 5 a Stem Leaf. Key: 2 0 = 20 points. b Stem Leaf. Key: 2 0 = 20 cm. 6 a Stem Leaf. Key: 4 3 = 43 cm.

Math 227 Elementary Statistics. Bluman 5 th edition

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes. Standardizing normal distributions The Standard Normal Curve

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Introduction to Computational Finance and Financial Econometrics Descriptive Statistics

STAT 113 Variability

Days Traveling Frequency Relative Frequency Percent Frequency % % 35 and above 1 Total %

STA 248 H1S Winter 2008 Assignment 1 Solutions

Measure of Variation

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Section Distributions of Random Variables

Counting Basics. Venn diagrams

Moments and Measures of Skewness and Kurtosis

Basic Procedure for Histograms

Averages and Variability. Aplia (week 3 Measures of Central Tendency) Measures of central tendency (averages)

Some Characteristics of Data

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Statistics I Chapter 2: Analysis of univariate data

IOP 201-Q (Industrial Psychological Research) Tutorial 5

2011 Pearson Education, Inc

Chapter 3. Lecture 3 Sections

Today s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation.

Tutorial Handout Statistics, CM-0128M Descriptive Statistics

Section 6-1 : Numerical Summaries

22.2 Shape, Center, and Spread

ECON 214 Elements of Statistics for Economists

Shifting and rescaling data distributions

Chapter 6: The Normal Distribution

Normal Model (Part 1)

2CORE. Summarising numerical data: the median, range, IQR and box plots

Statistics 114 September 29, 2012

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

The Normal Distribution

(a) salary of a bank executive (measured in dollars) quantitative. (c) SAT scores of students at Millersville University quantitative

Math 140 Introductory Statistics. First midterm September

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

Transcription:

Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1

Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 2

Section 3.1 Measures of Center Where is the sample distribution centered? Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 3

Figure 2.6 (Histogram Review) Cutpoint grouping. Weight of 18- to 24-year old males: (a) frequency histogram; (b) relative-frequency histogram Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 4

High and Low Temperatures in 71 US Cities (one-year) (chapter review problem 33, page 91) Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 5

Example 3.1: Weekly Salaries: Small Consulting Company wanted to compare the salaries of employees in the first half of summer to those placed in the second half of the Summer. Data Set I Data Set II Stem-and-leaf of Salary1 N = 13, Leaf Unit = 10 6 3 000000 (4) 4 0055 3 5 3 6 3 7 3 8 0 2 9 4 1 10 5 Stem-and-leaf of Salary2 N = 10, Leaf Unit = 10 5 3 00000 5 4 005 2 5 2 6 2 7 2 8 2 9 4 1 10 5 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 6

Definition 3.1 Mean of a Data Set The mean of a data set is the sum of the observations divided by the number of observations. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 7

Example 3.1: Weekly Salaries: Small Consulting Company wanted to compare the salaries of employees in the first half of summer to those placed in the second half of the Summer. Data Set I Order Statistics 300 300 300 300 300 300 400 400 450 450 800 904 1050 Stem-and-leaf of SALARY N = 13, Leaf Unit = 10 Stem-and-leaf of SALARY N = 13, Leaf Unit = 10 6 3 000000 (4) 4 5050 3 5 3 6 3 7 3 8 0 2 9 4 1 10 5 Order Leaves 6 3 000000 (4) 4 0055 3 5 3 6 3 7 3 8 0 2 9 4 1 10 5 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 8

Example 3.1: Weekly Salaries: Small Consulting Company wanted to compare the salaries of employees in the first half of summer to those placed in the second half of the Summer. Data Set I Order Statistics 300 300 300 300 300 300 400 400 450 450 800 904 1050 Stem-and-leaf of SALARY N = 13, Leaf Unit = 10 6 3 000000 (4) 4 0055 3 5 3 6 3 7 3 8 0 2 9 4 1 10 5 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 9

Example 3.1: Weekly Salaries: Small Consulting Company wanted to compare the salaries of employees in the first half of summer to those placed in the second half of the Summer. Data Set II Order Statistics 300 300 300 300 300 400 400 450 940 1050 Stem-and-leaf of Salary2 N = 10, Leaf Unit = 10 5 3 00000 5 4 005 2 5 2 6 2 7 2 8 2 9 4 1 10 5 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 10

Definition 3.4 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 11

Example 3.1: Weekly Salaries: Small Consulting Company wanted to compare the salaries of employees in the first half of summer to those placed in the second half of the Summer. Data Set I Data Set II Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 12

Definition 3.2 Median of a Data Set Arrange the data in increasing order. If the number of observations is odd, then the median is the observation exactly in the middle of the ordered list. If the number of observations is even, then the median is the mean of the two middle observations in the ordered list. In both cases, if we let n denote the number of observations, then the median is at position (n + 1) / 2 in the ordered list. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 13

Definition 3.3 Mode of a Data Set Find the frequency of each value in the data set. If no value occurs more than once, then the data set has no mode. Otherwise, any value that occurs with the greatest frequency is a mode of the data set. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 14

Example 3.1: Weekly Salaries: Small Consulting Company wanted to compare the salaries of employees in the first half of summer to those placed in the second half of the Summer. Data Set I Data Set II so, 5 th and 6 th values Set I: 300 300 300 300 300 300 400 400 450 450 800 940 1050 7 th value Set II: 300 300 300 300 300 400 400 450 940 1050 5 th value 6 th value Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 15

Tables 3.1, 3.2 & 3.4 Data Set I Data Set II Means, medians, and modes of salaries in Data Set I and Data Set II Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 16

Figure 3.1 Relative positions of the mean and median for (a) right-skewed, (b) symmetric, and (c) left-skewed distributions Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 17

Section 3.2 Measures of Variation Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 18

Figure 3.2 Five starting players on two basketball teams Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 19

Figure 3.3 Shortest and tallest starting players on the teams Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 20

Definition 3.5 Range of a Data Set The range of a data set is given by the formula Range = Max Min, where Max and Min denote the maximum and minimum observations, respectively. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 21

Definition 3.6 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 22

Key Fact 3.1 Variation and the Standard Deviation The more variation that there is in a data set, the larger is its standard deviation. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 23

Formula 3.1 numerator Σ(x i x) 2 = Σx i 2 (Σx i ) 2 /n Sample variance s 2 = Σ(x i x) 2 n 1 = Σx i 2 (Σx i ) 2 /n n 1 Sample standard deviation s = Σ(x i n 1 x) 2 = Σx i 2 (Σx i ) 2 /n n 1 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 24

Tables 3.10 & 3.11 Data sets that have different variation Means and standard deviations of the data sets in Table 3.10 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 25

Figures 3.5 and 3.6 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 26

Key Fact 3.2 Three-Standard-Deviations Rule Almost all the observations in any data set lie within three standard deviations to either side of the mean. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 27

Section 3.3 Chebyshev s Rule and the Empirical Rule Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 28

Key Fact 3.3 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 29

Key Fact 3.4 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 30

Figure 3.9 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 31

Table 3.12 PCB concentrations, in parts per million, of 60 pelican eggs 12387 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 32

Figure 3.10 Histogram of the PCB-concentration data Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 33

Figure 3.10 The mean and one, two, and three standard deviations to either side of the mean for the PCB-concentration data. Recall: x = 206.45, s = 66.42 43 out of the 60 observations, or 71.7%, lie within one standard deviation to either side of the mean. 57 out of the 60 observations, or 95%, lie within two standard deviations to either side of the mean. 60 out of the 60 observations, or 100%, lie within three standard deviations to either side of the mean. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 34

Do Exercise 3.139 on Page 124 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 35

Section 3.4 The Five-Number Summary; Boxplots Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 36

Figure 3.12 Quartiles for (a) uniform, (b) bell-shaped, (c) right-skewed, and (d) left-skewed distributions Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 37

Definition 3.7 Quartiles First, arrange the data in increasing order. Next, determine the median. Then, divide the (ordered) data set into two halves, a bottom half and a top half; if the number of observations is odd, include the median in both halves. The first quartile (Q 1 ) is the median of the bottom half of the data set. The second quartile (Q 2 ) is the median of the entire data set. The third quartile (Q 3 ) is the median of the top half of the data set. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 38

Procedure 3.1 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 39

Definition 3.8 Interquartile Range The interquartile range, or IQR, is the difference between the first and third quartiles; that is, IQR = Q 3 Q 1. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 40

Definition 3.9 Five-Number Summary The five-number summary of a data set is Min, Q 1, Q 2, Q 3, Max. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 41

Definition 3.10 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 42

Procedure 3.2 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 43

Figure 3.14 Constructing a boxplot for TV viewing times in Table 3.13 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 44

Figure 3.15 Boxplots for the data in Table 3.15 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 45

Figure 3.16 Boxplots for (a) right-skewed, (b) symmetric, and (d) left-skewed distributions Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 46

Section 3.5 Descriptive Measures for Populations; Use of Samples Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 47

Definition 3.11 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 48

Definition 3.12 Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 49

Figure 3.18 Population and sample for bolt diameters Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 50

Definition 3.13 Parameter and Statistic Parameter: A descriptive measure for a population. Statistic: A descriptive measure for a sample. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 51

Definition 3.14 & 3.15 z-score For an observed value of a variable x, the corresponding value of the standardized variable z is called the z-score of the observation. The term standard score is often used instead of z-score. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 52