MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION

Size: px
Start display at page:

Download "MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION"

Transcription

1 MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION 1 Day 3 Summer

2 DISTRIBUTION Symmetry Modality 单峰, 双峰 Skewness 正偏或负偏 Kurtosis 2

3 3 CHAPTER 4 Measures of Central Tendency 集中趋势

4 One major purpose of statistical procedures is to summarize raw data in a meaningful way to make some conclusions. e.g. You wonder how the students in your colleague s class are doing in the final exam this year. There is a number you REALLY want to know: Statistics that describe central tendency are numerical values that describe the center of a distribution of scores for a variable. 4

5 CENTRAL TENDENCY Three common measures of central tendency: Mode 众数 Median 中数 Mean 平均数 5

6 the most common measure of central tendency is the mean; the statistical notation for the mean is: µ when we are dealing with populations X X X = N when we are dealing with samples The median can be an observed number or not. How about the mode? 6

7 MEAN AND MEDIAN

8 8

9 MODE 9

10 10

11 COMPARE AND CONTRAST The more symmetric a distribution is, the closer these three measures of central tendency will be If a distribution is truly normal (symmetric and unimodal), then the mean, median, and mode will be exactly the same Unfortunately, this rarely happens. We must choose a measure that best suits our purposes and data. 11

12 ADVANTAGES AND DISADVANTAGES - MEAN Advantages: The mean can be defined mathematically with a simple equation and can easily be manipulated algebraically. Is the most stable estimate of the central tendency of population than would the sample medians or modes Disadvantages: Influenced by the extreme values. (Very sensitive to outliers.) The sample mean may not be an actual value observed in the data. 12

13 ADVANTAGES AND DISADVANTAGES - MEDIAN Advantages: It is unaffected by extreme scores (outliers) Disadvantages: Depends on the sample of data and is not easily generalized to the greater population Does not enter statistical equations readily and therefore more difficult to work with than the mean. may not be an actual value observed in the data. 13

14 SOME ADVANTAGES AND DISADVANTAGES - MODE Advantages: Any randomly selected observation, X i, is more likely to be the mode than any other score. It is the only measure of central tendency that can be used with nominal data. Is not affected by extreme scores Disadvantages: Depends on the sample of data and may not be representative of the population Can depend on the way the data is grouped Cannot be defined in simple mathematical equation 14

15 15 CHAPTER 5 Measures of Variability 分散趋势 / 变异性

16 VARIABILITY / DISPERSION 变异性 Variability is defined as how the data is distributed around a measure of central tendency (e.g.mean) Measures of variability describe the way and degree to which the data is spread Measures of variability quantify how similar the scores in a sample are to one another. 16

17 CONSIDER THE FOLLOWING: Two classes were assigned to the same teacher. In the first class, kids come with various background; in the second class, all the kids come from a family where at least one parent is a teacher/professor. How similar do you expect the pretest scores within the two groups to be? 17

18 THE RESULTING DATA 18

19 THE DATA FROM A GRAPHICAL PERSPECTIVE Class1- more variability Class 2- less variability 19

20 MEASURES OF VARIABILITY The range 全距 The interquartile range 四分位距 Deviation 离差 Average deviation Mean Absolute deviation Variance 方差 Standard deviation 标准差 20

21 RANGE 全距 The distance between the lowest and highest value. Data from the previous example: The range can be heavily influenced by extreme scores. 21

22 THE INTERQUARTILE RANGE 四分位距 The interquartile range is the range of the middle 50% of the observations. A trimmed statistic: how much from the lower end and the upper end respectively? Calculated by taking the difference between the 75 th percentile and 25 th percentile. The interquartile range has the opposite problem as the range it gets rid of too much of the data Percentile: the percentage of observations that are below a particular score value. 22

23 FINDING THE INTERQUARTILE RANGE: 23

24 DEVIATION 离差 The difference between every data point and the mean The average deviation The mean absolute deviation, m.a.d. Variance Standard deviation / SD 24

25 AVERAGE DEVIATION We could find value. N i ( ) d = X X i for each observed di i= 1 Then use = mean( di ) to look at on average how N far the observations are from the mean. While, the logic is sound, the average deviances for any sample will always be equal to zero --- Why? 25

26 There are two ways to eliminate problems connected with the positive and negative deviances Take the absolute value of the deviances (ignore the sign) or MAD Square each deviance, since the square of a negative number is positive 26

27 MAD Mean absolute deviation MAD = X N X Not convenient for statistical manipulation i 27

28 VARIANCE We start by finding how each observed value differs from the mean: To get rid of the negative deviances, we square each of these values: Then, we sum the squared deviances (often called the sum of squares ) Calculate the average. ( X X) i ( X X) 2 N i ( X ) 2 i X i= 1 28

29 VARIANCE: FINAL EQUATIONS N σ 2 i= 1 x s = = 2 i= 1 x ( X ) i X n N ( X ) i X n

30 STANDARD DEVIATION- SD 标准方差 Because we squared the deviations while calculating the variance, we have altered the original scale. This makes the variance difficult to interpret. To convert this back to the original scale, we take the square root called the standard deviation. σ is the population standard deviation s is the sample standard deviation Think of SD as a measure of how far our data values deviate from the mean, on average 30

31 STANDARD DEVIATION: FINAL EQUATIONS N σ = i= 1 x s x = ( X X) i n ( X X) i i= 1 N n

32 OUR EXAMPLE 32

33 BACK TO OUR EXAMPLE A loose interpretation: Class 1 deviated, either positively or negatively, on average, 24 points from the mean Class 2 deviated, either positively or negatively, on average, 12 points from the mean In general, we can conclude that the values in class 2 tend to be more similar to one another (homogeneous) than that of class 1. Interpretation in terms of our example: Teachers kids all performed very similarly, whereas those from other families were much more variable in the performance. 33

34 CHARACTERISTICS OF SD Basically a measure of the average of the deviations of each score from the mean. Can be used to build confidence intervals to see how many scores fall below or above the mean --- more on this in Chapter6. 34

35 DON T BE SCARED. Definitional Computational N i i n i i X N i N i x X n i n i x X N X s n σ = = = = = = ( ) ( ) N i i x n i i x X X N X X s n σ = = = =

36 THE PERPETUAL QUESTION: WHY DIVIDE BY n-1 FOR SAMPLE STATISTICS? Adjustment to produce an unbiased estimate 无偏估计. 1. Concrete examples in the book. Gravetter & Wallnau P Seeing the statistics Howell p

37 SEEING THE STATISTICS atisticsapplets/applets.html The true mean of the population is 50, and SD is Now we will sample from this population... 37

38 REPRESENTING DISTRIBUTIONS WITH GRAPHICS --- BOXPLOT A boxplot ( or box and whisker plot) includes a measure of central tendency (the median) and a measure of dispersion (the interquartile range) Hinges= 1 st and 3 rd quartiles= 25 th and 75 th quantile H-spread: the range between the two quartiles Whisker: 1.5*H-spread from the top and bottom of the box 38

39 BOXPLOT: HOWELL The whiskers stop at the farthest numbers observable in the data set but no more than 1.5*H. Observed values above that point is marked as outliers. You see the full range of the upper whisker, but a very short lower whisker. Why? 39

40 OUR EXAMPLE IN SPSS At least two routes Graphs Boxplot Analyze Descriptive statistics Explore 40

41 KEY TERMS Describing distribution:4,,,. Measures of central tendency:3,, Measures of variability:2, Displaying distribution:1 41

42 BREAK 42

43 43 THE NORMAL DISTRIBUTION & Z-SCORES Summer

44 OVERVIEW Probability for discrete vs. continuous data The normal distribution Standard Normal Distribution z-transformations and z-scores Using z-scores to find probabilities 44

45 Think of discrete variables with the notion of a probability of a specific outcome We have a known number (100) of purple(10), red(40) & white(50) marbles what is the probability of choosing a red marble? 45

46 FREQUENCY, AREA, AND PROBABILITY FOR DISCRETE VARIABLES 40 % 10 % 50 % The pie chart to the left represents the frequency distribution of red, purple and white marbles in a bag. 46

47 We think of continuous variables with the idea of a probability of obtaining a value that falls within a range With the distribution of IQ scores that I collected for a study, what is probability that somebody will have an IQ score of 90? 47

48 IQ Score Ranges Frequency Proportion Cumulative Total

49 A PROBABLY NON-PROFESSIONAL WAY TO EXPLAIN Like with the pie chart earlier, we can relate area to probability. Think of the area as the interval area for each bar. Your answer? How many potential ranges could we create? What does this mean? 49

50 I AM INTERESTED IN THE SCORE 90: AN INTERVAL OF 20 POINTS / 3 GROUPS :31/50=

51 AN INTERVAL OF 10 POINTS / 6 GROUPS 90-99: 18/50=

52 WITH AN INTERVAL OF 5 POINTS/11 GROUPS 90-94: 6/50=

53 WITH AN INTERVAL OF 2 POINTS/50 GROUPS 90-92: 4/50=

54 A CHANGE OF CONCEPT The probability of exactly any single value is 0, because we can break down the intervals into finer and finer ones until infinity, meaning the bar size will become smaller and smaller until 0. But we want to talk about a specific value. We want to use the same probability to interpret the score we will use probability density function (PDF). An x value will correspond to only one PDF value that is kind of the frequency, and is the height of a point on the normal curve. How does this work? 54

55 PDF 55

56 GRAPHING THE PDF AND RELATE TO AREA Density Graphing probability density function IQ Scores Density Density Graphing probability density function IQ Scores Graphing probability density function IQ Scores 56

57 PROBABILITY DENSITY FUNCTION/ PDF 概率密度函数 f ( X ) where = σ 1 ( e) 2π π = 3.14 ( X µ ) 2 / For every x value, we can plug the value into the function and get a f(x) number, which corresponds to the height of the point on the normal curve corresponding to the X value, we call it density. This is the y value in your z-table. The largest y value is at the center of the normal distribution where z= ( ) / 2*11.33 E.g. f (90) = (2.718) = *3.14 e σ = 2 57

58 PERCENTILES Percentile: the point below which a specified percentage of scores in the distribution fall Percentile rank: the percentage of scores equal to or less than the given score. To get the percentile rank involves integration in calculus. You don t have to calculate for that, someone has already prepared the table for us ( z table). We just need to know how to use it. A percentile is a score, a percentile rank is a percentage. 58

59 NORMAL DISTRIBUTION 正态分布 Normal distribution is important because: Many dependent variables are assumed to be normally distributed in the population The sampling distribution of the mean is normally distributed ( more coming.) Many statistics models are based on an assumption of a normally distributed variable. 59

60 NORMAL DISTRIBUTION Density Graphing probability density function IQ Scores Bell-shaped curve Unimodal Symmetric mean, median and mode are all in the center Not skewed Extends from - to + The total area under the curve is 1 60

61 NORMAL DISTRIBUTION About 68%of the distribution lies within 1 SD of the mean, 95% lies within 2 SD of the mean and 99.7% of the distribution lies within 3 SD of the mean. We can immediately make some inferences. 61

62 STANDARD NORMAL DISTRIBUTION 标准正态分布 The standard normal distribution is just a special case of normal distribution with a mean=0 and SD=1. Any normal distribution can be transformed to be a standardized normal distribution. Why bother transforming, or standardizing a distribution? 62

63 HOW MANY TABLES DO WE NEED? For our IQ data, our mean is 97.74, SD=11.33, one SD below the mean is =86.41, one SD above the mean is = The percentile rank of 84.13% corresponds to a raw score of For SAT score, mean=500, SD=100, one SD below the mean is 400, one SD above the mean is 600. The percentile rank of 84.13% corresponds to a raw score of

64 STANDARDIZED SCORES 标准分 When we transform our variables to the z- distribution (the standard normal distribution), we are standardizing our scores. This essentially means we put all of our values on the same scale and end up with a distribution of mean=0 and SD=1. We call the process the z-transformation The standardized scores that come out of this process are called z-scores. 64

65 Z-SCORE TRANSFORMATION z i = X i σ µ X is our original data µ is the mean of the population σ is the population standard deviation The end result will be a set of standardized scores. All scores that are below the mean will be negative and all scores above the mean will be positive We can interpret the value of the z-score as how many standard deviation above or below the mean A z-score =1.0 is a score that is exactly 1 SD above the mean A z-score of -1.5 is score that is exactly 1.5 SD below the mean 65

66 Z-SCORE EXAMPLE Test score: Mean = 50 Standard deviation = 10 So the z-score if you received a 60 is z = X µ = σ and the z-score if you received a 45 is z = X µ = σ = = 5 10 = 1 =

67 SO? Now we can refer to the z-table to see what percentile a score value of 60 or 45 corresponds to. A full z-score table can be found in Howell p Table E-10. A z-score of 1 corresponds to a percentile of This means 84.13% of scores fall at or below a z-score of 1 or the raw score of 60. A z-score of -.5 corresponds to a percentile rank of This means 30.85% of scores fall at or below a z-score of -.5 or a raw score of

68 SUMMARY PDF is introduced to get to the probability for continuous variable. How to transform any scores within a distribution into a z score ( or to standardize the raw scores)? 68

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations

More information

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

Simple Descriptive Statistics

Simple Descriptive Statistics Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

Numerical Descriptions of Data

Numerical Descriptions of Data Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =

More information

Descriptive Statistics

Descriptive Statistics Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs

More information

Lecture Week 4 Inspecting Data: Distributions

Lecture Week 4 Inspecting Data: Distributions Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your

More information

appstats5.notebook September 07, 2016 Chapter 5

appstats5.notebook September 07, 2016 Chapter 5 Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.

More information

Description of Data I

Description of Data I Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret

More information

STAT 113 Variability

STAT 113 Variability STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

3.1 Measures of Central Tendency

3.1 Measures of Central Tendency 3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent

More information

2 Exploring Univariate Data

2 Exploring Univariate Data 2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow

More information

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12)

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Descriptive statistics: - Measures of centrality (Mean, median, mode, trimmed mean) - Measures of spread (MAD, Standard deviation, variance) -

More information

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda, MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, CONTENTS What is measures of dispersion? Why measures of dispersion? How measures of dispersions are calculated? Range Quartile

More information

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

Applications of Data Dispersions

Applications of Data Dispersions 1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given

More information

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Categorical. A general name for non-numerical data; the data is separated into categories of some kind. Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

Lecture 2 Describing Data

Lecture 2 Describing Data Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms

More information

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION Subject Paper No and Title Module No and Title Paper No.2: QUANTITATIVE METHODS Module No.7: NORMAL DISTRIBUTION Module Tag PSY_P2_M 7 TABLE OF CONTENTS 1. Learning Outcomes 2. Introduction 3. Properties

More information

4. DESCRIPTIVE STATISTICS

4. DESCRIPTIVE STATISTICS 4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in

More information

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2] 1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders

More information

The Normal Distribution

The Normal Distribution 5.1 Introduction to Normal Distributions and the Standard Normal Distribution Section Learning objectives: 1. How to interpret graphs of normal probability distributions 2. How to find areas under the

More information

STAT 157 HW1 Solutions

STAT 157 HW1 Solutions STAT 157 HW1 Solutions http://www.stat.ucla.edu/~dinov/courses_students.dir/10/spring/stats157.dir/ Problem 1. 1.a: (6 points) Determine the Relative Frequency and the Cumulative Relative Frequency (fill

More information

Descriptive Analysis

Descriptive Analysis Descriptive Analysis HERTANTO WAHYU SUBAGIO Univariate Analysis Univariate analysis involves the examination across cases of one variable at a time. There are three major characteristics of a single variable

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

The Normal Distribution

The Normal Distribution Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we

More information

IOP 201-Q (Industrial Psychological Research) Tutorial 5

IOP 201-Q (Industrial Psychological Research) Tutorial 5 IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,

More information

Frequency Distribution and Summary Statistics

Frequency Distribution and Summary Statistics Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary

More information

Chapter 4 Variability

Chapter 4 Variability Chapter 4 Variability PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter and Larry B. Wallnau Chapter 4 Learning Outcomes 1 2 3 4 5

More information

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data. -3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data

More information

Section3-2: Measures of Center

Section3-2: Measures of Center Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number

More information

The normal distribution is a theoretical model derived mathematically and not empirically.

The normal distribution is a theoretical model derived mathematically and not empirically. Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.

More information

Chapter 3 Descriptive Statistics: Numerical Measures Part A

Chapter 3 Descriptive Statistics: Numerical Measures Part A Slides Prepared by JOHN S. LOUCKS St. Edward s University Slide 1 Chapter 3 Descriptive Statistics: Numerical Measures Part A Measures of Location Measures of Variability Slide Measures of Location Mean

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

Math 227 Elementary Statistics. Bluman 5 th edition

Math 227 Elementary Statistics. Bluman 5 th edition Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find

More information

Measures of Dispersion (Range, standard deviation, standard error) Introduction

Measures of Dispersion (Range, standard deviation, standard error) Introduction Measures of Dispersion (Range, standard deviation, standard error) Introduction We have already learnt that frequency distribution table gives a rough idea of the distribution of the variables in a sample

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, 2013 Abstract Review summary statistics and measures of location. Discuss the placement exam as an exercise

More information

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25 Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example

More information

Exploring Data and Graphics

Exploring Data and Graphics Exploring Data and Graphics Rick White Department of Statistics, UBC Graduate Pathways to Success Graduate & Postdoctoral Studies November 13, 2013 Outline Summarizing Data Types of Data Visualizing Data

More information

Numerical Descriptive Measures. Measures of Center: Mean and Median

Numerical Descriptive Measures. Measures of Center: Mean and Median Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where

More information

Putting Things Together Part 2

Putting Things Together Part 2 Frequency Putting Things Together Part These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for, and are in

More information

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment

More information

Measures of Central tendency

Measures of Central tendency Elementary Statistics Measures of Central tendency By Prof. Mirza Manzoor Ahmad In statistics, a central tendency (or, more commonly, a measure of central tendency) is a central or typical value for a

More information

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 11: BUSINESS STATISTICS I Semester 04 Major Exam #1 Sunday March 7, 005 Please circle your instructor

More information

Data Distributions and Normality

Data Distributions and Normality Data Distributions and Normality Definition (Non)Parametric Parametric statistics assume that data come from a normal distribution, and make inferences about parameters of that distribution. These statistical

More information

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Lecture 3 Reading: Sections 5.7 54 Remember, when you finish a chapter make sure not to miss the last couple of boxes: What Can Go

More information

Normal Probability Distributions

Normal Probability Distributions Normal Probability Distributions Properties of Normal Distributions The most important probability distribution in statistics is the normal distribution. Normal curve A normal distribution is a continuous

More information

We will also use this topic to help you see how the standard deviation might be useful for distributions which are normally distributed.

We will also use this topic to help you see how the standard deviation might be useful for distributions which are normally distributed. We will discuss the normal distribution in greater detail in our unit on probability. However, as it is often of use to use exploratory data analysis to determine if the sample seems reasonably normally

More information

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values

More information

Some estimates of the height of the podium

Some estimates of the height of the podium Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc COUNSELLING PSYCHOLOGY (2011 Admission Onwards) II Semester Complementary Course PSYCHOLOGICAL STATISTICS QUESTION BANK 1. The process of grouping

More information

Numerical Measurements

Numerical Measurements El-Shorouk Academy Acad. Year : 2013 / 2014 Higher Institute for Computer & Information Technology Term : Second Year : Second Department of Computer Science Statistics & Probabilities Section # 3 umerical

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 3 Presentation of Data: Numerical Summary Measures Part 2 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial

More information

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017

More information

CSC Advanced Scientific Programming, Spring Descriptive Statistics

CSC Advanced Scientific Programming, Spring Descriptive Statistics CSC 223 - Advanced Scientific Programming, Spring 2018 Descriptive Statistics Overview Statistics is the science of collecting, organizing, analyzing, and interpreting data in order to make decisions.

More information

Chapter 6 Simple Correlation and

Chapter 6 Simple Correlation and Contents Chapter 1 Introduction to Statistics Meaning of Statistics... 1 Definition of Statistics... 2 Importance and Scope of Statistics... 2 Application of Statistics... 3 Characteristics of Statistics...

More information

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows

More information

Introduction to Statistics I

Introduction to Statistics I Introduction to Statistics I Keio University, Faculty of Economics Continuous random variables Simon Clinet (Keio University) Intro to Stats November 1, 2018 1 / 18 Definition (Continuous random variable)

More information

Monte Carlo Simulation (Random Number Generation)

Monte Carlo Simulation (Random Number Generation) Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...

More information

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.) Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop

More information

5.1 Mean, Median, & Mode

5.1 Mean, Median, & Mode 5.1 Mean, Median, & Mode definitions Mean: Median: Mode: Example 1 The Blue Jays score these amounts of runs in their last 9 games: 4, 7, 2, 4, 10, 5, 6, 7, 7 Find the mean, median, and mode: Example 2

More information

Fundamentals of Statistics

Fundamentals of Statistics CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct

More information

DESCRIPTIVE STATISTICS

DESCRIPTIVE STATISTICS DESCRIPTIVE STATISTICS INTRODUCTION Numbers and quantification offer us a very special language which enables us to express ourselves in exact terms. This language is called Mathematics. We will now learn

More information

2 DESCRIPTIVE STATISTICS

2 DESCRIPTIVE STATISTICS Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled

More information

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good

More information

DESCRIPTIVE STATISTICS II. Sorana D. Bolboacă

DESCRIPTIVE STATISTICS II. Sorana D. Bolboacă DESCRIPTIVE STATISTICS II Sorana D. Bolboacă OUTLINE Measures of centrality Measures of spread Measures of symmetry Measures of localization Mainly applied on quantitative variables 2 DESCRIPTIVE STATISTICS

More information

Statistics vs. statistics

Statistics vs. statistics Statistics vs. statistics Question: What is Statistics (with a capital S)? Definition: Statistics is the science of collecting, organizing, summarizing and interpreting data. Note: There are 2 main ways

More information

Empirical Rule (P148)

Empirical Rule (P148) Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall

More information

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,

More information

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good

More information

Terms & Characteristics

Terms & Characteristics NORMAL CURVE Knowledge that a variable is distributed normally can be helpful in drawing inferences as to how frequently certain observations are likely to occur. NORMAL CURVE A Normal distribution: Distribution

More information

NOTES: Chapter 4 Describing Data

NOTES: Chapter 4 Describing Data NOTES: Chapter 4 Describing Data Intro to Statistics COLYER Spring 2017 Student Name: Page 2 Section 4.1 ~ What is Average? Objective: In this section you will understand the difference between the three

More information

Continuous Probability Distributions & Normal Distribution

Continuous Probability Distributions & Normal Distribution Mathematical Methods Units 3/4 Student Learning Plan Continuous Probability Distributions & Normal Distribution 7 lessons Notes: Students need practice in recognising whether a problem involves a discrete

More information

Lecture 9. Probability Distributions. Outline. Outline

Lecture 9. Probability Distributions. Outline. Outline Outline Lecture 9 Probability Distributions 6-1 Introduction 6- Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7- Properties of the Normal Distribution

More information

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x

More information

Descriptive Statistics (Devore Chapter One)

Descriptive Statistics (Devore Chapter One) Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf

More information

David Tenenbaum GEOG 090 UNC-CH Spring 2005

David Tenenbaum GEOG 090 UNC-CH Spring 2005 Simple Descriptive Statistics Review and Examples You will likely make use of all three measures of central tendency (mode, median, and mean), as well as some key measures of dispersion (standard deviation,

More information

Measures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms

Measures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms Measures of Central Tendency: Ungrouped Data Measures of central tendency yield information about particular places or locations in a group of numbers. Common Measures of Location Mode Median Percentiles

More information

Lecture 9. Probability Distributions

Lecture 9. Probability Distributions Lecture 9 Probability Distributions Outline 6-1 Introduction 6-2 Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7-2 Properties of the Normal Distribution

More information

Edexcel past paper questions

Edexcel past paper questions Edexcel past paper questions Statistics 1 Chapters 2-4 (Discrete) Statistics 1 Chapters 2-4 (Discrete) Page 1 Stem and leaf diagram Stem-and-leaf diagrams are used to represent data in its original form.

More information

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10 ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10 Fall 2011 Lecture 8 Part 2 (Fall 2011) Probability Distributions Lecture 8 Part 2 1 / 23 Normal Density Function f

More information

DATA HANDLING Five-Number Summary

DATA HANDLING Five-Number Summary DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest

More information

Lecture Data Science

Lecture Data Science Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Statistics Foundations JProf. Dr. Claudia Wagner Learning Goals How to describe sample data? What is mode/median/mean?

More information

Introduction to Descriptive Statistics

Introduction to Descriptive Statistics Introduction to Descriptive Statistics 17.871 Types of Variables ~Nominal (Quantitative) Nominal (Qualitative) categorical Ordinal Interval or ratio Describing data Moment Non-mean based measure Center

More information

Statistics I Chapter 2: Analysis of univariate data

Statistics I Chapter 2: Analysis of univariate data Statistics I Chapter 2: Analysis of univariate data Numerical summary Central tendency Location Spread Form mean quartiles range coeff. asymmetry median percentiles interquartile range coeff. kurtosis

More information

MAS187/AEF258. University of Newcastle upon Tyne

MAS187/AEF258. University of Newcastle upon Tyne MAS187/AEF258 University of Newcastle upon Tyne 2005-6 Contents 1 Collecting and Presenting Data 5 1.1 Introduction...................................... 5 1.1.1 Examples...................................

More information

Since his score is positive, he s above average. Since his score is not close to zero, his score is unusual.

Since his score is positive, he s above average. Since his score is not close to zero, his score is unusual. Chapter 06: The Standard Deviation as a Ruler and the Normal Model This is the worst chapter title ever! This chapter is about the most important random variable distribution of them all the normal distribution.

More information

CHAPTER 2 Describing Data: Numerical

CHAPTER 2 Describing Data: Numerical CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of

More information