David Tenenbaum GEOG 090 UNC-CH Spring 2005

Similar documents
Basic Procedure for Histograms

Simple Descriptive Statistics

Some Characteristics of Data

Measures of Central tendency

Fundamentals of Statistics

Measures of Dispersion (Range, standard deviation, standard error) Introduction

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

3.1 Measures of Central Tendency

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Engineering Mathematics III. Moments

Averages and Variability. Aplia (week 3 Measures of Central Tendency) Measures of central tendency (averages)

Descriptive Analysis

Statistics 114 September 29, 2012

Terms & Characteristics

The Mode: An Example. The Mode: An Example. Measure of Central Tendency: The Mode. Measure of Central Tendency: The Median

Frequency Distribution and Summary Statistics

Numerical Descriptions of Data

1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form:

Chapter 3 Descriptive Statistics: Numerical Measures Part A

CSC Advanced Scientific Programming, Spring Descriptive Statistics

IOP 201-Q (Industrial Psychological Research) Tutorial 5

MATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1)

Measures of Variation. Section 2-5. Dotplots of Waiting Times. Waiting Times of Bank Customers at Different Banks in minutes. Bank of Providence

2 DESCRIPTIVE STATISTICS

PSYCHOLOGICAL STATISTICS

Moments and Measures of Skewness and Kurtosis

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION

Statistics vs. statistics

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

ECON 214 Elements of Statistics for Economists

Unit 2 Statistics of One Variable

Measures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms

AP Statistics Chapter 6 - Random Variables

DESCRIPTIVE STATISTICS II. Sorana D. Bolboacă

Descriptive Statistics for Educational Data Analyst: A Conceptual Note

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

Statistics I Chapter 2: Analysis of univariate data

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Establishing a framework for statistical analysis via the Generalized Linear Model

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

STATS DOESN T SUCK! ~ CHAPTER 4

Refer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.

DESCRIPTIVE STATISTICS

2 Exploring Univariate Data

CHAPTER 2 Describing Data: Numerical

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1

Empirical Rule (P148)

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

UNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES

2.1 Properties of PDFs

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar

Numerical summary of data

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Lecture 07: Measures of central tendency

Descriptive Statistics

1/12/2011. Chapter 5: z-scores: Location of Scores and Standardized Distributions. Introduction to z-scores. Introduction to z-scores cont.

Lectures delivered by Prof.K.K.Achary, YRC

Normal Model (Part 1)

Lecture Week 4 Inspecting Data: Distributions

DATA SUMMARIZATION AND VISUALIZATION

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Population Mean GOALS. Characteristics of the Mean. EXAMPLE Population Mean. Parameter Versus Statistics. Describing Data: Numerical Measures

A CLEAR UNDERSTANDING OF THE INDUSTRY

Description of Data I

Data Analysis. BCF106 Fundamentals of Cost Analysis

4. DESCRIPTIVE STATISTICS

Numerical Descriptive Measures. Measures of Center: Mean and Median

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,

Review: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59.

1 Describing Distributions with numbers

Descriptive Statistics in Analysis of Survey Data

Data Distributions and Normality

Lecture Data Science

Monetary Economics Measuring Asset Returns. Gerald P. Dwyer Fall 2015

Chapter 4 Variability

Getting to know a data-set (how to approach data) Overview: Descriptives & Graphing

MA 1125 Lecture 05 - Measures of Spread. Wednesday, September 6, Objectives: Introduce variance, standard deviation, range.

Numerical Measurements

NCSS Statistical Software. Reference Intervals

MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment

Copyright 2005 Pearson Education, Inc. Slide 6-1

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Descriptive Statistics

( ) P = = =

Lecture 9. Probability Distributions. Outline. Outline

CABARRUS COUNTY 2008 APPRAISAL MANUAL

Lecture 2 Describing Data

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

The Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012

Lecture 9. Probability Distributions

DESCRIBING DATA: MESURES OF LOCATION

Lecture 18 Section Mon, Feb 16, 2009

Chapter 5: Summarizing Data: Measures of Variation

appstats5.notebook September 07, 2016 Chapter 5

The Normal Probability Distribution

Transcription:

Simple Descriptive Statistics Review and Examples You will likely make use of all three measures of central tendency (mode, median, and mean), as well as some key measures of dispersion (standard deviation, z- scores, and the coefficient of variation), along with the statistics that describe the shape of a distribution (skewness and kurtosis) at some point if you work with numeric data sets in an academic or research context In this lecture, we will review the procedures for calculating these statistics, and work through an example for each of the statistics (using a small data set, smaller than those that are typically found in research applications)

Measures of Central Tendency - Review 1. Mode This is the most frequently occurring value in the distribution 2. Median This is the value of a variable such that half of the observations are above and half are below this value i.e. this value divides the distribution into two groups of equal size 3. Mean a.k.a. average, the most commonly used measure of central tendency

Measures of Central Tendency - Review 1. Mode This is the most frequently occurring value in the distribution Procedure for finding the mode of a data set: 1) Sort the data, putting the values in ascending order 2) Count the instances of each value (if this is continuous data with a high degree of precision and many decimal places, this may be quite tedious) 3) Find the value that has the most occurrences this is the mode (if more than one value occurs an equal number of times and these exceed all other counts, we have multiple modes) Use the mode for multi-modal or nominal data sets

Measures of Central Tendency - Review 2. Median - ½ of the values are above & ½ below this value Procedure for finding the median of a data set: 1) Sort the data, putting the values in ascending order 2) Find the value with an equal number of values above and below it (if there are an even number of values, you will need to average two values together): Odd number of observations [(n-1)/2]+1 values from the lowest, e.g. n=19 [(19-1)/2]+1 = 10 th value Even number of observations average the (n/2) and [(n/2)+1] values, e.g. n=20 average the 10 th and 11 th Use the median with assymetric distributions, when you suspect outliers are present, or with ordinal data

Measures of Central Tendency - Review 3. Mean a.k.a. average, the most commonly used measure of central tendency Procedure for finding the mean of a data set: 1) Sum all the values in the data set 2) Divide the sum by the number of values in the data set x = i=n Σ x i i=1 n Use the mean when you have interval or ratio data sets with a large sample size, few (or no?) outliers, and a reasonably symmetric unimodal distribution

Measures of Central Tendency - Review An example data set: Daily low temperatures recorded in Chapel Hill from January 18, 2005 through January 31, 2005 in degrees Fahrenheit: Jan. 18 11 degrees Jan. 25 25 degrees Jan. 19 11 degrees Jan. 26 33 degrees Jan. 20 25 degrees Jan. 27 22 degrees Jan. 21 29 degrees Jan. 28 18 degrees Jan. 22 27 degrees Jan. 29 19 degrees Jan. 23 14 degrees Jan. 30 30 degrees Jan. 24 11 degrees Jan. 31 27 degrees For these 14 values, we will calculate all three measures of central tendency - the mode, median, and mean

Measures of Central Tendency - Review 1. Mode Find the most frequently occurring value 1) Sort the data, putting the values in ascending order: 11, 11, 11, 14, 18, 19, 22, 25, 25, 27, 27, 29, 30, 33 2) Count the instances of each value: 11, 11, 11, 14, 18, 19, 22, 25, 25, 27, 27, 29, 30, 33 3x 1x 1x 1x 1x 2x 2x 1x 1x 1x 3) Find the value that has the most occurrences: In this case, the mode is 11 degrees Fahrenheit, but is this a good measure of the central tendency of this data? Had there only been two days with a recorded temperature of 11 degrees, what would be the mode?

Measures of Central Tendency - Review 2. Median - ½ of the values are above & ½ below this value 1) Sort the data, putting the values in ascending order: 11, 11, 11, 14, 18, 19, 22, 25, 25, 27, 27, 29, 30, 33 2) Find the value with an equal number of values above and below it (if there are an even number of values, you will need to average two values together): Even number of observations average the (n/2) and [(n/2)+1] values Here, n=14 average the (14/2) and [(14/2)+1] values, i.e. the 7 th and 8 th values (22+25)/2 = 23.5 degrees F Here, the median is 23.5 degrees F is this a good measure of central tendency for this data?

Measures of Central Tendency - Review 3. Mean a.k.a. average, the most commonly used measure of central tendency i=n 1) Sum all the values in the data set Σ x i i=1 11 + 11 + 11 + 14 + 18 + 19 + 22 + 25 + 25 + 27 + 27 + 29 + 30 + 33 = 302 2) Divide the sum by the number of values in the data set Here, n=14, so calculate the mean using 302/14 = 21.57 The mean is 21.57 degrees F is this a good measure of central tendency for this data set?

Measures of Dispersion Review 1. Standard Deviation This is the most frequently used measure of dispersion because it has the same units as the values and their mean 2. Z-scores These express the difference from the mean in terms of standard deviations of an individual value, and thus can be compared to z-scores drawn from other data sets or distributions 3. Coefficient of Variation This is an overall measure of dispersion that is normalized with respect to the mean from the same distribution, and thus is comparable to coefficients of variation from other data sets because it is a normalized measure of dispersion

Measures of Dispersion Review 1. Standard Deviation Standard deviation is calculated by taking the square root of variance: σ = i=n Σ (x i µ) 2 i=1 N Population standard deviation S = i=n Σi=1 (x i x) 2 n - 1 Sample standard deviation Why do we prefer standard deviation over variance as a measure of dispersion? Magnitude of values and units match means.

Measures of Dispersion - Review 1. Standard Deviation This is the most frequently used measure of dispersion because it has the same units as the values and their mean (unlike variance) Procedure for finding the standard deviation of a data set: 1) Calculate the mean 2) Calculate the statistical distances (x i x) for each value 3) Square each of the statistical distances (x i x) 2 4) Sum the squared statistical distances, the sum of squares 5) Divide the sum of squares by N for a population or by (n-1) for a sample this gives you the variance 6) Take the square root of the variance to get the standard deviation

Measures of Dispersion - Review 2. Z-scores These express the difference from the mean in terms of standard deviations of an individual value, and thus can be compared to z-scores drawn from other data sets or distributions Procedure for finding the z-score of an observation: 1) Calculate the mean 2) Calculate the statistical distances (x i x) for each value where we wish find the z-score 3) Calculate the standard deviation 4) Calculate the z-score using the formula Z-score = x - x S

Measures of Dispersion - Review 3. Coefficient of Variation This is an overall measure of dispersion that is normalized with respect to the mean from the same distribution, and thus is comparable to coefficients of variation from other data sets because it is a normalized measure of dispersion Procedure for finding the coef. of variation for a data set: 1) Calculate the mean 2) Calculate the standard deviation 3) Calculate the coefficient of variation using the formula S σ Coefficient of variation = or (*100%) x µ

Measures of Dispersion - Review We will use the same example data set: Daily low CH temps. Jan. 18-31, 2005 in degrees F: Jan. 18 11 degrees Jan. 25 25 degrees Jan. 19 11 degrees Jan. 26 33 degrees Jan. 20 25 degrees Jan. 27 22 degrees Jan. 21 29 degrees Jan. 28 18 degrees Jan. 22 27 degrees Jan. 29 19 degrees Jan. 23 14 degrees Jan. 30 30 degrees Jan. 24 11 degrees Jan. 31 27 degrees For these 14 values, we will calculate the three measures of dispersion listed above - the standard deviation, some z-scores and the coefficient of variation for this data set

Measures of Dispersion - Review 1. Standard Deviation This is the most frequently used measure of dispersion because it has the same units as the values and their mean (unlike variance) 1) Calculate the mean We have previously found the mean = 21.57 degrees F 2) Calculate the statistical distances (x i x) for each value Jan. 18 (11 21.57) = -10.57 Jan. 25 (25 21.57) = 3.43 Jan. 19 (11 21.57) = -10.57 Jan. 26 (33 21.57) = 11.43 Jan. 20 (25 21.57) = 3.43 Jan. 27 (22 21.57) = 0.43 Jan. 21 (29 21.57) = 7.43 Jan. 28 (18 21.57) = -3.57 Jan. 22 (27 21.57) = 5.43 Jan. 29 (19 21.57) = -2.57 Jan. 23 (14 21.57) = -7.57 Jan. 30 (30 21.57) = 8.42 Jan. 24 (11 21.57) = -10.57 Jan. 31 (27 21.57) = 5.42 I have rounded the values for display here to 2 decimal places, ideally you want to do as little rounding as possible

Measures of Dispersion - Review 1. Standard Deviation cont. 3) Square each of the statistical distances (x i x) 2 Jan. 18 (-10.57) 2 = 111.76 Jan. 25 (3.43) 2 = 11.76 Jan. 19 (-10.57) 2 = 111.76 Jan. 26 (11.43) 2 = 130.61 Jan. 20 (3.43) 2 = 11.76 Jan. 27 (0.43) 2 = 0.18 Jan. 21 (7.43) 2 = 55.18 Jan. 28 (-3.57) 2 = 12.76 Jan. 22 (5.43) 2 = 29.57 Jan. 29 (-2.57) 2 = 6.61 Jan. 23 (7.57) 2 = 57.33 Jan. 30 (8.43) 2 = 71.04 Jan. 24 (-10.57) 2 = 111.76 Jan. 31 (5.43) 2 = 29.57 4) Sum the squared statistical distances, the sum of squares Sum of Squares = i=n Σ (x i x)2 = 751.43 i=1

Measures of Dispersion - Review 1. Standard Deviation cont. 5) Divide the sum of squares by N for a population or by (n-1) for a sample this gives you the variance Here, our sample n =14, so 751.43/(14-1) = 57.8 6) Take the square root of the variance to calculate the standard deviation Taking the square root of our variance (57.8) gives us the standard deviation for our data set 57.8 = 7.6

Measures of Dispersion - Review 2. Z-scores We will calculate z-scores for the lowest and highest temperatures in our sample (11 and 33 degrees) 1) Calculate the mean We have previously found the mean = 21.57 degrees F 2) Calculate the statistical distances (x i x) for each value where we wish find the z-score We have already calculated these statistical distances: Jan. 18 (11 21.57) = -10.57 Jan. 26 (33 21.57) = 11.43 3) Calculate the standard deviation We have already calculated the standard deviation for our data set and found it to be = 7.6 degrees

Measures of Dispersion - Review 2. Z-scores cont. 4) Calculate the z-score using the formula Z-score = x - x S i.e. divide the statistical distances by the standard deviation Jan. 18-10.57 / 7.6 = -1.39 Jan. 26 11.43 / 7.6 = 1.5 If we had another set of minimum temperatures from a previous January (from 2004, for example), we could calculate the z-scores for values from that data set, and make a reasonable comparison to these values

Measures of Dispersion - Review 3. Coefficient of Variation This is a normalized measure of dispersion for the variation throughout a data set 1) Calculate the mean We have previously found the mean = 21.57 degrees F 2) Calculate the standard deviation We have previously found the std. dev. = 7.6 degrees F 3) Calculate the coefficient of variation using the formula S σ Coefficient of variation = or (*100%) x µ Using the example values: 7.6/21.57 = 0.3524 or 35.24% This value could be compared with that from 2004 etc.

Skewness and Kurtosis - Review 1. Skewness This statistic measures the degree of asymmetry exhibited by the data (i.e. whether there are more observations on one side of the mean than the other) 2. Kurtosis This statistic measures the degree to which the distribution is flat or peaked

Skewness and Kurtosis - Review 1. Skewness This statistic measures the degree of asymmetry exhibited by the data (i.e. whether there are more observations on one side of the mean than the other): Skewness = i=n Σi=1 (x i x) 3 ns 3 Because the exponent in this moment is odd, skewness can be positive or negative; positive skewness has more observations below the mean than above it (negative vice-versa)

Skewness and Kurtosis - Review 1. Skewness This statistic measures the degree of asymmetry exhibited by the data Procedure for finding the skewness of a data set: 1) Calculate the mean 2) Calculate the statistical distances (x i x) for each value 3) Cube each of the statistical distances (x i x) 3 4) Sum the cubed statistical distances, the sum of cubes (i.e. this is the numerator in the skewness formula) 5) Divide the sum of cubes by the sample size multiplied by the standard deviation cubes (i.e. the denominator is n*s 3 in [Σ (x i x) 3 ] / [ n*s 3 ])

Skewness and Kurtosis - Review 2. Kurtosis This statistic measures how flat or peaked the distribution is, and is formulated as: i=n Σi=1 (x i x) 4 Kurtosis = ns 4-3 The 3 is included in this formula because it results in the kurtosis of a normal distribution to have the value 0 (this condition is also termed having a mesokurtic distribution)

Skewness and Kurtosis - Review 2. Kurtosis This statistic measures how flat or peaked the distribution is Procedure for finding the kurtosis of a data set: 1) Calculate the mean 2) Calculate the statistical distances (x i x) for each value 3) Raise each of the statistical distances to the 4 th power, i.e. (x i x) 4 4) Sum the statistical distances to the 4 th power Σ (x i x) 4 5) Divide the sum by the sample size multiplied by the standard deviation raised to the 4 th power (i.e. the denominator is n*s 4 in [Σ (x i x) 4 ] / [ n*s 4 ]) 6) Subtract 3 from [Σ (x i x) 4 ] / [ n*s 4 ]

Skewness & Kurtosis - Review We will use the same example data set: Daily low CH temps. Jan. 18-31, 2005 in degrees F: Jan. 18 11 degrees Jan. 25 25 degrees Jan. 19 11 degrees Jan. 26 33 degrees Jan. 20 25 degrees Jan. 27 22 degrees Jan. 21 29 degrees Jan. 28 18 degrees Jan. 22 27 degrees Jan. 29 19 degrees Jan. 23 14 degrees Jan. 30 30 degrees Jan. 24 11 degrees Jan. 31 27 degrees Using these 14 values, we will calculate the two distribution shape descriptive statistics listed above, the skewness and kurtosis for this data set

Skewness & Kurtosis - Review 1. Skewness This statistic measures the degree of asymmetry exhibited by the data 1) Calculate the mean We have previously found the mean = 21.57 degrees F 2) Calculate the statistical distances (x i x) for each value We have previously calculated the statistical distances: Jan. 18 (11 21.57) = -10.57 Jan. 25 (25 21.57) = 3.43 Jan. 19 (11 21.57) = -10.57 Jan. 26 (33 21.57) = 11.43 Jan. 20 (25 21.57) = 3.43 Jan. 27 (22 21.57) = 0.43 Jan. 21 (29 21.57) = 7.43 Jan. 28 (18 21.57) = -3.57 Jan. 22 (27 21.57) = 5.43 Jan. 29 (19 21.57) = -2.57 Jan. 23 (14 21.57) = -7.57 Jan. 30 (30 21.57) = 8.42 Jan. 24 (11 21.57) = -10.57 Jan. 31 (27 21.57) = 5.42

Skewness & Kurtosis - Review 1. Skewness cont. 3) Cube each of the statistical distances (x i x) 3 Jan. 18 (-10.57) 3 = -1181.41 Jan. 25 (3.43) 3 = 40.3 Jan. 19 (-10.57) 3 = -1181.41 Jan. 26 (11.43) 3 = 1492.71 Jan. 20 (3.43) 3 = 40.3 Jan. 27 (0.43) 3 = 0.08 Jan. 21 (7.43) 3 = 409.94 Jan. 28 (-3.57) 3 = -45.55 Jan. 22 (5.43) 3 = 159.98 Jan. 29 (-2.57) 3 = -17 Jan. 23 (7.57) 3 = -434.04 Jan. 30 (8.43) 3 = 598.77 Jan. 24 (-10.57) 3 = -1181.41 Jan. 31 (5.43) 3 = 159.98 4) Sum the cubed statistical distances, the sum of cubes Sum of cubes = i=n Σ (x i x)3 = -1138.78 i=1

Skewness & Kurtosis - Review 1. Skewness cont. 5) Divide the sum of cubes (-1138.78) by n*s 3 (S=7.6 from above): Σ (x -1138.78 14*(7.6) = -1138.78 i x) 3 = n*s 3 14*438.98 = -1138.78 3 6145.72 = -0.1851 The negative value of skewness indicates that our sample distribution has greater frequencies at the higher values of temperature (although interpreting skewness with a sample this small and a distribution that is not really normally shaped is somewhat of a stretch )

Skewness & Kurtosis - Review 2. Kurtosis This statistic measures the degree to which the distribution is flat or peaked 1) Calculate the mean We have previously found the mean = 21.57 degrees F 2) Calculate the statistical distances (x i x) for each value We have previously calculated the statistical distances: Jan. 18 (11 21.57) = -10.57 Jan. 25 (25 21.57) = 3.43 Jan. 19 (11 21.57) = -10.57 Jan. 26 (33 21.57) = 11.43 Jan. 20 (25 21.57) = 3.43 Jan. 27 (22 21.57) = 0.43 Jan. 21 (29 21.57) = 7.43 Jan. 28 (18 21.57) = -3.57 Jan. 22 (27 21.57) = 5.43 Jan. 29 (19 21.57) = -2.57 Jan. 23 (14 21.57) = -7.57 Jan. 30 (30 21.57) = 8.42 Jan. 24 (11 21.57) = -10.57 Jan. 31 (27 21.57) = 5.42

Skewness & Kurtosis - Review 2. Kurtosis cont. 3) Raise each of the statistical distances to the 4 th power (x i x) 4 Jan. 18 (-10.57) 4 = 12489.2 Jan. 25 (3.43) 4 = 138.18 Jan. 19 (-10.57) 4 = 12489.2 Jan. 26 (11.43) 4 = 17059.56 Jan. 20 (3.43) 4 = 138.18 Jan. 27 (0.43) 4 = 0.03 Jan. 21 (7.43) 4 = 3045.24 Jan. 28 (-3.57) 4 = 162.69 Jan. 22 (5.43) 4 = 868.44 Jan. 29 (-2.57) 4 = 43.72 Jan. 23 (7.57) 4 = 3286.33 Jan. 30 (8.43) 4 = 5046.8 Jan. 24 (-10.57) 4 = 12489.2 Jan. 31 (5.43) 4 = 868.44 4) Sum the statistical distances raised to the 4 th power Sum of 4 th powers = i=n Σ (x i x)4 = 68125.24 i=1

Skewness & Kurtosis - Review 2. Kurtosis cont. 5) Divide the sum of 4 th powers (68125.24) by n*s 4 (S=7.6 from above): Σ (x 68125.24 14*(7.6) = 68125.24 i x) 4 = n*s 4 14*3341.1 = 68125.24 4 46775.32 = 1.4564 6) Subtract 3 from [Σ (x i x) 4 ] / [ n*s 4 ] Using our values, the kurtosis is 1.4564 3 = -1.5436 Because this kurtosis is <0, this sample has a platykurtic distribution meaning the curve is flatter than a normal curve (but caveats to interpretation apply)