Numerical Measurements

Similar documents
Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,

Chapter 3 Descriptive Statistics: Numerical Measures Part A

CHAPTER 2 Describing Data: Numerical

Numerical Descriptions of Data

3.1 Measures of Central Tendency

ECON 214 Elements of Statistics for Economists

Engineering Mathematics III. Moments

Simple Descriptive Statistics

Fundamentals of Statistics

PSYCHOLOGICAL STATISTICS

Descriptive Statistics

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1

Measures of Central tendency

Statistics I Chapter 2: Analysis of univariate data

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

appstats5.notebook September 07, 2016 Chapter 5

Measures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Measure of Variation

2 DESCRIPTIVE STATISTICS

Section3-2: Measures of Center

Description of Data I

DATA SUMMARIZATION AND VISUALIZATION

Measures of Dispersion (Range, standard deviation, standard error) Introduction

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Some estimates of the height of the podium

Averages and Variability. Aplia (week 3 Measures of Central Tendency) Measures of central tendency (averages)

Population Mean GOALS. Characteristics of the Mean. EXAMPLE Population Mean. Parameter Versus Statistics. Describing Data: Numerical Measures

Some Characteristics of Data

Unit 2 Statistics of One Variable

STATS DOESN T SUCK! ~ CHAPTER 4

1 Describing Distributions with numbers

Statistics 114 September 29, 2012

SUMMARY STATISTICS EXAMPLES AND ACTIVITIES

Chapter 5: Summarizing Data: Measures of Variation

Diploma in Business Administration Part 2. Quantitative Methods. Examiner s Suggested Answers

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis

Moments and Measures of Skewness and Kurtosis

DESCRIPTIVE STATISTICS

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

SOLUTIONS TO THE LAB 1 ASSIGNMENT

Chapter 6 Simple Correlation and

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).

STAT 113 Variability

Chapter 4 Variability

Descriptive Analysis

VARIABILITY: Range Variance Standard Deviation

Normal Probability Distributions

Lecture 1: Review and Exploratory Data Analysis (EDA)

STATISTICS STUDY NOTES UNIT I MEASURES OF CENTRAL TENDENCY DISCRETE SERIES. Direct Method. N Short-cut Method. X A f d N Step-Deviation Method

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Descriptive Statistics

Applications of Data Dispersions

David Tenenbaum GEOG 090 UNC-CH Spring 2005

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.

Contents. An Overview of Statistical Applications CHAPTER 1. Contents (ix) Preface... (vii)

4. DESCRIPTIVE STATISTICS

Statistics vs. statistics

Measures of Variation. Section 2-5. Dotplots of Waiting Times. Waiting Times of Bank Customers at Different Banks in minutes. Bank of Providence

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section

Modern Methods of Data Analysis - SS 2009

DESCRIPTIVE STATISTICS II. Sorana D. Bolboacă

Numerical Descriptive Measures. Measures of Center: Mean and Median

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Definitions Chapter 4. Section 4-3. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman 1 Monday, March 18, 2013

KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI

6.2 Normal Distribution. Normal Distributions

1) 3 points Which of the following is NOT a measure of central tendency? a) Median b) Mode c) Mean d) Range

MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION

CABARRUS COUNTY 2008 APPRAISAL MANUAL

Section-2. Data Analysis

IOP 201-Q (Industrial Psychological Research) Tutorial 5

3.3-Measures of Variation

Mini-Lecture 3.1 Measures of Central Tendency

2 Exploring Univariate Data

Chapter 3-Describing Data: Numerical Measures

Quantitative Analysis and Empirical Methods

1 A Brief History of. Chapter. Risk and Return. Dollar Returns. PercentReturn. Learning Objectives. A Brief History of Risk and Return

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =

Misleading Graphs. Examples Compare unlike quantities Truncate the y-axis Improper scaling Chart Junk Impossible to interpret

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS

Copyright 2005 Pearson Education, Inc. Slide 6-1

Numerical summary of data

Refer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.

Basic Procedure for Histograms

1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form:

2011 Pearson Education, Inc

Lecture 2 Describing Data

8. From FRED, search for Canada unemployment and download the unemployment rate for all persons 15 and over, monthly,

Chapter 4. The Normal Distribution

Continuous Probability Distributions

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar

Frequency Distribution and Summary Statistics

Monte Carlo Simulation (Random Number Generation)

Describing Data: One Quantitative Variable

Statistics I Final Exam, 24 June Degrees in ADE, DER-ADE, ADE-INF, FICO, ECO, ECO-DER.

Transcription:

El-Shorouk Academy Acad. Year : 2013 / 2014 Higher Institute for Computer & Information Technology Term : Second Year : Second Department of Computer Science Statistics & Probabilities Section # 3 umerical Measurements Thirdly: Measures of Dispersion (variation): Measures of variation give information on the spread or variability of the data values. The dispersion of a distribution reveals how the observations are spread out or scattered on each side of the center. To measure the dispersion, scatter, or variation of a distribution is as important as to locate the central tendency. If the dispersion is small, it indicates high uniformity of the observations in the distribution. Absence of dispersion in the data indicates perfect uniformity. This situation arises when all observations in the distribution are identical. If this were the case, description of any single observation would suffice. A measure of dispersion appears to serve two purposes. First, it is one of the most important quantities used to characterize a frequency distribution. Ta.M.3ed Statistics & Probabilities (Section (3)) Page 1 of 9

Second, it affords a basis of comparison between two or more frequency distributions. The study of dispersion bears its importance from the fact that various distributions may have exactly the same averages, but substantial differences in their variability. Absolute Measures of Dispersion 1. Range Simplest measure of variation. Difference between the largest and the smallest observations: Range = X largest X smallest Ignores the way in which data are distributed. Extremely sensitive to extreme values (outliers). Ta.M.3ed Statistics & Probabilities (Section (3)) Page 2 of 9

2. Inter-quartile Range A measure similar to the special range (Q) is the inter-quartile range. Eliminate some high- and low-valued observations and calculate the range from the remaining values. Is not sensitive to extreme values (outliers). Inter-quartile range = 3rd quartile 1st quartile IQR = Q 3 Q 1 3. Quartile deviation (Semi-interquartile Range) The inter-quartile range is frequently reduced to the measure of semi-interquartile range, known as the quartile deviation (QD), by dividing it by 2. QD = (Q3 Q1) / 2 4. Variance and standard deviation The variance is the sum of the squared deviations from the mean divided by the number of cases. Variance has properties making it useful for certain statistical analysis. The standard deviation is the positive square root of the meansquare deviations of the observations from their arithmetic mean. Standard deviation is in same units as variable, more readily interpreted. Standard deviation is measure of absolute deviation. Standard deviation is very sensitive to extreme values (outliers). Ta.M.3ed Statistics & Probabilities (Section (3)) Page 3 of 9

Why square differences between data values and mean? o Gives positive values. o Gives more weight to larger differences. o Has desirable statistical properties. Why n - 1 for sample variance? Ungrouped Data o Dividing by n underestimates population variance. o Dividing by n-1 gives unbiased estimate of population variance. For Population Standard deviation σ = Variance σ 2 = i=1 x i μ 2 i=1 (x i μ ) 2 For Sample Standard deviation S = Variance S 2 = n x i x 2 i=1 n 1 n x i x 2 i=1 n 1 Ta.M.3ed Statistics & Probabilities (Section (3)) Page 4 of 9

5. Mean deviation The mean deviation is an average of absolute deviations of individual observations from the central value of a series. Ungrouped Data MD = i=1 x i μ Relative Measures of Dispersion To compare the extent of variation of different distributions whether having differing or identical units of measurements, it is necessary to consider some other measures that reduce the absolute deviation in some relative form. These measures are usually expressed in the form of coefficients and are pure numbers, independent of the unit of measurements. 1. Coefficient of variation A coefficient of variation is computed as a ratio of the standard deviation of the distribution to the mean of the same distribution. CV = σ μ 100 2. Coefficient of quartile deviation The coefficient of quartile deviation is computed from the first and the third quartiles using the following formula: Ta.M.3ed Statistics & Probabilities (Section (3)) Page 5 of 9

Coefficient of quartile deviation = Q 3 Q 1 Q 3 + Q 1 100 Fourthly: Measures of Distribution: Skewness measures The term skewness refers to the lack of symmetry. The lack of symmetry in a distribution is always determined with reference to a normal or Gaussian distribution. ote that a normal distribution is always symmetrical. The skewness may be either positive or negative. When the skewness of a distribution is positive (negative), the distribution is called a positively (negatively) skewed distribution. Absence of skewness makes a distribution symmetrical. It is important to emphasize that skewness of a distribution cannot be determined simply my inspection. Skewness measures degree of symmetry of distribution. Useful when comparing sample distributions with different shapes. Useful in data analysis. Many distributions are not symmetrical. They may be tail off to right or to the left and as such said to be skewed. The simplest measure of skewness is the Pearson s coefficient of skewness: Pearson' s coefficient of skewness Pearson' s coefficient of Bowley's coefficien t of Mean- Mode Standard deviation 3 1 Ta.M.3ed Statistics & Probabilities (Section (3)) Page 6 of 9 3 Mean- Median skewness Standard deviation Q skewness 3 Q1 2 Q Q M e

ote that: If Mean > Mode, the skewness is positive. If Mean < Mode, the skewness is negative. If Mean = Mode, the skewness is zero. Interpretation of standard deviation (The Empirical Rule): If distribution of data approximately bell shaped, Then About 68% of the data fall within one standard deviation of the mean. Ta.M.3ed Statistics & Probabilities (Section (3)) Page 7 of 9

About 95% of the data fall within two standard deviations of the mean. About 99.7% of the data fall within three standard deviations of the mean. Ta.M.3ed Statistics & Probabilities (Section (3)) Page 8 of 9

Example (1) Here are the EPA city gas mileage figures for two-seat-car models available in the United States in 1992: 10 17 13 32 22 15 8 16 28 17 18 15 21 15 24 9 17 25 13 Find: 1) Range? 2) Inter-quartile Range? 3) Quartile deviation (Semi-interquartile Range)? 4) Variance? 5) Standard deviation? 6) Mean deviation? 7) Coefficient of variation? 8) Coefficient of quartile deviation? 9) Skewness coefficient? Reference These sections based on: - Triola, ELEMETARY STATISTICS, Eighth Edition. Copyright 2001. Addison Wesley Longman. - Akm Saiful Islam, Data Management and Statistical Analysis, Bangladesh University. Ta.M.3ed Statistics & Probabilities (Section (3)) Page 9 of 9