David Tenenbaum GEOG 090 UNC-CH Spring 2005
|
|
- Iris Bishop
- 6 years ago
- Views:
Transcription
1 Simple Descriptive Statistics Review and Examples You will likely make use of all three measures of central tendency (mode, median, and mean), as well as some key measures of dispersion (standard deviation, z- scores, and the coefficient of variation), along with the statistics that describe the shape of a distribution (skewness and kurtosis) at some point if you work with numeric data sets in an academic or research context In this lecture, we will review the procedures for calculating these statistics, and work through an example for each of the statistics (using a small data set, smaller than those that are typically found in research applications)
2 Measures of Central Tendency - Review 1. Mode This is the most frequently occurring value in the distribution 2. Median This is the value of a variable such that half of the observations are above and half are below this value i.e. this value divides the distribution into two groups of equal size 3. Mean a.k.a. average, the most commonly used measure of central tendency
3 Measures of Central Tendency - Review 1. Mode This is the most frequently occurring value in the distribution Procedure for finding the mode of a data set: 1) Sort the data, putting the values in ascending order 2) Count the instances of each value (if this is continuous data with a high degree of precision and many decimal places, this may be quite tedious) 3) Find the value that has the most occurrences this is the mode (if more than one value occurs an equal number of times and these exceed all other counts, we have multiple modes) Use the mode for multi-modal or nominal data sets
4 Measures of Central Tendency - Review 2. Median - ½ of the values are above & ½ below this value Procedure for finding the median of a data set: 1) Sort the data, putting the values in ascending order 2) Find the value with an equal number of values above and below it (if there are an even number of values, you will need to average two values together): Odd number of observations [(n-1)/2]+1 values from the lowest, e.g. n=19 [(19-1)/2]+1 = 10 th value Even number of observations average the (n/2) and [(n/2)+1] values, e.g. n=20 average the 10 th and 11 th Use the median with assymetric distributions, when you suspect outliers are present, or with ordinal data
5 Measures of Central Tendency - Review 3. Mean a.k.a. average, the most commonly used measure of central tendency Procedure for finding the mean of a data set: 1) Sum all the values in the data set 2) Divide the sum by the number of values in the data set x = i=n Σ x i i=1 n Use the mean when you have interval or ratio data sets with a large sample size, few (or no?) outliers, and a reasonably symmetric unimodal distribution
6 Measures of Central Tendency - Review An example data set: Daily low temperatures recorded in Chapel Hill from January 18, 2005 through January 31, 2005 in degrees Fahrenheit: Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees For these 14 values, we will calculate all three measures of central tendency - the mode, median, and mean
7 Measures of Central Tendency - Review 1. Mode Find the most frequently occurring value 1) Sort the data, putting the values in ascending order: 11, 11, 11, 14, 18, 19, 22, 25, 25, 27, 27, 29, 30, 33 2) Count the instances of each value: 11, 11, 11, 14, 18, 19, 22, 25, 25, 27, 27, 29, 30, 33 3x 1x 1x 1x 1x 2x 2x 1x 1x 1x 3) Find the value that has the most occurrences: In this case, the mode is 11 degrees Fahrenheit, but is this a good measure of the central tendency of this data? Had there only been two days with a recorded temperature of 11 degrees, what would be the mode?
8 Measures of Central Tendency - Review 2. Median - ½ of the values are above & ½ below this value 1) Sort the data, putting the values in ascending order: 11, 11, 11, 14, 18, 19, 22, 25, 25, 27, 27, 29, 30, 33 2) Find the value with an equal number of values above and below it (if there are an even number of values, you will need to average two values together): Even number of observations average the (n/2) and [(n/2)+1] values Here, n=14 average the (14/2) and [(14/2)+1] values, i.e. the 7 th and 8 th values (22+25)/2 = 23.5 degrees F Here, the median is 23.5 degrees F is this a good measure of central tendency for this data?
9 Measures of Central Tendency - Review 3. Mean a.k.a. average, the most commonly used measure of central tendency i=n 1) Sum all the values in the data set Σ x i i= = 302 2) Divide the sum by the number of values in the data set Here, n=14, so calculate the mean using 302/14 = The mean is degrees F is this a good measure of central tendency for this data set?
10 Measures of Dispersion Review 1. Standard Deviation This is the most frequently used measure of dispersion because it has the same units as the values and their mean 2. Z-scores These express the difference from the mean in terms of standard deviations of an individual value, and thus can be compared to z-scores drawn from other data sets or distributions 3. Coefficient of Variation This is an overall measure of dispersion that is normalized with respect to the mean from the same distribution, and thus is comparable to coefficients of variation from other data sets because it is a normalized measure of dispersion
11 Measures of Dispersion Review 1. Standard Deviation Standard deviation is calculated by taking the square root of variance: σ = i=n Σ (x i µ) 2 i=1 N Population standard deviation S = i=n Σi=1 (x i x) 2 n - 1 Sample standard deviation Why do we prefer standard deviation over variance as a measure of dispersion? Magnitude of values and units match means.
12 Measures of Dispersion - Review 1. Standard Deviation This is the most frequently used measure of dispersion because it has the same units as the values and their mean (unlike variance) Procedure for finding the standard deviation of a data set: 1) Calculate the mean 2) Calculate the statistical distances (x i x) for each value 3) Square each of the statistical distances (x i x) 2 4) Sum the squared statistical distances, the sum of squares 5) Divide the sum of squares by N for a population or by (n-1) for a sample this gives you the variance 6) Take the square root of the variance to get the standard deviation
13 Measures of Dispersion - Review 2. Z-scores These express the difference from the mean in terms of standard deviations of an individual value, and thus can be compared to z-scores drawn from other data sets or distributions Procedure for finding the z-score of an observation: 1) Calculate the mean 2) Calculate the statistical distances (x i x) for each value where we wish find the z-score 3) Calculate the standard deviation 4) Calculate the z-score using the formula Z-score = x - x S
14 Measures of Dispersion - Review 3. Coefficient of Variation This is an overall measure of dispersion that is normalized with respect to the mean from the same distribution, and thus is comparable to coefficients of variation from other data sets because it is a normalized measure of dispersion Procedure for finding the coef. of variation for a data set: 1) Calculate the mean 2) Calculate the standard deviation 3) Calculate the coefficient of variation using the formula S σ Coefficient of variation = or (*100%) x µ
15 Measures of Dispersion - Review We will use the same example data set: Daily low CH temps. Jan , 2005 in degrees F: Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees For these 14 values, we will calculate the three measures of dispersion listed above - the standard deviation, some z-scores and the coefficient of variation for this data set
16 Measures of Dispersion - Review 1. Standard Deviation This is the most frequently used measure of dispersion because it has the same units as the values and their mean (unlike variance) 1) Calculate the mean We have previously found the mean = degrees F 2) Calculate the statistical distances (x i x) for each value Jan. 18 ( ) = Jan. 25 ( ) = 3.43 Jan. 19 ( ) = Jan. 26 ( ) = Jan. 20 ( ) = 3.43 Jan. 27 ( ) = 0.43 Jan. 21 ( ) = 7.43 Jan. 28 ( ) = Jan. 22 ( ) = 5.43 Jan. 29 ( ) = Jan. 23 ( ) = Jan. 30 ( ) = 8.42 Jan. 24 ( ) = Jan. 31 ( ) = 5.42 I have rounded the values for display here to 2 decimal places, ideally you want to do as little rounding as possible
17 Measures of Dispersion - Review 1. Standard Deviation cont. 3) Square each of the statistical distances (x i x) 2 Jan. 18 (-10.57) 2 = Jan. 25 (3.43) 2 = Jan. 19 (-10.57) 2 = Jan. 26 (11.43) 2 = Jan. 20 (3.43) 2 = Jan. 27 (0.43) 2 = 0.18 Jan. 21 (7.43) 2 = Jan. 28 (-3.57) 2 = Jan. 22 (5.43) 2 = Jan. 29 (-2.57) 2 = 6.61 Jan. 23 (7.57) 2 = Jan. 30 (8.43) 2 = Jan. 24 (-10.57) 2 = Jan. 31 (5.43) 2 = ) Sum the squared statistical distances, the sum of squares Sum of Squares = i=n Σ (x i x)2 = i=1
18 Measures of Dispersion - Review 1. Standard Deviation cont. 5) Divide the sum of squares by N for a population or by (n-1) for a sample this gives you the variance Here, our sample n =14, so /(14-1) = ) Take the square root of the variance to calculate the standard deviation Taking the square root of our variance (57.8) gives us the standard deviation for our data set 57.8 = 7.6
19 Measures of Dispersion - Review 2. Z-scores We will calculate z-scores for the lowest and highest temperatures in our sample (11 and 33 degrees) 1) Calculate the mean We have previously found the mean = degrees F 2) Calculate the statistical distances (x i x) for each value where we wish find the z-score We have already calculated these statistical distances: Jan. 18 ( ) = Jan. 26 ( ) = ) Calculate the standard deviation We have already calculated the standard deviation for our data set and found it to be = 7.6 degrees
20 Measures of Dispersion - Review 2. Z-scores cont. 4) Calculate the z-score using the formula Z-score = x - x S i.e. divide the statistical distances by the standard deviation Jan / 7.6 = Jan / 7.6 = 1.5 If we had another set of minimum temperatures from a previous January (from 2004, for example), we could calculate the z-scores for values from that data set, and make a reasonable comparison to these values
21 Measures of Dispersion - Review 3. Coefficient of Variation This is a normalized measure of dispersion for the variation throughout a data set 1) Calculate the mean We have previously found the mean = degrees F 2) Calculate the standard deviation We have previously found the std. dev. = 7.6 degrees F 3) Calculate the coefficient of variation using the formula S σ Coefficient of variation = or (*100%) x µ Using the example values: 7.6/21.57 = or 35.24% This value could be compared with that from 2004 etc.
22 Skewness and Kurtosis - Review 1. Skewness This statistic measures the degree of asymmetry exhibited by the data (i.e. whether there are more observations on one side of the mean than the other) 2. Kurtosis This statistic measures the degree to which the distribution is flat or peaked
23 Skewness and Kurtosis - Review 1. Skewness This statistic measures the degree of asymmetry exhibited by the data (i.e. whether there are more observations on one side of the mean than the other): Skewness = i=n Σi=1 (x i x) 3 ns 3 Because the exponent in this moment is odd, skewness can be positive or negative; positive skewness has more observations below the mean than above it (negative vice-versa)
24 Skewness and Kurtosis - Review 1. Skewness This statistic measures the degree of asymmetry exhibited by the data Procedure for finding the skewness of a data set: 1) Calculate the mean 2) Calculate the statistical distances (x i x) for each value 3) Cube each of the statistical distances (x i x) 3 4) Sum the cubed statistical distances, the sum of cubes (i.e. this is the numerator in the skewness formula) 5) Divide the sum of cubes by the sample size multiplied by the standard deviation cubes (i.e. the denominator is n*s 3 in [Σ (x i x) 3 ] / [ n*s 3 ])
25 Skewness and Kurtosis - Review 2. Kurtosis This statistic measures how flat or peaked the distribution is, and is formulated as: i=n Σi=1 (x i x) 4 Kurtosis = ns 4-3 The 3 is included in this formula because it results in the kurtosis of a normal distribution to have the value 0 (this condition is also termed having a mesokurtic distribution)
26 Skewness and Kurtosis - Review 2. Kurtosis This statistic measures how flat or peaked the distribution is Procedure for finding the kurtosis of a data set: 1) Calculate the mean 2) Calculate the statistical distances (x i x) for each value 3) Raise each of the statistical distances to the 4 th power, i.e. (x i x) 4 4) Sum the statistical distances to the 4 th power Σ (x i x) 4 5) Divide the sum by the sample size multiplied by the standard deviation raised to the 4 th power (i.e. the denominator is n*s 4 in [Σ (x i x) 4 ] / [ n*s 4 ]) 6) Subtract 3 from [Σ (x i x) 4 ] / [ n*s 4 ]
27 Skewness & Kurtosis - Review We will use the same example data set: Daily low CH temps. Jan , 2005 in degrees F: Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Jan degrees Using these 14 values, we will calculate the two distribution shape descriptive statistics listed above, the skewness and kurtosis for this data set
28 Skewness & Kurtosis - Review 1. Skewness This statistic measures the degree of asymmetry exhibited by the data 1) Calculate the mean We have previously found the mean = degrees F 2) Calculate the statistical distances (x i x) for each value We have previously calculated the statistical distances: Jan. 18 ( ) = Jan. 25 ( ) = 3.43 Jan. 19 ( ) = Jan. 26 ( ) = Jan. 20 ( ) = 3.43 Jan. 27 ( ) = 0.43 Jan. 21 ( ) = 7.43 Jan. 28 ( ) = Jan. 22 ( ) = 5.43 Jan. 29 ( ) = Jan. 23 ( ) = Jan. 30 ( ) = 8.42 Jan. 24 ( ) = Jan. 31 ( ) = 5.42
29 Skewness & Kurtosis - Review 1. Skewness cont. 3) Cube each of the statistical distances (x i x) 3 Jan. 18 (-10.57) 3 = Jan. 25 (3.43) 3 = 40.3 Jan. 19 (-10.57) 3 = Jan. 26 (11.43) 3 = Jan. 20 (3.43) 3 = 40.3 Jan. 27 (0.43) 3 = 0.08 Jan. 21 (7.43) 3 = Jan. 28 (-3.57) 3 = Jan. 22 (5.43) 3 = Jan. 29 (-2.57) 3 = -17 Jan. 23 (7.57) 3 = Jan. 30 (8.43) 3 = Jan. 24 (-10.57) 3 = Jan. 31 (5.43) 3 = ) Sum the cubed statistical distances, the sum of cubes Sum of cubes = i=n Σ (x i x)3 = i=1
30 Skewness & Kurtosis - Review 1. Skewness cont. 5) Divide the sum of cubes ( ) by n*s 3 (S=7.6 from above): Σ (x *(7.6) = i x) 3 = n*s 3 14* = = The negative value of skewness indicates that our sample distribution has greater frequencies at the higher values of temperature (although interpreting skewness with a sample this small and a distribution that is not really normally shaped is somewhat of a stretch )
31 Skewness & Kurtosis - Review 2. Kurtosis This statistic measures the degree to which the distribution is flat or peaked 1) Calculate the mean We have previously found the mean = degrees F 2) Calculate the statistical distances (x i x) for each value We have previously calculated the statistical distances: Jan. 18 ( ) = Jan. 25 ( ) = 3.43 Jan. 19 ( ) = Jan. 26 ( ) = Jan. 20 ( ) = 3.43 Jan. 27 ( ) = 0.43 Jan. 21 ( ) = 7.43 Jan. 28 ( ) = Jan. 22 ( ) = 5.43 Jan. 29 ( ) = Jan. 23 ( ) = Jan. 30 ( ) = 8.42 Jan. 24 ( ) = Jan. 31 ( ) = 5.42
32 Skewness & Kurtosis - Review 2. Kurtosis cont. 3) Raise each of the statistical distances to the 4 th power (x i x) 4 Jan. 18 (-10.57) 4 = Jan. 25 (3.43) 4 = Jan. 19 (-10.57) 4 = Jan. 26 (11.43) 4 = Jan. 20 (3.43) 4 = Jan. 27 (0.43) 4 = 0.03 Jan. 21 (7.43) 4 = Jan. 28 (-3.57) 4 = Jan. 22 (5.43) 4 = Jan. 29 (-2.57) 4 = Jan. 23 (7.57) 4 = Jan. 30 (8.43) 4 = Jan. 24 (-10.57) 4 = Jan. 31 (5.43) 4 = ) Sum the statistical distances raised to the 4 th power Sum of 4 th powers = i=n Σ (x i x)4 = i=1
33 Skewness & Kurtosis - Review 2. Kurtosis cont. 5) Divide the sum of 4 th powers ( ) by n*s 4 (S=7.6 from above): Σ (x *(7.6) = i x) 4 = n*s 4 14* = = ) Subtract 3 from [Σ (x i x) 4 ] / [ n*s 4 ] Using our values, the kurtosis is = Because this kurtosis is <0, this sample has a platykurtic distribution meaning the curve is flatter than a normal curve (but caveats to interpretation apply)
Basic Procedure for Histograms
Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that
More informationSimple Descriptive Statistics
Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency
More informationSome Characteristics of Data
Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key
More informationMeasures of Central tendency
Elementary Statistics Measures of Central tendency By Prof. Mirza Manzoor Ahmad In statistics, a central tendency (or, more commonly, a measure of central tendency) is a central or typical value for a
More informationFundamentals of Statistics
CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct
More informationMeasures of Dispersion (Range, standard deviation, standard error) Introduction
Measures of Dispersion (Range, standard deviation, standard error) Introduction We have already learnt that frequency distribution table gives a rough idea of the distribution of the variables in a sample
More informationChapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1
Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and
More information3.1 Measures of Central Tendency
3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent
More informationOverview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution
PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations
More informationEngineering Mathematics III. Moments
Moments Mean and median Mean value (centre of gravity) f(x) x f (x) x dx Median value (50th percentile) F(x med ) 1 2 P(x x med ) P(x x med ) 1 0 F(x) x med 1/2 x x Variance and standard deviation
More informationAverages and Variability. Aplia (week 3 Measures of Central Tendency) Measures of central tendency (averages)
Chapter 4 Averages and Variability Aplia (week 3 Measures of Central Tendency) Chapter 5 (omit 5.2, 5.6, 5.8, 5.9) Aplia (week 4 Measures of Variability) Measures of central tendency (averages) Measures
More informationDescriptive Analysis
Descriptive Analysis HERTANTO WAHYU SUBAGIO Univariate Analysis Univariate analysis involves the examination across cases of one variable at a time. There are three major characteristics of a single variable
More informationStatistics 114 September 29, 2012
Statistics 114 September 29, 2012 Third Long Examination TGCapistrano I. TRUE OR FALSE. Write True if the statement is always true; otherwise, write False. 1. The fifth decile is equal to the 50 th percentile.
More informationTerms & Characteristics
NORMAL CURVE Knowledge that a variable is distributed normally can be helpful in drawing inferences as to how frequently certain observations are likely to occur. NORMAL CURVE A Normal distribution: Distribution
More informationThe Mode: An Example. The Mode: An Example. Measure of Central Tendency: The Mode. Measure of Central Tendency: The Median
Chapter 4: What is a measure of Central Tendency? Numbers that describe what is typical of the distribution You can think of this value as where the middle of a distribution lies (the median). or The value
More informationFrequency Distribution and Summary Statistics
Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary
More informationNumerical Descriptions of Data
Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =
More information1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form:
1 Exercise One Note that the data is not grouped! 1.1 Calculate the mean ROI Below you find the raw data in tabular form: Obs Data 1 18.5 2 18.6 3 17.4 4 12.2 5 19.7 6 5.6 7 7.7 8 9.8 9 19.9 10 9.9 11
More informationChapter 3 Descriptive Statistics: Numerical Measures Part A
Slides Prepared by JOHN S. LOUCKS St. Edward s University Slide 1 Chapter 3 Descriptive Statistics: Numerical Measures Part A Measures of Location Measures of Variability Slide Measures of Location Mean
More informationCSC Advanced Scientific Programming, Spring Descriptive Statistics
CSC 223 - Advanced Scientific Programming, Spring 2018 Descriptive Statistics Overview Statistics is the science of collecting, organizing, analyzing, and interpreting data in order to make decisions.
More informationIOP 201-Q (Industrial Psychological Research) Tutorial 5
IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,
More informationMATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1)
LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1) Descriptive statistics are ways of summarizing large sets of quantitative (numerical) information. The best way to reduce a set of
More informationMeasures of Variation. Section 2-5. Dotplots of Waiting Times. Waiting Times of Bank Customers at Different Banks in minutes. Bank of Providence
Measures of Variation Section -5 1 Waiting Times of Bank Customers at Different Banks in minutes Jefferson Valley Bank 6.5 6.6 6.7 6.8 7.1 7.3 7.4 Bank of Providence 4. 5.4 5.8 6. 6.7 8.5 9.3 10.0 Mean
More information2 DESCRIPTIVE STATISTICS
Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled
More informationPSYCHOLOGICAL STATISTICS
UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc COUNSELLING PSYCHOLOGY (2011 Admission Onwards) II Semester Complementary Course PSYCHOLOGICAL STATISTICS QUESTION BANK 1. The process of grouping
More informationMoments and Measures of Skewness and Kurtosis
Moments and Measures of Skewness and Kurtosis Moments The term moment has been taken from physics. The term moment in statistical use is analogous to moments of forces in physics. In statistics the values
More information9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives
Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical
More informationModule Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION
Subject Paper No and Title Module No and Title Paper No.2: QUANTITATIVE METHODS Module No.7: NORMAL DISTRIBUTION Module Tag PSY_P2_M 7 TABLE OF CONTENTS 1. Learning Outcomes 2. Introduction 3. Properties
More informationStatistics vs. statistics
Statistics vs. statistics Question: What is Statistics (with a capital S)? Definition: Statistics is the science of collecting, organizing, summarizing and interpreting data. Note: There are 2 main ways
More informationChapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)
Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop
More informationECON 214 Elements of Statistics for Economists
ECON 214 Elements of Statistics for Economists Session 3 Presentation of Data: Numerical Summary Measures Part 2 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh
More informationUnit 2 Statistics of One Variable
Unit 2 Statistics of One Variable Day 6 Summarizing Quantitative Data Summarizing Quantitative Data We have discussed how to display quantitative data in a histogram It is useful to be able to describe
More informationMeasures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms
Measures of Central Tendency: Ungrouped Data Measures of central tendency yield information about particular places or locations in a group of numbers. Common Measures of Location Mode Median Percentiles
More informationAP Statistics Chapter 6 - Random Variables
AP Statistics Chapter 6 - Random 6.1 Discrete and Continuous Random Objective: Recognize and define discrete random variables, and construct a probability distribution table and a probability histogram
More informationDESCRIPTIVE STATISTICS II. Sorana D. Bolboacă
DESCRIPTIVE STATISTICS II Sorana D. Bolboacă OUTLINE Measures of centrality Measures of spread Measures of symmetry Measures of localization Mainly applied on quantitative variables 2 DESCRIPTIVE STATISTICS
More informationDescriptive Statistics for Educational Data Analyst: A Conceptual Note
Recommended Citation: Behera, N.P., & Balan, R. T. (2016). Descriptive statistics for educational data analyst: a conceptual note. Pedagogy of Learning, 2 (3), 25-30. Descriptive Statistics for Educational
More informationMath 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment
Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class
More informationChapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.
-3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data
More informationStatistics I Chapter 2: Analysis of univariate data
Statistics I Chapter 2: Analysis of univariate data Numerical summary Central tendency Location Spread Form mean quartiles range coeff. asymmetry median percentiles interquartile range coeff. kurtosis
More informationMeasures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean
Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values
More informationEstablishing a framework for statistical analysis via the Generalized Linear Model
PSY349: Lecture 1: INTRO & CORRELATION Establishing a framework for statistical analysis via the Generalized Linear Model GLM provides a unified framework that incorporates a number of statistical methods
More informationBoth the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.
Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of
More informationSTATS DOESN T SUCK! ~ CHAPTER 4
CHAPTER 4 QUESTION 1 The Geometric Mean Suppose you make a 2-year investment of $5,000 and it grows by 100% to $10,000 during the first year. During the second year, however, the investment suffers a 50%
More informationRefer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.
Refer to Ex 3-18 on page 123-124 Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B. Test on Chapter 3 Friday Sept 27 th. You are expected to provide
More informationDESCRIPTIVE STATISTICS
DESCRIPTIVE STATISTICS INTRODUCTION Numbers and quantification offer us a very special language which enables us to express ourselves in exact terms. This language is called Mathematics. We will now learn
More information2 Exploring Univariate Data
2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting
More informationCHAPTER 2 Describing Data: Numerical
CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of
More informationLecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1
Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Section 7.4-1 Chapter 7 Estimates and Sample Sizes 7-1 Review and Preview 7- Estimating a Population
More informationEmpirical Rule (P148)
Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall
More informationWeek 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.
Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.
More informationCHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =
Solutions to End-of-Section and Chapter Review Problems 225 CHAPTER 6 6.1 (a) P(Z < 1.20) = 0.88493 P(Z > 1.25) = 1 0.89435 = 0.10565 P(1.25 < Z < 1.70) = 0.95543 0.89435 = 0.06108 (d) P(Z < 1.25) or Z
More informationQuantitative Methods for Economics, Finance and Management (A86050 F86050)
Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge
More informationUNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES
f UNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES Normal Distribution: Definition, Characteristics and Properties Structure 4.1 Introduction 4.2 Objectives 4.3 Definitions of Probability
More information2.1 Properties of PDFs
2.1 Properties of PDFs mode median epectation values moments mean variance skewness kurtosis 2.1: 1/13 Mode The mode is the most probable outcome. It is often given the symbol, µ ma. For a continuous random
More informationMeasures of Central Tendency Lecture 5 22 February 2006 R. Ryznar
Measures of Central Tendency 11.220 Lecture 5 22 February 2006 R. Ryznar Today s Content Wrap-up from yesterday Frequency Distributions The Mean, Median and Mode Levels of Measurement and Measures of Central
More informationNumerical summary of data
Numerical summary of data Introduction to Statistics Measures of location: mode, median, mean, Measures of spread: range, interquartile range, standard deviation, Measures of form: skewness, kurtosis,
More informationKey Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions
SGSB Workshop: Using Statistical Data to Make Decisions Module 2: The Logic of Statistical Inference Dr. Tom Ilvento January 2006 Dr. Mugdim Pašić Key Objectives Understand the logic of statistical inference
More informationLecture 07: Measures of central tendency
Lecture 07: Measures of central tendency Ernesto F. L. Amaral September 21, 2017 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A Tool for Social Research. Stamford:
More informationDescriptive Statistics
Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs
More information1/12/2011. Chapter 5: z-scores: Location of Scores and Standardized Distributions. Introduction to z-scores. Introduction to z-scores cont.
Chapter 5: z-scores: Location of Scores and Standardized Distributions Introduction to z-scores In the previous two chapters, we introduced the concepts of the mean and the standard deviation as methods
More informationLectures delivered by Prof.K.K.Achary, YRC
Lectures delivered by Prof.K.K.Achary, YRC Given a data set, we say that it is symmetric about a central value if the observations are distributed symmetrically about the central value. In symmetrically
More informationNormal Model (Part 1)
Normal Model (Part 1) Formulas New Vocabulary The Standard Deviation as a Ruler The trick in comparing very different-looking values is to use standard deviations as our rulers. The standard deviation
More informationLecture Week 4 Inspecting Data: Distributions
Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your
More informationDATA SUMMARIZATION AND VISUALIZATION
APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296
More informationHypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD
Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD MAJOR POINTS Sampling distribution of the mean revisited Testing hypotheses: sigma known An example Testing hypotheses:
More informationPopulation Mean GOALS. Characteristics of the Mean. EXAMPLE Population Mean. Parameter Versus Statistics. Describing Data: Numerical Measures
GOALS Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright 010 by The McGraw-Hill Companies, Inc. All rights reserved. 3-1. Calculate the arithmetic mean, weighted mean, median, mode,
More informationA CLEAR UNDERSTANDING OF THE INDUSTRY
A CLEAR UNDERSTANDING OF THE INDUSTRY IS CFA INSTITUTE INVESTMENT FOUNDATIONS RIGHT FOR YOU? Investment Foundations is a certificate program designed to give you a clear understanding of the investment
More informationDescription of Data I
Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret
More informationData Analysis. BCF106 Fundamentals of Cost Analysis
Data Analysis BCF106 Fundamentals of Cost Analysis June 009 Chapter 5 Data Analysis 5.0 Introduction... 3 5.1 Terminology... 3 5. Measures of Central Tendency... 5 5.3 Measures of Dispersion... 7 5.4 Frequency
More information4. DESCRIPTIVE STATISTICS
4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in
More informationNumerical Descriptive Measures. Measures of Center: Mean and Median
Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where
More informationMEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,
MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, CONTENTS What is measures of dispersion? Why measures of dispersion? How measures of dispersions are calculated? Range Quartile
More informationReview: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59.
Review: Chebyshev s Rule Measures of Dispersion II Tom Ilvento STAT 200 Is based on a mathematical theorem for any data At least ¾ of the measurements will fall within ± 2 standard deviations from the
More information1 Describing Distributions with numbers
1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write
More informationDescriptive Statistics in Analysis of Survey Data
Descriptive Statistics in Analysis of Survey Data March 2013 Kenneth M Coleman Mohammad Nizamuddiin Khan Survey: Definition A survey is a systematic method for gathering information from (a sample of)
More informationData Distributions and Normality
Data Distributions and Normality Definition (Non)Parametric Parametric statistics assume that data come from a normal distribution, and make inferences about parameters of that distribution. These statistical
More informationLecture Data Science
Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Statistics Foundations JProf. Dr. Claudia Wagner Learning Goals How to describe sample data? What is mode/median/mean?
More informationMonetary Economics Measuring Asset Returns. Gerald P. Dwyer Fall 2015
Monetary Economics Measuring Asset Returns Gerald P. Dwyer Fall 2015 WSJ Readings Readings this lecture, Cuthbertson Ch. 9 Readings next lecture, Cuthbertson, Chs. 10 13 Measuring Asset Returns Outline
More informationChapter 4 Variability
Chapter 4 Variability PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter and Larry B. Wallnau Chapter 4 Learning Outcomes 1 2 3 4 5
More informationGetting to know a data-set (how to approach data) Overview: Descriptives & Graphing
Overview: Descriptives & Graphing 1. Getting to know a data set 2. LOM & types of statistics 3. Descriptive statistics 4. Normal distribution 5. Non-normal distributions 6. Effect of skew on central tendency
More informationMA 1125 Lecture 05 - Measures of Spread. Wednesday, September 6, Objectives: Introduce variance, standard deviation, range.
MA 115 Lecture 05 - Measures of Spread Wednesday, September 6, 017 Objectives: Introduce variance, standard deviation, range. 1. Measures of Spread In Lecture 04, we looked at several measures of central
More informationNumerical Measurements
El-Shorouk Academy Acad. Year : 2013 / 2014 Higher Institute for Computer & Information Technology Term : Second Year : Second Department of Computer Science Statistics & Probabilities Section # 3 umerical
More informationNCSS Statistical Software. Reference Intervals
Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and
More informationMBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment
MBEJ 1023 Planning Analytical Methods Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment Contents What is statistics? Population and Sample Descriptive Statistics Inferential
More informationCopyright 2005 Pearson Education, Inc. Slide 6-1
Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is
More informationBiostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras
Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions
More informationDescriptive Statistics
Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations
More information( ) P = = =
1. On a lunch counter, there are 5 oranges and 6 apples. If 3 pieces of fruit are selected, find the probability that 1 orange and apples are selected. Order does not matter Combinations: 5C1 (1 ) 6C P
More informationLecture 9. Probability Distributions. Outline. Outline
Outline Lecture 9 Probability Distributions 6-1 Introduction 6- Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7- Properties of the Normal Distribution
More informationCABARRUS COUNTY 2008 APPRAISAL MANUAL
STATISTICS AND THE APPRAISAL PROCESS PREFACE Like many of the technical aspects of appraising, such as income valuation, you have to work with and use statistics before you can really begin to understand
More informationLecture 2 Describing Data
Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms
More informationBasic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract
Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, 2013 Abstract Review summary statistics and measures of location. Discuss the placement exam as an exercise
More informationStat 101 Exam 1 - Embers Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.
More informationThe Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012
The Normal Distribution & Descriptive Statistics Kin 304W Week 2: Jan 15, 2012 1 Questionnaire Results I received 71 completed questionnaires. Thank you! Are you nervous about scientific writing? You re
More informationLecture 9. Probability Distributions
Lecture 9 Probability Distributions Outline 6-1 Introduction 6-2 Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7-2 Properties of the Normal Distribution
More informationDESCRIBING DATA: MESURES OF LOCATION
DESCRIBING DATA: MESURES OF LOCATION A. Measures of Central Tendency Measures of Central Tendency are used to pinpoint the center or average of a data set which can then be used to represent the typical
More informationLecture 18 Section Mon, Feb 16, 2009
The s the Lecture 18 Section 5.3.4 Hampden-Sydney College Mon, Feb 16, 2009 Outline The s the 1 2 3 The 4 s 5 the 6 The s the Exercise 5.12, page 333. The five-number summary for the distribution of income
More informationChapter 5: Summarizing Data: Measures of Variation
Chapter 5: Introduction One aspect of most sets of data is that the values are not all alike; indeed, the extent to which they are unalike, or vary among themselves, is of basic importance in statistics.
More informationappstats5.notebook September 07, 2016 Chapter 5
Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.
More informationThe Normal Probability Distribution
1 The Normal Probability Distribution Key Definitions Probability Density Function: An equation used to compute probabilities for continuous random variables where the output value is greater than zero
More information