Descriptive Statistics

Size: px
Start display at page:

Download "Descriptive Statistics"

Transcription

1 Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations about the shape and spread of the data, a more complete understanding of the data can be attained by summarizing them using statistics. This chapter presents such statistical measures, including measures of central tendency, measures of variability, and measures of shape. The computation of these measures is different for ungrouped and grouped data. 3.1 Measures of Central Tendency: Ungrouped Data One type of measure that is used to describe a set of data is the measure of central tendency. Measures of central tendency yield information about the centre, or middle part, of a group of numbers. Measures of central tendency do not focus on the span of the data set or how far values are from the middle numbers. The measures of central tendency presented here for ungrouped data are the mode, the median, the mean, percentiles, and quartiles. Mode, Median, and Mean The mode is the most frequently occurring value in a set of data. The median is the middle value in an ordered array of numbers. For an array with an odd number of terms, the median is the middle number. For an array with an even number of terms, the median is the average of the two middle numbers. The mean is the average of a group of numbers and is computed by summing all numbers and dividing by the number of numbers. Demonstration Problem 3.1 Shown below is a list of the 11 largest motor vehicle producers in the world and the number of vehicles produced by each in Auto Manufacturer Production (millions) Toyota Motor Corp. 7.2 General Motors 6.5 Volkswagen Group 6.1 Ford Motor Co. 4.7 Hyundai 4.6 PSA Peugeot Citroën 3.0 Honda 3.0 Nissan 2.7 Fiat 2.5 Suzuki 2.4 Renault Input the data into Excel. Save as Demo_

2 36 Descriptive Statistics 2. Click on the Data tab and Data Analysis (if you don t see this option, refer to Chapter 2 to install the Analysis TookPak). 3. Select Descriptive Statistics and then select into the Input Range box. Select a cell to the right of the data for the Output Range and select Summary statistics. If you input and selected the label, select Labels in first row (make sure it is in the cell directly above the data). 4. Widen the column with the text to see all of the text (click and drag or double-click the line between the column letters). 5. The mean uses all the data, and each data item influences the mean. It is also a disadvantage because extremely large or small values can cause the mean to be pulled toward the extreme value.

3 Chapter 3 37 Remarks The mean uses all the data, and each data item influences the mean. It is also a disadvantage because extremely large or small values can cause the mean to be pulled toward the extreme value. In this data set, the mean value is 4.09 and the median is 3 showing that the large values pull the mean to a higher value whereas a typical value would be more like 3. The mode or most common value is also 3. Percentiles Percentiles are measures of central tendency that divide a group of data into 100 parts. There are 99 percentiles because it takes 99 dividers to separate a group of data into 100 parts. Let s use our data set to find specific percentiles using an Excel function. Demonstration Problem Input the following data into Excel in a column: 14, 12, 19, 23, 5, 13, 28, Click in a cell to the right of the data and input the function =PERCENTILE(select range of data, 0.3). The 30th percentile is represented by The answer is 13.1 and the whole number would be 13. A percentile may or may not be one of the data values. Note: The Rank and Percentile feature of the Data Analysis tool of Excel has the capability of ordering the data, assigning ranks to the data, and yielding the percentiles of the data. To access this command, click on Data Analysis and select Rank and Percentile from the menu. In the Rank and Percentile dialogue box, enter the location of the data to be analysed in Input Range. For this data set, the output looks like the output on the right:

4 38 Descriptive Statistics Quartiles Quartiles are measures of central tendency that divide a group of data into four subgroups or parts. If the observations are ordered from smallest to largest, each quartile represents 25% of the observations. The first quartile (Q 1 ) represents the median of the observations ordered from the minimum to the overall median M. The second quartile is the overall median (M) and represents 50% of all observations. The third quartile represents the median of the upper 50% of the observations. A five-number summary gives a complete description of the distribution, including the minimum number, Q 1, M (median), Q 3, and the maximum number. A box plot is a graph of the five-number summary. Side-by-side box plots are useful for comparing several distributions. Demonstration Problem Open the Demo_3-3 file from the folder titled Demonstration Problem Data Sets on the student companion site located at 2. Below the data, input the following labels and formulas according to the instructions given earlier in Chapter 1 using Function Wizard or by inputting the formulas manually. You could also use the Quartile function for all of the values by inserting 0,1,2,3,4 for the second argument. For example, Maximum would be =QUARTILE(C2:C17,4). 3. To view the functions used on a worksheet, there is an option in Excel to display all equations. Select File Options Advanced. Scroll down to Display options for this worksheet and select Show formulas in cells instead of their calculated results. Click OK and you will be able to view all of the formulas used on the current worksheet. Reverse the selection when you want to see only the results. You can always click on a cell and see the formula that was input in the Formula Bar.

5 Chapter The resulting values calculated are shown as follows: Note: These values are not quite the same as the values calculated in the textbook. That is because the quartiles are calculated by a different algorithm in different software programs. If you use the Min, Max, and Median functions, those values will be the same. The values that differ are Q1 and Q Measures of Variability: Ungrouped Data Business researchers can use another group of analytic tools, measures of variability, to describe the spread or the dispersion of a set of data. Using measures of variability in conjunction with measures of central tendency makes possible a more complete numerical description of the data. Methods of computing measures of variability differ for ungrouped data and grouped data. This section focuses on seven measures of variability for ungrouped data: range, interquartile range, mean absolute deviation, variance, standard deviation, z scores, and coefficient of variation. Range The range is the difference between the largest value and the smallest value of a data set. Although it is usually a single numeric value, some business researchers define the range of data as the ordered pair of smallest and largest numbers (smallest, largest). It is a crude measure of variability, describing the distance to the outer bounds of the data set. An advantage of the range is its ease of computation. A disadvantage of the range is that, because it is computed with the values that are on the extremes of the data, it is affected by extreme values, and its application as a measure of variability is limited. Interquartile Range Another measure of variability is the interquartile range. The interquartile range is the range of values between the first and third quartile. Essentially, it is the range of the middle 50% of the data and is determined by computing the value of Q 3 minus Q 1. The interquartile range is especially useful in situations where data users are more interested in values towards the middle and less interested in extremes. In addition, the interquartile range is used in the construction of box-and-whisker plots. The interquartile range value can differ slightly when using different software programs due to the underlying algorithms defining the quartiles.

6 40 Descriptive Statistics Demonstration Problem 3.3 (continued) 1. Open the Demo_3-3_Results file from the folder titled Demonstration Problem Data Sets on the student companion site located at or use the results calculated in the previous exercise on quartiles. 2. To calculate the Range in Excel, use a simple subtraction formula. Click on a cell below the quartile calculations and input = and then click on the computed maximum value, type a - and click on the computed minimum value. For our example, it should look like this: The result: 3. To calculate the interquartile range, use a simple subtraction formula with Q1 and Q3. Click on a cell below the range calculation and input = and then click on the computed Q3 value, type a - and click on the computed Q1 value. For our example, it should look like this: The result: Mean Absolute Deviation, Variance, and Standard Deviation Three other measures of variability are the variance, the standard deviation, and the mean absolute deviation. The variance and standard deviation are widely used in statistics. Although the standard deviation has some stand-alone potential, the importance of variance and standard deviation lies mainly in their role as tools used in conjunction with other statistical devices. Mean Absolute Deviation The mean absolute deviation (MAD) is the average of the absolute values of the deviations around the mean for a set of numbers. There is no function of this in Excel but you can set up a table and use formulas to calculate: MAD MAD Problem x = i N µ 1. A small company started a production line to build computers. During the first five weeks of production, the output is 5, 9, 16, 17, and 18 computers, respectively. Calculate the mean absolute deviation for the computer production data. 2. Input the data vertically in one column. In a cell below the data, calculate the mean of the data set by inputting the function =AVERAGE, selecting the data to find the mean, and pressing Enter. 3. In the next column, input a formula to find the absolute value of the mean subtracted from the value in the first column. Remember that the reference to the average in the formula has to be absolute by using the function key F4. The formula should look like the one below. Copy the

7 Chapter 3 41 formula down. Insert a formula below to sum the values using the function =SUM(select the data). The resulting values should look like: 4. Below those values, input a function to find the absolute value of the sum of those values for (x μ) as =SUM(select the data). The worksheet should look like the one below. Please note that the sum of (x μ) should be The MAD calculation can be input in a cell as the sum of the absolute value of (x μ) divided by the sample size N, which is 5. The result is 4.8. Variance Because absolute values are not conducive to easy manipulation, mathematicians developed an alternative mechanism for overcoming the zero-sum property of deviations from the mean. This approach utilizes the square of the deviations from the mean. The result is the variance, an important measure of variability. The variance is the average of the squared deviations about the arithmetic mean for a set of numbers. The population variance is denoted by σ 2. 2 ( x µ) 2 σ = i N Population Variance and Standard Deviation Problem 1. Open the MAD_results file or use the results calculated in the previous exercise on MAD. 2. The most straightforward way to calculate a population variance is to use the Variance function in Excel. For this example, choose the population variance (VAR.P). The result is 26.0.

8 42 Descriptive Statistics 3. The square root of this value is the standard deviation. In Excel, you type the function =SQRT(click on the number or type the number to find the square root). The standard deviation value is 5.1. Or, use the population Standard Deviation function in Excel (STDDEV.P). Standard Deviation The population standard deviation is a measure of the spread of a distribution. A symmetric distribution is completely described by its centre at the mean and its spread defined by multiples of its standard deviation. The standard deviation σ is the square root of the variance σ 2. 2 ( xi x) σ = N Population versus Sample Variance and Standard Deviation The sample variance is denoted by s 2 and the sample standard deviation by s. The main use for sample variances and standard deviations is as estimators of population variances and standard deviations. Because of this, computation of the sample variance and standard deviation differs slightly from computation of the population variance and standard deviation. Both the sample variance and sample standard deviation use n 1 in the denominator.

9 Chapter 3 43 s 2 x i x) = n 1 2 ( s 2 ( x i x) n 1 ]should be s = in second equation[ Sample Variance and Standard Deviation Problem 1. Input the following data representing a sample of six of the largest accounting firms in the United States and the number of partners associated with each firm. Firm Number of Partners Deloitte & Touche 2654 Ernst & Young 2108 PricewaterhouseCoopers 2069 KPMG 1664 RSM McGladrey 720 Grant Thornton The most straightforward way to calculate a sample variance is to use the Sample Variance function in Excel. Choose the function VAR.S. The result is 806, In a similar way, the sample standard deviation can be calculated by using the sample Standard Deviation function in Excel. For this example, choose the function STDEV.S. The result is

10 44 Descriptive Statistics z Scores Normal distributions are defined as perfectly symmetric, bell-shaped curves. The normal distribution has certain qualities that allow calculations based on standardized values of the standard deviation (as described in the text). A z score represents the number of standard deviations a value (x) is above or below the mean of a set of numbers when the data are normally distributed. Using z scores allows translation of a value s raw distance from the mean into units of standard deviations. If a variable x has a normal distribution N(µ,σ) with a mean µ and a population standard deviation σ, then the standardized variable = x µ xi x z also has a standard normal distribution. For samples, z =. σ s Normal distribution calculations determine the probability that a distribution would have a given value at least that high or higher, or at least that low or lower. The calculations can also determine a specific value for a given probability or proportion. These methods are described in the text. Demonstration Problem 3.5: A Normal Distribution Variation In the computing industry, the average age of professional employees tends to be younger than in many other business professions. Suppose the average age of a professional employed by a particular computer firm is 28 with a standard deviation of 6 years. A histogram of professional employee ages with this firm reveals that the data are normally distributed. Determine the range of ages within which at least 80% of the workers ages would fall. What percentage of employees are younger than 22? Older than 45? Determine the range of ages within which at least 80% of the workers ages would fall. This is a variation of Demonstration Problem 3.5 in the textbook, which addresses the non-normal distribution. Excel functions can calculate normal distribution values. The following method demonstrates calculating probabilities with and without using a z-score calculation: 1. Open an empty worksheet in Excel. 2. Type in the given values and their headings. We are interested in the proportion or probability of having an age as young as or younger than 22. Because of how the normal distribution table is constructed, the area to the left of a specific value is always calculated. If an area to the right of a specific value is desired, we will have to subtract the value from Click in the cell to the right of the P < 22 to input the function to calculate the standardized z- value of a specific x-value. Input =STANDARDIZE(22,28,6) and press Enter. The arguments of the function in order are the x-value, the population mean, and the population standard

11 Chapter 3 45 deviation. Instead of typing in the specific values, you can click on the cells containing those values. The result is Click in the next cell to the right and input the function to calculate the probability of an x-value being as low or lower than 22. Input =NORM.S.DIST(-1,1). Instead of typing -1, you can click on the z-score of -1. A second argument of True or 1 is entered to calculate the percentage to the left of the given value and press Enter. The result is That is, 15.87% of the ages are less than or equal to 22 years. 5. If a z-value is not required, you can input the function =NORM.DIST(22,28,6,1) and press Enter. The arguments of the function are the x-value, the population mean, the population standard deviation, and 1 for True for a cumulative probability and 0 for False for a cumulative probability. For this example, we are interested in the cumulative probability to the left of the x- value of 22 so this value is True, or 1. The result is That is, 15.87% of the employees are less than or equal to 22 years of age. 6. Click in the cell to the right of the P > 45 to input =NORM.DIST(45,28,6,0) and press Enter. The arguments of the function in order are the x-value, the population mean, the population standard deviation, and 0 for False to calculate the percentage to the right of the given value and press Enter. The result is That is, 0.12% of the employees are more than or equal to 45 years old. 7. To calculate what percentage of values would lie between 2 ages, you could subtract the probability to the left of the smaller value from the probability to the left of the larger value. The

12 46 Descriptive Statistics in-between percentage would be = You can also input the formula =NORM.DIST(45,28,6,1)-NORM.DIST(22,28,6,1). 8. The final calculation is to determine the range within which at least 80% of the workers ages would fall. In an empty cell, calculate the z-value by inputting the formula =NORM.INV(0.8) into an empty cell and press Enter. This function calculates the z-value for a proportion of 0.8 or 80%. In this problem we wanted to find the lower 80% of values. The result is 33 years. 3.3 Measures of Central Tendency Three measures of central tendency are presented in the textbook for grouped data: the mean, the median, and the mode. There are no functions in Excel that help calculate these for values except by modeling the manual calculations shown in the text. 3.4 Measures of Shape Measures of shape are tools that can be used to describe the shape of a distribution of data. In this section, the text examines two measures of shape: skewness and kurtosis. It also looks at box-and-whisker plots. Skewness is addressed in the following section, Descriptive Statistics. Box-and-Whisker Plots and Five-Number Summary Another way to describe a distribution of data is by using a box-and-whisker plot. A box-and-whisker plot, sometimes called a box plot, is a diagram that utilizes the upper and lower quartiles along with the median and the two most extreme values to depict a distribution graphically. There is no way to construct a box and whisker plot in Excel. There are some Excel templates or macros that can be downloaded that provide this capability. 3.5 Descriptive Statistics on the Computer Excel s descriptive statistics output for the same computer production data is displayed below. The Excel output contains the mean, the median, the mode, the sample standard deviation, the sample variance, and the range. The descriptive statistics feature on either of these computer packages yields a lot of useful information about a data set. 1. Input the computer production data into one column of Excel: 5, 9, 16, 17, Click on the Data tab and Data Analysis. 3. Select Descriptive Statistics and then select into the Input Range box. Select a cell to the right of the data for the Output Range and select Summary statistics. If you input and selected the label, select Labels in first row (make sure it is in the cell directly above the data).

13 Chapter Widen the column with the text to see all of the text (click and drag or double-click the line between the column letters). 5. The mean uses all the data, and each data item influences the mean. It is also a disadvantage because extremely large or small values can cause the mean to be pulled towards the extreme value.

14 48 Descriptive Statistics Kurtosis Kurtosis describes the amount of peakedness of a distribution. Kurtosis near 0 means a data set exhibits peakedness close to the normal distribution see examples in text. Skewness Skewness is when a distribution is asymmetrical or lacks symmetry. A distribution can be positively skewed or negatively skewed see examples in text. A skew greater than +1 indicates a high degree of positive skew, 1 indicates a high degree of negative skew, and between indicates a relatively symmetric data set. SUMMARY OF EXCEL COMMANDS USED IN CHAPTER 3 Excel has the capability of producing many of the statistics in this chapter piecemeal and there is one Excel feature, Descriptive Statistics, that produces many of these statistics in one output. Descriptive Statistics Select the Data tab on the Excel worksheet. From the Analysis panel at the right top of the Data tab worksheet, click on Data Analysis. If your Excel worksheet does not show the Data Analysis option, then you can load it as an add-in following directions given in Chapter 2. From the Data Analysis pulldown menu, select Descriptive Statistics. In the Descriptive Statistics dialogue box, click in the Input Range and select the data to be analysed. Check Labels in fir st row if your data contain a label in the first row (cell). Check the box beside Summary statistics. The Summary statistics feature computes a wide variety of descriptive statistics. The output includes the mean, the median, the mode, the standard deviation, the sample variance, a measure of kurtosis, a measure of skewness, the range, the mini mum, the maximum, the sum and the count. Rank and Percentiles The Rank and Percentile feature of the Data Analysis tool of Excel has the capability of ordering the data, assigning ranks to the data, and yielding the percentiles of the data. To access this command, click on Data Analysis (see above) and select Rank and Percentile from the menu. In the Rank and Per centile dialog box, enter the location of the data to be analysed in Input Range. Check Labels in fir st Row if your data contain a label in the first row (cell). Using Functions Many of the individual statistics presented in this chapter can be computed using the Insert Function (fx) feature of Excel. To access Insert Function, go to the For mulas tab on an Excel worksheet (top centre tab) and Insert Function is on the far left of the menu bar. In the Insert Function dialogue box at the top, there is a pulldown menu where it says Or select a categor y. From the pulldown menu associated with this command, select Statistical. There are 83 different statistics that can be computed using one of these commands. Select the one that you want to compute and enter the location of the data. Some of the more useful commands in this menu are AVERAGE, MEDIAN, MODE.SNGL, SK EW, STDEV.S, and VAR.S.

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

3.1 Measures of Central Tendency

3.1 Measures of Central Tendency 3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent

More information

Numerical Descriptions of Data

Numerical Descriptions of Data Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =

More information

Descriptive Statistics

Descriptive Statistics Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs

More information

LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL

LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL There is a wide range of probability distributions (both discrete and continuous) available in Excel. They can be accessed through the Insert Function

More information

appstats5.notebook September 07, 2016 Chapter 5

appstats5.notebook September 07, 2016 Chapter 5 Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.

More information

SUMMARY STATISTICS EXAMPLES AND ACTIVITIES

SUMMARY STATISTICS EXAMPLES AND ACTIVITIES Session 6 SUMMARY STATISTICS EXAMPLES AD ACTIVITIES Example 1.1 Expand the following: 1. X 2. 2 6 5 X 3. X 2 4 3 4 4. X 4 2 Solution 1. 2 3 2 X X X... X 2. 6 4 X X X X 4 5 6 5 3. X 2 X 3 2 X 4 2 X 5 2

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

Simple Descriptive Statistics

Simple Descriptive Statistics Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency

More information

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line. Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,

More information

Description of Data I

Description of Data I Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret

More information

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda, MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, CONTENTS What is measures of dispersion? Why measures of dispersion? How measures of dispersions are calculated? Range Quartile

More information

MATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1)

MATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1) LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1) Descriptive statistics are ways of summarizing large sets of quantitative (numerical) information. The best way to reduce a set of

More information

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values

More information

MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment

MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment MBEJ 1023 Planning Analytical Methods Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment Contents What is statistics? Population and Sample Descriptive Statistics Inferential

More information

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

Monte Carlo Simulation (Random Number Generation)

Monte Carlo Simulation (Random Number Generation) Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...

More information

Measures of Dispersion (Range, standard deviation, standard error) Introduction

Measures of Dispersion (Range, standard deviation, standard error) Introduction Measures of Dispersion (Range, standard deviation, standard error) Introduction We have already learnt that frequency distribution table gives a rough idea of the distribution of the variables in a sample

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

DATA HANDLING Five-Number Summary

DATA HANDLING Five-Number Summary DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest

More information

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Categorical. A general name for non-numerical data; the data is separated into categories of some kind. Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,

More information

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25 Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example

More information

1.2 Describing Distributions with Numbers, Continued

1.2 Describing Distributions with Numbers, Continued 1.2 Describing Distributions with Numbers, Continued Ulrich Hoensch Thursday, September 6, 2012 Interquartile Range and 1.5 IQR Rule for Outliers The interquartile range IQR is the distance between the

More information

CHAPTER 2 Describing Data: Numerical

CHAPTER 2 Describing Data: Numerical CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of

More information

Frequency Distribution and Summary Statistics

Frequency Distribution and Summary Statistics Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary

More information

Fundamentals of Statistics

Fundamentals of Statistics CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc COUNSELLING PSYCHOLOGY (2011 Admission Onwards) II Semester Complementary Course PSYCHOLOGICAL STATISTICS QUESTION BANK 1. The process of grouping

More information

Measures of Central tendency

Measures of Central tendency Elementary Statistics Measures of Central tendency By Prof. Mirza Manzoor Ahmad In statistics, a central tendency (or, more commonly, a measure of central tendency) is a central or typical value for a

More information

DESCRIPTIVE STATISTICS

DESCRIPTIVE STATISTICS DESCRIPTIVE STATISTICS INTRODUCTION Numbers and quantification offer us a very special language which enables us to express ourselves in exact terms. This language is called Mathematics. We will now learn

More information

What s Normal? Chapter 8. Hitting the Curve. In This Chapter

What s Normal? Chapter 8. Hitting the Curve. In This Chapter Chapter 8 What s Normal? In This Chapter Meet the normal distribution Standard deviations and the normal distribution Excel s normal distribution-related functions A main job of statisticians is to estimate

More information

DECISION SUPPORT Risk handout. Simulating Spreadsheet models

DECISION SUPPORT Risk handout. Simulating Spreadsheet models DECISION SUPPORT MODELS @ Risk handout Simulating Spreadsheet models using @RISK 1. Step 1 1.1. Open Excel and @RISK enabling any macros if prompted 1.2. There are four on-line help options available.

More information

starting on 5/1/1953 up until 2/1/2017.

starting on 5/1/1953 up until 2/1/2017. An Actuary s Guide to Financial Applications: Examples with EViews By William Bourgeois An actuary is a business professional who uses statistics to determine and analyze risks for companies. In this guide,

More information

Summary of Statistical Analysis Tools EDAD 5630

Summary of Statistical Analysis Tools EDAD 5630 Summary of Statistical Analysis Tools EDAD 5630 Test Name Program Used Purpose Steps Main Uses/Applications in Schools Principal Component Analysis SPSS Measure Underlying Constructs Reliability SPSS Measure

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 3 Presentation of Data: Numerical Summary Measures Part 2 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh

More information

Lecture 2 Describing Data

Lecture 2 Describing Data Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms

More information

Numerical Descriptive Measures. Measures of Center: Mean and Median

Numerical Descriptive Measures. Measures of Center: Mean and Median Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where

More information

Lecture Week 4 Inspecting Data: Distributions

Lecture Week 4 Inspecting Data: Distributions Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your

More information

Statistics (This summary is for chapters 18, 29 and section H of chapter 19)

Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x n =

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow

More information

Ti 83/84. Descriptive Statistics for a List of Numbers

Ti 83/84. Descriptive Statistics for a List of Numbers Ti 83/84 Descriptive Statistics for a List of Numbers Quiz scores in a (fictitious) class were 10.5, 13.5, 8, 12, 11.3, 9, 9.5, 5, 15, 2.5, 10.5, 7, 11.5, 10, and 10.5. It s hard to get much of a sense

More information

Descriptive Analysis

Descriptive Analysis Descriptive Analysis HERTANTO WAHYU SUBAGIO Univariate Analysis Univariate analysis involves the examination across cases of one variable at a time. There are three major characteristics of a single variable

More information

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either

More information

DazStat. Introduction. Installation. DazStat is an Excel add-in for Excel 2003 and Excel 2007.

DazStat. Introduction. Installation. DazStat is an Excel add-in for Excel 2003 and Excel 2007. DazStat Introduction DazStat is an Excel add-in for Excel 2003 and Excel 2007. DazStat is one of a series of Daz add-ins that are planned to provide increasingly sophisticated analytical functions particularly

More information

Prepared By. Handaru Jati, Ph.D. Universitas Negeri Yogyakarta.

Prepared By. Handaru Jati, Ph.D. Universitas Negeri Yogyakarta. Prepared By Handaru Jati, Ph.D Universitas Negeri Yogyakarta handaru@uny.ac.id Chapter 7 Statistical Analysis with Excel Chapter Overview 7.1 Introduction 7.2 Understanding Data 7.2.1 Descriptive Statistics

More information

1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form:

1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form: 1 Exercise One Note that the data is not grouped! 1.1 Calculate the mean ROI Below you find the raw data in tabular form: Obs Data 1 18.5 2 18.6 3 17.4 4 12.2 5 19.7 6 5.6 7 7.7 8 9.8 9 19.9 10 9.9 11

More information

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.) Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop

More information

Monte Carlo Simulation (General Simulation Models)

Monte Carlo Simulation (General Simulation Models) Monte Carlo Simulation (General Simulation Models) Revised: 10/11/2017 Summary... 1 Example #1... 1 Example #2... 10 Summary Monte Carlo simulation is used to estimate the distribution of variables when

More information

MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION

MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION 1 Day 3 Summer 2017.07.31 DISTRIBUTION Symmetry Modality 单峰, 双峰 Skewness 正偏或负偏 Kurtosis 2 3 CHAPTER 4 Measures of Central Tendency 集中趋势

More information

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

2 Exploring Univariate Data

2 Exploring Univariate Data 2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting

More information

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations

More information

STAT 113 Variability

STAT 113 Variability STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2

More information

Handout 5: Summarizing Numerical Data STAT 100 Spring 2016

Handout 5: Summarizing Numerical Data STAT 100 Spring 2016 In this handout, we will consider methods that are appropriate for summarizing a single set of numerical measurements. Definition Numerical Data: A set of measurements that are recorded on a naturally

More information

Measures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms

Measures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms Measures of Central Tendency: Ungrouped Data Measures of central tendency yield information about particular places or locations in a group of numbers. Common Measures of Location Mode Median Percentiles

More information

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) = Solutions to End-of-Section and Chapter Review Problems 225 CHAPTER 6 6.1 (a) P(Z < 1.20) = 0.88493 P(Z > 1.25) = 1 0.89435 = 0.10565 P(1.25 < Z < 1.70) = 0.95543 0.89435 = 0.06108 (d) P(Z < 1.25) or Z

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

IOP 201-Q (Industrial Psychological Research) Tutorial 5

IOP 201-Q (Industrial Psychological Research) Tutorial 5 IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,

More information

You should already have a worksheet with the Basic Plus Plan details in it as well as another plan you have chosen from ehealthinsurance.com.

You should already have a worksheet with the Basic Plus Plan details in it as well as another plan you have chosen from ehealthinsurance.com. In earlier technology assignments, you identified several details of a health plan and created a table of total cost. In this technology assignment, you ll create a worksheet which calculates the total

More information

Written by N.Nilgün Çokça. Advance Excel. Part One. Using Excel for Data Analysis

Written by N.Nilgün Çokça. Advance Excel. Part One. Using Excel for Data Analysis Written by N.Nilgün Çokça Advance Excel Part One Using Excel for Data Analysis March, 2018 P a g e 1 Using Excel for Calculations Arithmetic operations Arithmetic operators: To perform basic mathematical

More information

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows

More information

The Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012

The Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012 The Normal Distribution & Descriptive Statistics Kin 304W Week 2: Jan 15, 2012 1 Questionnaire Results I received 71 completed questionnaires. Thank you! Are you nervous about scientific writing? You re

More information

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, 2013 Abstract Review summary statistics and measures of location. Discuss the placement exam as an exercise

More information

Section3-2: Measures of Center

Section3-2: Measures of Center Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

SOLUTIONS TO THE LAB 1 ASSIGNMENT

SOLUTIONS TO THE LAB 1 ASSIGNMENT SOLUTIONS TO THE LAB 1 ASSIGNMENT Question 1 Excel produces the following histogram of pull strengths for the 100 resistors: 2 20 Histogram of Pull Strengths (lb) Frequency 1 10 0 9 61 63 6 67 69 71 73

More information

2 DESCRIPTIVE STATISTICS

2 DESCRIPTIVE STATISTICS Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

Chapter 4 Variability

Chapter 4 Variability Chapter 4 Variability PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter and Larry B. Wallnau Chapter 4 Learning Outcomes 1 2 3 4 5

More information

Applications of Data Dispersions

Applications of Data Dispersions 1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given

More information

GETTING STARTED. To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop

GETTING STARTED. To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop Minitab 14 1 GETTING STARTED To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop The Minitab session will come up like this 2 To SAVE FILE 1. Click File>Save Project

More information

4. DESCRIPTIVE STATISTICS

4. DESCRIPTIVE STATISTICS 4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in

More information

Putting Things Together Part 2

Putting Things Together Part 2 Frequency Putting Things Together Part These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for, and are in

More information

Web Extension: Continuous Distributions and Estimating Beta with a Calculator

Web Extension: Continuous Distributions and Estimating Beta with a Calculator 19878_02W_p001-008.qxd 3/10/06 9:51 AM Page 1 C H A P T E R 2 Web Extension: Continuous Distributions and Estimating Beta with a Calculator This extension explains continuous probability distributions

More information

Statistics vs. statistics

Statistics vs. statistics Statistics vs. statistics Question: What is Statistics (with a capital S)? Definition: Statistics is the science of collecting, organizing, summarizing and interpreting data. Note: There are 2 main ways

More information

Engineering Mathematics III. Moments

Engineering Mathematics III. Moments Moments Mean and median Mean value (centre of gravity) f(x) x f (x) x dx Median value (50th percentile) F(x med ) 1 2 P(x x med ) P(x x med ) 1 0 F(x) x med 1/2 x x Variance and standard deviation

More information

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2] 1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders

More information

Chapter 3 Descriptive Statistics: Numerical Measures Part A

Chapter 3 Descriptive Statistics: Numerical Measures Part A Slides Prepared by JOHN S. LOUCKS St. Edward s University Slide 1 Chapter 3 Descriptive Statistics: Numerical Measures Part A Measures of Location Measures of Variability Slide Measures of Location Mean

More information

Describing Data: Displaying and Exploring Data

Describing Data: Displaying and Exploring Data Describing Data: Displaying and Exploring Data Chapter 4 McGraw-Hill/Irwin Copyright 2011 by the McGraw-Hill Companies, Inc. All rights reserved. LEARNING OBJECTIVES LO1. Develop and interpret a dot plot.

More information

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION Subject Paper No and Title Module No and Title Paper No.2: QUANTITATIVE METHODS Module No.7: NORMAL DISTRIBUTION Module Tag PSY_P2_M 7 TABLE OF CONTENTS 1. Learning Outcomes 2. Introduction 3. Properties

More information

Introduction to Basic Excel Functions and Formulae Note: Basic Functions Note: Function Key(s)/Input Description 1. Sum 2. Product

Introduction to Basic Excel Functions and Formulae Note: Basic Functions Note: Function Key(s)/Input Description 1. Sum 2. Product Introduction to Basic Excel Functions and Formulae Excel has some very useful functions that you can use when working with formulae. This worksheet has been designed using Excel 2010 however the basic

More information

Numerical Measurements

Numerical Measurements El-Shorouk Academy Acad. Year : 2013 / 2014 Higher Institute for Computer & Information Technology Term : Second Year : Second Department of Computer Science Statistics & Probabilities Section # 3 umerical

More information

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data. -3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data

More information

MgtOp S 215 Chapter 8 Dr. Ahn

MgtOp S 215 Chapter 8 Dr. Ahn MgtOp S 215 Chapter 8 Dr. Ahn An estimator of a population parameter is a rule that tells us how to use the sample values,,, to estimate the parameter, and is a statistic. An estimate is the value obtained

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,

More information

Review: Types of Summary Statistics

Review: Types of Summary Statistics Review: Types of Summary Statistics We re often interested in describing the following characteristics of the distribution of a data series: Central tendency - where is the middle of the distribution?

More information

David Tenenbaum GEOG 090 UNC-CH Spring 2005

David Tenenbaum GEOG 090 UNC-CH Spring 2005 Simple Descriptive Statistics Review and Examples You will likely make use of all three measures of central tendency (mode, median, and mean), as well as some key measures of dispersion (standard deviation,

More information

Some estimates of the height of the podium

Some estimates of the height of the podium Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40

More information

NCSS Statistical Software. Reference Intervals

NCSS Statistical Software. Reference Intervals Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and

More information

The Normal Probability Distribution

The Normal Probability Distribution 1 The Normal Probability Distribution Key Definitions Probability Density Function: An equation used to compute probabilities for continuous random variables where the output value is greater than zero

More information

Chapter 5: Summarizing Data: Measures of Variation

Chapter 5: Summarizing Data: Measures of Variation Chapter 5: Introduction One aspect of most sets of data is that the values are not all alike; indeed, the extent to which they are unalike, or vary among themselves, is of basic importance in statistics.

More information

Moments and Measures of Skewness and Kurtosis

Moments and Measures of Skewness and Kurtosis Moments and Measures of Skewness and Kurtosis Moments The term moment has been taken from physics. The term moment in statistical use is analogous to moments of forces in physics. In statistics the values

More information

Normal Model (Part 1)

Normal Model (Part 1) Normal Model (Part 1) Formulas New Vocabulary The Standard Deviation as a Ruler The trick in comparing very different-looking values is to use standard deviations as our rulers. The standard deviation

More information