Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis
|
|
- Clinton Piers Dalton
- 5 years ago
- Views:
Transcription
1 Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem Developed by mathematicians Jules Bienaymé ( ) and Pafnuty Chebyshev ( ). For any population with mean mand standard deviation s,, the percentage of observations that lie within k standard deviations of the mean must be at least 100[1 1/k 2 ]. 4B-2 1
2 Chebyshev s Theorem For k = 2 standard deviations, 100[1 1/2 2 ] = 75% So, at least 75.0% will lie within m+ 2s For k = 3 standard deviations, 100[1 1/3 2 ] = 88.9% So, at least 88.9% will lie within m+ 3s Although applicable to any data set, these limits tend to be too wide to be useful. 4B-3 4B-4 The Empirical Rule The normal or Gaussian distribution was named for Karl Gauss ( ). The normal distribution is symmetric and is also known as the bell-shaped curve. The Empirical Rule states that for data from a normal distribution, we expect that for k = 1 about 68.26% will lie within m+ 1s k = 2 about 95.44% will lie within m+ 2s k = 3 about 99.73% will lie within m+ 3s 2
3 The Empirical Rule Distance from the mean is measured in terms of the number of standard deviations. Note: no upper bound is given. Data values outside µ + 3σ are rare. 4B-5 Example: Exam Scores If 80 students take an exam, how many will score within 2 standard deviations of the mean? Assuming exam scores follow a normal distribution, the empirical rule states about 95.44% will lie within m+ 2s so 95.44% x students will score + 2sfrom m. How many students will score more than 2 standard deviations from the mean? 4B-6 3
4 Unusual Observations Unusual observations are those that lie beyond m+ 2s. Outliers are observations that lie beyond m+ 3s. 4B-7 Unusual Observations For example, the P/E ratio data contains several large data values. Are they unusual or outliers? B-8 4
5 The Empirical Rule If the sample came from a normal distribution, then the Empirical rule states x 1s = ± 1(14.08) = (8.6, 38.8) x 2s = ± 2(14.08) = (-5.4, 50.9) x 3s = ± 3(14.08) = (-19.5, 65.0) 4B-9 The Empirical Rule Are there any unusual values or outliers? Unusual Unusual Outliers Outliers B-10 5
6 Defining a Standardized Variable A standardized variable (Z) redefines each observation in terms the number of standard deviations from the mean. Standardization formula for a population: z i x i Standardization formula for a sample: z i x i s x 4B-11 Defining a Standardized Variable z i tells how far away the observation is from the mean. For example, for the P/E data, the first value x 1 = 7. The associated z value is z i x s x i = = B-12 6
7 Defining a Standardized Variable A negative z value means the observation is below the mean. Positive z means the observation is above the mean. For x 68 = 91, z x s x i i = = B-13 Defining a Standardized Variable Here are the standardized z values for the P/E data: What do you conclude for these three values? 4B-14 7
8 Defining a Standardized Variable MegaStat calculates standardized values as well as checks for outliers. In Excel, use =STANDARDIZE(Array, Mean, STDev) to calculate a standardized z value. 4B-15 Outliers 4B-16 What do we do with outliers in a data set? If due to erroneous data, then discard. An outrageous observation (one completely outside of an expected range) is certainly invalid. Recognize unusual data points and outliers and their potential impact on your study. Research books and articles on how to handle outliers. 8
9 Estimating Sigma 4B-17 For a normal distribution, the range of values is 6s(from m 3sto m+ 3s). If you know the range R (high low), you can estimate the standard deviation as s= R/6. Useful for approximating the standard deviation when only R is known. This estimate depends on the assumption of normality. Percentiles and Quartiles Percentiles Percentiles are data that have been divided into 100 groups. For example, you score in the 83 rd percentile on a standardized test. That means that 83% of the test- takers scored below you. Deciles are data that have been divided into 10 groups. Quintiles are data that have been divided into 5 groups. Quartiles are data that have been divided into 4 groups. 4B-18 9
10 Percentiles and Quartiles Percentiles Percentiles are used to establish benchmarks for comparison purposes (e.g., health care, manufacturing and banking industries use 5, 25, 50, 75 and 90 percentiles). Quartiles (25, 50, and 75 percent) are commonly used to assess financial performance and stock portfolios. Percentiles are used in employee merit evaluation and salary benchmarking. 4B-19 Percentiles and Quartiles Quartiles Quartiles are scale points that divide the sorted data into four groups of approximately equal size. Q 1 Q 2 Q 3 Lower 25% Second 25% Third 25% Upper 25% 4B-20 The three values that separate the four groups are called Q 1, Q 2, and Q 3, respectively. 10
11 Percentiles and Quartiles 4B-21 Quartiles 4B-21 The second quartile Q 2 is the median, an important indicator of central tendency. Q 2 Lower 50% Upper 50% Q 1 and Q 3 measure dispersion since the interquartile range Q 3 Q 1 measures the degree of spread in the middle 50 percent of data values. Q 1 Q 3 Lower 25% Middle 50% Upper 25% Percentiles and Quartiles Quartiles The first quartile Q 1 is the median of the data values below Q 2, and the third quartile Q 3 is the median of the data values above Q 2. Q 1 Q 2 Q 3 Lower 25% Second 25% Third 25% Upper 25% For first half of data, 50% above, 50% below Q 1. For second half of data, 50% above, 50% below Q 3. 4B-22 11
12 Percentiles and Quartiles Quartiles Depending on n,, the quartiles Q 1,Q 2, and Q 3 may be members of the data set or may lie between two of the sorted data values. 4B-23 4B-24 Percentiles and Quartiles Method of Medians For small data sets, find quartiles using method of medians: Step 1. Sort the observations. Step 2. Find the median Q 2. Step 3. Find the median of the data values that lie below Q 2. Step 4. Find the median of the data values that lie above Q 2. 12
13 Percentiles and Quartiles Excel Quartiles 4B-25 Use Excel function =QUARTILE(Array, k) to return the kth quartile. Excel treats quartiles as a special case of percentiles. For example, to calculate Q 3 =QUARTILE(Array, 3) =PERCENTILE(Array, 75) Excel calculates the quartile positions as: Position of Q n Position of Q n Position of Q n Percentiles and Quartiles Example: P/E Ratios and Quartiles Consider the following P/E ratios for 68 stocks in a portfolio Use quartiles to define benchmarks for stocks that are low-priced (bottom quartile) or high- priced (top quartile). 4B-26 13
14 Percentiles and Quartiles Example: P/E Ratios and Quartiles Using Excel s method of interpolation, the quartile positions are: Quartile Position Formula Interpolate Between Q 1 = 0.25(68) = X 17 + X 18 Q 2 = 0.50(68) = X 34 + X 35 Q 3 = 0.75(68) = X 51 + X 52 4B-27 Percentiles and Quartiles Example: P/E Ratios and Quartiles The quartiles are: Quartile Formula First (Q 1 ) Q 1 = X (X 18 -X 17 ) = (14-14) = 14 Second (Q 2 ) Q 2 = X (X 35 -X 34 ) = (19-19) = 19 Third ( Third (Q 3 ) Q 3 = X (X 52 -X 51 ) = (26-26) = 26 4B-28 14
15 Percentiles and Quartiles Example: P/E Ratios and Quartiles So, to summarize: Q 1 Q 2 Q 3 Lower 25% 14 Second 25% 19 Third 25% 26 Upper 25% of P/E Ratios of P/E Ratios of P/E Ratios of P/E Ratios 4B-29 These quartiles express central tendency and dispersion. What is the interquartile range? Because of clustering of identical data values, these quartiles do not provide clean cut points between groups of observations. Percentiles and Quartiles Tip Whether you use the method of medians or Excel, your quartiles will be about the same. Small differences in calculation techniques typically do not lead to different conclusions in business applications. 4B-30 15
16 Percentiles and Quartiles Caution 4B-31 Quartiles generally resist outliers. However, quartiles do not provide clean cut points in the sorted data, especially in small samples with repeating data values. Data set A: 1, 2, 4, 4, 8, 8, 8, 8 Q = 3, Q 1 2 = 6, Q 3 = 8 Data set B: 0, 3, 3, 6, 6, 6, 10, 15 Q = 3, Q 1 2 = 6, Q 3 = 8 Although they have identical quartiles, these two data sets are not similar. The quartiles do not represent either data set well. 4B-32 Box Plots A useful tool of exploratory data analysis (EDA). Also called a box-and-whisker plot. Based on a five-number summary: X min, Q 1, Q 2, Q 3, X max Consider the five-number summary for the 68 P/E ratios: X min, Q 1, Q 2, Q 3, X max
17 Box Plots The box plot is displayed visually, like this. A box plot shows central tendancy, dispersion, and shape. 4B-33 Box Plots Fences and Unusual Data Values Use quartiles to detect unusual data points. These points are called fences and can be found using the following formulas: Inner fences Outer fences: Lower fence Q (Q 3 Q 1 ) Q (Q 3 Q 1 ) Upper fence Q (Q 3 Q 1 ) Q (Q 3 Q 1 ) Values outside the inner fences are unusual while those outside the outer fences are outliers. 4B-34 17
18 Box Plots Fences and Unusual Data Values For example, consider the P/E ratio data: Inner fences Lower fence: (26 14) = 4 Outer fences: (26 14) = 22 Upper fence: (26 14) = (26 14) = +62 Ignore the lower fence since it is negative and P/E ratios are only positive. 4B-35 Box Plots Fences and Unusual Data Values Truncate the whisker at the fences and display unusual values and outliers Inner Outer Fence Fence as dots. Unusual Outliers 4B-36 4B-36 Based on these fences, there are three unusual P/E values and two outliers. 18
19 Percentiles and Quartiles Midhinge The average of the first and third quartiles. Q Midhinge = 1 Q 3 2 4B-37 The name midhinge derives from the idea that, if the box were folded in half, it would resemble a hinge.. Box Plots Whiskers Box Center of Box is Midhinge Q 1 Q 3 Minimum Median (Q 2 ) Right-skewed Maximum 4B-38 19
20 Correlation Correlation Coefficient The sample correlation coefficient is a statistic that describes the degree of linearity between paired observations on two quantitative variables X and Y. 4B-39 Correlation Correlation Coefficient Its range is -1 r +1. Excel s formula =CORREL(Xdata, Ydata) 4B-40 20
21 Correlation Correlation Coefficient Illustration of Correlation Coefficients 4B-41 Correlation What is the nature of the relationship between square feet of shopping area and sales that is implied by the following correlation? 4B-42 21
22 Grouped Data Nature of Grouped Data Although some information is lost, grouped data are easier to display than raw data. When bin limits are given, the mean and standard deviation can be estimated. Accuracy of grouped estimates depend on - the number of bins - distribution of data within bins - bin frequencies 4B-43 Grouped Data Mean and Standard Deviation Consider the frequency distribution for prices of Lipitor for three cities: 4B-44 Where m j = class midpoint f j = class frequency k = number of classes n = sample size 22
23 Grouped Data Nature of Grouped Data Estimate the mean and standard deviation by k f jm j x n 47 j 1 j 1 2 k f j () m j x s n Note: don t round off too soon. 4B-45 4B-46 Grouped Data Nature of Grouped Data Now estimate the coefficient of variation CV = 100 (s / x ) = 100 ( / ) = 9.2% Accuracy Issues How accurate are grouped estimates compared to ungrouped estimates? For the previous example, we can compare the grouped data statistics to the ungrouped data statistics. 23
24 Grouped Data Accuracy Issues Accuracy tends to improve as the number of bins increases. If the first or last class is open-ended, there will be no class midpoint (no mean can be estimated). Assume a lower limit of zero for the first class when the data are nonnegative. You may be able to assume an upper limit for some variables (e.g., age). Median and quartiles may be estimated even with open-ended classes. 4B-47 Skewness Skewness and Kurtosis Generally, skewness may be indicated by looking at the sample histogram or by comparing the mean and median. 4B-48 This visual indicator is imprecise and does not take into consideration sample size n. 24
25 Skewness and Kurtosis Skewness Skewness is a unit-free statistic. The coefficient compares two samples measured in different units or one sample with a known reference distribution (e.g., symmetric normal distribution). Calculate the sample s skewness coefficient as: 3 n n xi x Skewness = ( n 1)( n 2) i 1 s 4B-49 Skewness and Kurtosis Skewness In Excel, go to Tools Data Analysis Descriptive Statistics or use the function =SKEW(array) 4B-50 25
26 Skewness and Kurtosis Skewness Consider the following table showing the 90% range for the sample skewness coefficient. 4B-51 Skewness Skewness and Kurtosis Coefficients within the 90% range may be attributed to random variation. 4B-52 26
27 Skewness and Kurtosis Skewness (Figure 4.36) Coefficients outside the range suggest the sample came from a nonnormal population. 4B-53 Skewness Skewness and Kurtosis As n increases, the range of chance variation narrows. 4B-54 27
28 Skewness and Kurtosis Kurtosis Kurtosis is the relative length of the tails and the degree of concentration in the center. Consider three kurtosis prototype shapes. Heavier tails 4B-55 Kurtosis Skewness and Kurtosis A histogram is an unreliable guide to kurtosis since scale and axis proportions may differ. Excel and MINITAB calculate kurtosis as: Kurtosis = 4 2 n( n 1) n xi x 3( n 1) ( n 1)( n 2)( n 3)( i 1 2)( s 3) n n 4B-56 28
29 Skewness and Kurtosis Kurtosis Consider the following table of expected 90% range for sample kurtosis coefficient. 4B-57 Kurtosis Skewness and Kurtosis A sample coefficient within the ranges may be attributed to chance variation. 4B-58 29
30 Skewness and Kurtosis Kurtosis Coefficients outside the range would suggest the sample differs from a normal population. 4B-59 Kurtosis Skewness and Kurtosis As sample size increases, the chance range narrows. 4B-60 Inferences about kurtosis are risky for n <
31 Applied Statistics in Business & Economics End of Chapter 4B 4B-61 31
Numerical Descriptions of Data
Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =
More informationSimple Descriptive Statistics
Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency
More informationChapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.
-3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data
More information3.1 Measures of Central Tendency
3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent
More informationApplications of Data Dispersions
1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given
More informationDescribing Data: One Quantitative Variable
STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive
More informationChapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1
Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and
More informationappstats5.notebook September 07, 2016 Chapter 5
Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.
More informationHandout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25
Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example
More informationDot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.
Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,
More informationDATA SUMMARIZATION AND VISUALIZATION
APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296
More information1 Describing Distributions with numbers
1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write
More informationMEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,
MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, CONTENTS What is measures of dispersion? Why measures of dispersion? How measures of dispersions are calculated? Range Quartile
More informationBasic Procedure for Histograms
Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that
More informationMeasures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean
Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values
More informationCHAPTER 2 Describing Data: Numerical
CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow
More informationMath 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment
Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class
More informationEmpirical Rule (P148)
Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall
More informationSection3-2: Measures of Center
Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number
More informationDescriptive Statistics
Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations
More informationSome Characteristics of Data
Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key
More informationUnit 2 Statistics of One Variable
Unit 2 Statistics of One Variable Day 6 Summarizing Quantitative Data Summarizing Quantitative Data We have discussed how to display quantitative data in a histogram It is useful to be able to describe
More informationECON 214 Elements of Statistics for Economists
ECON 214 Elements of Statistics for Economists Session 3 Presentation of Data: Numerical Summary Measures Part 2 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh
More informationNOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS
NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows
More informationA LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]
1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders
More informationCopyright 2005 Pearson Education, Inc. Slide 6-1
Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is
More informationOverview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution
PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations
More informationWe will also use this topic to help you see how the standard deviation might be useful for distributions which are normally distributed.
We will discuss the normal distribution in greater detail in our unit on probability. However, as it is often of use to use exploratory data analysis to determine if the sample seems reasonably normally
More informationDescription of Data I
Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret
More information4. DESCRIPTIVE STATISTICS
4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in
More information2 Exploring Univariate Data
2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting
More informationFrequency Distribution and Summary Statistics
Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary
More informationPercentiles, STATA, Box Plots, Standardizing, and Other Transformations
Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Lecture 3 Reading: Sections 5.7 54 Remember, when you finish a chapter make sure not to miss the last couple of boxes: What Can Go
More informationWeek 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.
Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.
More informationSTAT 113 Variability
STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2
More information9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives
Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical
More informationStat 101 Exam 1 - Embers Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.
More informationChapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1
Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.
More informationCHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =
Solutions to End-of-Section and Chapter Review Problems 225 CHAPTER 6 6.1 (a) P(Z < 1.20) = 0.88493 P(Z > 1.25) = 1 0.89435 = 0.10565 P(1.25 < Z < 1.70) = 0.95543 0.89435 = 0.06108 (d) P(Z < 1.25) or Z
More informationMonte Carlo Simulation (Random Number Generation)
Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...
More informationMath146 - Chapter 3 Handouts. The Greek Alphabet. Source: Page 1 of 39
Source: www.mathwords.com The Greek Alphabet Page 1 of 39 Some Miscellaneous Tips on Calculations Examples: Round to the nearest thousandth 0.92431 0.75693 CAUTION! Do not truncate numbers! Example: 1
More informationGOALS. Describing Data: Displaying and Exploring Data. Dot Plots - Examples. Dot Plots. Dot Plot Minitab Example. Stem-and-Leaf.
Describing Data: Displaying and Exploring Data Chapter 4 GOALS 1. Develop and interpret a dot plot.. Develop and interpret a stem-and-leaf display. 3. Compute and understand quartiles, deciles, and percentiles.
More informationQuantitative Methods for Economics, Finance and Management (A86050 F86050)
Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge
More informationProbability & Statistics Modular Learning Exercises
Probability & Statistics Modular Learning Exercises About The Actuarial Foundation The Actuarial Foundation, a 501(c)(3) nonprofit organization, develops, funds and executes education, scholarship and
More informationMeasures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms
Measures of Central Tendency: Ungrouped Data Measures of central tendency yield information about particular places or locations in a group of numbers. Common Measures of Location Mode Median Percentiles
More informationSTAT 157 HW1 Solutions
STAT 157 HW1 Solutions http://www.stat.ucla.edu/~dinov/courses_students.dir/10/spring/stats157.dir/ Problem 1. 1.a: (6 points) Determine the Relative Frequency and the Cumulative Relative Frequency (fill
More informationNumerical Measurements
El-Shorouk Academy Acad. Year : 2013 / 2014 Higher Institute for Computer & Information Technology Term : Second Year : Second Department of Computer Science Statistics & Probabilities Section # 3 umerical
More informationDescriptive Statistics
Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs
More informationMini-Lecture 3.1 Measures of Central Tendency
Mini-Lecture 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data 3. Explain what it means for a
More informationAP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE
AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,
More informationChapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)
Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop
More informationBoth the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.
Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of
More informationExploring Data and Graphics
Exploring Data and Graphics Rick White Department of Statistics, UBC Graduate Pathways to Success Graduate & Postdoctoral Studies November 13, 2013 Outline Summarizing Data Types of Data Visualizing Data
More informationTerms & Characteristics
NORMAL CURVE Knowledge that a variable is distributed normally can be helpful in drawing inferences as to how frequently certain observations are likely to occur. NORMAL CURVE A Normal distribution: Distribution
More information3.3-Measures of Variation
3.3-Measures of Variation Variation: Variation is a measure of the spread or dispersion of a set of data from its center. Common methods of measuring variation include: 1. Range. Standard Deviation 3.
More informationBiostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras
Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions
More informationSOLUTIONS TO THE LAB 1 ASSIGNMENT
SOLUTIONS TO THE LAB 1 ASSIGNMENT Question 1 Excel produces the following histogram of pull strengths for the 100 resistors: 2 20 Histogram of Pull Strengths (lb) Frequency 1 10 0 9 61 63 6 67 69 71 73
More informationPopulation Mean GOALS. Characteristics of the Mean. EXAMPLE Population Mean. Parameter Versus Statistics. Describing Data: Numerical Measures
GOALS Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright 010 by The McGraw-Hill Companies, Inc. All rights reserved. 3-1. Calculate the arithmetic mean, weighted mean, median, mode,
More informationLecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution
More informationNCSS Statistical Software. Reference Intervals
Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and
More informationDescriptive Analysis
Descriptive Analysis HERTANTO WAHYU SUBAGIO Univariate Analysis Univariate analysis involves the examination across cases of one variable at a time. There are three major characteristics of a single variable
More informationThe Normal Distribution
Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we
More informationEstimation and Confidence Intervals
Estimation and Confidence Intervals Chapter 9-1/2 McGraw-Hill/Irwin Copyright 2011 by the McGraw-Hill Companies, Inc. All rights reserved. LEARNING OBJECTIVES LO1. Define a point estimate. LO2. Define
More informationMeasures of Central Tendency Lecture 5 22 February 2006 R. Ryznar
Measures of Central Tendency 11.220 Lecture 5 22 February 2006 R. Ryznar Today s Content Wrap-up from yesterday Frequency Distributions The Mean, Median and Mode Levels of Measurement and Measures of Central
More informationE.D.A. Exploratory Data Analysis E.D.A. Steps for E.D.A. Greg C Elvers, Ph.D.
E.D.A. Greg C Elvers, Ph.D. 1 Exploratory Data Analysis One of the most important steps in analyzing data is to look at the raw data This allows you to: find observations that may be incorrect quickly
More informationMATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1)
LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1) Descriptive statistics are ways of summarizing large sets of quantitative (numerical) information. The best way to reduce a set of
More information2 DESCRIPTIVE STATISTICS
Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled
More informationModule Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION
Subject Paper No and Title Module No and Title Paper No.2: QUANTITATIVE METHODS Module No.7: NORMAL DISTRIBUTION Module Tag PSY_P2_M 7 TABLE OF CONTENTS 1. Learning Outcomes 2. Introduction 3. Properties
More informationDescribing Data: Displaying and Exploring Data
Describing Data: Displaying and Exploring Data Chapter 4 McGraw-Hill/Irwin Copyright 2011 by the McGraw-Hill Companies, Inc. All rights reserved. LEARNING OBJECTIVES LO1. Develop and interpret a dot plot.
More information1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form:
1 Exercise One Note that the data is not grouped! 1.1 Calculate the mean ROI Below you find the raw data in tabular form: Obs Data 1 18.5 2 18.6 3 17.4 4 12.2 5 19.7 6 5.6 7 7.7 8 9.8 9 19.9 10 9.9 11
More informationThe normal distribution is a theoretical model derived mathematically and not empirically.
Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.
More informationMEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION
MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION 1 Day 3 Summer 2017.07.31 DISTRIBUTION Symmetry Modality 单峰, 双峰 Skewness 正偏或负偏 Kurtosis 2 3 CHAPTER 4 Measures of Central Tendency 集中趋势
More informationData that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.
Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer
More informationChapter 4-Describing Data: Displaying and Exploring Data
Chapter 4-Describing Data: Displaying and Exploring Data Jie Zhang, Ph.D. Student Account and Information Systems Department College of Business Administration The University of Texas at El Paso jzhang6@utep.edu
More informationDescriptive Statistics (Devore Chapter One)
Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf
More informationMeasures of Dispersion (Range, standard deviation, standard error) Introduction
Measures of Dispersion (Range, standard deviation, standard error) Introduction We have already learnt that frequency distribution table gives a rough idea of the distribution of the variables in a sample
More informationSTAB22 section 1.3 and Chapter 1 exercises
STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea
More informationSUMMARY STATISTICS EXAMPLES AND ACTIVITIES
Session 6 SUMMARY STATISTICS EXAMPLES AD ACTIVITIES Example 1.1 Expand the following: 1. X 2. 2 6 5 X 3. X 2 4 3 4 4. X 4 2 Solution 1. 2 3 2 X X X... X 2. 6 4 X X X X 4 5 6 5 3. X 2 X 3 2 X 4 2 X 5 2
More informationFundamentals of Statistics
CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct
More informationSome estimates of the height of the podium
Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40
More informationBasic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract
Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, 2013 Abstract Review summary statistics and measures of location. Discuss the placement exam as an exercise
More informationReview. What is the probability of throwing two 6s in a row with a fair die? a) b) c) d) 0.333
Review In most card games cards are dealt without replacement. What is the probability of being dealt an ace and then a 3? Choose the closest answer. a) 0.0045 b) 0.0059 c) 0.0060 d) 0.1553 Review What
More informationChapter 4-Describing Data: Displaying and Exploring Data
Chapter 4-Describing Data: Displaying and Exploring Data Jie Zhang, Ph.D. Student Account and Information Systems Department College of Business Administration The University of Texas at El Paso jzhang6@utep.edu
More informationCenter and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.
Chapter 3 Section3-: Measures of Center Section 3-3: Measurers of Variation Section 3-4: Measures of Relative Standing Section 3-5: Exploratory Data Analysis Describing Distributions with Numbers The overall
More informationDESCRIPTIVE STATISTICS
DESCRIPTIVE STATISTICS INTRODUCTION Numbers and quantification offer us a very special language which enables us to express ourselves in exact terms. This language is called Mathematics. We will now learn
More informationSTATISTICAL DISTRIBUTIONS AND THE CALCULATOR
STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either
More informationKey Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions
SGSB Workshop: Using Statistical Data to Make Decisions Module 2: The Logic of Statistical Inference Dr. Tom Ilvento January 2006 Dr. Mugdim Pašić Key Objectives Understand the logic of statistical inference
More informationLecture 2 Describing Data
Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms
More informationThe Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).
We will look the three common and useful measures of spread. The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). 1 Ameasure of the center
More informationUNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES
f UNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES Normal Distribution: Definition, Characteristics and Properties Structure 4.1 Introduction 4.2 Objectives 4.3 Definitions of Probability
More informationToday s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation.
1 Today s plan: Section 4.1.4: Dispersion: Five-Number summary and Standard Deviation. 2 Once we know the central location of a data set, we want to know how close things are to the center. 2 Once we know
More informationstarting on 5/1/1953 up until 2/1/2017.
An Actuary s Guide to Financial Applications: Examples with EViews By William Bourgeois An actuary is a business professional who uses statistics to determine and analyze risks for companies. In this guide,
More informationKING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section
KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 11: BUSINESS STATISTICS I Semester 04 Major Exam #1 Sunday March 7, 005 Please circle your instructor
More information12/1/2017. Chapter. Copyright 2009 by The McGraw-Hill Companies, Inc. 8B-2
Sampling Distributions and Estimation (Part ) 8 Chapter Proportion C.I. for the Difference of Two s, m 1 -m C.I. for the Difference of Two Proportions, p 1 -p Population Variance, s McGraw-Hill/Irwin Copyright
More informationDavid Tenenbaum GEOG 090 UNC-CH Spring 2005
Simple Descriptive Statistics Review and Examples You will likely make use of all three measures of central tendency (mode, median, and mean), as well as some key measures of dispersion (standard deviation,
More informationEngineering Mathematics III. Moments
Moments Mean and median Mean value (centre of gravity) f(x) x f (x) x dx Median value (50th percentile) F(x med ) 1 2 P(x x med ) P(x x med ) 1 0 F(x) x med 1/2 x x Variance and standard deviation
More informationMeasures of Variation. Section 2-5. Dotplots of Waiting Times. Waiting Times of Bank Customers at Different Banks in minutes. Bank of Providence
Measures of Variation Section -5 1 Waiting Times of Bank Customers at Different Banks in minutes Jefferson Valley Bank 6.5 6.6 6.7 6.8 7.1 7.3 7.4 Bank of Providence 4. 5.4 5.8 6. 6.7 8.5 9.3 10.0 Mean
More informationKARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI
88 P a g e B S ( B B A ) S y l l a b u s KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI Course Title : STATISTICS Course Number : BA(BS) 532 Credit Hours : 03 Course 1. Statistical
More informationDATA HANDLING Five-Number Summary
DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest
More information