Chapter 3 Student Lecture Notes 3-1

Similar documents
Mode is the value which occurs most frequency. The mode may not exist, and even if it does, it may not be unique.

Chapter 3 Descriptive Statistics: Numerical Measures Part B

UNIVERSITY OF VICTORIA Midterm June 6, 2018 Solutions

Measures of Spread IQR and Deviation. For exam X, calculate the mean, median and mode. For exam Y, calculate the mean, median and mode.

Chapter 5 Student Lecture Notes 5-1

Tests for Two Correlations

The Institute of Chartered Accountants of Sri Lanka

II. Random Variables. Variable Types. Variables Map Outcomes to Numbers

MgtOp 215 Chapter 13 Dr. Ahn

02_EBA2eSolutionsChapter2.pdf 02_EBA2e Case Soln Chapter2.pdf

Tests for Two Ordered Categorical Variables

PhysicsAndMathsTutor.com

OCR Statistics 1 Working with data. Section 2: Measures of location

Random Variables. b 2.

ISyE 512 Chapter 9. CUSUM and EWMA Control Charts. Instructor: Prof. Kaibo Liu. Department of Industrial and Systems Engineering UW-Madison

Elton, Gruber, Brown and Goetzmann. Modern Portfolio Theory and Investment Analysis, 7th Edition. Solutions to Text Problems: Chapter 4

Capability Analysis. Chapter 255. Introduction. Capability Analysis

Evaluating Performance

Elton, Gruber, Brown, and Goetzmann. Modern Portfolio Theory and Investment Analysis, 7th Edition. Solutions to Text Problems: Chapter 9

Midterm Exam. Use the end of month price data for the S&P 500 index in the table below to answer the following questions.

Which of the following provides the most reasonable approximation to the least squares regression line? (a) y=50+10x (b) Y=50+x (d) Y=1+50x

Linear Combinations of Random Variables and Sampling (100 points)

Analysis of Variance and Design of Experiments-II

= 1. UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Parameters and Statistics. Measures of Centrality

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session STS041) p The Max-CUSUM Chart

4. Greek Letters, Value-at-Risk

Numerical Descriptions of Data

Elements of Economic Analysis II Lecture VI: Industry Supply

Standardization. Stan Becker, PhD Bloomberg School of Public Health

Alternatives to Shewhart Charts

OPERATIONS RESEARCH. Game Theory

3: Central Limit Theorem, Systematic Errors

Midterm Version 2 Solutions

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

Appendix - Normally Distributed Admissible Choices are Optimal

Random Variables. 8.1 What is a Random Variable? Announcements: Chapter 8

Probability Distributions. Statistics and Quantitative Analysis U4320. Probability Distributions(cont.) Probability

occurrence of a larger storm than our culvert or bridge is barely capable of handling? (what is The main question is: What is the possibility of

- Inferential: methods using sample results to infer conclusions about a larger pop n.

PASS Sample Size Software. :log

TCOM501 Networking: Theory & Fundamentals Final Examination Professor Yannis A. Korilis April 26, 2002

Scribe: Chris Berlind Date: Feb 1, 2010

Physics 4A. Error Analysis or Experimental Uncertainty. Error

Principles of Finance

Using Conditional Heteroskedastic

Data Mining Linear and Logistic Regression

AS MATHEMATICS HOMEWORK S1

CHAPTER 9 FUNCTIONAL FORMS OF REGRESSION MODELS

15-451/651: Design & Analysis of Algorithms January 22, 2019 Lecture #3: Amortized Analysis last changed: January 18, 2019

The Integration of the Israel Labour Force Survey with the National Insurance File

Financial Risk Management in Portfolio Optimization with Lower Partial Moment

A Bootstrap Confidence Limit for Process Capability Indices

/ Computational Genomics. Normalization

EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY

Creating a zero coupon curve by bootstrapping with cubic splines.

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 10 Making Choices: The Method, MARR, and Multiple Attributes

EDC Introduction

An Application of Alternative Weighting Matrix Collapsing Approaches for Improving Sample Estimates

Comparisons of Gene Expression Indexes for Oligonucleotide Arrays

3/3/2014. CDS M Phil Econometrics. Vijayamohanan Pillai N. Truncated standard normal distribution for a = 0.5, 0, and 0.5. CDS Mphil Econometrics

Multifactor Term Structure Models

Notes are not permitted in this examination. Do not turn over until you are told to do so by the Invigilator.

Survey of Math Test #3 Practice Questions Page 1 of 5

Simple Regression Theory II 2010 Samuel L. Baker

Solution of periodic review inventory model with general constrains

Mutual Funds and Management Styles. Active Portfolio Management

Notes on experimental uncertainties and their propagation

Spatial Variations in Covariates on Marriage and Marital Fertility: Geographically Weighted Regression Analyses in Japan

Lecture 7. We now use Brouwer s fixed point theorem to prove Nash s theorem.

Homework 9: due Monday, 27 October, 2008

AMS Financial Derivatives I

Introduction. Why One-Pass Statistics?

Risk and Return: The Security Markets Line

Understanding Annuities. Some Algebraic Terminology.

Sampling Distributions of OLS Estimators of β 0 and β 1. Monte Carlo Simulations

ECE 586GT: Problem Set 2: Problems and Solutions Uniqueness of Nash equilibria, zero sum games, evolutionary dynamics

Real Exchange Rate Fluctuations, Wage Stickiness and Markup Adjustments

FM303. CHAPTERS COVERED : CHAPTERS 5, 8 and 9. LEARNER GUIDE : UNITS 1, 2 and 3.1 to 3.3. DUE DATE : 3:00 p.m. 19 MARCH 2013

Fourth report on the consistency of risk weighted assets

CrimeStat Version 3.3 Update Notes:

σ may be counterbalanced by a larger

A Comparison of Risk Return Relationship in the Portfolio Selection Models

Available online: 20 Dec 2011

Graphical Methods for Survival Distribution Fitting

REGIONAL INCOMES STRUCTURE ANALYSIS IN SLOVAK REPUBLIC ON THE BASIS OF EU-SILC DATA

THIRD MIDTERM EXAM EC26102: MONEY, BANKING AND FINANCIAL MARKETS MARCH 24, 2004

The Effects of Industrial Structure Change on Economic Growth in China Based on LMDI Decomposition Approach

The Mack-Method and Analysis of Variability. Erasmus Gerigk

Prospect Theory and Asset Prices

Weights in CPI/HICP and in seasonally adjusted series

Parallel Prefix addition

Numerical Analysis ECIV 3306 Chapter 6

Introduction. Chapter 7 - An Introduction to Portfolio Management

A Test of Normality. Textbook Reference: Chapter 14.2 (eighth edition, pages 591 3; seventh edition, pages 624 6).

Lecture 6 Foundations of Finance. Lecture 6: The Intertemporal CAPM (ICAPM): A Multifactor Model and Empirical Evidence

Information Hiding in Image using Combined Approach of Pixel Mapping Method and Pixel Value Differencing Method

Likelihood Fits. Craig Blocker Brandeis August 23, 2004

ISyE 2030 Summer Semester 2004 June 30, 2004

Final Exam. 7. (10 points) Please state whether each of the following statements is true or false. No explanation needed.

Transcription:

Chapter 3 Student Lecture otes 3-1 Busness Statstcs: A Decson-Makng Approach 6 th Edton Chapter 3 Descrbng Data Usng umercal Measures 005 Prentce-Hall, Inc. Chap 3-1 Chapter Goals After completng ths chapter, you should be able to: Compute and nterpret the mean, medan, and mode for a set of data Compute the range, varance, and standard devaton and know what these values mean Construct and nterpret a box and whskers plot Compute and explan the coeffcent of varaton and z scores Use numercal measures along wth graphs, charts, and tables to descrbe data 005 Prentce-Hall, Inc. Chap 3- Chapter Topcs Measures of Center and Locaton Mean, medan, mode, geometrc mean, mdrange Other measures of Locaton Weghted mean, percentles, quartles Measures of Varaton Range, nterquartle range, varance and standard devaton, coeffcent of varaton 005 Prentce-Hall, Inc. Chap 3-3 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3- Summary Measures Descrbng Data umercally Center and Locaton Mean Medan Mode Weghted Mean Other Measures of Locaton Percentles Quartles Varaton Range Interquartle Range Varance Standard Devaton Coeffcent of Varaton 005 Prentce-Hall, Inc. Chap 3-4 Measures of Center and Locaton Overvew Center and Locaton x = µ = Mean Medan Mode Weghted Mean n n x = 1 x = 1 005 Prentce-Hall, Inc. Chap 3-5 X µ W W = = w x w w w x Mean (Arthmetc Average) The Mean s the arthmetc average of data values Sample mean n = Sample Sze n x = 1 x1 + x + L + xn x = = n n Populaton mean µ = = Populaton Sze x x1 + x + = = 1 L + x 005 Prentce-Hall, Inc. Chap 3-6 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-3 Mean (Arthmetc Average) The most common measure of central tendency (contnued) Mean = sum of values dvded by the number of values Affected by extreme values (outlers) 0 1 3 4 5 6 7 8 9 10 0 1 3 4 5 6 7 8 9 10 Mean = 3 1+ + 3 + 4 + 5 15 = = 3 5 5 Mean = 4 1+ + 3 + 4 + 10 0 = = 4 5 5 005 Prentce-Hall, Inc. Chap 3-7 Medan ot affected by extreme values 0 1 3 4 5 6 7 8 9 10 0 1 3 4 5 6 7 8 9 10 Medan = 3 Medan = 3 In an ordered array, the medan s the mddle number If n or s odd, the medan s the mddle number If n or s even, the medan s the average of the two mddle numbers 005 Prentce-Hall, Inc. Chap 3-8 Mode A measure of central tendency Value that occurs most often ot affected by extreme values Used for ether numercal or categorcal data There may may be no mode There may be several modes 0 1 3 4 5 6 7 8 9 10 11 1 13 14 Mode = 5 0 1 3 4 5 6 o Mode 005 Prentce-Hall, Inc. Chap 3-9 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-4 Weghted Mean Used when values are grouped by frequency or relatve mportance Example: Sample of 6 Repar Projects Days to Frequency Complete 5 4 6 1 7 8 8 Weghted Mean Days to Complete: wx (4 5) + (1 6) + (8 7) + ( 8) XW = = w 4 + 1 + 8 + 164 = = 6 6.31 days 005 Prentce-Hall, Inc. Chap 3-10 Revew Example Fve houses on a hll by the beach House Prces: $,000 K $,000,000 500,000 300,000 100,000 100,000 $300 K $500 K $100 K $100 K 005 Prentce-Hall, Inc. Chap 3-11 Summary Statstcs House Prces: $,000,000 500,000 300,000 100,000 100,000 Sum 3,000,000 Mean: ($3,000,000/5) = $600,000 Medan: mddle value of ranked data = $300,000 Mode: most frequent value = $100,000 005 Prentce-Hall, Inc. Chap 3-1 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-5 Whch measure of locaton s the best? Mean s generally used, unless extreme values (outlers) exst Then medan s often used, snce the medan s not senstve to extreme values. Example: Medan home prces may be reported for a regon less senstve to outlers 005 Prentce-Hall, Inc. Chap 3-13 Shape of a Dstrbuton Descrbes how data s dstrbuted Symmetrc or skewed Left-Skewed Symmetrc Rght-Skewed Mean < Medan < Mode (Longer tal extends to left) Mean = Medan = Mode Mode < Medan < Mean (Longer tal extends to rght) 005 Prentce-Hall, Inc. Chap 3-14 Other Locaton Measures Other Measures of Locaton Percentles Quartles The p th percentle n a data array: p% are less than or equal to ths value (100 p)% are greater than or equal to ths value (where 0 p 100) 1 st quartle = 5 th percentle nd quartle = 50 th percentle = medan 3 rd quartle = 75 th percentle 005 Prentce-Hall, Inc. Chap 3-15 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-6 Percentles The p th percentle n an ordered array of n values s the value n th poston, where p = (n + 1) 100 Example: The 60 th percentle n an ordered array of 19 values s the value n 1 th poston: p 60 = (n + 1) = (19 + 1) = 1 100 100 005 Prentce-Hall, Inc. Chap 3-16 Quartles Quartles splt the ranked data nto 4 equal groups 5% 5% 5% 5% Q1 Q Q3 Example: Fnd the frst quartle Sample Data n Ordered Array: 11 1 13 16 16 17 18 1 (n = 9) Q1 = 5 th percentle, so fnd the 5 (9+1) =.5 poston 100 so use the value half way between the nd and 3 rd values, so Q1 = 1.5 005 Prentce-Hall, Inc. Chap 3-17 Box and Whsker Plot A Graphcal dsplay of data usng 5-number summary: Mnmum -- Q1 -- Medan -- Q3 -- Maxmum Example: 5% 5% 5% 5% Mnmum 1st Medan 3rd Maxmum Mnmum Quartle 1st Medan 3rd Quartle Maxmum Quartle Quartle 005 Prentce-Hall, Inc. Chap 3-18 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-7 Shape of Box and Whsker Plots The Box and central lne are centered between the endponts f data s symmetrc around the medan A Box and Whsker plot can be shown n ether vertcal or horzontal format 005 Prentce-Hall, Inc. Chap 3-19 Dstrbuton Shape and Box and Whsker Plot Left-Skewed Symmetrc Rght-Skewed Q1 Q Q3 Q1 Q Q3 Q1 Q Q3 005 Prentce-Hall, Inc. Chap 3-0 Box-and-Whsker Plot Example Below s a Box-and-Whsker plot for the followng data: Mn Q1 Q Q3 Max 0 3 3 4 5 5 10 7 0 3 5 7 Ths data s very rght skewed, as the plot depcts 005 Prentce-Hall, Inc. Chap 3-1 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-8 Measures of Varaton Varaton Range Interquartle Range Varance Standard Devaton Coeffcent of Varaton Populaton Varance Populaton Standard Devaton Sample Varance Sample Standard Devaton 005 Prentce-Hall, Inc. Chap 3- Varaton Measures of varaton gve nformaton on the spread or varablty of the data values. Same center, dfferent varaton 005 Prentce-Hall, Inc. Chap 3-3 Range Smplest measure of varaton Dfference between the largest and the smallest observatons: Range = x maxmum x mnmum Example: 0 1 3 4 5 6 7 8 9 10 11 1 13 14 Range = 14-1 = 13 005 Prentce-Hall, Inc. Chap 3-4 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-9 Dsadvantages of the Range Ignores the way n whch data are dstrbuted 7 8 9 10 11 1 Range = 1-7 = 5 7 8 9 10 11 1 Range = 1-7 = 5 Senstve to outlers 1,1,1,1,1,1,1,1,1,1,1,,,,,,,,,3,3,3,3,4,5 Range = 5-1 = 4 1,1,1,1,1,1,1,1,1,1,1,,,,,,,,,3,3,3,3,4,10 Range = 10-1 = 119 005 Prentce-Hall, Inc. Chap 3-5 Interquartle Range Can elmnate some outler problems by usng the nterquartle range Elmnate some hgh-and low-valued observatons and calculate the range from the remanng values. Interquartle range = 3 rd quartle 1 st quartle 005 Prentce-Hall, Inc. Chap 3-6 Interquartle Range Example: X mnmum Q1 Medan (Q) Q3 X maxmum 5% 5% 5% 5% 1 30 45 57 70 Interquartle range = 57 30 = 7 005 Prentce-Hall, Inc. Chap 3-7 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-10 Varance Average of squared devatons of values from the mean Sample varance: n (x x) = 1 s = n -1 Populaton varance: (x µ) 005 Prentce-Hall, Inc. Chap 3-8 σ = = 1 Standard Devaton Most commonly used measure of varaton Shows varaton about the mean Has the same unts as the orgnal data Sample standard devaton: s = n = 1 (x x) n -1 Populaton standard devaton: σ = (x µ) 005 Prentce-Hall, Inc. Chap 3-9 = 1 Calculaton Example: Sample Standard Devaton Sample Data (X ) : 10 1 14 15 17 18 18 4 n = 8 Mean = x = 16 s = = (10 x ) + (1 x ) + (14 x ) + L + (4 x ) n 1 (10 16) + (1 16) + (14 16) 8 1 + L + (4 16) = 16 7 = 4.46 005 Prentce-Hall, Inc. Chap 3-30 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-11 Comparng Standard Devatons Data A 11 1 13 14 15 16 17 18 19 0 1 Data B 11 1 13 14 15 16 17 18 19 0 1 Data C 11 1 13 14 15 16 17 18 19 0 1 Mean = 15.5 s = 3.338 Mean = 15.5 s =.958 Mean = 15.5 s = 4.57 005 Prentce-Hall, Inc. Chap 3-31 Coeffcent of Varaton Measures relatve varaton Always n percentage (%) Shows varaton relatve to mean Is used to compare two or more sets of data measured n dfferent unts CV Populaton = σ µ 100% CV Sample s = 100% x 005 Prentce-Hall, Inc. Chap 3-3 Stock A: Comparng Coeffcent of Varaton Average prce last year = $50 Standard devaton = $5 s $5 CV A = 100% = 100% = 10% x $50 Stock B: Average prce last year = $100 Standard devaton = $5 s $5 CV B = 100% = 100% = 5% x $100 Both stocks have the same standard devaton, but stock B s less varable relatve to ts prce 005 Prentce-Hall, Inc. Chap 3-33 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-1 The Emprcal Rule If the data dstrbuton s bell-shaped, then the nterval: µ ± 1σ contans about 68% of the values n the populaton or the sample X 68% µ µ ±1σ 005 Prentce-Hall, Inc. Chap 3-34 The Emprcal Rule µ ± σ contans about 95% of the values n the populaton or the sample µ ± 3σ contans about 99.7% of the values n the populaton or the sample 95% µ ± σ 99.7% µ ± 3σ 005 Prentce-Hall, Inc. Chap 3-35 Tchebysheff s Theorem Regardless of how the data are dstrbuted, at least (1-1/k ) of the values wll fall wthn k standard devatons of the mean Examples: At least wthn (1-1/1 ) = 0%... k=1 (µ ± 1σ) (1-1/ ) = 75%... k= (µ ± σ) (1-1/3 ) = 89%. k=3 (µ ± 3σ) 005 Prentce-Hall, Inc. Chap 3-36 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-13 Standardzed Data Values A standardzed data value refers to the number of standard devatons a value s from the mean Standardzed data values are sometmes referred to as z-scores 005 Prentce-Hall, Inc. Chap 3-37 Standardzed Populaton Values where: x µ z = σ x = orgnal data value µ = populaton mean σ = populaton standard devaton z = standard score (number of standard devatons x s from µ) 005 Prentce-Hall, Inc. Chap 3-38 Standardzed Sample Values where: x x z = s x = orgnal data value x = sample mean s = sample standard devaton z = standard score (number of standard devatons x s from µ) 005 Prentce-Hall, Inc. Chap 3-39 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-14 Usng Mcrosoft Excel Descrptve Statstcs are easy to obtan from Mcrosoft Excel Use menu choce: tools / data analyss / descrptve statstcs Enter detals n dalog box 005 Prentce-Hall, Inc. Chap 3-40 Usng Excel Use menu choce: tools / data analyss / descrptve statstcs 005 Prentce-Hall, Inc. Chap 3-41 Usng Excel (contnued) Enter dalog box detals Check box for summary statstcs Clck OK 005 Prentce-Hall, Inc. Chap 3-4 005 Prentce-Hall, Inc.

Chapter 3 Student Lecture otes 3-15 Excel output Mcrosoft Excel descrptve statstcs output, usng the house prce data: House Prces: $,000,000 500,000 300,000 100,000 100,000 005 Prentce-Hall, Inc. Chap 3-43 Chapter Summary Descrbed measures of center and locaton Mean, medan, mode, geometrc mean, mdrange Dscussed percentles and quartles Descrbed measure of varaton Range, nterquartle range, varance, standard devaton, coeffcent of varaton Created Box and Whsker Plots 005 Prentce-Hall, Inc. Chap 3-44 Chapter Summary (contnued) Illustrated dstrbuton shapes Symmetrc, skewed Dscussed Tchebysheff s Theorem Calculated standardzed data values 005 Prentce-Hall, Inc. Chap 3-45 005 Prentce-Hall, Inc.