Statistics for Economics & Business

Similar documents
Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1

Sampling Distributions and Estimation

point estimator a random variable (like P or X) whose values are used to estimate a population parameter

A point estimate is the value of a statistic that estimates the value of a parameter.

Chapter 8 Interval Estimation. Estimation Concepts. General Form of a Confidence Interval

Chapter 8: Estimation of Mean & Proportion. Introduction

CHAPTER 8 Estimating with Confidence

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

ii. Interval estimation:

Confidence Intervals Introduction

Estimating Proportions with Confidence

Standard Deviations for Normal Sampling Distributions are: For proportions For means _

Inferential Statistics and Probability a Holistic Approach. Inference Process. Inference Process. Chapter 8 Slides. Maurice Geraghty,

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

Topic-7. Large Sample Estimation

Statistics for Business and Economics

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3)

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion

Math 124: Lecture for Week 10 of 17

Introduction to Probability and Statistics Chapter 7

A random variable is a variable whose value is a numerical outcome of a random phenomenon.

Sampling Distributions and Estimation

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

Lecture 4: Probability (continued)

Lecture 5: Sampling Distribution

Chapter 10 - Lecture 2 The independent two sample t-test and. confidence interval

B = A x z

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS

CHAPTER 8 CONFIDENCE INTERVALS

. (The calculated sample mean is symbolized by x.)

1. Suppose X is a variable that follows the normal distribution with known standard deviation σ = 0.3 but unknown mean µ.

BASIC STATISTICS ECOE 1323

BIOSTATS 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval

The Idea of a Confidence Interval

1 Random Variables and Key Statistics

Lecture 5 Point Es/mator and Sampling Distribu/on

5. Best Unbiased Estimators

NOTES ON ESTIMATION AND CONFIDENCE INTERVALS. 1. Estimation

14.30 Introduction to Statistical Methods in Economics Spring 2009


Sampling Distributions & Estimators

Chapter 10 Statistical Inference About Means and Proportions With Two Populations. Learning objectives

18.S096 Problem Set 5 Fall 2013 Volatility Modeling Due Date: 10/29/2013

APPLICATION OF GEOMETRIC SEQUENCES AND SERIES: COMPOUND INTEREST AND ANNUITIES

5 Statistical Inference

Chpt 5. Discrete Probability Distributions. 5-3 Mean, Variance, Standard Deviation, and Expectation

Calculation of the Annual Equivalent Rate (AER)

Exam 2. Instructor: Cynthia Rudin TA: Dimitrios Bisias. October 25, 2011

STRAND: FINANCE. Unit 3 Loans and Mortgages TEXT. Contents. Section. 3.1 Annual Percentage Rate (APR) 3.2 APR for Repayment of Loans

ST 305: Exam 2 Fall 2014

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i

These characteristics are expressed in terms of statistical properties which are estimated from the sample data.

Outline. Populations. Defs: A (finite) population is a (finite) set P of elements e. A variable is a function v : P IR. Population and Characteristics

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions

Research Article The Probability That a Measurement Falls within a Range of n Standard Deviations from an Estimate of the Mean

Maximum Empirical Likelihood Estimation (MELE)

4.5 Generalized likelihood ratio test

SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME

Exam 1 Spring 2015 Statistics for Applications 3/5/2015

1. Find the area under the standard normal curve between z = 0 and z = 3. (a) (b) (c) (d)

AY Term 2 Mock Examination

Parametric Density Estimation: Maximum Likelihood Estimation

CAPITAL PROJECT SCREENING AND SELECTION

Monetary Economics: Problem Set #5 Solutions

1 Estimating the uncertainty attached to a sample mean: s 2 vs.

Unbiased estimators Estimators

Elementary Statistics and Inference. Elementary Statistics and Inference. Chapter 20 Chance Errors in Sampling (cont.) 22S:025 or 7P:025.

Models of Asset Pricing

Models of Asset Pricing

Models of Asset Pricing

The material in this chapter is motivated by Experiment 9.

Control Charts for Mean under Shrinkage Technique

We learned: $100 cash today is preferred over $100 a year from now


Subject CT1 Financial Mathematics Core Technical Syllabus

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory

CAPITAL ASSET PRICING MODEL

Variance and Standard Deviation (Tables) Lecture 10

A Bayesian perspective on estimating mean, variance, and standard-deviation from data

When you click on Unit V in your course, you will see a TO DO LIST to assist you in starting your course.

Section 3.3 Exercises Part A Simplify the following. 1. (3m 2 ) 5 2. x 7 x 11

ISBN Copyright 2015 The Continental Press, Inc.

APPLIED STATISTICS Complementary Course of BSc Mathematics - IV Semester CUCBCSS Admn onwards Question Bank

1 Estimating sensitivities

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices?

Introduction to Statistical Inference

Parameter Uncertainty in Loss Ratio Distributions and its Implications

1 The Power of Compounding

DESCRIPTION OF MATHEMATICAL MODELS USED IN RATING ACTIVITIES

Forecasting bad debt losses using clustering algorithms and Markov chains

Data Analysis and Statistical Methods Statistics 651

Optimizing of the Investment Structure of the Telecommunication Sector Company

Appendix 1 to Chapter 5

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge

Overlapping Generations

Review Procedures and Reporting by Peer Reviewer

Institute of Actuaries of India Subject CT5 General Insurance, Life and Health Contingencies

Limits of sequences. Contents 1. Introduction 2 2. Some notation for sequences The behaviour of infinite sequences 3

Transcription:

Statistics for Ecoomics & Busiess Cofidece Iterval Estimatio Learig Objectives I this chapter, you lear: To costruct ad iterpret cofidece iterval estimates for the mea ad the proportio How to determie the sample size ecessary to develop a cofidece iterval estimate for the mea or proportio How to use cofidece iterval estimates i auditig

Chapter Outlie Cofidece Itervals for the Populatio Mea, µ whe Populatio Stadard Deviatio σ is Kow whe Populatio Stadard Deviatio σ is Ukow Cofidece Itervals for the Populatio Proportio, π Determiig the Required Sample Size Poit ad Iterval Estimates A poit estimate is a sigle umber, a cofidece iterval provides additioal iformatio about the variability of the estimate Lower Cofidece Limit Poit Estimate Width of cofidece iterval Upper Cofidece Limit

Poit Estimates We ca estimate a Populatio Parameter Mea Proportio µ π with a Sample Statistic (a Poit Estimate) X p Cofidece Itervals How much ucertaity is associated with a poit estimate of a populatio parameter? A iterval estimate provides more iformatio about a populatio characteristic tha does a poit estimate. Such iterval estimates are called cofidece itervals.

Cofidece Iterval Estimate A iterval gives a rage of values: Takes ito cosideratio variatio i sample statistics from sample to sample Based o observatios from 1 sample Gives iformatio about closeess to ukow populatio parameters Stated i terms of level of cofidece e.g. 95% cofidet, 99% cofidet Ca ever be 100% cofidet Cofidece Iterval Example Cereal fill example Populatio has µ = 368 ad σ = 15. If you take a sample of size = 25 you kow 368 ± 1.96 * 15 / 25 = (362.12, 373.88) cotais 95% of the sample meas Whe you do t kow µ, you use X to estimate µ If X = 362.3 the iterval is 362.3 ± 1.96 * 15 / 25 = (356.42, 368.18) Sice 356.42 µ 368.18 the iterval based o this sample makes a correct statemet about µ. But what about the itervals from other possible samples of size 25?

Cofidece Iterval Example (cotiued) Sample # X Lower Limit Upper Limit Cotai µ? 1 362.30 356.42 368.18 Yes 2 369.50 363.62 375.38 Yes 3 360.00 354.12 365.88 No 4 362.12 356.24 368.00 Yes 5 373.88 368.00 379.76 Yes Cofidece Iterval Example (cotiued) I practice you oly take oe sample of size I practice you do ot kow µ so you do ot kow if the iterval actually cotais µ However, you do kow that 95% of the itervals formed i this maer will cotai µ Thus, based o the oe sample you actually selected, you ca be 95% cofidet your iterval will cotai µ (this is a 95% cofidece iterval) Note: 95% cofidece is based o the fact that we used Z = 1.96.

Estimatio Process Populatio (mea, µ, is ukow) Radom Sample Mea X = 50 I am 95% cofidet that µ is betwee 40 & 60. Sample Geeral Formula The geeral formula for all cofidece itervals is: Poit Estimate ± (Critical Value)(Stadard Error) Where: Poit Estimate is the sample statistic estimatig the populatio parameter of iterest Critical Value is a table value based o the samplig distributio of the poit estimate ad the desired cofidece level Stadard Error is the stadard deviatio of the poit estimate

Cofidece Level Cofidece Level Cofidece the iterval will cotai the ukow populatio parameter A percetage (less tha 100%) Cofidece Level, (1-α) Suppose cofidece level = 95% Also writte (1 - α) = 0.95, (so α = 0.05) A relative frequecy iterpretatio: 95% of all the cofidece itervals that ca be costructed will cotai the ukow true parameter A specific iterval either will cotai or will ot cotai the true parameter No probability ivolved i a specific iterval (cotiued)

Cofidece Itervals Cofidece Itervals Populatio Mea Populatio Proportio σ Kow σ Ukow Cofidece Iterval for µ (σ Kow) Assumptios Populatio stadard deviatio σ is kow Populatio is ormally distributed If populatio is ot ormal, use large sample Cofidece iterval estimate: X ± Zα/2 σ where X Z α/2 σ/ is the poit estimate is the ormal distributio critical value for a probability of α/2 i each tail is the stadard error

Fidig the Critical Value, Z α/2 Cosider a 95% cofidece iterval: 1 α = 0.95 so α = 0.05 Z α/2 = ± 1.96 α 2 = 0.025 α 2 = 0.025 Z uits: X uits: Z α/2 = -1.96 0 Z α/2 = 1.96 Lower Cofidece Limit Poit Estimate Upper Cofidece Limit Commo Levels of Cofidece Commoly used cofidece levels are 90%, 95%, ad 99% Cofidece Level 80% 90% 95% 98% 99% 99.8% 99.9% Cofidece Coefficiet, 1 α 0.80 0.90 0.95 0.98 0.99 0.998 0.999 Z α/2 value 1.28 1.645 1.96 2.33 2.58 3.08 3.27

Itervals ad Level of Cofidece Samplig Distributio of the Mea α /2 1 α α/2 Itervals exted from X Z α / 2 to X + Z α / 2 σ σ µ x = µ x 1 x 2 Cofidece Itervals x (1-α)x100% of itervals costructed cotai µ; (α)x100% do ot. Example A sample of 11 circuits from a large ormal populatio has a mea resistace of 2.20 ohms. We kow from past testig that the populatio stadard deviatio is 0.35 ohms. Determie a 95% cofidece iterval for the true mea resistace of the populatio.

Example (cotiued) A sample of 11 circuits from a large ormal populatio has a mea resistace of 2.20 ohms. We kow from past testig that the populatio stadard deviatio is 0.35 ohms. Solutio: 1.9932 σ X ± Zα/2 = 2.20 ± 1.96 (0.35/ = 2.20 ± 0.2068 µ 2.4068 11) Iterpretatio We are 95% cofidet that the true mea resistace is betwee 1.9932 ad 2.4068 ohms Although the true mea may or may ot be i this iterval, 95% of itervals formed i this maer will cotai the true mea

Cofidece Itervals Cofidece Itervals Populatio Mea Populatio Proportio σ Kow σ Ukow Do You Ever Truly Kow σ? Probably ot! I virtually all real world busiess situatios, σ is ot kow. If there is a situatio where σ is kow, the µ is also kow (sice to calculate σ you eed to kow µ.) If you truly kow µ there would be o eed to gather a sample to estimate it.

Cofidece Iterval for µ (σ Ukow) If the populatio stadard deviatio σ is ukow, we ca substitute the sample stadard deviatio, S This itroduces extra ucertaity, sice S is variable from sample to sample So we use the t distributio istead of the ormal distributio Assumptios Cofidece Iterval for µ (σ Ukow) Populatio stadard deviatio is ukow Populatio is ormally distributed If populatio is ot ormal, use large sample Use Studet s t Distributio Cofidece Iterval Estimate: X ± tα / 2 (where t α/2 is the critical value of the t distributio with -1 degrees of freedom ad a area of α/2 i each tail) S (cotiued)

Studet s t Distributio The t is a family of distributios The t α/2 value depeds o degrees of freedom (d.f.) Number of observatios that are free to vary after sample mea has bee calculated d.f. = - 1 Degrees of Freedom (df) Idea: Number of observatios that are free to vary after sample mea has bee calculated Example: Suppose the mea of 3 umbers is 8.0 Let X 1 = 7 Let X 2 = 8 What is X 3? If the mea of these three values is 8.0, the X 3 must be 9 (i.e., X 3 is ot free to vary) Here, = 3, so degrees of freedom = 1 = 3 1 = 2 (2 values ca be ay umbers, but the third is ot free to vary for a give mea)

Studet s t Distributio Note: t Z as icreases Stadard Normal (t with df = ) t-distributios are bellshaped ad symmetric, but have fatter tails tha the ormal t (df = 13) t (df = 5) 0 t Studet s t Table Upper Tail Area df.10.05.025 1 3.078 6.314 12.706 Let: = 3 df = - 1 = 2 α = 0.10 α/2 = 0.05 2 1.886 2.920 4.303 3 1.638 2.353 3.182 α/2 = 0.05 The body of the table cotais t values, ot probabilities 0 2.920 t

Selected t distributio values With compariso to the Z value Cofidece t t t Z Level (10 d.f.) (20 d.f.) (30 d.f.) ( d.f.) 0.80 1.372 1.325 1.310 1.28 0.90 1.812 1.725 1.697 1.645 0.95 2.228 2.086 2.042 1.96 0.99 3.169 2.845 2.750 2.58 Note: t Z as icreases Example of t distributio cofidece iterval A radom sample of = 25 has X = 50 ad S = 8. Form a 95% cofidece iterval for µ d.f. = 1 = 24, so t α /2 = t 0.025 = 2.0639 The cofidece iterval is S X ± t α /2 = 50 ± (2.0639) 8 25 46.698 µ 53.302

Example of t distributio cofidece iterval Iterpretig this iterval requires the assumptio that the populatio you are samplig from is approximately a ormal distributio (especially sice is oly 25). This coditio ca be checked by creatig a: Normal probability plot or Boxplot (cotiued) Cofidece Itervals Cofidece Itervals Populatio Mea Populatio Proportio σ Kow σ Ukow

Cofidece Itervals for the Populatio Proportio, π A iterval estimate for the populatio proportio ( π ) ca be calculated by addig a allowace for ucertaity to the sample proportio ( p ) Cofidece Itervals for the Populatio Proportio, π Recall that the distributio of the sample proportio is approximately ormal if the sample size is large, with stadard deviatio σ p = π(1 π ) We will estimate this with sample data (cotiued) p(1 p)

Cofidece Iterval Edpoits Upper ad lower cofidece limits for the populatio proportio are calculated with the formula where p ± Z α /2 p(1 p) Z α/2 is the stadard ormal value for the level of cofidece desired p is the sample proportio is the sample size Note: must have X > 5 ad X > 5 Example A radom sample of 100 people shows that 25 are left-haded. Form a 95% cofidece iterval for the true proportio of left-haders

Example A radom sample of 100 people shows that 25 are left-haded. Form a 95% cofidece iterval for the true proportio of left-haders. (cotiued) p = 100 * 0.25 = 25 > 5 & (1-p) = 100 * 0.75 = 75 > 5 Make sure the sample is big eough p ± Z α /2 p(1 p)/ = 25/100 ± 1.96 0.25(0.75)/100 = 0.25 ± 1.96 (0.0433) 0.1651 π 0.3349 Iterpretatio We are 95% cofidet that the true percetage of left-haders i the populatio is betwee 16.51% ad 33.49%. Although the iterval from 0.1651 to 0.3349 may or may ot cotai the true proportio, 95% of itervals formed from samples of size 100 i this maer will cotai the true proportio.

Determiig Sample Size Determiig Sample Size For the Mea For the Proportio Samplig Error The required sample size ca be foud to obtai a desired margi of error (e) with a specified level of cofidece (1 - α) The margi of error is also called samplig error the amout of imprecisio i the estimate of the populatio parameter the amout added ad subtracted to the poit estimate to form the cofidece iterval

Determiig Sample Size Determiig Sample Size For the Mea X ± Z α / 2 σ Samplig error (margi of error) e = Z α / 2 σ Determiig Sample Size Determiig Sample Size (cotiued) For the Mea e = Z α / 2 σ Now solve for to get = Z 2 2 α / 2 σ 2 e

Determiig Sample Size (cotiued) To determie the required sample size for the mea, you must kow: The desired level of cofidece (1 - α), which determies the critical value, Z α/2 The acceptable samplig error, e The stadard deviatio, σ Required Sample Size Example If σ = 45, what sample size is eeded to estimate the mea withi ± 5 with 90% cofidece? 2 Z σ 2 e (1.645) = 5 2 2 = 2 (45) 2 = 219.19 So the required sample size is = 220 (Always roud up)

If σ is ukow If ukow, σ ca be estimated whe determiig the required sample size Use a value for σ that is expected to be at least as large as the true σ Select a pilot sample ad estimate σ with the sample stadard deviatio, S Determiig Sample Size Determiig Sample Size (cotiued) For the Proportio e = Z π(1 π) Now solve for to get = Z 2 π (1 π) 2 e

Determiig Sample Size (cotiued) To determie the required sample size for the proportio, you must kow: The desired level of cofidece (1 - α), which determies the critical value, Z α/2 The acceptable samplig error, e The true proportio of evets of iterest, π π ca be estimated with a pilot sample if ecessary (or coservatively use 0.5 as a estimate of π) Required Sample Size Example How large a sample would be ecessary to estimate the true proportio defective i a large populatio withi ±3%, with 95% cofidece? (Assume a pilot sample yields p = 0.12)

Required Sample Size Example Solutio: For 95% cofidece, use Z α/2 = 1.96 e = 0.03 p = 0.12, so use this to estimate π 2 Zα = /2 = 2 π (1 π) = 2 e 2 (1.96) (0.12)(1 0.12) (0.03) (cotiued) 450.74 So use = 451 Applicatios i Auditig Six advatages of statistical samplig i auditig Samplig is less time cosumig ad less costly Samplig provides a objective way to calculate the sample size i advace Samplig provides results that are objective ad defesible Because the sample size is based o demostrable statistical priciples, the audit is defesible before oe s superiors ad i a court of law

Applicatios i Auditig (cotiued) Samplig provides a estimate of the samplig error Allows auditors to geeralize their fidigs to the populatio with a kow samplig error Ca provide more accurate coclusios about the populatio Samplig is ofte more accurate for drawig coclusios about large populatios Examiig every item i a large populatio is subject to sigificat o-samplig error Samplig allows auditors to combie, ad the evaluate collectively, samples collected by differet idividuals Cofidece Iterval for Populatio Total Amout Poit estimate for a populatio of size N: Populatio total = NX Cofidece iterval estimate: N X ± N ( tα / 2 ) S N N 1 (This is samplig without replacemet, so use the fiite populatio correctio factor i the cofidece iterval formula)

Cofidece Iterval for Populatio Total: Example A firm has a populatio of 1,000 accouts ad wishes to estimate the total populatio value. A sample of 80 accouts is selected with average balace of $87.6 ad stadard deviatio of $22.3. Costruct the 95% cofidece iterval estimate of the total balace. Example Solutio N = 1000, = 80, X = 87.6, S = 22.3 S N NX ± N( t ) α /2 N 1 22.3 1, 000 80 = (1, 000)(87.6) ± (1, 000)(1.9905) 80 1, 000 1 = 87, 600 ± 4, 762.48 The 95% cofidece iterval for the populatio total balace is $82,837.52 to $92,362.48

Cofidece Iterval for Total Differece Poit estimate for a populatio of size N: Total Differece = ND Where the average differece, D, is: D = i= 1 D where D i i = audited value - origial value Cofidece Iterval for Total Differece (cotiued) Cofidece iterval estimate: N D ± N SD ( tα / 2) N N 1 where S D = i= 1 ( D i 1 D) 2

Oe-Sided Cofidece Itervals Applicatio: fid the upper boud for the proportio of items that do ot coform with iteral cotrols Upper boud = p + Zα p(1 p) N N 1 where Z α is the stadard ormal value for the level of cofidece desired p is the sample proportio of items that do ot coform is the sample size N is the populatio size Ethical Issues A cofidece iterval estimate (reflectig samplig error) should always be icluded whe reportig a poit estimate The level of cofidece should always be reported The sample size should be reported A iterpretatio of the cofidece iterval estimate should also be provided

Summary Itroduced the cocept of cofidece itervals Discussed poit estimates Developed cofidece iterval estimates Created cofidece iterval estimates for the mea (σ kow) Determied cofidece iterval estimates for the mea (σ ukow) Created cofidece iterval estimates for the proportio Determied required sample size for mea ad proportio cofidece iterval estimates with a desired margi of error Summary (cotiued) Developed applicatios of cofidece iterval estimatio i auditig Cofidece iterval estimatio for populatio total Cofidece iterval estimatio for total differece i the populatio Oe-sided cofidece itervals for the proportio ocoformig Addressed cofidece iterval estimatio ad ethical issues

Statistics for Ecoomics & Busiess Additioal Topic (optioal) Estimatio & Sample Size Determiatio For Fiite Populatios Topic Learig Objectives I this topic, you lear: Whe to use a fiite populatio correctio factor i calculatig a cofidece iterval for either µ or π How to use a fiite populatio correctio factor i calculatig a cofidece iterval for either µ or π How to use a fiite populatio correctio factor i calculatig a sample size for a cofidece iterval for either µ or π

Use A fpc Whe Samplig More Tha 5% Of The Populatio (/N > 0.05) Cofidece Iterval For µ with a fpc X ± t α/2 S N N 1 Cofidece Iterval For π with a fpc p ± z α / 2 p(1 p) N N 1 A fpc simply reduces the stadard error of either the sample mea or the sample proportio Cofidece Iterval for µ with a fpc Suppose N = 1000, = 100, X = 50, s = 10 s N 95% CI for µ : X ± tα / 2 N 1 10 1000 100 50 ± 1.984 = 100 1000 1 50 ± 1.88 = (48.12, 51.88) =

Determiig Sample Size with a fpc Calculate the sample size ( 0 ) without a fpc For µ: For π: Z σ 2 2 α /2 = 0 2 e Z π (1 π ) α e 2 /2 = 0 2 Apply the fpc utilizig the followig formula to arrive at the fial sample size (). = 0 N / ( 0 + (N-1)) Topic Summary Described whe to use a fiite populatio correctio i calculatig a cofidece iterval for either µ or π Examied the formulas for calculatig a cofidece iterval for either µ or π utilizig a fiite populatio correctio Examied the formulas for calculatig a sample size i a cofidece iterval for either µ or π