Sampling Distributions & Estimators

Similar documents
. (The calculated sample mean is symbolized by x.)

point estimator a random variable (like P or X) whose values are used to estimate a population parameter

Introduction to Probability and Statistics Chapter 7

Topic-7. Large Sample Estimation

Lecture 5: Sampling Distribution

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3)

Lecture 4: Probability (continued)

CHAPTER 8 Estimating with Confidence

5. Best Unbiased Estimators

Inferential Statistics and Probability a Holistic Approach. Inference Process. Inference Process. Chapter 8 Slides. Maurice Geraghty,

5 Statistical Inference

A point estimate is the value of a statistic that estimates the value of a parameter.

Estimating Proportions with Confidence

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1

Lecture 5 Point Es/mator and Sampling Distribu/on

Sampling Distributions and Estimation

Sampling Distributions and Estimation

Chapter 10 - Lecture 2 The independent two sample t-test and. confidence interval

Standard Deviations for Normal Sampling Distributions are: For proportions For means _

Exam 1 Spring 2015 Statistics for Applications 3/5/2015

Math 124: Lecture for Week 10 of 17

Statistics for Economics & Business

Chapter 8: Estimation of Mean & Proportion. Introduction

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion

14.30 Introduction to Statistical Methods in Economics Spring 2009

A random variable is a variable whose value is a numerical outcome of a random phenomenon.

NOTES ON ESTIMATION AND CONFIDENCE INTERVALS. 1. Estimation

1. Suppose X is a variable that follows the normal distribution with known standard deviation σ = 0.3 but unknown mean µ.

BASIC STATISTICS ECOE 1323

1 Random Variables and Key Statistics

Exam 2. Instructor: Cynthia Rudin TA: Dimitrios Bisias. October 25, 2011

ST 305: Exam 2 Fall 2014

Statistics for Business and Economics

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory

Chapter 8 Interval Estimation. Estimation Concepts. General Form of a Confidence Interval

CHAPTER 8 CONFIDENCE INTERVALS

The Idea of a Confidence Interval

18.S096 Problem Set 5 Fall 2013 Volatility Modeling Due Date: 10/29/2013

Outline. Populations. Defs: A (finite) population is a (finite) set P of elements e. A variable is a function v : P IR. Population and Characteristics

AY Term 2 Mock Examination

1 Estimating the uncertainty attached to a sample mean: s 2 vs.

1. Find the area under the standard normal curve between z = 0 and z = 3. (a) (b) (c) (d)

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

Confidence Intervals Introduction

A Bayesian perspective on estimating mean, variance, and standard-deviation from data

SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME

Simulation Efficiency and an Introduction to Variance Reduction Methods

Unbiased estimators Estimators

0.1 Valuation Formula:

These characteristics are expressed in terms of statistical properties which are estimated from the sample data.

BIOSTATS 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval

ii. Interval estimation:

B = A x z

1 Estimating sensitivities

Parametric Density Estimation: Maximum Likelihood Estimation

Chapter 17 Sampling Distribution Models

Quantitative Analysis

Bayes Estimator for Coefficient of Variation and Inverse Coefficient of Variation for the Normal Distribution

Confidence Intervals based on Absolute Deviation for Population Mean of a Positively Skewed Distribution

Variance and Standard Deviation (Tables) Lecture 10

Point Estimation by MLE Lesson 5

Chpt 5. Discrete Probability Distributions. 5-3 Mean, Variance, Standard Deviation, and Expectation

Point Estimation by MLE Lesson 5

Control Charts for Mean under Shrinkage Technique

Dr. Maddah ENMG 624 Financial Eng g I 03/22/06. Chapter 6 Mean-Variance Portfolio Theory

STAT 135 Solutions to Homework 3: 30 points

x satisfying all regularity conditions. Then


Just Lucky? A Statistical Test for Option Backdating

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i

Models of Asset Pricing

Elementary Statistics and Inference. Elementary Statistics and Inference. Chapter 20 Chance Errors in Sampling (cont.) 22S:025 or 7P:025.

Systematic and Complex Sampling!

1 Basic Growth Models

Models of Asset Pricing

Models of Asset Pricing

Monetary Economics: Problem Set #5 Solutions

SOLUTION QUANTITATIVE TOOLS IN BUSINESS NOV 2011

Generative Models, Maximum Likelihood, Soft Clustering, and Expectation Maximization

Maximum Empirical Likelihood Estimation (MELE)

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010

Lecture 9: The law of large numbers and central limit theorem

The Time Value of Money

Appendix 1 to Chapter 5

Research Article The Probability That a Measurement Falls within a Range of n Standard Deviations from an Estimate of the Mean

= α e ; x 0. Such a random variable is said to have an exponential distribution, with parameter α. [Here, view X as time-to-failure.

of Asset Pricing R e = expected return

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions

of Asset Pricing APPENDIX 1 TO CHAPTER EXPECTED RETURN APPLICATION Expected Return

Parameter Uncertainty in Loss Ratio Distributions and its Implications

Probability and statistics

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices?

Chapter 4 - Consumer. Household Demand and Supply. Solving the max-utility problem. Working out consumer responses. The response function

Topic 14: Maximum Likelihood Estimation

Introduction to Statistical Inference

Quantitative Analysis

5 Decision Theory: Basic Concepts

Transcription:

API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 Samplig Distributios & Estimators I. Estimators The Importace of Samplig Radomly Three Properties of Estimators 1. Ubiased 2. Cosistet 3. Efficiet I am grateful to previous TFs for may of these materials.

API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 From the Estimators Module Quiz: Suppose you are iterested i estimatig the mea household icome of a populatio ad collect data o a radom sample of households. Cosider the followig estimator: Estimator = X 1, the value of the first icome listed i the sample (i.e. the icome for the household we happe to have surveyed first) More formally, if sample icomes are {x l, x 2, x 3,, x }, the estimate is x 1 If you are usig this estimator to estimate the populatio mea, is this estimator: Ubiased? Cosistet? Practice: For all of the followig questios, assume the populatio parameter is µ. 1. Show that X is ubiased. 2. Derive the variace of X. 2

API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 3. Fill out the followig table: Estimator Ubiased? Cosistet? Most Efficiet? Y 1 +Y 25 +Y 99 3 (Y i ) + 5 i=1 i=1 (Y i ) 3

API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 Target Parameter (θ) Sample Size (s) Poit Estimator (ˆθ) E(ˆθ) Stadard Deviatio of Samplig Distributio (σˆθ) µ Y µ σ p ˆp p p(1 p) µ 1 µ 2 1, 2 Y 1 Y 2 µ 1 µ 2 σ 1 2 2 1 + σ 2 2 p 1 p 2 1, 2 ˆp 1 ˆp 2 p 1 p 2 p 1 (1 p 1 ) 1 + p 2 (1 p 2 ) 2 Notes: 1. The expected values ad stadard errors show i the table are valid regardless of the form of the populatio probability desity fuctio. 2. All four estimators possess probability distributios that are approximately ormal for large samples. 3. For the last two rows, the two samples are assumed to be idepedet. 4. For all rows, the samples are assumed to be simple radom samples. 4

API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 II. The Cetral Limit Theorem If X 1, X 2,, X costitute a simple radom sample from a populatio with mea µ ad variace σ 2, the the sample mea X has a approximately ormal distributio with mea µ ad stadard error σ /, assumig the sample size is large eough (usually > 30 is sufficiet). The approximatio improves as icreases. This result holds regardless of the form of the populatio distributio. See this applet to visualize the CLT: http://oliestatbook.com/stat_sim/samplig_dist/ Experimet with differet sample sizes ad populatio distributios to see how the CLT is ivoked. Keep i mid this distictio from lecture: Distributio i the Populatio Distributio i the Sample "Samplig" Distributio Desity 0.02.04.06.08.1.12.14.16 Relative Frequecy Histogram of Age (Populatio N=1049) 20 30 40 50 60 70 Age Desity 0.02.04.06.08.1.12.14.16 Relative Frequecy Histogram of Age (Sample =50) 20 30 40 50 60 70 Age y 0.1.2.3.4.5 20 25 30 35 40 Age 5

API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 Example: Survey of Registered Voters A telephoe survey of 2374 adult Americas fids that 1912 of the respodets are registered voters. Costruct a 90% Cofidece Iterval for the proportio of adults i the Uited States who are registered voters. 6

API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 III. Brigig it All Together Meas Estimator: X Parameter: µ Why is this ormally distributed? Why is the mea of the distributio µ? Sice X is a ubiased estimator for µ (E( X) =µ), it will be cetered aroud µ. The Cetral Limit Theorem shows that the samplig distributio of some estimators (icludig X) is ormal (assumig a large eough sample size). µ ( ) X µ 1.96 X µ +1.96 X Why this width? The size of our iterval depeds o the: Stadard error of our estimator ( X = / p for X) Stadard deviatio of the populatio distributio ( ) Sample size Our multiplier (1.96 i this case) Distributio of our samplig distributio (Normal i this case) Size of our chose cofidece level (95% i this case) Why this particular estimate of X? No reaso! It s just the estimate that we happeed to get from a sigle sample of the populatio. I real life, this value (ad its stadard deviatio) is all we have to work with. 7

API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 Proportios Estimator: ˆp Parameter: p Why is this ormally distributed? Why is the mea of the distributio p? Sice ˆp is a ubiased estimator for p (E(ˆp) =p), it will be cetered aroud p. The Cetral Limit Theorem shows that the samplig distributio of some estimators (icludig ˆp) is ormal (assumig a large eough sample size). p 1.96 ˆp ˆp p ( ) p 1.96 ˆp Why this width? The size of our iterval depeds o the: Stadard error of our estimator ( ˆp = p p(1 p)/ for ˆp) Stadard deviatio of the populatio distributio ( ) Sample size Our multiplier (1.96 i this case) Distributio of our samplig distributio (Normal i this case) Size of our chose cofidece level (95% i this case) Why this particular estimate of ˆp? No reaso! It s just the estimate that we happeed to get from a sigle sample of the populatio. I real life, this value (ad its stadard deviatio) is all we have to work with. 8

API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 IV. Cofidece Itervals Thik about the various factors that cotribute to a cofidece iterval of the mea age of a populatio. What would happe to the samplig distributio (e.g. would it chage shape? ceter?) ad cofidece iterval (e.g. would it get larger? smaller?) as the followig icreased? 1. Sample size 2. Stadard deviatio of age i the populatio 3. Cofidece level 9