Outline. Populations. Defs: A (finite) population is a (finite) set P of elements e. A variable is a function v : P IR. Population and Characteristics

Similar documents
Sampling Distributions and Estimation

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1

. (The calculated sample mean is symbolized by x.)

Statistics for Economics & Business

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion

point estimator a random variable (like P or X) whose values are used to estimate a population parameter

Sampling Distributions & Estimators

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS

Introduction to Probability and Statistics Chapter 7

5. Best Unbiased Estimators

Lecture 5: Sampling Distribution

A point estimate is the value of a statistic that estimates the value of a parameter.

Chpt 5. Discrete Probability Distributions. 5-3 Mean, Variance, Standard Deviation, and Expectation

Topic-7. Large Sample Estimation

Inferential Statistics and Probability a Holistic Approach. Inference Process. Inference Process. Chapter 8 Slides. Maurice Geraghty,

Statistics for Business and Economics

Confidence Intervals Introduction

BASIC STATISTICS ECOE 1323

Lecture 5 Point Es/mator and Sampling Distribu/on

Chapter 10 - Lecture 2 The independent two sample t-test and. confidence interval

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3)

NOTES ON ESTIMATION AND CONFIDENCE INTERVALS. 1. Estimation

Chapter 8 Interval Estimation. Estimation Concepts. General Form of a Confidence Interval

4.5 Generalized likelihood ratio test

Parametric Density Estimation: Maximum Likelihood Estimation

These characteristics are expressed in terms of statistical properties which are estimated from the sample data.

1 Random Variables and Key Statistics

Monetary Economics: Problem Set #5 Solutions

Unbiased estimators Estimators

Sampling Distributions and Estimation

14.30 Introduction to Statistical Methods in Economics Spring 2009

B = A x z

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

1. Suppose X is a variable that follows the normal distribution with known standard deviation σ = 0.3 but unknown mean µ.

Exam 1 Spring 2015 Statistics for Applications 3/5/2015

Estimating Proportions with Confidence

A random variable is a variable whose value is a numerical outcome of a random phenomenon.

Lecture 4: Probability (continued)

Elementary Statistics and Inference. Elementary Statistics and Inference. Chapter 20 Chance Errors in Sampling (cont.) 22S:025 or 7P:025.

x satisfying all regularity conditions. Then

ECON 5350 Class Notes Maximum Likelihood Estimation

BIOSTATS 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval

CHAPTER 8 Estimating with Confidence

STAT 135 Solutions to Homework 3: 30 points

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010

Quantitative Analysis

Chapter 8: Estimation of Mean & Proportion. Introduction

Standard Deviations for Normal Sampling Distributions are: For proportions For means _

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory

An Improved Estimator of Population Variance using known Coefficient of Variation

1 Basic Growth Models

SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME

5 Statistical Inference

Lecture 9: The law of large numbers and central limit theorem

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

18.S096 Problem Set 5 Fall 2013 Volatility Modeling Due Date: 10/29/2013

ST 305: Exam 2 Fall 2014

Summary. Recap. Last Lecture. .1 If you know MLE of θ, can you also know MLE of τ(θ) for any function τ?

AY Term 2 Mock Examination

ii. Interval estimation:

Math 124: Lecture for Week 10 of 17

Topic 14: Maximum Likelihood Estimation

Estimation of Population Variance Utilizing Auxiliary Information

A New Constructive Proof of Graham's Theorem and More New Classes of Functionally Complete Functions

Confidence Intervals based on Absolute Deviation for Population Mean of a Positively Skewed Distribution

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

Section Mathematical Induction and Section Strong Induction and Well-Ordering

Problem Set 1a - Oligopoly

Exam 2. Instructor: Cynthia Rudin TA: Dimitrios Bisias. October 25, 2011

1 Estimating sensitivities

APPLIED STATISTICS Complementary Course of BSc Mathematics - IV Semester CUCBCSS Admn onwards Question Bank

1. Find the area under the standard normal curve between z = 0 and z = 3. (a) (b) (c) (d)


Dr. Maddah ENMG 624 Financial Eng g I 03/22/06. Chapter 6 Mean-Variance Portfolio Theory

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 2

Notes on Expected Revenue from Auctions

Control Charts for Mean under Shrinkage Technique

Section 3.3 Exercises Part A Simplify the following. 1. (3m 2 ) 5 2. x 7 x 11

Quantitative Analysis

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge

Chapter 15. Bayesian Statistics and Decision Analysis. COMPLETE BUSINESS STATISTICS by AMIR D. ACZEL & JAYAVEL SOUNDERPANDIAN

SOLUTION QUANTITATIVE TOOLS IN BUSINESS NOV 2011

1 Estimating the uncertainty attached to a sample mean: s 2 vs.

Parameter Uncertainty in Loss Ratio Distributions and its Implications

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices?

= α e ; x 0. Such a random variable is said to have an exponential distribution, with parameter α. [Here, view X as time-to-failure.

CAPITAL ASSET PRICING MODEL

The pps Package. R topics documented: November 22, Version Date Title Functions for PPS sampling

Sequences and Series

Models of Asset Pricing

Models of Asset Pricing

Bayes Estimator for Coefficient of Variation and Inverse Coefficient of Variation for the Normal Distribution

0.1 Valuation Formula:

MATH 205 HOMEWORK #1 OFFICIAL SOLUTION

The Idea of a Confidence Interval

Rafa l Kulik and Marc Raimondo. University of Ottawa and University of Sydney. Supplementary material

Introduction to Statistical Inference

I. Measures of Central Tendency: -Allow us to summarize an entire data set with a single value (the midpoint).

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions

Transcription:

Outlie Populatio Characteristics Types of Samples Sample Characterstics Sample Aalogue Estimatio Populatios Defs: A (fiite) populatio is a (fiite) set P of elemets e. A variable is a fuctio v : P IR. Examples a). P cosists of all Michiga household, v is household icome. b). P cosists of all GRE examiees v is verbal score. Populatio Characteristics The Populatio Size. Let deote the populatio size #P. The Populatio Mea Variace. The populatio mea variace are µ 1 v(e) e P σ 2 1 [v(e) µ] 2. e P Populatio Characteristics Cotiued The Populatio Distributio Fuctio: The distributio fuctio of X is #{e : v(e) x} F (x), F is called the populatio distributio fuctio (for v). Characteristics of F, like percetiles, are called populatio characteristics. ote: If a elemet e is selected rom (equally likely), the X v(e) is a rom variable for which X F, E(X) µ σ 2 X σ2.

F Ifiite Populatios: A Coveiet Fictio For large populatios, it may be useful to preted that the populatio distributio fuctio has a give form, usually ormal. Example: GRE Score: Approximately 1.2 millo studets took the GRE bewtee 2001 2004. Cosider verbal scores. The populatio mea variace are µ 469 σ 2 120. Let F be the populatio DF. The ( ) ( ) x µ x 469 F (x) Φ Φ σ 120 0.0 0.2 0.4 0.6 0.8 1.0 GRE Scores 200 300 400 500 600 700 800 x Samplig Def: A sample is a subset of the populatio, usually selected at rom. (More o this later). Deote the sample size by the sample values by X 1,, X, Sample Characteristics: Computed by the same (or slightly modified) formulas used for populatio characteristics. For example, X X 1 + + X, is called the sample mea. The Sample Mea Sample Characteristics AKA Sample Aalogue Estimatio X X 1 + + X, The Sample Variace ( ) 1 S 2 (X I 1 X) 2, The Sample Distributio Fuctio F # (z) #{i : X i z}.

Types of Samples Simple Rom Samples: Selected elemets from the populatio at rom (equally likely). The Other Types of Samples Stratified Samples: Suppose P has some structure, say P [X i z] #{e P : v(e) z} the populatio distributio fuctio. With replacemet: X 1,, X id F. F (z), P P 1 P M, where the P i are mutually exclusive. The, we could take simple rom samples X i,1,, X i,i Without replaemet: X 1,, X are depedet. However, if the populatio size is large, the there is is ot much depedece, this is frequetly igored. from each of the P i. Example: Geographical regios. More o Simple Rom Samples Two Stage Samples: With P P 1 P M, as above: : Select {i 1,, i m } from {1,, M}. : Take a simple rom sample from each P j. Systematic Samples Samplig Proportioal to Size If X 1,, X is a simple rom sample, the X i F, the populatio distributio fuctio. E(X i ) µ, (the populatio mea), σ 2 X i σ 2, (the populatio variace), sice these deped oly o the distributio of X i. E( X) µ.

Properties of X S 2 Example: GRE Scores 10,000 Samples of Size 100 For a Simple Rom Sample With Replacemet Histogram of x E( X) µ, (1) σ2 σ 2 X, (2) E(S 2 ) σ 2, (3) where µ σ 2 are the populatio mea variace. otes 1: (1) (3) are called ubiasedess; right aswer i repeated samples. Frequecy 0 500 1000 1500 420 440 460 480 500 x 2: Kow (1) (2); (3) soo. ote: Average X 469.0234 Average S 2 (119.9971) 2. sice LHS A Idetity (X i µ) 2 ( 1)S 2 + ( X µ) 2, [(X i X) + ( X µ)] 2 (X i X) 2 + 2( X µ) (X i X) + ( X µ)] 2 ( 1)S 2 + 0 + ( X µ) 2. Recall: Also Derivatio of (3): E(S 2 ) σ 2. (X i µ) 2 ( 1)S 2 + ( X µ) 2. ( 1)E(S 2 ) + E[( X µ) 2 ] E[(X i µ)] 2 σ 2. E[( X µ) 2 ] σ 2 X σ2 σ2. ( 1)E(S 2 ) + σ 2 σ 2, or E(S 2 ) σ 2.

The Derivatio Samplig Without Replacemet: For samplig without replacemet, ( ) σ 2 X σ2. Correctio Factor: ( 1 ) ote: The ratio of the w.o.r. stard deviatio to the w.r. is. Has little effect if < /10. ote: Accuracy is primarily determied by, ot /. For samplig without replacemet, Here E(X 1 X 2 ) j1 1 () 1 () j1,j i j1 v(e i )v(e j ) v(e i )v(e j ) v(e i ) 2 [ ] 2 v(e i )v(e j ) v(e i ) 2 µ 2 v(e i ) 2 E(Xi 2 ). σ X1,X 2 E(X 1 X 2 ) µ 2 1 [ 2 µ 2 E(Xi 2 ) ] µ 2 () ( ) 1 [µ 2 E(Xi 2 ) ] σ2 σ 2 Sum σx 2 i + j1,j i σ 2 σ 2 ( 1) ( 1 1 ) σ 2. σ Xi,X j Recall: σ 2 Sum ( 1 1 ) σ 2 σ 2 X σ2 Sum 2 ( ) σ 2. ( ) σ 2.

More o Stratified Populatios For a stratified populatio P P 1 P m, let i, µ i, σ 2 i deote the size, mea, variace of the i th sub-populatio P i. Further, let, µ, σ 2 deote the size, mea, variace of the etire populatio P. The where σ 2 1 + + m, (1) µ p i µ i, (2) p i σi 2 + p i (µ i µ) 2, (3) p i i. ote: σ 2 ca be bigger tha ay of the σ 2 i. Derivatios That 1 + + m is clear. For (2), let e ij be the j th elemet of P i. The For (3), write µ 1 m i v(e ij ) 1 j1 σ 2 1 m i µ i m i [v(e ij ) µ] 2 j1 m p i µ i. [v(e ij ) µ] 2 [v(e ij ) µ i ] 2 + [µ i µ][v(e ij ) µ i ] + (µ i µ) 2 proceed as i the idetity. More o Stratified Samplig ow suppose that idepedet simple rom samples of sizes 1,, m are drawm fro the sub-populatios P 1,, P m. Let suppose that The as above, 1 + + m r i i, r i p i i. X r 1 X1 + + r m Xm p 1 X1 + + p m Xm, E( X) p 1 µ 1 + + p m µ m µ More o Stratified Samplig The variace of X p1 X1 + + p m Xm for statified samplig is m p 2 i σ 2 i i 1 p i σi 2. For simple rom samplig, the variace would have bee [ σ 2 1 ] p i σi 2 + p i (µ i µ) 2, which could be substatially larger. so that X is ubiased.