STAT 135 Solutions to Homework 3: 30 points

Similar documents
5. Best Unbiased Estimators

ECON 5350 Class Notes Maximum Likelihood Estimation

Parametric Density Estimation: Maximum Likelihood Estimation

14.30 Introduction to Statistical Methods in Economics Spring 2009

Summary. Recap. Last Lecture. .1 If you know MLE of θ, can you also know MLE of τ(θ) for any function τ?

Exam 1 Spring 2015 Statistics for Applications 3/5/2015

Lecture 5 Point Es/mator and Sampling Distribu/on

x satisfying all regularity conditions. Then

Topic 14: Maximum Likelihood Estimation

Introduction to Probability and Statistics Chapter 7

Maximum Empirical Likelihood Estimation (MELE)

5 Statistical Inference

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010

NOTES ON ESTIMATION AND CONFIDENCE INTERVALS. 1. Estimation

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS

18.S096 Problem Set 5 Fall 2013 Volatility Modeling Due Date: 10/29/2013

Monetary Economics: Problem Set #5 Solutions

Confidence Intervals Introduction

1 Random Variables and Key Statistics

Point Estimation by MLE Lesson 5

Unbiased estimators Estimators

Lecture 9: The law of large numbers and central limit theorem

Asymptotics: Consistency and Delta Method

Exam 2. Instructor: Cynthia Rudin TA: Dimitrios Bisias. October 25, 2011

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1

0.1 Valuation Formula:

4.5 Generalized likelihood ratio test

Sampling Distributions and Estimation

A random variable is a variable whose value is a numerical outcome of a random phenomenon.

Topic-7. Large Sample Estimation

Lecture 5: Sampling Distribution

Point Estimation by MLE Lesson 5

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3)

CHAPTER 8 Estimating with Confidence

Problem Set 1a - Oligopoly

Sampling Distributions & Estimators

Statistics for Economics & Business

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices?

Math 312, Intro. to Real Analysis: Homework #4 Solutions

Chapter 10 - Lecture 2 The independent two sample t-test and. confidence interval

point estimator a random variable (like P or X) whose values are used to estimate a population parameter

Estimating Proportions with Confidence

Inferential Statistics and Probability a Holistic Approach. Inference Process. Inference Process. Chapter 8 Slides. Maurice Geraghty,

BASIC STATISTICS ECOE 1323

Bayes Estimator for Coefficient of Variation and Inverse Coefficient of Variation for the Normal Distribution

A Bayesian perspective on estimating mean, variance, and standard-deviation from data

. (The calculated sample mean is symbolized by x.)

INTERVAL GAMES. and player 2 selects 1, then player 2 would give player 1 a payoff of, 1) = 0.

Sampling Distributions and Estimation

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i

ii. Interval estimation:

A point estimate is the value of a statistic that estimates the value of a parameter.

Outline. Populations. Defs: A (finite) population is a (finite) set P of elements e. A variable is a function v : P IR. Population and Characteristics

1. Suppose X is a variable that follows the normal distribution with known standard deviation σ = 0.3 but unknown mean µ.

Sequences and Series

Rafa l Kulik and Marc Raimondo. University of Ottawa and University of Sydney. Supplementary material

Overlapping Generations

5 Decision Theory: Basic Concepts

Lecture 4: Probability (continued)

Chapter 4: Asymptotic Properties of MLE (Part 3)

1 Basic Growth Models

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

Simulation Efficiency and an Introduction to Variance Reduction Methods

Standard Deviations for Normal Sampling Distributions are: For proportions For means _

BIOSTATS 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval

1 Estimating sensitivities

Estimation of Parameters of Three Parameter Esscher Transformed Laplace Distribution

Insurance and Production Function Xingze Wang, Ying Hsuan Lin, and Frederick Jao (2007)

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory

2.6 Rational Functions and Their Graphs

Chpt 5. Discrete Probability Distributions. 5-3 Mean, Variance, Standard Deviation, and Expectation

AY Term 2 Mock Examination

EVEN NUMBERED EXERCISES IN CHAPTER 4

Solutions to Problem Sheet 1

These characteristics are expressed in terms of statistical properties which are estimated from the sample data.

Generative Models, Maximum Likelihood, Soft Clustering, and Expectation Maximization

Chapter 8: Estimation of Mean & Proportion. Introduction

Math 124: Lecture for Week 10 of 17

Estimation of Population Variance Utilizing Auxiliary Information

Solution to Tutorial 6

Chapter 8 Interval Estimation. Estimation Concepts. General Form of a Confidence Interval

11.7 (TAYLOR SERIES) NAME: SOLUTIONS 31 July 2018

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion

Exercise. Show the corrected sample variance is an unbiased estimator of population variance. S 2 = n i=1 (X i X ) 2 n 1. Exercise Estimation

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 2

Statistics for Business and Economics

FOUNDATION ACTED COURSE (FAC)

A New Constructive Proof of Graham's Theorem and More New Classes of Functionally Complete Functions

MATH : EXAM 2 REVIEW. A = P 1 + AP R ) ny

The material in this chapter is motivated by Experiment 9.

ISBN Copyright 2015 The Continental Press, Inc.

SELECTING THE NUMBER OF CHANGE-POINTS IN SEGMENTED LINE REGRESSION

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge

Annual compounding, revisited

ASYMPTOTIC MEAN SQUARE ERRORS OF VARIANCE ESTIMATORS FOR U-STATISTICS AND THEIR EDGEWORTH EXPANSIONS

= α e ; x 0. Such a random variable is said to have an exponential distribution, with parameter α. [Here, view X as time-to-failure.

Further Pure 1 Revision Topic 5: Sums of Series

An Improved Estimator of Population Variance using known Coefficient of Variation

Transcription:

STAT 35 Solutios to Homework 3: 30 poits Sprig 205 The objective of this Problem Set is to study the Stei Pheomeo 955. Suppose that θ θ, θ 2,..., θ cosists of ukow parameters, with 3. We wish to estimate these parameters with measuremets X,..., X, oe for each parameter θ i, ad we suppose that the measuremets are idepedet, Normally distributed, with X i N θ i, σ 2 where σ 2 is kow. If v is a -dimesioal vector, we deote v 2 v2 i. Part I: 5 poits. Show that the Maximum-Likelihood stimator for θ is ˆθ i,ml X i, i,..., Sice we have X i Nθ i, σ 2, the likelihood fuctio is give by likθ i f θi X i 2πσ 2 e 2σ 2 X i θ i 2 We wat to fid the value of θ i that maximizes the likelihood fuctio or equivaletly the log likelihood fuctio. Takig the log of the likelihood fuctio, we have lθ i loglikθ i 2 log2πσ2 2σ 2 X i θ i 2 To idetify the value of θ i which maximizes the log-likelihood fuctio, we set the derivative with respect to θ i to zero, ad solve for θ i. Thus lθ i θ i σ 2 X i θ i implyig that so σ 2 X i ˆθ i,ml 0 ˆθ i,ml X i as required.

2. Show that the Method of Momets stimator for θ is ˆθ i,mm X i, i,..., Sice X i Nθ i, σ 2, it follows immediately that X i θ i so that ˆθ i,mm X i sice we have oly oe observatio X i Nθ i, σ 2. 3. Recall that the risk of a estimator ˆθ is defied as Rˆθ ˆθ θ 2 ˆθ i θ i 2 Show that the risk of both ˆθ ML ad ˆθ MM is Rˆθ ML Rˆθ MM σ 2 Rˆθ ML Rˆθ MM X i θ i 2 Xi 2 2X i θ i + θi 2 [ X 2 i 2θi X i + θi 2 ] [ σ 2 + θi 2 2θi 2 + θi 2 ] σ 2 σ 2 Note that X 2 i V arx i + X i 2 2

Part II: 0 poits We will ow see that it is possible to fid a estimator for θ with smaller risk. We defie as the Stei estimator of θ. ˆθ i,s X i 2σ2 X i, i,...,. Show that Rˆθ S X θ 2 + 2 2σ 2 θ + 2 2 σ 4 R ˆθiS ˆθ ii,s θ i 2 X i 2σ2 X i θ i 2 X 2σ2 i θ i X i X i θ i 2 2σ2 2σ 2 2 2X i θ i X i + X i X i θ i 2 2 2σ 2 X i θ i X 2 i + 2 2 σ 4 Xi X i θ i 2 + 2 2σ 2 θ X + 2 2 σ 4 2 X 4 X i θ i 2 + 2 2σ 2 θ + 2 2 σ 4 X θ 2 + 2 2σ 2 θ + 2 2 σ 4 2 2. Stei s Lemma says that, whe X N µ, σ 2 ad h is a smooth fuctio, the Use Stei s Lemma to show that X µhx σ 2 h X Rˆθ S σ 2 2 2 σ 4 3

Hit : For every i,...,, cosider hx i Hit 2: Recall that X + Y X + Y. X i X i X 2 i + j i X j 2. Recall that R ˆθiS X θ 2 + 2 2σ 2 θ + 2 2 σ 4 ad from Part I, we kow that X θ 2 σ 2. Thus to show this result, we oly eed to show that 2 2σ 2 θ 2 2 2 σ 4 which we ca do as follows θ θ Xi θ i X i X i X i θ i X 2 i + j i X2 j σ 2 X i X i Xi 2 + j i X2 j σ 2 X 2 i + 2Xi 2 j i X2 j Xi 2 + j i X2 j σ 2 X2 i + j X2 j 2X2 i Xi 2 + 2 j i X2 j X σ 2 2 2X 2 i X 4 [ ] X σ 2 2 2 i X 4 ] σ [ 2 2 X2 i X ] 4 σ [ 2 2 X 2 σ 2 2 Thus we have show that R ˆθiS σ 2 2 2 σ 4 2 Hit 4

3. Show that Rˆθ S < Rˆθ ML if 3 Recall that Rˆθ ML σ 2, so sice 2 2 σ 4 > 0 for 3. Rˆθ S σ 2 2 2 σ 4 < σ 2 Rˆθ ML 4. The Cramer-Rao boud ca be exteded to the multivariate case. I particular, the risk of a ubiased estimator ˆθ satisfies Rˆθ i varˆθ i i Iθ i Justify that ˆθ S is a biased estimator. Note: You do ot eed to compute the bias. Recall that the Fisher iformatio for θ is give by [ 2 ] Iθ 0 2 θ log fx; θ θθ0 We first compute the Fisher iformatio for θ i : fx; θ i 2πσ 2 e 2σ 2 x θ i 2 takig the log yields log fx; θ i 2 log2πσ2 2σ 2 x θ i 2 Differetiatig with respect the θ i, we get Ad differetiatig agai, we get So takig the egative of the expectatio implies that θ i log fx; θ i σ 2 x θ i 2 2 θ i log fx; θ i σ 2 Iθ i σ 2 Thus sice a ubiased estimator, ˆθ, of θ i must have risk satisfyig Rˆθ i σ 2 σ 2 It follows that ˆθ S is biased, sice we showed i previous parts that Rˆθ S < Rˆθ ML σ 2 5

Part III: 5 poits Now, suppose that we have m iid measuremets G,..., G m of the proportio of water i a grapefruit, m iid measuremets P,..., P m of the proportio of water i a potato, ad m iid measuremets S,..., S m of the proportio of water i a sea bass. For simplicity, assume that the measuremets all have the same precisio i.e. the same variace, which is kow i advace.. If you oly care about fruits ad vegetables, how would you estimate the water cotet of both grapefruit ad potato? Note that the Stei estimator simultaeously calculates a estimate of proportio of water i each of grapefruit, potatoes ad sea bass, despite the fact that these quatities are urelated. The ML ad MOM estimates, however, are applied separately to estimate the proportio of water i grapefruit, potatoes ad sea bass respectively. We showed above that the Stei estimator is superior to the ML ad MOM estimators whe 3, so sice i this example we are estimatig oly 2 parameters, we would use ML or MOM to estimate the proportio of water i grapefruit ad potato separately. 2. If you also care about fish, how would you adjust the estimator? Why? As argued i the previous questio, the Stei estimator is superior to the ML ad MOM estimators whe 3. I this example, we have that 3, ad thus we should estimate the proportio of water i grapefruit, potatoes ad sea bass simultaeously usig the Stei estimator. 6

Part IV: 5 poits bous The Stei estimator is also kow as a shrikage estimator. For large, let us see that this is ideed the case. Assume that the parameters θ are modeled as iid samples from the Normal distributio θ i N 0, c 2. We ca rewrite ˆθ S as ˆθ i,s ρx i, with ρ σ 2. 2 X 2. By writig X i θ i + X i θ i, ad usig the Law of Large Numbers, justify that 2 X 2 2 σ2 + 2 c2,. Hit : Observe that Z i θ i X i θ i are iid radom variables, sice θ i N 0, c 2 ad X i θ i N 0, σ 2. Hit 2: Use the fact that if A A ad B B ad A ad B are costats, the A + B A + B. Sice 2 X 2 2 2 2 2 2 X 2 i X 2 i θ i + X i θ i 2 [ θ 2 i + 2θ i X i θ i + X i θ i 2] θi 2 + 2 2 θ i X i θ i + 2 X i θ i 2 LLN 2 θ2 + 2 2 θ X θ + 2 X θ 2 2 c2 + 2 σ2 θ 2 V arθ + θ 2 c 2 + 0 c 2 θ X θ θ X θ 0 X θ 2 V arx θ + X θ 2 σ 2 + 0 σ 2 2. Deduce that 0 < p < i the case. Therefore, the Stei estimator shriks each coordiate towards zero i order to reduce the total expected risk Sice ρ σ 2 2 X 2 7

it follows from oticig that σ 2 > 0, 2 > 0 ad > 0, for 3, that ρ < Next, the previous questio provided a limit that, by oticig 2, yields So ρ σ2 c 2 + σ 2 > σ2 σ 2 0 ρ > 0 8