A Note About Maximum Likelihood Estimator in Hypergeometric Distribution

Similar documents
CHAPTER 8 Estimating with Confidence

Estimating Proportions with Confidence

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS

Math 312, Intro. to Real Analysis: Homework #4 Solutions

Statistics for Economics & Business

A New Approach to Obtain an Optimal Solution for the Assignment Problem

A New Constructive Proof of Graham's Theorem and More New Classes of Functionally Complete Functions

Chapter 8: Estimation of Mean & Proportion. Introduction

point estimator a random variable (like P or X) whose values are used to estimate a population parameter

5. Best Unbiased Estimators

Sampling Distributions and Estimation

Solutions to Problem Sheet 1

Topic-7. Large Sample Estimation

NORMALIZATION OF BEURLING GENERALIZED PRIMES WITH RIEMANN HYPOTHESIS

Monetary Economics: Problem Set #5 Solutions

Unbiased estimators Estimators

Notes on Expected Revenue from Auctions

ECON 5350 Class Notes Maximum Likelihood Estimation

Confidence Intervals Introduction

Standard Deviations for Normal Sampling Distributions are: For proportions For means _

The Limit of a Sequence (Brief Summary) 1

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3)

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions

Maximum Empirical Likelihood Estimation (MELE)

. (The calculated sample mean is symbolized by x.)

Parametric Density Estimation: Maximum Likelihood Estimation

14.30 Introduction to Statistical Methods in Economics Spring 2009

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices?

STAT 135 Solutions to Homework 3: 30 points

5 Statistical Inference

Sequences and Series

Lecture 5: Sampling Distribution

Elementary Statistics and Inference. Elementary Statistics and Inference. Chapter 20 Chance Errors in Sampling (cont.) 22S:025 or 7P:025.

4.5 Generalized likelihood ratio test

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

Lecture 4: Probability (continued)

Bayes Estimator for Coefficient of Variation and Inverse Coefficient of Variation for the Normal Distribution

ii. Interval estimation:

The material in this chapter is motivated by Experiment 9.

NOTES ON ESTIMATION AND CONFIDENCE INTERVALS. 1. Estimation

Introduction to Probability and Statistics Chapter 7

Overlapping Generations

Lecture 5 Point Es/mator and Sampling Distribu/on

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions


SUPPLEMENTAL MATERIAL

Sampling Distributions and Estimation

FOUNDATION ACTED COURSE (FAC)

Estimating the Parameters of the Three-Parameter Lognormal Distribution

Chapter 10 - Lecture 2 The independent two sample t-test and. confidence interval

Solution to Tutorial 6

An Improved Estimator of Population Variance using known Coefficient of Variation

INTERVAL GAMES. and player 2 selects 1, then player 2 would give player 1 a payoff of, 1) = 0.

Outline. Plotting discrete-time signals. Sampling Process. Discrete-Time Signal Representations Important D-T Signals Digital Signals

Optimal Risk Classification and Underwriting Risk for Substandard Annuities

Department of Mathematics, S.R.K.R. Engineering College, Bhimavaram, A.P., India 2

ad covexity Defie Macaulay duratio D Mod = r 1 = ( CF i i k (1 + r k) i ) (1.) (1 + r k) C = ( r ) = 1 ( CF i i(i + 1) (1 + r k) i+ k ) ( ( i k ) CF i

Section Mathematical Induction and Section Strong Induction and Well-Ordering

STRAND: FINANCE. Unit 3 Loans and Mortgages TEXT. Contents. Section. 3.1 Annual Percentage Rate (APR) 3.2 APR for Repayment of Loans

Models of Asset Pricing

Models of Asset Pricing

ON DIFFERENTIATION AND HARMONIC NUMBERS

Simulation Efficiency and an Introduction to Variance Reduction Methods

Anomaly Correction by Optimal Trading Frequency

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010

Models of Asset Pricing

B = A x z

1 Estimating sensitivities

A point estimate is the value of a statistic that estimates the value of a parameter.

5 Decision Theory: Basic Concepts

ST 305: Exam 2 Fall 2014

1 The Power of Compounding

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

A random variable is a variable whose value is a numerical outcome of a random phenomenon.

Insurance and Production Function Xingze Wang, Ying Hsuan Lin, and Frederick Jao (2007)

EXERCISE - BINOMIAL THEOREM

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion

Limits of sequences. Contents 1. Introduction 2 2. Some notation for sequences The behaviour of infinite sequences 3

1 Estimating the uncertainty attached to a sample mean: s 2 vs.

Dr. Maddah ENMG 400 Engineering Economy 06/24/09. Chapter 2 Factors: How time and interest affect money

Annual compounding, revisited

Outline. Populations. Defs: A (finite) population is a (finite) set P of elements e. A variable is a function v : P IR. Population and Characteristics

When you click on Unit V in your course, you will see a TO DO LIST to assist you in starting your course.

Subject CT5 Contingencies Core Technical. Syllabus. for the 2011 Examinations. The Faculty of Actuaries and Institute of Actuaries.

AUTOMATIC GENERATION OF FUZZY PAYOFF MATRIX IN GAME THEORY

Optimizing of the Investment Structure of the Telecommunication Sector Company

We learned: $100 cash today is preferred over $100 a year from now

x satisfying all regularity conditions. Then

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India July 2012

CHANGE POINT TREND ANALYSIS OF GNI PER CAPITA IN SELECTED EUROPEAN COUNTRIES AND ISRAEL

2.6 Rational Functions and Their Graphs

Single-Payment Factors (P/F, F/P) Single-Payment Factors (P/F, F/P) Single-Payment Factors (P/F, F/P)

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i

Monopoly vs. Competition in Light of Extraction Norms. Abstract

Estimation of generalized Pareto distribution

Random Sequences Using the Divisor Pairs Function

Point Estimation by MLE Lesson 5

Further Pure 1 Revision Topic 5: Sums of Series

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory

Transcription:

Comuicacioes e Estadística Juio 2009, Vol. 2, No. 1 A Note About Maximum Likelihood Estimator i Hypergeometric Distributio Ua ota sobre los estimadores de máxima verosimilitud e la distribució hipergeométrica Hawe Zhag a hawezhag@usta.edu.co Abstract The method of maximum likelihood estimatio is oe of the most importat statistical techiques, ad it is widely used by statistical scietists. However for the hypergeometric distributio Hg(, R, N, the maximum likelihood estimators of N ad R are ot clear i most of the statistical texts. I this paper, rigorous procedures i order to fid the maximum likelihood estimator of N ad R i a hypergeometric distributio are preseted. Key words: Hypergeometric distributio, maximum likelihood estimatio.. Resume El método de la estimació de máxima verosimilitud es ua de las técicas más importates de la estadística y es utilizado ampliamete por los profesioales estadísticos. Si embargo, para la distribució hipergeométrica Hg(, R, N, los estimadores de máxima verosimilitud de N y R o so claros e la mayoría de los textos estadísticos. E este artículo, se preseta los procedimietos rigurosos para ecotrar estos estimadores de máxima verosimilitud. Palabras clave: Distribució hipergeométrica, estimació de máxima verosimilitud.. 1 Itroductio The method of maximum likelihood estimatio is oe of the most importat statistical techique, ad is oe of the cocepts that every studet of statistics should a Docete. CIEES, Uiversidad Sato Tomás 1

2 Hawe Zhag be familiarized with. The maximum likelihood estimatio of parameters i most of the probabilistic distributio is easy to hadel, but the problem of estimatig the populatio size i the capture-recapture problem (see Ardilly & Tillé (2006, that is, the problem of estimatig N i a hypergeometric distributio Hg(, R, N, is ot very clear i statistical iferece texts, sice this problem is ot cosidered i most of them, for example, Mood & Boes (1974, Bickel & Doksum (2001 ad Casella & Berger (2002. Although the estimator of maximum likelihood of N is give by Shao (2003, the procedure is ot quite clear. Also we may eed to estimate the parameter R, that is the size of a subpopulatio, ad the same situatio occurs with this problem of estimatio. The goal of this paper is to provide the rigorous procedures i order to fid the maximum likelihood of the parameters N ad R. 2 Estimatig N Suppose that X Hg(, R, N, ad we eed to estimate the poblatio size N, the likelihood fuctio is: ( R N R L(N = L(x, N = x( x ( N, (1 if max( N + R, 0 x mi(r,. The fuctio L(N is a discreet fuctio, so the way to fid the maximum is ot takig derivatives with respect to N, but fidig out where L(N is a creasig fuctio of N, ad where is decreasig. I this way, we compute the ratio D(N = L(N/L(N 1, ad we fid out where D(N > 1 ad where D(N < 1. With a simple algebraic reasoig, we have: which is equivalet to D(N = D(N = ( N R ( N 1 x ( N ( N R 1 x > 1, (N R!(N 1!(N!(N R 1 + x! (N R 1!N!(N 1!(N R + x! > 1. Cacelig all posible factorials, it ca be show that D(N > 1 if ad oly if (N R(N > N(N R + x, so the values of N where D(N > 1 are defied by N < R/x. Similarly, D(N < 1 for N > R/x. Based o the former argumet, some teachers mistakely establish that the maximum likelihood estimator of N is R/X. The previous statemet is ot correct, sice accordig to the defiitio of the maximum likelihood estimator, the realizatio of the estimator must fall i the parameter space. Ad clearly the realizatio of the statistic R/X may be fractioal, ad it is immediately disqualified to be the maximum likelihood estimator of N. Comuicacioes e Estadística, juio 2009, Vol. 2, No. 1

A Note About Maximum Likelihood Estimator i Hypergeometric Distributio 3 Likelihood 0.0 0.1 0.2 0.3 0.4 0.5 0.6 5 7 9 11 13 15 17 19 21 23 25 N Figure 1: Likelihood fuctio with R = 5, = 4 ad x = 3. Shao (2003, pg.277 gives the correct maximum likelihood estimator of N, that is, ˆN MV = [R/X] where [a] deote the iteger part of a. Note that the creasig ad decreasig property of L(N leads two cadidates to be the maximum likelihood estimator of N: [R/X] ad [R/X] + 1. However Shao draw the coclusio without a further commet. To verify that the maximum likelihood estimator N is, ideed, [R/X], we recall that D(N < 1 if N > R/x. It is clear that [R/X] + 1 > R/x, so D([R/X] + 1 < 1 that is L([R/X] + 1/L([R/X] < 1, so we coclude that L([R/X] + 1 < L([R/X]. (2 Further more we ca coclude that [R/X] is the uique maximum likelihood estimator sice i (2 we ca ever have the equality. Suppose that R = 5, = 4 ad x = 3, i this case, [R/x] = [6.67] = 6 is the maximum likelihood estimate of N. Figure 1 illustrates the likelihood fuctio L(N where clearly the maximum situates at 6. 3 Estimatig R Suppose that the parameter of iterest is R, the umber of idividuals with some specific characteristic i a populatio of size N, ad i a sample of size, x idividuals with this characteristic are selected. So we cosider the likelihood fuctio as a fuctio of R, that is ( R N R L(R = x( x ( N, (3 Comuicacioes e Estadística, juio 2009, Vol. 2, No. 1

4 Hawe Zhag ad accordig to the discussed i the previous sectio, we compute the ratio D(R = L(R/L(R + 1, ad we fid out for what values of R, D(R > 1 ad for what values, D(R < 1. We have is equivalet to D(R = D(R = ( R N R x( > 1 (4 ( R+1 x x ( N R 1 x (R + 1 x(n R > 1, (5 (R + 1(N R + x which is equivalet to (R + 1 x(n R > (R + 1(N R + x. Ad this x(n + 1 leads to D(R > 1 if ad oly if R >, that is, L(R is decreasig for itegers larger tha x(n + 1 ad icreasig for itegers smaller tha x(n + 1. Ad the ituitio leads that ˆR MV = x(n + 1 true whe is a iteger, ad i this case D(R = 1, so ( ( x(n + 1 x(n + 1 L = L + 1, ad the maximum likelihood estimator may be either of 1, ad is ot uique. x(n + 1, but this is x(n + 1 ad x(n + 1 + x(n + 1 x(n + 1 If is ot iteger, the value of R that maximizes L(R is [ ] x(n + 1 or [ ] + 1 cosiderig that the estimator of R must take values i the set of the itegers. However, recallig the previous argumet we ca state that ( L [ ( x(n + 1 ] < L [ x(n + 1 ] + 1 sice [ x(n + 1 ] < x(n + 1, ad i this case, ˆR x(n + 1 x(n + 1 MV = [ ] + 1 = [ ]. Summarizig, we have that x(n + 1 x(n + 1 x(n + 1 1 or if is a iteger ˆR MV = x(n + 1 x(n + 1 [ ] if is ot a iteger (6 Comuicacioes e Estadística, juio 2009, Vol. 2, No. 1

A Note About Maximum Likelihood Estimator i Hypergeometric Distributio 5 Likelihood 0.0 0.1 0.2 0.3 0.4 2 4 6 8 10 12 14 16 R Figure 2: Likelihood fuctio with N = 19, = 5 ad x = 2. x(n + 1 Suppose that N = 19, = 5 ad x = 2, i this case, = 8 is a iteger, so the maximum likelihood estimate of R may be 7 or 8. Figure 2 illustrates the likelihood fuctio L(R where clearly, the maximum of the fuctio situates at 7 ad 8. x(n + 1 Now suppose that the N = 20, = 5 ad x = 2, the = 8.4, so the maximum likelihood estimate of R is [8.4] = 8. Figure 3 illustrates the likelihood fuctio L(R where the maximum of the fuctio situates at 8. 4 Coclusio I this paper, the procedures of maximum likelihood estimatio i hypergeometric distributio are preseted, ad we poit out that because the parameters N ad R are itegers, the maximizatio of the likelihood fuctio eeds to be hadled carefully with more detailed aalysis. Refereces Ardilly, P. & Tillé, Y. (2006, Samplig Methods: Spriger. Exercises ad Solutios., Bickel, P. & Doksum, K. (2001, Mathematical Statistics. Basic Ideas ad Selected Tipics., Vol. I, secod ed, Pretice-Hall. Casella, G. & Berger, R. (2002, Statistical Iferece., secod ed, Duxbury Pres. Comuicacioes e Estadística, juio 2009, Vol. 2, No. 1

6 Hawe Zhag Likelihood 0.0 0.1 0.2 0.3 0.4 2 4 6 8 10 12 14 16 R Figure 3: Likelihood fuctio with N = 20, = 5 ad x = 2. Mood, A.M, G. F. & Boes, D. (1974, Itroductio to the theory of statistics., Iteratioal editio. McGraw Hill. Shao, J. (2003, Mathematical statistics., secod ed, Spriger. Comuicacioes e Estadística, juio 2009, Vol. 2, No. 1