5 Decision Theory: Basic Concepts

Size: px
Start display at page:

Download "5 Decision Theory: Basic Concepts"

Transcription

1 5 Decisio Theory: Basic Cocepts Poit estimatio of a ukow parameter is geerally cosidered the most basic iferece problem. Speakig geerically, if θ is some ukow parameter takig values i a suitable parameter space Θ, the a poit estimate is a educated guess at the true value of θ. Now, of course, we do ot just guess the true value of θ without some iformatio. Iformatio comes from data; some iformatio may also come from expert opiio separate from data. I ay case, a poit estimate is a fuctio of the available sample data. To start with, we allow ay fuctio of the data as a possible poit estimate. Theory of iferece is used to separate the good estimates from the ot so good or bad estimates. As usual, we will start with a example to help us uderstad the geeral defiitios. Example 5.1. Suppose X 1,,X Poi(λ), λ > 0. Suppose we wat to estimate the parameter λ. Now, of course, λ = E λ (X 1 ), i.e., λ is the populatio mea. So, just istictively, we may wat to estimate λ by the sample mea X = X 1+ +X ; ad, ideed, X is a possible poit estimator of λ. While λ takes values i Θ = (0, ), the estimator X takes values i A = [0, ); X ca be equal to zero! So, the set of possible values of the parameter ad the set of possible values of a estimator eed ot be idetical. We must allow them to be differet sets, i geeral. Now, X is certaily ot the oly possible estimator of λ. We ca use essetially ay fuctio of the sample observatios X 1,,X to estimate the parameter λ. For example, just X 1, or X 1 + X 2 X 3, or eve seemigly poor estimators like X1 4. Ay estimator is allowed to begi with; theory will separate the good oes from the bad oes. Next, suppose X 1,,X N(µ, σ 2 ), where µ, σ are ukow parameters. So, ow, we have got a two dimesioal parameter vector, θ = (µ, σ). Suppose we wat to estimate µ. Oce agai, a possible estimator is X; a few other possible estimators are the sample media, M = media{x 1,,X }, or X 2+X 4 2, or eve seemigly poor estimators, like 100 X. Suppose, it was kow to us that µ must be oegative. The, the set of possible values of µ is Θ = [0, ). However, the istictive poit estimator X ca take ay real value. It takes values i the set A = (, ). You would otice agai that A is ot the same as Θ i this case. I geeral, A ad Θ ca be differet sets. If we wat istead to estimate σ 2, which is the populatio variace, a first thought is to use the sample variace s 2 = 1 1 (X i X) 2 (dividig by 1 rather tha seems a little odd at first glace, but has a mathematical reaso, which will be clear soo). I this example, the parameter σ 2 ad the estimator s 2 both take values i (0, ). But, if we kew that σ 2 100, say, the oce agai, Θ ad A would be differet. Ad, as always, there are may other possible estimators of σ 2, for example, 1 1 (X i M ) 2, where M is the sample media. Here is a formal defiitio of a poit estimator. 267

2 Defiitio 5.1. Let the vector of sample observatios X () = (X 1,,X ) have a joit distributio P = P ad let θ = h(p), takig values i the parameter space Θ R p, be a parameter of the distributio P. Let T(X 1,,X ) takig values i a specified set A R p be a geeral statistic, The, ay such T(X 1,,X ) is called a poit estimator of θ. The set Θ is called the parameter space, ad the set A is called the statisticia s actio space. If specific observed sample data X 1 = x 1,,X = x are available, the the particular value T(x 1,,x ) is called a estimate of θ. Thus, the word estimator applies to the geeral fuctio T(X 1,,X ), ad the word estimate applies to the value T(x 1,,x ) for specific data. I this text, we use estimator ad estimate syoymously. A stadard geeral otatio for a geeric estimate of a parameter θ is ˆθ = ˆθ(X 1,,X ) Evaluatig a Estimator ad MSE Except i rare cases, the estimate ˆθ would ot be exactly equal to the true value of the ukow parameter θ. It seems reasoable that we like a estimate ˆθ which geerally comes quite close to the true value of θ, ad dislike a estimate ˆθ which geerally misses the true value of θ by a large amout. How, are we goig to make this precise ad quatifiable? The geeral approach to this questio ivolves the specificatio of a loss ad a risk fuctio, which we will itroduce i a later sectio. For ow, we describe a very commo ad eve hugely popular criterio for evaluatig a poit estimator, the mea squared error. Defiitio 5.2. Let θ be a real valued parameter, ad ˆθ a estimate of θ. The mea squared error (MSE) of ˆθ is defied as MSE = MSE(θ, ˆθ) = E θ [(ˆθ θ) 2 ], θ Θ. If the parameter θ is p-dimesioal, θ = (θ 1,,θ p ), ad ˆθ = ˆθ 1,, ˆθ p ), the the mea squared error f ˆθ is defied as MSE = MSE(θ, ˆθ) = E θ [ ˆθ θ 2 ] = p E θ [(ˆθ i θ i ) 2 ], θ Θ Bias ad Variace It turs out that the mea squared error of a estimator has oe compoet to do with systematic error of the estimator ad a secod compoet to do with radom error of the estimator. If a estimator ˆθ routiely overestimated the parameter θ, the usually we will have ˆθ θ > 0; we thik of this as the estimator makig a systematic error. Systematic 268

3 errors ca also be made by routiely uderestimatig θ, We quatify the systematic error of a estimator by lookig at E θ (ˆθ θ) = E θ (ˆθ) θ; this is called the bias of the estimator; we deote it as b(θ). If the bias of a estimator ˆθ is always zero, b(θ) = 0 for all θ, the the estimator ˆθ is called ubiased. O the other had, the estimator ˆθ may ot make much systematic error, but still ca just be ureliable because from oe dataset to aother, its accuracy may differ wildly. This is called radom or fluctuatio error, ad we ofte quatify the radom error by lookig at the variace of the estimator, Var θ (ˆθ). A pleasat property of the MSE of a estimator is that always the MSE eatly breaks ito two compoets, oe ivolvig the bias, ad the other ivolvig the variace. You have to try to keep both of them small; large biases ad large variaces are both red sigals. Here is a bias-variace decompositio result. Theorem 5.1. Let θ be a real valued parameter ad ˆθ a estimator with a fiite variace uder all θ. The, MSE(θ, ˆθ) = Var θ (ˆθ) + b 2 (θ), θ Θ. Proof; To prove this simple theorem, we recall the elemetary probability fact that for ay radom variable U with a fiitte variace, E(U 2 ) = Var(U)+[E(U)] 2. Idetifyig U with ˆθ θ, MSE(θ, ˆθ) = Var θ (ˆθ θ) + [E(ˆθ θ)] 2 = Var θ (ˆθ) + b 2 (θ) Computig ad Graphig MSE We will ow see oe itroductory example. Example 5.2. (Estimatig a Normal Mea ad Variace) Suppose we have sample observatios X 1,,X N(µ, σ 2 ), where < µ <, σ > 0 are ukoow parameters. The parameter is two dimesioal, θ = (µ, σ). First cosider estimatio of µ, ad as a example, cosider these two estimates: X ad X +1. We will calculate the MSE of each estimate, ad make some commets. Sice E( X) = 1 sum E(X i) = 1 (µ) = µ for ay µ ad σ, the bias of X for estimatig µ is zero; E θ ( X) µ = µ µ = 0. I other words, X is a ubiased estimate of µ. Therefore, the MSE of X is just its variace, E[( X µ) 2 ] = Var( X) = Var(X 1) = σ2. Notice that the MSE of X does ot deped o µ; it oly depeds o σ 2. Now, we will fid the MSE of the other estimate X +1. For this, by our Theorem 3.1, the MSE of X +1 is E[( X + 1 µ)2 ] = Var( X X ) + [E( µ)]2 269

4 Compariso of MSE of Estimates i Example mu = ( + 1 )2 Var( X) + [ + 1 µ µ]2 = ( + 1 )2σ2 + ( )2 µ 2 = µ 2 ( + 1) 2 + σ2 ( + 1) 2. Notice that the MSE of this estimate does deped o both µ ad σ 2 µ ; 2 is the cotributio of the bias compoet i the MSE, ad σ2 is the cotributio of the variace (+1) 2 (+1) 2 compoet. For purposes of compariso, we plot the MSE of both estimates, takig = 10 ad σ = 1; the MSE of X is costat i µ, ad the MSE of X +1 is a quadratic i µ. For µ ear zero, X +1 has a smaller MSE, but otherwise, X has a smaller MSE. The graphs of the two mea squared errors cross. This is quite ofte the case i poit estimatio. Next, we cosider estimatio of σ 2. Now, we will use as our example the estimates s 2 = 1 1 (X i X) 2 ad 1 (X i X) 2. First, we will prove that s 2, i.e., the estimator that divides by 1 is a ubiased estimator of σ 2. Here is the proof. First, ote the algebraic idetity (X i X) 2 = = (Xi 2 2 XX i + X 2 ) = Xi 2 2 X 2 + X 2 = Xi 2 2 X Xi 2 X 2. X i + X 2 Therefore, E[ (X i X) 2 ] = E(X1) 2 E[ X 2 ] = (σ 2 + µ 2 ) ( σ2 + µ2 ) = ( 1)σ 2, 270

5 which gives E(s 2 ) = 1 1 ( 1)σ2 = σ 2. Therefore, the MSE of s 2, by Theorem 3,1, is the same as its variace, E[(s 2 σ 2 ) 2 ] = Var(s 2 1 ) = Var( 1 (X i X) 2 ) = 1 (X i X) 2 ( 1) 2Var(σ2 σ 2 ) = = σ 4 2σ4 ( 1) 22( 1) = 1, by usig the fact that if X 1,,X are N(µ, σ 2 ), the σ 4 ( 1) 2Var( (X i X) 2 σ 2 ) P (X i X) 2 that the variace of a χ 2 distributio is twice its degrees of freedom. σ 2 χ 2 ( 1) ad Next, we will f the MSE of the secod estimate 1 (X i X) 2. This estimate does have a bias! Its bias is Also, its variace is b(θ) = E[ 1 (X i X) 2 ] σ 2 = 1 σ2 σ 2 = σ2. Var( 1 (X i X) 2 ) = Var( 1 1 (X i X) 2 ) = ( 1)2 2 2σ 4 1 = 2σ4 ( 1) 2. Therefore, by Theorem 3.1, the MSE of our secod estimate 1 (X i X) 2 is E[( 1 (X i X) 2 σ 2 ) 2 ] = 2σ4 ( 1) 2 + σ4 (2 1)σ4 = 2 2. You may verify that this is always smaller tha 2σ4 1, the MSE of s2. So, we have the sceario that i this example, the biased estimator 1 (X i X) 2 always has a smaller MSE tha the ubiased estimator s 2. This is a special example; do ot expect this to automatically hold i aother example Loss ad Risk Fuctios Use of MSE is a special case of a more geeral formulatio of estimatio as a decisio theory problem. I this formulatio, we view the poit estimatio problem as a two perso game, player I who chooses the true value of θ from Θ, ad player II who chooses a poit estimate, possibly after observig some sample data. Player I is called ature, ad player II the statisticia. I accordace with our previous otatio, we allow the statisticia to choose the value of his estimate from a well defied set A, called the actio space. There is 271

6 aother compoet ecessary to complete the formulatio. After player I chooses θ from Θ, ad player II chooses a specific estimate for θ, say ˆθ(x 1,,x ) = a, player II has to pay player I a pealty for makig a possibly icorrect guess o the true value of θ. This is called the loss fuctio, ad is deoted as L(θ, a), θ Θ, a A. I priciple, the loss fuctio ca bbe almost ay fuctio with some atural costraits; but, i our practice of statistics, we ted to use just oe or two most of the times, because it is easier to work with them. We ote that the realized value of a estimate ˆθ(x 1,,x ) depeds o the particular data x 1,,x actually obtaied. I oe samplig experimet, our data may be such that the estimate ˆθ(x 1,,x ) comes quite close to the true value of θ, while i aother idepedet samplig experimet, the data may be such that the estimate is ot so good. We try to look at average performace of ˆθ(X 1,,X ) as a estimatig procedure, or decisio rule. This motivates the defiitio of a risk. Oe commet is ow i order. Sice may other iferece problems besides poit estimatio ca be put uder the geeral formulatio of decisio theory, we prefer to use a sigle otatio for the statisticia s decisio procedure. To be cosistet with much of the statistical literature, we use the otatio δ(x 1,,X ). I the poit estimatio cotext, δ(x 1,,X ) will mea a estimator; i aother cotext, it may mea somethig else, like a test. Defiitio 5.3. Let Θ deote ature s parameter space, A the statisticia s actio space, ad L(θ, a) a specific loss fuctio. The, the risk fuctio of a decisio procedure δ(x 1,,X ) is the average loss icurred by δ: R(θ, δ) = E θ [L(θ, δ(x 1,,X ))]. I the above, the expectatio E θ meas expectatio with respect to the joit distributio of (X 1,,X ) uder θ. Example 5.3. (Some Possible Loss Fuctios) We usually impose the coditio that L(θ, a) = 0 if a = θ (o pealty for perfect work). We also ofte take L(θ, a) to be a mootoe odecreasig fuctio of the physical distace betwee θ ad a. So, if θ ad a are real valued, we ofte let L(θ, a) = W( theta a ) for some fuctio W with W(0) = 0, ad W(x) mootoe odecreasig o [0, ). Some commo choices are: Squared error loss L(θ, a) = (a θ) 2 ; Absolute error loss L(θ, a) = a θ ; Zero-K loss L(θ, a) = KI a θ >c ; 272

7 Asymmetric loss L(θ, a) = K 1 if a < θ,= 0 if a = θ, K 2 if a > θ; Power loss L(θ, a) = a θ α, α > 0; Weighted squared error loss L(θ, a) = w(θ)(a θ) 2. I a practical problem, writig dow a loss fuctio that correctly reflects cosequeces of various decisios uder every possible circumstace is difficult ad eve impossible to do. You should treat decisio theory as a frequetly useful geeral guide as to the choice of your procedure. By far, i poit estimatio, squared error loss is the most commo, because it is the easiest to work with, ad gives reasoable geeral guide as to which procedures are good ad which are bad. Note the importat fact that Risk fuctio uder squared error loss is the same as MSE Optimality ad Priciple of Low Risk It seems oly atural that oce we have specified a loss fuctio, we should prefer decisio procedures δ that have low risk. Lower risk is cosidered such a good property, that procedures whose risk fuctios ca always be beate by some other alterative procedure are called iadmissible. I fact, it would be the best if we could fid oe decisio procedure δ 0 which has the smallest risk amog all possible decisio procedures at every possible value of θ. Ufortuately, rarely, this is possible. There are o uiformly best decisio procedures. Risk fuctios of decet decisio procedures ted to cross; sometimes, oe is better ad some other times, the other procedure is better. I the ext few sectios, we will give a elemetary itroductio to popular methods for choosig amog decisio procedures whose risk fuctios cross. But, right ow, let us see oe more cocrete risk calculatio. Example 5.4. (Estimatio of Biomial p). Suppose observatios X 1,,X are take from a Beroulli distributio with a ukow parameter p, 0 < p < 1. Suppose the statisticia s actio space is the closed iterval [0, 1], ad that we use the squared error loss fuctio (a p) 2. Let T = X i be the total umber of successes. We cosider three decisio procedures (estimators) δ 1 (X 1,,X ) = T ; δ 2 (X 1,,X ) = T ; δ 3 (X 1,,X )

8 Risk Fuctios of Three Estimators i Example p Note that all three estimators are of the form a+bt for suitable a, b. The bias of a geeral estimator of this form is b(p) = a + bp p = a + (b 1)p, ad so, the risk fuctio of a geeral estimator of this form is R(p, a + bt) = E[(a + bt p) 2 ] = b 2 (p) + Var(a + bt) = (a+(b 1)p) 2 +b 2 p(1 p) = a 2 +(b 2 +2ab 2a)p+( 2 b 2 b 2 2b+1)p 2. (1) δ 1 correspods to a = 0, b = 1, δ 2 to a = 1 +2, b = 1 +2, ad δ 3 to a = 1 2, b = 0. Substitutio ito (3) yields: p(1 p) R(p, δ 1 ) = ; 1 + ( 4)p(1 p) R(p, δ 2 ) = ( + 2) 2 ; R(p, δ 3 ) = 1 p(1 p). 4 These three risk fuctios are plotted above. Evidetly, oe of them is uiformly the best amog the three estimators. Near p = 0, 1, δ 1 is the best, ear p = 1 2, δ 3 is the best, ad i two subitervals away from 0, 1 ad away from 1 2, δ 2 is the best. We must somehow collapse each of these risk fuctios ito a sigle represetative umber, so that we ca compare umbers rather tha fuctios. We preset some traditioal ideas o how to collapse risk fuctios to a sigle represetative umber i our ext two sectios. 274

9 5.0.6 Prior Distributios ad Bayes Procedures: First Examples A ituitively attractive method to collapse a risk fuctio ito a sigle umber is to take a average of the risk fuctio with respect to a weight fuctio, say π(θ). For example, i our above biomial example, the parameter is p, ad we average the risk fuctio of ay procedure δ by calculatig 1 0 R(p, δ)π(p)dp, for a specific weight fuctio π(p). Usually, we choose π(p) to be a probability desity fuctio o [0, 1], the parameter space for p. Such weight fuctios are called prior distributios for the parameter, ad the average risk r(π, δ) = 1 0 R(p, δ)π(p)dp is called the Bayes risk of the procedure δ with respect to the prior π. The idea is to prefer a procedure δ 2 to δ 1 if r(π, δ 2 ) < r(π, δ 1 ). Sice we are o loger comparig two fuctios, but oly comparig two umbers, it is usually the case that oe of the two umbers will be strictly smaller tha the other oe, ad that will tell us which procedure is to be preferred, provided that we are sold o the choice of the specific prior distributio π that we did choose. This approach is kow as Bayesia decisio theory ad is a very importat compoet of statistical iferece. The Bayes approach will ofte give you useful ideas about which procedures may be reasoable i a give problem. We ow give the formal defiitios. Defiitio 5.4. Let G be a specific probability distributio o atures s parameter space Θ; G is called a prior distributio or simply a prior for the parameter θ. Defiitio 5.5. Let R(θ, δ) deote the risk fuctio of a geeral decisio procedure δ uder a specific loss fuctio L(θ, a). The Bayes risk of δ with respect to the prior G is defied as r(g, δ) = E G [R(θ, δ)], where the otatio E G meas a expectatio by treatig the parameter θ as a radom variable with the distributio G. If θ is a cotiuous parameter ad the prior G has a desity fuctio π, we have r(π, δ) = Θ R(θ, δ)π(θ)dθ. Remark: Although we do ot look kidly upo such procedures δ, the Bayes risk r(g, δ) ca be ifiite. Sice lower risks are preferred i geeral, we would also prefer lower Bayes risk. This raises a immediate atural questio; which particular δ makes r(g, δ) the smallest possible amog all decisio procedures? This motivates a importat defiitio. Defiitio 5.6. Give a loss fuctio L ad a prior distributio G, ay procedure δ G 275

10 such that r(g, δ G ) = if δ is called a Bayes procedure uder L ad G. r(g, δ) Remark: A Bayes procedure eed ot always exist, ad whe it exists it eed ot i geeral be uique. But, uder commoly used loss fuctios, a Bayes procedure does exist, ad it is eve uique. This is detailed later. Let us ow retur to the biomial example to illustrate these various ew cocepts. Example 5.5. (Estimatio of Biomial p). As a example, take the specific prior distributio G = U[0, 1] with π(p) = 1I 0 p 1. The, the Bayes risks of the three procedures δ 1, δ 2, δ 3 are r(π, δ 2 ) = r(π, δ 1 ) = 1 0 r(π, δ 3 ) = 1 0 p(1 p) dp = 1 6 ; 1 + ( 4)p(1 p) ( + 2) 2 dp = 1 0 [ 1 4 p(1 p)]dp = ( + 2) ; We ow see o easy algebra that if > 2, the a orderig amog the three procedures δ 1, δ 2, δ 3 has emerged: r(π, δ 2 ) < r(π, δ 1 ) < r(π, δ 3 ). Therefore, amog these three procedures, we should prefer δ 2 over the other two if > 2 ad if we use G = U[0, 1] as the prior. Compare this eat coclusio with the difficulty of reachig a coclusio by comparig their risk fuctios, which cross. Remark: It may be show that δ 2 is i fact the uique Bayes procedure i the Biomial p example, if loss is squared error ad G = U[0, 1]. It will ot be proved i this chapter. However, if you chage the choice of the prior G to some other distributio o [0, 1], the the Bayes procedure will o loger be δ 2. It will be some other procedure, depedig o exactly what the prior distributio G is; Bayes procedures are to be treated i detail i Chapter Maximum Risk ad Miimaxity A secod commo method to collapse a risk fuctio R(θ, δ) to a sigle umber is to look at the maximum value of the risk fuctio over all possible values of the parameter θ. Precisely, cosider the umber R(δ) = max R(θ, δ). θ Θ I this approach, you will prefer δ 2 to δ 1 if R(δ2 ) < R(δ 1 ). Accordig to the priciple of miimaxity, we should ultimately use that procedure δ 0 that results i the miimum 276

11 possible value of R(δ) over all possible procedures δ. That is, you miimize over δ the maximum value of R(θ, δ); hece the ame miimax. Cosider a layma s example to uderstad the cocept of miimaxity. Suppose you have to travel from a city A to a small tow B which is about 500 miles away. You ca either fly or drive. However, oly sigle egie small plaes fly to tow B, ad you are worried about their safety. O the other had, drivig to B will take you 10 hours, ad it will cost you valuable time, may be some icome, opportuities for doig other pleasurable thigs, ad it would be rather tirig. But you do ot thik that drivig to B ca cause you death. The worst possible risk of drivig to B is lower tha the worst possible risk of flyig to B, ad accordig to the miimaxity priciple, you should drive. If we preset miimaxity i such a light, it comes across as the philosophy of the timid. But, i may problems of statistical iferece, the priciple of miimaxity has ultimately resulted i a procedure that is reasoable, or that you would have probably used ayway because of other reasos. It ca also serve as a idealistic bechmark agaist which you ca evaluate other procedures. Moreover, although it is ot obvious, the approach of miimaxity ad the Bayes approach are sometimes coected. A miimax procedure is sometimes a Bayes procedure with respect to a suitable prior G, ad this sort of a coectio gives some additioal credibility to miimax procedures. Brow (1994, 2000) are two very lucidly writte articles that explai the role of miimaxity i the evolutio of statistical iferece. Let us ow revisit our biomial p example for illustratio. Example 5.6. (Estimatio of Biomial p). The plot of the risk fuctios of the three procedures δ 1, δ 2, δ 3 reveals what their maximum risks are. The maximum risk of δ 1 ad δ 2 is attaied at p = 1 2, ad the maximum risk of δ 3 is at the boudary poits p = 0, 1 (strictly speakig, the maximum is a supremum, ad the supremum is ot attaied at ay p i the ope iterval (0, 1)). Precisely, R(δ 1 ) = R( 1 2, δ 1) = 1 4 ; R(δ 2 ) = R( 1 2, δ 2) = 4( + 2) 2; We have the orderig R(δ 3 ) = lim p 0,1 R(p, δ 3) = 1 4. R(δ 2 ) < R(δ 1 ) < R(δ 3 ), ad therefore, amog the three procedures δ 1, δ 2, δ 3, accordig to the priciple of miimizig the maximum risk, we should prefer δ

12 Remark: It may be show that the overall miimax procedure amog all possible procedures is a rather obscure oe; it is ot δ 2. You will see it i Chapter 7. Note that the priciple of miimaxity i its formulatio does ot call for specificatio of a prior. Some statisticias cosider this a plus. However, as we remarked earlier, at the ed, a miimax procedure may tur out to be a Bayes procedure with respect to some prior. Very ofte, this prior is difficult or impossible to guess. Miimax procedures ca be extremely difficult to fid. It is also importat to remember that although miimaxity does ot ivolve the assumptio of a specific prior, absolutely does require the assumptio of a specific loss fuctio. Just as a example, if i our biomial p problem, we use absolute error loss p a istead of squared error loss (p = a) 2, the miimax procedure will chage, ad eve worse, o oe has ever worked it out! Assumptios Matter Statistical iferece always requires some assumptios. You have to assume somethig. How much we assume depeds o the problem, o the approach, ad perhaps o some other thigs, like how much data do we have to test our assumptios. For example, for fidig a miimax procedure, we have to make assumptios about the model, meaig the distributio of the uderlyig radom variable (ormal or Cauchy, for istace), ad we have to make a assumptio of a specific loss fuctio. I the Bayes approach, too, we will eed to make both oof these assumptios ad a extra assumptio of a specific prior G. It is importat that we at least kow what assumptios we have made, ad whe possible, make some attempt to validate those assumptios by usig the available data. Model validatio is ot a easy task, especially whe we have made assumptios which are hard to verify without a lot of data. Statisticias do ot agree o whether prior distributios o parameters ca or eve should be validated from oe s data. There are very few formal ad widely accepted methods for verifyig a chose loss fuctio. The origial idea was to elicit a problem specific loss fuctio by havig a persoal meetig with the cliet. Such efforts have geerally bee regarded as impractical, or have ot resulted i clear success. I additio, if loss fuctios are always problem specific, the we elimiate the premise of a structured theory that will apply simultaeously to may problems. So, whe it comes to the assumptio of a loss fuctio, we have geerally sided with coveiece; it is the primary reaso that squared error loss ad MSE are so dear to us as a professio. We recommed Dawid (1982), Seidefeld (1985), ad Kadae ad Wolfso (1998) for thoughtful expositio of these issues. subsubsectiorobustess as a Theme I statistical iferece, the phrase robust applies to the somewhat ebulous property of isesitivity of a particular procedure to the assump- 278

13 tios that were made. The idea of robust ifereces seems to have bee first metioed i those words i Box (1953). Here is a simple example. You will see i Chapters 6 ad 8 that for estimatig the locatio parameter µ of a uivariate ormal distributio, the sample mea X is almost uiversally regarded as the estimator that oe should use. Now, we have made some assumptios here, amely, that the sequece of our sample observatios X 1,,X are observatios from some ormal distributio, N(µ, σ 2 ). uppose ow that we did our modellig a little too carelessly. Suppose the observatios are, but from a distributio with a much heavier tail tha the ormal; e.g., suppose a much better model would have bee that X 1,,X C(µ, σ) for some µ, σ. The, as it turs out, the sample mea X is a extremely poor estimate of µ. You ca therefore argue that X is ot robust to the assumptio of ormality, whe it comes to our assumptio about the tail. We ca similarly ask ad ivestigate if X is robust to the assumptio of idepedece of our sample observatios X 1,,X. Ca we do aythig about it? The aswer depeds o our ambitio. No oe ca build a procedure which is agreably robust to every assumptio that oe has made. However, if we isolate a few assumptios, perhaps oly those that we are most usure about, the it may be possible to costruct procedures which are robust agaist those specific assumptios. For example, if we assume symmetry, but wat to be robust agaist the tail of our distributio, the we ca do somethig about it. As a illustratio, if all we wish is to be robust agaist the tail, the to estimate the locatio parameter µ of a desity o the real lie, we would be much better off usig the sample media as a estimator of µ, istead of the sample mea. But, if i additio to the tail, we wat to be robust agaist the assumptio that our desity is symmetric aroud the parameter µ, the we do ot have ay reasoable robust solutios to that problem. We just caot be simultaeously robust agaist all our assumptios. It is also importat to uderstad that robustess is a desirable goal, but ot ecessarily the first goal. I settig robustess as our first goal, we may ed up selectig a procedure which is so focused o cautio that it does oly a mediocre job i all situatios, ad ot a excellet job i ay situatio. Robustess is iheretly a murky ad elusive cocept. It is sometimes obtaiable to a limited extet. But, still, we must always pay extremely close attetio to all of our assumptios. I doig hoest iferece, it is importat that we kow ad uderstad all the assumptios we have made. For example, i a poit estimatio problem, we should ask ourselves: 1. Have we assumed that our sample observatios are from some F? 2. Have we assumed that F belogs to some specific parametric family, such as Poisso or ormal? 3. What is our assumed loss fuctio? 279

14 4. What is our assumed prior distributio? 5. Which of these assumptios are seriously questioable? 6. Do we kow how to examie if our proposed estimator is reasoably robust to those assumptios? 7. If we coclude that it is ot reasoably robust, do we kow what to do ext? We will treat robust estimatio to some extet i Chapter 7. Some additioal refereces o robust iferece are Huber (1981), Hampel et. al (1986), Portoy ad He (2000), ad Stigler (2010). 5.1 Exercises Exercise 5.1. (Skills). Suppose X 1,,X 5 are five observatios from a Beroulli distributio with parameter p. Let T = 5 X i. (a) Calculate i closed form the MSE of the followig three estimators: δ 1 (X 1,,X 5 ) = T 5 ; δ 2 (X 1,,X 5 ) = T + 2 ; 9 δ 3 (X 1,,X 5 ) = 1 2, if T = 2, 3 = T 5, if T = 0, 1, 4, 5. (b) Plot the MSE of each estimator o a sigle plot, ad commet. Exercise 5.2. (Partly Coceptual). Suppose X, Y, Z are three observatios from a ormal distributio with mea µ ad kow variace σ 2. Let U, V, W deote the smallest, the media, ad the largest amog the three observatios X, Y, Z. (a) Which of the followig three estimators of µ are ubiased, i.e., the bias is zero for all µ: V ; W; U + W ;.2U +.6V +.2W. 2 (b) Cosider the followig strage soudig estimator. Toss a fair coi. If it lads o heads, estimate µ as V ; if it lads o tails, estimate µ as ubiased? (c) Do you kow how to calculate the MSE of the estimator i part (b)? X+Y +Z 3. Is this estimator Exercise 5.3. Suppose that we have two observatios o a ukow parameter µ. The first oe is X N(µ,1), ad the secod oe is Y U[µ 1, µ + 1], where X, Y are idepedet. 280

15 (a) Fid the MSE of a estimator of µ of the geeral form ax + by. (b) Whe is such a estimator ubiased? (c) Betwee the three estimators X, Y, X+Y 2, do you have a preferece for oe of them? Justify your preferece. Exercise 5.4. (Thresholdig Estimate). Suppose X N(µ, 1). Cosider the thresholdig estimator of µ that equals zero if X 1 ad equals X if X > 1. (a) Fid the expectatio ad the secod momet of this estimator. (b) Hece fid the bias ad the variace of this estimator. (c) Plot the MSE of this estimator ad the MSE of the estimator X. Do they cross? (d) Ituitively, whe would you prefer to use the thresholdig estimate over X? Exercise 5.5. (Absolute Error Loss). Suppose X 1,,X N(µ, σ 2 ). If the loss fuctio is the absolute error loss L(µ, a) = µ a, fid the risk fuctio of X. Exercise 5.6. (Absolute Error Loss). Suppose a sigle observatio X P oi(λ). If the loss fuctio is the absolute error loss L(λ, a) = λ a, (a) Fid the risk of the estimator X whe λ = 1. (b) Fid the risk of the estimator X whe λ = 1.4. Exercise 5.7. Suppose a sigle observatio X Poi(λ). Suggest a estimator of λ 2 ad the criticize your ow choice. Exercise 5.8. (Skills). Suppose X 1,,X N(µ, σ 2 ), where σ 2 is cosidered kow. Suppose loss is squared error, ad cosider estimators of µ of the geeral form a + b X. (a) For what choices of a, b is the risk fuctio uiformly bouded as a fuctio of µ? (b) For such choices of a, b, fid the maximum risk. (c) Idetify the particular choice of a, b that gives the smallest value for the maximum risk. What is the correspodig estimator? Exercise 5.9. (Cost of Samplig). Suppose a certai Beroulli parameter is defiitely kow to be betwee.46 ad.54. It costs some c dollars to take oe Beroulli observatio. Would you care to sample, or estimate p to be.5 without botherig to take ay data? Exercise (Sigal plus Backgroud). Suppose a latet variable Y Poi(λ), λ > 0; Y caot be directly measured. Istead we measure X = Y + B, where B Poi(2), ad Y, B are idepedet. I a physical experimet, your observed value of X tured out to be zero. What would be your estimate of λ? Exercise (Bayes Risks). Suppose X 1,,X Poi(λ), ad suppose λ has a stadard expoetial prior distributio. Take a squared error loss fuctio. (a) Calculate the Bayes risk of each of the followig estimators of λ: The costat estimate 1; X; X i

16 (b) Which of the three estimators has the smallest Bayes risk? The largest Bayes risk? Exercise (Bayes Risks). Suppose X N(µ,1) ad Y U[µ 1, µ + 1], ad that X, Y are idepedet. Suppose µ has a stadard ormal prior distributio. Take a squared error loss fuctio. (a) Calculate the Bayes risk of each of the followig estimators of µ: X, Y, X + Y 2 (b) Which estimator has a smaller Bayes risk? Exercise (Bayes Risks). Suppose X 1,,X N(µ, σ 2 ), where σ 2 is cosidered kow. Suppose loss is squared error. Calculate the Bayes risk of the estimator +1 X for each of the followig prior distributios G: G = U[ 3, 3];N(0, 1); C(0, 1). Exercise (Uiform Distributios). Suppose X 1,,X U[0, θ], θ > 0. Take a squared error loss fuctio. (a) Fid the risk fuctio of each of the followig three estimators of θ: where X () is the sample maximum. 2 X; X () ; + 1 X (), (b) Calculate the Bayes risk of each of these estimators if θ has a geeral gamma prior distributio, θ G(α, λ). (c) Which estimator has the smallest Bayes risk? Does this deped o the values of α, λ? Exercise (Maximum Risk). Suppose X N(µ,1), ad it is kow that 1 µ 1. Suppose loss is squared error. (a) Calculate the risk fuctio of each of the followig estimators: X; X ; the costat estimator zero. 2 (b) Fid the maximum risk of each estimator. Commet. (c) Cosider estimators of the geeral form a+bx, < a <, b 0. For which choice of a, b is the maximum risk miimized? Exercise (Maximum Risk). Suppose X 1,,X 16 are sixtee observatios from a Beroulli distributio with parameter p, 0 < p < 1. Let T = 16 X i. (a) Calculate the risk fuctio of each of the followig estimators: T 16 ; T (b) Fid the maximum risk of each estimator. Commet.. 282

17 5.2 Refereces Berger, J. (2010). Statistical Decisio Theory ad Bayesia Aalysis, Spriger, New York. Bickel, P. ad Doksum, K. (2001). Mathematical Statistics, Basic Ideas ad Selected Topics, Vol.I, Pretice Hall, NJ Box, G.E.P. (1953). Noormality ad tests of variace, Biometrika, 40, Brow, L. (1994). Miimaxity, more or less, I Stat. Dec. Theory ad Rel. Topics, S.S. Gupta ad J. Berger Eds., Spriger, New York. Brow, L. (2000). A essay o statistical decisio theory, JASA, 95, DasGupta, A. (2008). Asymptotic Theory of Statistics ad Probability, Spriger, New York. Dawid, A.P. (1982). The well calibrated Bayesia, with discussio, JASA, 77, Hampel, F., Rochetti, E., Rousseeuw, P., ad Stahel, W. (1986). Robust Statistics: The Approach Based o Ifluece Fuctios, Wiley, New York. Huber, P. (1981). Robust Statistics, Wiley, New York. Johstoe, I. (2012). Fuctio Estimatio, Gaussia Sequece Models, ad Wavelet Shrikage, Cambridge Uiv. Press, Cambridge, Forthcomiig. Kadae, J. ad Wolfso, L. (1998). Experieces i elicitatio, The Statisticia, 47, 1, Lehma, E. ad Casella, G. Theory of Poit Estimatio, Spriger, New York. Portoy, S. ad He, X. (2000). A robust jourey to the ew milleium, JASA, 95, Seidefeld, T. (1985). Calibratio, coherece, ad scorig rules, Phil. Sci., 52, Youg, A. ad Smith, R. (2010). Essetials of Statistical Iferece, Cambridge Uiv. Press, Cambridge. Stigler, S. (2010). The chagig history of robustess, Amer. Statist., 64,

5. Best Unbiased Estimators

5. Best Unbiased Estimators Best Ubiased Estimators http://www.math.uah.edu/stat/poit/ubiased.xhtml 1 of 7 7/16/2009 6:13 AM Virtual Laboratories > 7. Poit Estimatio > 1 2 3 4 5 6 5. Best Ubiased Estimators Basic Theory Cosider agai

More information

Parametric Density Estimation: Maximum Likelihood Estimation

Parametric Density Estimation: Maximum Likelihood Estimation Parametric Desity stimatio: Maimum Likelihood stimatio C6 Today Itroductio to desity estimatio Maimum Likelihood stimatio Itroducto Bayesia Decisio Theory i previous lectures tells us how to desig a optimal

More information

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS Lecture 4: Parameter Estimatio ad Cofidece Itervals GENOME 560 Doug Fowler, GS (dfowler@uw.edu) 1 Review: Probability Distributios Discrete: Biomial distributio Hypergeometric distributio Poisso distributio

More information

5 Statistical Inference

5 Statistical Inference 5 Statistical Iferece 5.1 Trasitio from Probability Theory to Statistical Iferece 1. We have ow more or less fiished the probability sectio of the course - we ow tur attetio to statistical iferece. I statistical

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpeCourseWare http://ocwmitedu 430 Itroductio to Statistical Methods i Ecoomics Sprig 009 For iformatio about citig these materials or our Terms of Use, visit: http://ocwmitedu/terms 430 Itroductio

More information

Estimating Proportions with Confidence

Estimating Proportions with Confidence Aoucemets: Discussio today is review for midterm, o credit. You may atted more tha oe discussio sectio. Brig sheets of otes ad calculator to midterm. We will provide Scatro form. Homework: (Due Wed Chapter

More information

point estimator a random variable (like P or X) whose values are used to estimate a population parameter

point estimator a random variable (like P or X) whose values are used to estimate a population parameter Estimatio We have oted that the pollig problem which attempts to estimate the proportio p of Successes i some populatio ad the measuremet problem which attempts to estimate the mea value µ of some quatity

More information

Sampling Distributions and Estimation

Sampling Distributions and Estimation Cotets 40 Samplig Distributios ad Estimatio 40.1 Samplig Distributios 40. Iterval Estimatio for the Variace 13 Learig outcomes You will lear about the distributios which are created whe a populatio is

More information

Unbiased estimators Estimators

Unbiased estimators Estimators 19 Ubiased estimators I Chapter 17 we saw that a dataset ca be modeled as a realizatio of a radom sample from a probability distributio ad that quatities of iterest correspod to features of the model distributio.

More information

x satisfying all regularity conditions. Then

x satisfying all regularity conditions. Then AMS570.01 Practice Midterm Exam Sprig, 018 Name: ID: Sigature: Istructio: This is a close book exam. You are allowed oe-page 8x11 formula sheet (-sided). No cellphoe or calculator or computer is allowed.

More information

A random variable is a variable whose value is a numerical outcome of a random phenomenon.

A random variable is a variable whose value is a numerical outcome of a random phenomenon. The Practice of Statistics, d ed ates, Moore, ad Stares Itroductio We are ofte more iterested i the umber of times a give outcome ca occur tha i the possible outcomes themselves For example, if we toss

More information

Lecture 4: Probability (continued)

Lecture 4: Probability (continued) Lecture 4: Probability (cotiued) Desity Curves We ve defied probabilities for discrete variables (such as coi tossig). Probabilities for cotiuous or measuremet variables also are evaluated usig relative

More information

Statistics for Economics & Business

Statistics for Economics & Business Statistics for Ecoomics & Busiess Cofidece Iterval Estimatio Learig Objectives I this chapter, you lear: To costruct ad iterpret cofidece iterval estimates for the mea ad the proportio How to determie

More information

. (The calculated sample mean is symbolized by x.)

. (The calculated sample mean is symbolized by x.) Stat 40, sectio 5.4 The Cetral Limit Theorem otes by Tim Pilachowski If you have t doe it yet, go to the Stat 40 page ad dowload the hadout 5.4 supplemet Cetral Limit Theorem. The homework (both practice

More information

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010 Combiig imperfect data, ad a itroductio to data assimilatio Ross Baister, NCEO, September 00 rbaister@readigacuk The probability desity fuctio (PDF prob that x lies betwee x ad x + dx p (x restrictio o

More information

The material in this chapter is motivated by Experiment 9.

The material in this chapter is motivated by Experiment 9. Chapter 5 Optimal Auctios The material i this chapter is motivated by Experimet 9. We wish to aalyze the decisio of a seller who sets a reserve price whe auctioig off a item to a group of bidders. We begi

More information

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1 Chapter 8 Cofidece Iterval Estimatio Copyright 2015, 2012, 2009 Pearso Educatio, Ic. Chapter 8, Slide 1 Learig Objectives I this chapter, you lear: To costruct ad iterpret cofidece iterval estimates for

More information

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3)

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3) Today: Fiish Chapter 9 (Sectios 9.6 to 9.8 ad 9.9 Lesso 3) ANNOUNCEMENTS: Quiz #7 begis after class today, eds Moday at 3pm. Quiz #8 will begi ext Friday ad ed at 10am Moday (day of fial). There will be

More information

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices?

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices? FINM6900 Fiace Theory How Is Asymmetric Iformatio Reflected i Asset Prices? February 3, 2012 Referece S. Grossma, O the Efficiecy of Competitive Stock Markets where Traders Have Diverse iformatio, Joural

More information

Notes on Expected Revenue from Auctions

Notes on Expected Revenue from Auctions Notes o Epected Reveue from Auctios Professor Bergstrom These otes spell out some of the mathematical details about first ad secod price sealed bid auctios that were discussed i Thursday s lecture You

More information

Lecture 5 Point Es/mator and Sampling Distribu/on

Lecture 5 Point Es/mator and Sampling Distribu/on Lecture 5 Poit Es/mator ad Samplig Distribu/o Fall 03 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech Road map Poit Es/ma/o Cofidece Iterval

More information

A Bayesian perspective on estimating mean, variance, and standard-deviation from data

A Bayesian perspective on estimating mean, variance, and standard-deviation from data Brigham Youg Uiversity BYU ScholarsArchive All Faculty Publicatios 006--05 A Bayesia perspective o estimatig mea, variace, ad stadard-deviatio from data Travis E. Oliphat Follow this ad additioal works

More information

Maximum Empirical Likelihood Estimation (MELE)

Maximum Empirical Likelihood Estimation (MELE) Maximum Empirical Likelihood Estimatio (MELE Natha Smooha Abstract Estimatio of Stadard Liear Model - Maximum Empirical Likelihood Estimator: Combiatio of the idea of imum likelihood method of momets,

More information

Anomaly Correction by Optimal Trading Frequency

Anomaly Correction by Optimal Trading Frequency Aomaly Correctio by Optimal Tradig Frequecy Yiqiao Yi Columbia Uiversity September 9, 206 Abstract Uder the assumptio that security prices follow radom walk, we look at price versus differet movig averages.

More information

Standard Deviations for Normal Sampling Distributions are: For proportions For means _

Standard Deviations for Normal Sampling Distributions are: For proportions For means _ Sectio 9.2 Cofidece Itervals for Proportios We will lear to use a sample to say somethig about the world at large. This process (statistical iferece) is based o our uderstadig of samplig models, ad will

More information

NOTES ON ESTIMATION AND CONFIDENCE INTERVALS. 1. Estimation

NOTES ON ESTIMATION AND CONFIDENCE INTERVALS. 1. Estimation NOTES ON ESTIMATION AND CONFIDENCE INTERVALS MICHAEL N. KATEHAKIS 1. Estimatio Estimatio is a brach of statistics that deals with estimatig the values of parameters of a uderlyig distributio based o observed/empirical

More information

ECON 5350 Class Notes Maximum Likelihood Estimation

ECON 5350 Class Notes Maximum Likelihood Estimation ECON 5350 Class Notes Maximum Likelihood Estimatio 1 Maximum Likelihood Estimatio Example #1. Cosider the radom sample {X 1 = 0.5, X 2 = 2.0, X 3 = 10.0, X 4 = 1.5, X 5 = 7.0} geerated from a expoetial

More information

Bayes Estimator for Coefficient of Variation and Inverse Coefficient of Variation for the Normal Distribution

Bayes Estimator for Coefficient of Variation and Inverse Coefficient of Variation for the Normal Distribution Iteratioal Joural of Statistics ad Systems ISSN 0973-675 Volume, Number 4 (07, pp. 7-73 Research Idia Publicatios http://www.ripublicatio.com Bayes Estimator for Coefficiet of Variatio ad Iverse Coefficiet

More information

Monetary Economics: Problem Set #5 Solutions

Monetary Economics: Problem Set #5 Solutions Moetary Ecoomics oblem Set #5 Moetary Ecoomics: oblem Set #5 Solutios This problem set is marked out of 1 poits. The weight give to each part is idicated below. Please cotact me asap if you have ay questios.

More information

18.S096 Problem Set 5 Fall 2013 Volatility Modeling Due Date: 10/29/2013

18.S096 Problem Set 5 Fall 2013 Volatility Modeling Due Date: 10/29/2013 18.S096 Problem Set 5 Fall 2013 Volatility Modelig Due Date: 10/29/2013 1. Sample Estimators of Diffusio Process Volatility ad Drift Let {X t } be the price of a fiacial security that follows a geometric

More information

Overlapping Generations

Overlapping Generations Eco. 53a all 996 C. Sims. troductio Overlappig Geeratios We wat to study how asset markets allow idividuals, motivated by the eed to provide icome for their retiremet years, to fiace capital accumulatio

More information

CHAPTER 8 Estimating with Confidence

CHAPTER 8 Estimating with Confidence CHAPTER 8 Estimatig with Cofidece 8.2 Estimatig a Populatio Proportio The Practice of Statistics, 5th Editio Stares, Tabor, Yates, Moore Bedford Freema Worth Publishers Estimatig a Populatio Proportio

More information

Inferential Statistics and Probability a Holistic Approach. Inference Process. Inference Process. Chapter 8 Slides. Maurice Geraghty,

Inferential Statistics and Probability a Holistic Approach. Inference Process. Inference Process. Chapter 8 Slides. Maurice Geraghty, Iferetial Statistics ad Probability a Holistic Approach Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike 4.0

More information

Models of Asset Pricing

Models of Asset Pricing APPENDIX 1 TO CHAPTER 4 Models of Asset Pricig I this appedix, we first examie why diversificatio, the holdig of may risky assets i a portfolio, reduces the overall risk a ivestor faces. The we will see

More information

Models of Asset Pricing

Models of Asset Pricing APPENDIX 1 TO CHAPTER4 Models of Asset Pricig I this appedix, we first examie why diversificatio, the holdig of may risky assets i a portfolio, reduces the overall risk a ivestor faces. The we will see

More information

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Meas ad Proportios Itroductio: I this chapter we wat to fid out the value of a parameter for a populatio. We do t kow the value of this parameter for the etire

More information

STAT 135 Solutions to Homework 3: 30 points

STAT 135 Solutions to Homework 3: 30 points STAT 35 Solutios to Homework 3: 30 poits Sprig 205 The objective of this Problem Set is to study the Stei Pheomeo 955. Suppose that θ θ, θ 2,..., θ cosists of ukow parameters, with 3. We wish to estimate

More information

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion Basic formula for the Chi-square test (Observed - Expected ) Expected Basic formula for cofidece itervals sˆ x ± Z ' Sample size adjustmet for fiite populatio (N * ) (N + - 1) Formulas for estimatig populatio

More information

0.1 Valuation Formula:

0.1 Valuation Formula: 0. Valuatio Formula: 0.. Case of Geeral Trees: q = er S S S 3 S q = er S S 4 S 5 S 4 q 3 = er S 3 S 6 S 7 S 6 Therefore, f (3) = e r [q 3 f (7) + ( q 3 ) f (6)] f () = e r [q f (5) + ( q ) f (4)] = f ()

More information

Topic-7. Large Sample Estimation

Topic-7. Large Sample Estimation Topic-7 Large Sample Estimatio TYPES OF INFERENCE Ò Estimatio: É Estimatig or predictig the value of the parameter É What is (are) the most likely values of m or p? Ò Hypothesis Testig: É Decidig about

More information

Introduction to Probability and Statistics Chapter 7

Introduction to Probability and Statistics Chapter 7 Itroductio to Probability ad Statistics Chapter 7 Ammar M. Sarha, asarha@mathstat.dal.ca Departmet of Mathematics ad Statistics, Dalhousie Uiversity Fall Semester 008 Chapter 7 Statistical Itervals Based

More information

Appendix 1 to Chapter 5

Appendix 1 to Chapter 5 Appedix 1 to Chapter 5 Models of Asset Pricig I Chapter 4, we saw that the retur o a asset (such as a bod) measures how much we gai from holdig that asset. Whe we make a decisio to buy a asset, we are

More information

1 Random Variables and Key Statistics

1 Random Variables and Key Statistics Review of Statistics 1 Radom Variables ad Key Statistics Radom Variable: A radom variable is a variable that takes o differet umerical values from a sample space determied by chace (probability distributio,

More information

INTERVAL GAMES. and player 2 selects 1, then player 2 would give player 1 a payoff of, 1) = 0.

INTERVAL GAMES. and player 2 selects 1, then player 2 would give player 1 a payoff of, 1) = 0. INTERVAL GAMES ANTHONY MENDES Let I ad I 2 be itervals of real umbers. A iterval game is played i this way: player secretly selects x I ad player 2 secretly ad idepedetly selects y I 2. After x ad y are

More information

AY Term 2 Mock Examination

AY Term 2 Mock Examination AY 206-7 Term 2 Mock Examiatio Date / Start Time Course Group Istructor 24 March 207 / 2 PM to 3:00 PM QF302 Ivestmet ad Fiacial Data Aalysis G Christopher Tig INSTRUCTIONS TO STUDENTS. This mock examiatio

More information

Exam 1 Spring 2015 Statistics for Applications 3/5/2015

Exam 1 Spring 2015 Statistics for Applications 3/5/2015 8.443 Exam Sprig 05 Statistics for Applicatios 3/5/05. Log Normal Distributio: A radom variable X follows a Logormal(θ, σ ) distributio if l(x) follows a Normal(θ, σ ) distributio. For the ormal radom

More information

Models of Asset Pricing

Models of Asset Pricing 4 Appedix 1 to Chapter Models of Asset Pricig I this appedix, we first examie why diversificatio, the holdig of may risky assets i a portfolio, reduces the overall risk a ivestor faces. The we will see

More information

of Asset Pricing R e = expected return

of Asset Pricing R e = expected return Appedix 1 to Chapter 5 Models of Asset Pricig EXPECTED RETURN I Chapter 4, we saw that the retur o a asset (such as a bod) measures how much we gai from holdig that asset. Whe we make a decisio to buy

More information

Limits of sequences. Contents 1. Introduction 2 2. Some notation for sequences The behaviour of infinite sequences 3

Limits of sequences. Contents 1. Introduction 2 2. Some notation for sequences The behaviour of infinite sequences 3 Limits of sequeces I this uit, we recall what is meat by a simple sequece, ad itroduce ifiite sequeces. We explai what it meas for two sequeces to be the same, ad what is meat by the -th term of a sequece.

More information

BIOSTATS 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval

BIOSTATS 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval BIOSTATS 540 Fall 015 6. Estimatio Page 1 of 7 Uit 6. Estimatio Use at least twelve observatios i costructig a cofidece iterval - Gerald va Belle What is the mea of the blood pressures of all the studets

More information

1 Estimating the uncertainty attached to a sample mean: s 2 vs.

1 Estimating the uncertainty attached to a sample mean: s 2 vs. Political Sciece 100a/200a Fall 2001 Cofidece itervals ad hypothesis testig, Part I 1 1 Estimatig the ucertaity attached to a sample mea: s 2 vs. σ 2 Recall the problem of descriptive iferece: We wat to

More information

A New Constructive Proof of Graham's Theorem and More New Classes of Functionally Complete Functions

A New Constructive Proof of Graham's Theorem and More New Classes of Functionally Complete Functions A New Costructive Proof of Graham's Theorem ad More New Classes of Fuctioally Complete Fuctios Azhou Yag, Ph.D. Zhu-qi Lu, Ph.D. Abstract A -valued two-variable truth fuctio is called fuctioally complete,

More information

Math 312, Intro. to Real Analysis: Homework #4 Solutions

Math 312, Intro. to Real Analysis: Homework #4 Solutions Math 3, Itro. to Real Aalysis: Homework #4 Solutios Stephe G. Simpso Moday, March, 009 The assigmet cosists of Exercises 0.6, 0.8, 0.0,.,.3,.6,.0,.,. i the Ross textbook. Each problem couts 0 poits. 0.6.

More information

Sequences and Series

Sequences and Series Sequeces ad Series Matt Rosezweig Cotets Sequeces ad Series. Sequeces.................................................. Series....................................................3 Rudi Chapter 3 Exercises........................................

More information

of Asset Pricing APPENDIX 1 TO CHAPTER EXPECTED RETURN APPLICATION Expected Return

of Asset Pricing APPENDIX 1 TO CHAPTER EXPECTED RETURN APPLICATION Expected Return APPENDIX 1 TO CHAPTER 5 Models of Asset Pricig I Chapter 4, we saw that the retur o a asset (such as a bod) measures how much we gai from holdig that asset. Whe we make a decisio to buy a asset, we are

More information

1. Suppose X is a variable that follows the normal distribution with known standard deviation σ = 0.3 but unknown mean µ.

1. Suppose X is a variable that follows the normal distribution with known standard deviation σ = 0.3 but unknown mean µ. Chapter 9 Exercises Suppose X is a variable that follows the ormal distributio with kow stadard deviatio σ = 03 but ukow mea µ (a) Costruct a 95% cofidece iterval for µ if a radom sample of = 6 observatios

More information

ST 305: Exam 2 Fall 2014

ST 305: Exam 2 Fall 2014 ST 305: Exam Fall 014 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad

More information

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Meas ad Proportios Itroductio: We wat to kow the value of a parameter for a populatio. We do t kow the value of this parameter for the etire populatio because

More information

Rafa l Kulik and Marc Raimondo. University of Ottawa and University of Sydney. Supplementary material

Rafa l Kulik and Marc Raimondo. University of Ottawa and University of Sydney. Supplementary material Statistica Siica 009: Supplemet 1 L p -WAVELET REGRESSION WITH CORRELATED ERRORS AND INVERSE PROBLEMS Rafa l Kulik ad Marc Raimodo Uiversity of Ottawa ad Uiversity of Sydey Supplemetary material This ote

More information

Chapter 8: Estimation of Mean & Proportion. Introduction

Chapter 8: Estimation of Mean & Proportion. Introduction Chapter 8: Estimatio of Mea & Proportio 8.1 Estimatio, Poit Estimate, ad Iterval Estimate 8.2 Estimatio of a Populatio Mea: σ Kow 8.3 Estimatio of a Populatio Mea: σ Not Kow 8.4 Estimatio of a Populatio

More information

Topic 14: Maximum Likelihood Estimation

Topic 14: Maximum Likelihood Estimation Toic 4: November, 009 As before, we begi with a samle X = (X,, X of radom variables chose accordig to oe of a family of robabilities P θ I additio, f(x θ, x = (x,, x will be used to deote the desity fuctio

More information

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i The iformatio required by the mea-variace approach is substatial whe the umber of assets is large; there are mea values, variaces, ad )/2 covariaces - a total of 2 + )/2 parameters. Sigle-factor model:

More information

Section Mathematical Induction and Section Strong Induction and Well-Ordering

Section Mathematical Induction and Section Strong Induction and Well-Ordering Sectio 4.1 - Mathematical Iductio ad Sectio 4. - Strog Iductio ad Well-Orderig A very special rule of iferece! Defiitio: A set S is well ordered if every subset has a least elemet. Note: [0, 1] is ot well

More information

CAPITAL PROJECT SCREENING AND SELECTION

CAPITAL PROJECT SCREENING AND SELECTION CAPITAL PROJECT SCREEIG AD SELECTIO Before studyig the three measures of ivestmet attractiveess, we will review a simple method that is commoly used to scree capital ivestmets. Oe of the primary cocers

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge Biomial Model Stock Price Dyamics The value of a optio at maturity depeds o the price of the uderlyig stock at maturity. The value of the optio today depeds o the expected value of the optio at maturity

More information

Lecture 9: The law of large numbers and central limit theorem

Lecture 9: The law of large numbers and central limit theorem Lecture 9: The law of large umbers ad cetral limit theorem Theorem.4 Let X,X 2,... be idepedet radom variables with fiite expectatios. (i) (The SLLN). If there is a costat p [,2] such that E X i p i i=

More information

These characteristics are expressed in terms of statistical properties which are estimated from the sample data.

These characteristics are expressed in terms of statistical properties which are estimated from the sample data. 0. Key Statistical Measures of Data Four pricipal features which characterize a set of observatios o a radom variable are: (i) the cetral tedecy or the value aroud which all other values are buched, (ii)

More information

Problem Set 1a - Oligopoly

Problem Set 1a - Oligopoly Advaced Idustrial Ecoomics Sprig 2014 Joha Steek 6 may 2014 Problem Set 1a - Oligopoly 1 Table of Cotets 2 Price Competitio... 3 2.1 Courot Oligopoly with Homogeous Goods ad Differet Costs... 3 2.2 Bertrad

More information

Math 124: Lecture for Week 10 of 17

Math 124: Lecture for Week 10 of 17 What we will do toight 1 Lecture for of 17 David Meredith Departmet of Mathematics Sa Fracisco State Uiversity 2 3 4 April 8, 2008 5 6 II Take the midterm. At the ed aswer the followig questio: To be revealed

More information

Exam 2. Instructor: Cynthia Rudin TA: Dimitrios Bisias. October 25, 2011

Exam 2. Instructor: Cynthia Rudin TA: Dimitrios Bisias. October 25, 2011 15.075 Exam 2 Istructor: Cythia Rudi TA: Dimitrios Bisias October 25, 2011 Gradig is based o demostratio of coceptual uderstadig, so you eed to show all of your work. Problem 1 You are i charge of a study

More information

1 Estimating sensitivities

1 Estimating sensitivities Copyright c 27 by Karl Sigma 1 Estimatig sesitivities Whe estimatig the Greeks, such as the, the geeral problem ivolves a radom variable Y = Y (α) (such as a discouted payoff) that depeds o a parameter

More information

A point estimate is the value of a statistic that estimates the value of a parameter.

A point estimate is the value of a statistic that estimates the value of a parameter. Chapter 9 Estimatig the Value of a Parameter Chapter 9.1 Estimatig a Populatio Proportio Objective A : Poit Estimate A poit estimate is the value of a statistic that estimates the value of a parameter.

More information

Solution to Tutorial 6

Solution to Tutorial 6 Solutio to Tutorial 6 2012/2013 Semester I MA4264 Game Theory Tutor: Xiag Su October 12, 2012 1 Review Static game of icomplete iformatio The ormal-form represetatio of a -player static Bayesia game: {A

More information

SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME

SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME All Right Reserved No. of Pages - 10 No of Questios - 08 SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME YEAR I SEMESTER I (Group B) END SEMESTER EXAMINATION

More information

Using Math to Understand Our World Project 5 Building Up Savings And Debt

Using Math to Understand Our World Project 5 Building Up Savings And Debt Usig Math to Uderstad Our World Project 5 Buildig Up Savigs Ad Debt Note: You will have to had i aswers to all umbered questios i the Project Descriptio See the What to Had I sheet for additioal materials

More information

APPLICATION OF GEOMETRIC SEQUENCES AND SERIES: COMPOUND INTEREST AND ANNUITIES

APPLICATION OF GEOMETRIC SEQUENCES AND SERIES: COMPOUND INTEREST AND ANNUITIES APPLICATION OF GEOMETRIC SEQUENCES AND SERIES: COMPOUND INTEREST AND ANNUITIES Example: Brado s Problem Brado, who is ow sixtee, would like to be a poker champio some day. At the age of twety-oe, he would

More information

Estimation of Population Variance Utilizing Auxiliary Information

Estimation of Population Variance Utilizing Auxiliary Information Iteratioal Joural of Statistics ad Systems ISSN 0973-675 Volume 1, Number (017), pp. 303-309 Research Idia Publicatios http://www.ripublicatio.com Estimatio of Populatio Variace Utilizig Auxiliary Iformatio

More information

Summary. Recap. Last Lecture. .1 If you know MLE of θ, can you also know MLE of τ(θ) for any function τ?

Summary. Recap. Last Lecture. .1 If you know MLE of θ, can you also know MLE of τ(θ) for any function τ? Last Lecture Biostatistics 60 - Statistical Iferece Lecture Cramer-Rao Theorem Hyu Mi Kag February 9th, 03 If you kow MLE of, ca you also kow MLE of τ() for ay fuctio τ? What are plausible ways to compare

More information

Sampling Distributions & Estimators

Sampling Distributions & Estimators API-209 TF Sessio 2 Teddy Svoroos September 18, 2015 Samplig Distributios & Estimators I. Estimators The Importace of Samplig Radomly Three Properties of Estimators 1. Ubiased 2. Cosistet 3. Efficiet I

More information

Control Charts for Mean under Shrinkage Technique

Control Charts for Mean under Shrinkage Technique Helderma Verlag Ecoomic Quality Cotrol ISSN 0940-5151 Vol 24 (2009), No. 2, 255 261 Cotrol Charts for Mea uder Shrikage Techique J. R. Sigh ad Mujahida Sayyed Abstract: I this paper a attempt is made to

More information

Subject CT1 Financial Mathematics Core Technical Syllabus

Subject CT1 Financial Mathematics Core Technical Syllabus Subject CT1 Fiacial Mathematics Core Techical Syllabus for the 2018 exams 1 Jue 2017 Subject CT1 Fiacial Mathematics Core Techical Aim The aim of the Fiacial Mathematics subject is to provide a groudig

More information

Chpt 5. Discrete Probability Distributions. 5-3 Mean, Variance, Standard Deviation, and Expectation

Chpt 5. Discrete Probability Distributions. 5-3 Mean, Variance, Standard Deviation, and Expectation Chpt 5 Discrete Probability Distributios 5-3 Mea, Variace, Stadard Deviatio, ad Expectatio 1/23 Homework p252 Applyig the Cocepts Exercises p253 1-19 2/23 Objective Fid the mea, variace, stadard deviatio,

More information

Generative Models, Maximum Likelihood, Soft Clustering, and Expectation Maximization

Generative Models, Maximum Likelihood, Soft Clustering, and Expectation Maximization Geerative Models Maximum Likelihood Soft Clusterig ad Expectatio Maximizatio Aris Aagostopoulos We will see why we desig models for data how to lear their parameters ad how by cosiderig a mixture model

More information

Chapter 10 - Lecture 2 The independent two sample t-test and. confidence interval

Chapter 10 - Lecture 2 The independent two sample t-test and. confidence interval Assumptios Idepedet Samples - ukow σ 1, σ - 30 or m 30 - Upooled case Idepedet Samples - ukow σ 1, σ - 30 or m 30 - Pooled case Idepedet samples - Pooled variace - Large samples Chapter 10 - Lecture The

More information

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India July 2012

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India July 2012 Game Theory Lecture Notes By Y. Narahari Departmet of Computer Sciece ad Automatio Idia Istitute of Sciece Bagalore, Idia July 01 Chapter 4: Domiat Strategy Equilibria Note: This is a oly a draft versio,

More information

The Limit of a Sequence (Brief Summary) 1

The Limit of a Sequence (Brief Summary) 1 The Limit of a Sequece (Brief Summary). Defiitio. A real umber L is a it of a sequece of real umbers if every ope iterval cotaiig L cotais all but a fiite umber of terms of the sequece. 2. Claim. A sequece

More information

EXERCISE - BINOMIAL THEOREM

EXERCISE - BINOMIAL THEOREM BINOMIAL THOEREM / EXERCISE - BINOMIAL THEOREM LEVEL I SUBJECTIVE QUESTIONS. Expad the followig expressios ad fid the umber of term i the expasio of the expressios. (a) (x + y) 99 (b) ( + a) 9 + ( a) 9

More information

Research Article The Probability That a Measurement Falls within a Range of n Standard Deviations from an Estimate of the Mean

Research Article The Probability That a Measurement Falls within a Range of n Standard Deviations from an Estimate of the Mean Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 70806, 8 pages doi:0.540/0/70806 Research Article The Probability That a Measuremet Falls withi a Rage of Stadard Deviatios

More information

A Note About Maximum Likelihood Estimator in Hypergeometric Distribution

A Note About Maximum Likelihood Estimator in Hypergeometric Distribution Comuicacioes e Estadística Juio 2009, Vol. 2, No. 1 A Note About Maximum Likelihood Estimator i Hypergeometric Distributio Ua ota sobre los estimadores de máxima verosimilitud e la distribució hipergeométrica

More information

Section 3.3 Exercises Part A Simplify the following. 1. (3m 2 ) 5 2. x 7 x 11

Section 3.3 Exercises Part A Simplify the following. 1. (3m 2 ) 5 2. x 7 x 11 123 Sectio 3.3 Exercises Part A Simplify the followig. 1. (3m 2 ) 5 2. x 7 x 11 3. f 12 4. t 8 t 5 f 5 5. 3-4 6. 3x 7 4x 7. 3z 5 12z 3 8. 17 0 9. (g 8 ) -2 10. 14d 3 21d 7 11. (2m 2 5 g 8 ) 7 12. 5x 2

More information

ii. Interval estimation:

ii. Interval estimation: 1 Types of estimatio: i. Poit estimatio: Example (1) Cosider the sample observatios 17,3,5,1,18,6,16,10 X 8 X i i1 8 17 3 5 118 6 16 10 8 116 8 14.5 14.5 is a poit estimate for usig the estimator X ad

More information

First determine the payments under the payment system

First determine the payments under the payment system Corporate Fiace February 5, 2008 Problem Set # -- ANSWERS Klick. You wi a judgmet agaist a defedat worth $20,000,000. Uder state law, the defedat has the right to pay such a judgmet out over a 20 year

More information

Confidence Intervals Introduction

Confidence Intervals Introduction Cofidece Itervals Itroductio A poit estimate provides o iformatio about the precisio ad reliability of estimatio. For example, the sample mea X is a poit estimate of the populatio mea μ but because of

More information

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions A Empirical Study of the Behaviour of the Sample Kurtosis i Samples from Symmetric Stable Distributios J. Marti va Zyl Departmet of Actuarial Sciece ad Mathematical Statistics, Uiversity of the Free State,

More information

4.5 Generalized likelihood ratio test

4.5 Generalized likelihood ratio test 4.5 Geeralized likelihood ratio test A assumptio that is used i the Athlete Biological Passport is that haemoglobi varies equally i all athletes. We wish to test this assumptio o a sample of k athletes.

More information

STRAND: FINANCE. Unit 3 Loans and Mortgages TEXT. Contents. Section. 3.1 Annual Percentage Rate (APR) 3.2 APR for Repayment of Loans

STRAND: FINANCE. Unit 3 Loans and Mortgages TEXT. Contents. Section. 3.1 Annual Percentage Rate (APR) 3.2 APR for Repayment of Loans CMM Subject Support Strad: FINANCE Uit 3 Loas ad Mortgages: Text m e p STRAND: FINANCE Uit 3 Loas ad Mortgages TEXT Cotets Sectio 3.1 Aual Percetage Rate (APR) 3.2 APR for Repaymet of Loas 3.3 Credit Purchases

More information

Chapter 4 - Consumer. Household Demand and Supply. Solving the max-utility problem. Working out consumer responses. The response function

Chapter 4 - Consumer. Household Demand and Supply. Solving the max-utility problem. Working out consumer responses. The response function Almost essetial Cosumer: Optimisatio Chapter 4 - Cosumer Osa 2: Household ad supply Cosumer: Welfare Useful, but optioal Firm: Optimisatio Household Demad ad Supply MICROECONOMICS Priciples ad Aalysis

More information

The Idea of a Confidence Interval

The Idea of a Confidence Interval AP Statistics Ch. 8 Notes Estimatig with Cofidece I the last chapter, we aswered questios about what samples should look like assumig that we kew the true values of populatio parameters (like μ, σ, ad

More information

Non-Inferiority Logrank Tests

Non-Inferiority Logrank Tests Chapter 706 No-Iferiority Lograk Tests Itroductio This module computes the sample size ad power for o-iferiority tests uder the assumptio of proportioal hazards. Accrual time ad follow-up time are icluded

More information