Reinforcement Learning

Size: px
Start display at page:

Download "Reinforcement Learning"

Transcription

1 Reiforcemet Learig Ala Fer * Based i part o slides by Daiel Weld

2 So far. Give a MDP model we kow how to fid optimal policies (for moderately-sized MDPs) Value Iteratio or Policy Iteratio Give just a simulator of a MDP we kow how to select actios Mote-Carlo Plaig What if we do t have a model or simulator? Like whe we were babies... Like i may real-world applicatios All we ca do is wader aroud the world observig what happe gettig rewarded ad puished Eters reiforcemet learig 2

3 Reiforcemet Learig No kowledge of eviromet Ca oly act i the world ad observe states ad reward May factors make RL difficult: Actios have o-determiistic effects Which are iitially ukow Rewards / puishmets are ifrequet Ofte at the ed of log sequeces of actios How do we determie what actio(s) were really resposible for reward or puishmet? (credit assigmet) World is large ad complex Nevertheless learer must decide what actios to take We will assume the world behaves as a MDP 3

4 Pure Reiforcemet Learig vs. Mote-Carlo Plaig I pure reiforcemet learig: the aget begis with o kowledge waders aroud the world observig outcomes I Mote-Carlo plaig the aget begis with o declarative kowledge of the world has a iterface to a world simulator that allows observig the outcome of takig ay actio i ay state The simulator gives the aget the ability to teleport to ay state, at ay time, ad the apply ay actio A pure RL aget does ot have the ability to teleport Ca oly observe the outcomes that it happes to reach 4

5 Pure Reiforcemet Learig vs. Mote-Carlo Plaig MC plaig is sometimes called RL with a strog simulator I.e. a simulator where we ca set the curret state to ay state at ay momet Pure RL is sometimes called RL with a weak simulator I.e. a simulator where we caot set the state A strog simulator ca emulate a weak simulator So pure RL ca be used i the MC plaig framework But ot vice versa 5

6 Passive vs. Active learig Passive learig The aget has a fixed policy ad tries to lear the utilities of states by observig the world go by Aalogous to policy evaluatio Ofte serves as a compoet of active learig algorithms Ofte ispires active learig algorithms Active learig The aget attempts to fid a optimal (or at least good) policy by actig i the world Aalogous to solvig the uderlyig MDP, but without first beig give the MDP model 6

7 Model-Based vs. Model-Free RL Model based approach to RL: lear the MDP model, or a approximatio of it use it for policy evaluatio or to fid the optimal policy Model free approach to RL: derive the optimal policy without explicitly learig the model useful whe model is difficult to represet ad/or lear We will cosider both types of approaches 7

8 Small vs. Huge MDPs We will first cover RL methods for small MDPs MDPs where the umber of states ad actios is reasoably small These algorithms will ispire more advaced methods Later we will cover algorithms for huge MDPs Fuctio Approximatio Methods Policy Gradiet Methods Least-Squares Policy Iteratio 8

9 Example: Passive RL Suppose give a statioary policy (show by arrows) Actios ca stochastically lead to uiteded grid cell Wat to determie how good it is 9

10 Objective: Value Fuctio 0

11 Passive RL Estimate V (s) Not give trasitio matrix, or reward fuctio! Follow the policy for may epochs givig traiig sequeces. (,)(,2)(,3)(,2)(,3)(2,3)(3,3) (3,4) + (,)(,2)(,3)(2,3)(3,3)(3,2)(3,3)(3,4) + (,)(2,)(3,)(3,2)(4,2) - Assume that after eterig + or - state the aget eters zero reward termial state So we do t bother showig those trasitios

12 Approach : Direct Estimatio Direct estimatio (also called Mote Carlo) Estimate V (s) as average total reward of epochs cotaiig s (calculatig from s to ed of epoch) Reward to go of a state s the sum of the (discouted) rewards from that state util a termial state is reached Key: use observed reward to go of the state as the direct evidece of the actual expected utility of that state Averagig the reward-to-go samples will coverge to true value at state 2

13 Direct Estimatio Coverge very slowly to correct utilities values (requires a lot of sequeces) Does t exploit Bellma costraits o policy values V ( s) R( s) T( a, s') V ( s') It is happy to cosider value fuctio estimates that violate this property badly. s' How ca we icorporate the Bellma costraits? 3

14 Approach 2: Adaptive Dyamic Programmig (ADP) ADP is a model based approach V Follow the policy for awhile Estimate trasitio model based o observatios Lear reward fuctio Use estimated model to compute utility of policy ( s) R( s) s' T( a, s') V ( s') leared How ca we estimate trasitio model T(a,s )? Simply the fractio of times we see s after takig a i state s. NOTE: Ca boud error with Cheroff bouds if we wat 4

15 ADP learig curves (4,3) (3,3) (2,3) (,) (3,) (4,) (4,2) 5

16 Approach 3: Temporal Differece Learig (TD) V Ca we avoid the computatioal expese of full DP policy evaluatio? Temporal Differece Learig (model free) Do local updates of utility/value fuctio o a per-actio basis Do t try to estimate etire trasitio fuctio! For each trasitio from s to s, we perform the followig update: ( s) V ( s) ( R( s) V ( s') V ( s)) updated estimate Ituitively moves us closer to satisfyig Bellma costrait Why? V learig rate ( s) R( s) discout factor s' T( a, s') V ( s') 6

17 7 Aside: Olie Mea Estimatio Suppose that we wat to icremetally compute the mea of a sequece of umbers (x, x 2, x 3,. ) E.g. to estimate the expected value of a radom variable from a sequece of samples. Give a ew sample x +, the ew mea is the old estimate (for samples) plus the weighted differece betwee the ew sample ad old estimate i i i i i i X x X x x x x X ˆ ˆ ˆ average of + samples

18 8 Aside: Olie Mea Estimatio Suppose that we wat to icremetally compute the mea of a sequece of umbers (x, x 2, x 3,. ) E.g. to estimate the expected value of a radom variable from a sequece of samples. i i i i i i X x X x x x x X ˆ ˆ ˆ average of + samples

19 9 Aside: Olie Mea Estimatio Suppose that we wat to icremetally compute the mea of a sequece of umbers (x, x 2, x 3,. ) E.g. to estimate the expected value of a radom variable from a sequece of samples. Give a ew sample x +, the ew mea is the old estimate (for samples) plus the weighted differece betwee the ew sample ad old estimate i i i i i i X x X x x x x X ˆ ˆ ˆ average of + samples sample + learig rate

20 Approach 3: Temporal Differece Learig (TD) V TD update for trasitio from s to s : ( s) V ( s) ( R( s) V ( s') V ( s)) updated estimate learig rate (oisy) sample of value at s based o ext state s So the update is maitaiig a mea of the (oisy) value samples If the learig rate decreases appropriately with the umber of samples (e.g. /) the the value estimates will coverge to true values! (o-trivial) V ( s) R( s) s' T( a, s') V ( s') 20

21 Approach 3: Temporal Differece Learig (TD) V TD update for trasitio from s to s : ( s) V ( s) ( R( s) V ( s') V ( s)) learig rate (oisy) sample of utility based o ext state Ituitio about covergece V Whe V satisfies Bellma costraits the expected update is 0. ( s) R( s) s' T( a, s') V ( s') Ca use results from stochastic optimizatio theory to prove covergece i the limit 2

22 The TD learig curve Tradeoff: requires more traiig experiece (epochs) tha ADP but much less computatio per epoch Choice depeds o relative cost of experiece vs. computatio 22

23 Passive RL: Comparisos Mote-Carlo Direct Estimatio (model free) Simple to implemet Each update is fast Does ot exploit Bellma costraits Coverges slowly Adaptive Dyamic Programmig (model based) Harder to implemet Each update is a full policy evaluatio (expesive) Fully exploits Bellma costraits Fast covergece (i terms of updates) Temporal Differece Learig (model free) Update speed ad implemetatio similiar to direct estimatio Partially exploits Bellma costraits---adjusts state to agree with observed successor Not all possible successors as i ADP Covergece i betwee direct estimatio ad ADP 23

24 Betwee ADP ad TD Movig TD toward ADP At each step perform TD updates based o observed trasitio ad imagied trasitios Imagied trasitio are geerated usig estimated model The more imagied trasitios used, the more like ADP Makig estimate more cosistet with ext state distributio Coverges i the limit of ifiite imagied trasitios to ADP 24

25 Active Reiforcemet Learig So far, we ve assumed aget has a policy We just leared how good it is Now, suppose aget must lear a good policy (ideally optimal) While actig i ucertai world 25

26 Naïve Approach. Act Radomly for a (log) time 2. Lear Or systematically explore all possible actios Trasitio fuctio Reward fuctio 3. Use value iteratio, policy iteratio, 4. Follow resultig policy thereafter. Will this work? Ay problems? Yes (if we do step log eough ad there are o dead-eds ) We will act radomly for a log time before exploitig what we kow. 26

27 Revisio of Naïve Approach. Start with iitial (uiformed) model 2. Solve for optimal policy give curret model (usig value or policy iteratio) 3. Execute actio suggested by policy i curret state 4. Update estimated model based o observed trasitio 5. Goto 2 This is just ADP but we follow the greedy policy suggested by curret value estimate Will this work? No. Ca get stuck i local miima. What ca be doe? 27

28 Exploratio versus Exploitatio Two reasos to take a actio i RL Exploitatio: To try to get reward. We exploit our curret kowledge to get a payoff. Exploratio: Get more iformatio about the world. How do we kow if there is ot a pot of gold aroud the corer. To explore we typically eed to take actios that do ot seem best accordig to our curret model. Maagig the trade-off betwee exploratio ad exploitatio is a critical issue i RL Basic ituitio behid most approaches: Explore more whe kowledge is weak Exploit more as we gai kowledge 28

29 ADP-based RL. Start with iitial model 2. Solve for optimal policy give curret model (usig value or policy iteratio) 3. Take actio accordig to a explore/exploit policy (explores more early o ad gradually uses policy from 2) 4. Update estimated model based o observed trasitio 5. Goto 2 This is just ADP but we follow the explore/exploit policy Will this work? Depeds o the explore/exploit policy. Ay ideas? 29

30 Explore/Exploit Policies Greedy actio is actio maximizig estimated Q-value Q( a) R( s) s' T( a, s') V ( s') where V is curret optimal value fuctio estimate (based o curret model), ad R, T are curret estimates of model Q(a) is the expected value of takig actio a i state s ad the gettig the estimated value V(s ) of the ext state s Wat a exploratio policy that is greedy i the limit of ifiite exploratio (GLIE) Guaratees covergece GLIE Policy O time step t select radom actio with probability p(t) ad greedy actio with probability -p(t) p(t) = /t will lead to covergece, but is slow 30

31 Explore/Exploit Policies Greedy actio is actio maximizig estimated Q-value Q( a) R( s) s' T( a, s') V where V is curret value fuctio estimate, ad R, T are curret estimates of model GLIE Policy 2: Boltzma Exploratio Select actio a with probability, Pr( a s) a' A ( s') T is the temperature. Large T meas that each actio has about the same probability. Small T leads to more greedy behavior. Typically start with large T ad decrease with time exp Q( a) / T exp Q( a') / T 3

32 The Impact of Temperature Pr( a s) a' A exp Q( a) / T exp Q( a') / T Suppose we have two actios ad that Q(a) =, Q(a2) = 2 T=0 gives Pr(a s) = 0.48, Pr(a2 s) = 0.52 Almost equal probability, so will explore T= gives Pr(a s) = 0.27, Pr(a2 s) = 0.73 Probabilities more skewed, so explore a less T = 0.25 gives Pr(a s) = 0.02, Pr(a2 s) = 0.98 Almost always exploit a2 32

33 Alterative Approach: Optimistic Exploratio. Start with iitial model 2. Solve for optimistic policy (uses optimistic variat of value iteratio) (iflates value of actios leadig to uexplored regios) 3. Take greedy actio accordig to optimistic policy 4. Update estimated model 5. Goto 2 Basically act as if all uexplored state-actio pairs are maximally rewardig. 33

34 Optimistic Exploratio Recall that value iteratio iteratively performs the followig update at all states: V ( s) Optimistic variat adjusts update to make actios that lead to uexplored regios look good Implemet variat of VI that assigs the highest possible value V max to ay state-actio pair that has ot bee explored eough Maximum value is whe we get maximum reward forever V R( s) max max max t max R R t 0 a T( a, s') V ( s') What do we mea by explored eough? N(a) > N e, where N(a) is umber of times actio a has bee tried i state s ad N e is a user selected parameter s' 34

35 Optimistic Exploratio V ( s) R( s) max a T( a, s') V ( s') Optimistic value iteratio computes a optimistic value fuctio V + usig updates V ( s) R( s) max a V s' s' T( a, s') V The aget will behave iitially as if there were woderful rewards scattered all over aroud optimistic. But after actios are tried eough times we will perform stadard o-optimistic value iteratio max Some recet theoretical results show how to set N e is to arrive at provably optimal learig (with high probability) i polyomial time (the RMAX algorithm), ( s'), N( a) N( a) N e N e 35

36 Aother View o Optimistic Exploratio: The Rmax Algorithm. Start with a optimistic model (assig largest possible reward to uexplored states ) (actios from uexplored states oly self trasitio) 2. Solve for optimal policy i optimistic model 3. Take greedy actio accordig to policy 4. Update optimistic estimated model (if a state becomes kow the use its true statistics) 5. Goto 2 Aget always acts greedily accordig to a model that assumes all uexplored states are maximally rewardig 36

37 Rmax: Optimistic Model Keep track of umber of times a state-actio pair is tried If N(a) < N e the T(a,s)= ad R(s) = Rmax otherwise T(a,s ) ad R(s) are based o estimates obtaied from the N e experieces (the estimate of true model) For large eough N e these will be accurate estimates A optimal policy for this optimistic model will try to reach uexplored states (those with uexplored actios) sice it ca stay at those states ad accumulate maximum reward Never explicitly explores. Is always greedy, but with respect to a optimistic outlook. Theoretically efficiet algorithm. 37

38 TD-based Active RL. Start with iitial value fuctio 2. Take actio from explore/exploit policy givig ew state s (should coverge to greedy policy, i.e. GLIE) 3. Update estimated model 4. Perform TD update V( s) V( s) ( R( s) V( s') V( s)) V(s) is ew estimate of optimal value fuctio at state s. 5. Goto 2 Just like TD for passive RL, but we follow explore/exploit policy Give the usual assumptios about learig rate ad GLIE, TD will coverge to a optimal value fuctio! 38

39 TD-based Active RL. Start with iitial value fuctio 2. Take actio from explore/exploit policy givig ew state s (should coverge to greedy policy, i.e. GLIE) 3. Update estimated model 4. Perform TD update V( s) V( s) ( R( s) V( s') V( s)) V(s) is ew estimate of optimal value fuctio at state s. 5. Goto 2 Requires a estimated model. Why? To compute Q(a) for greedy policy executio Ca we costruct a model-free variat? 39

40 Q-Learig: Model-Free RL Istead of learig the optimal value fuctio V, directly lear the optimal Q fuctio. Recall Q(a) is the expected value of takig actio a i state s ad the followig the optimal policy thereafter The optimal Q-fuctio satisfies which gives: Q( a) R( s) R( s) T( a, s') V ( s') T( a, s') max Q( s', a') Give the Q fuctio we ca act optimally by selectig actio greedily accordig to Q(a) without a model s' s' V ( s) max Q( a') How ca we lear the Q-fuctio directly? a' a' 40

41 Q-Learig: Model-Free RL Bellma costraits o optimal Q-fuctio: Q( a) R( s) s' T( a, s')max Q( a') We ca perform updates after each actio just like i TD. After takig actio a i state s ad reachig state s do: (ote that we directly observe reward R(s)) a' Q( a) Q( a) ( R( s) max a' Q( s', a') Q( a)) (oisy) sample of Q-value based o ext state 4

42 Q-Learig. Start with iitial Q-fuctio (e.g. all zeros) 2. Take actio from explore/exploit policy givig ew state s (should coverge to greedy policy, i.e. GLIE) 3. Perform TD update Q( a) Q( a) ( R( s) max Q( s', a') Q( a)) Q(a) is curret estimate of optimal Q-fuctio. 4. Goto 2 Does ot require model sice we lear Q directly! Uses explicit S x A table to represet Q Explore/exploit policy directly uses Q-values E.g. use Boltzma exploratio. Book uses exploratio fuctio for exploratio (Figure 2.8) a' 42

43 Q-Learig: Speedup for Goal-Based Problems Goal-Based Problem: receive big reward i goal state ad the trasitio to termial state Mii-project 2 is goal based Cosider iitializig Q(a) to zeros ad the observig the followig sequece of (state, reward, actio) triples (s0,0,a0) (0,a) (s2,0,a2) (termial,0) The sequece of Q-value updates would result i: Q(s0,a0) = 0, Q(a) =0, Q(s2,a2)=0 So othig was leared at s0 ad s Next time this trajectory is observed we will get o-zero for Q(a) but still Q(s0,a0)=0 43

44 Q-Learig: Speedup for Goal-Based Problems From the example we see that it ca take may learig trials for the fial reward to back propagate to early state-actio pairs Two approaches for addressig this problem:. Trajectory replay: store each trajectory ad do several iteratios of Q-updates o each oe 2. Reverse updates: store trajectory ad do Q-updates i reverse order I our example (with learig rate ad discout factor equal to for ease of illustratio) reverse updates would give Q(s2,a2) = 0, Q(a) = 0, Q(s0,a0)=0 44

45 Q-Learig: Suggestios for Mii Project 2 A very simple exploratio strategy is -greedy exploratio (geerally called epsilo greedy ) Select a small value for e (perhaps 0.) O each step: With probability select a radom actio, ad with probability - select a greedy actio But it might be iterestig to play with exploratio a bit (e.g. compare to a decreasig exploratio rate) You ca use a discout factor of oe. 45

46 Active Reiforcemet Learig Summary Methods ADP Temporal Differece Learig Q-learig All coverge to optimal policy assumig a GLIE exploratio strategy Optimistic exploratio with ADP ca be show to coverge i polyomial time with high probability All methods assume the world is ot too dagerous (o cliffs to fall off durig exploratio) So far we have assumed small state spaces 46

47 ADP vs. TD vs. Q Differet opiios. (my opiio) Whe state space is small the this is ot such a importat issue. Computatio Time ADP-based methods use more computatio time per step Memory Usage ADP-based methods uses O(m 2 ) memory Active TD-learig uses O(m 2 ) memory (must store model) Q-learig uses O(m) memory for Q-table Learig efficiecy (performace per uit experiece) ADP-based methods make more efficiet use of experiece by storig a model that summarizes the history ad the reasoig about the model (e.g. via value iteratio or policy iteratio) 47

48 What about large state spaces? Oe approach is to map the origial state space S to a much smaller state space S via some hashig fuctio. Ideally similar states i S are mapped to the same state i S The do learig over S istead of S. Note that the world may ot look Markovia whe viewed through the les of S, so covergece results may ot apply But, still the approach ca work if a good eough S is egieered (requires careful desig), e.g. Empirical Evaluatio of a Reiforcemet Learig Spoke Dialogue System. With S. Sigh, D. Litma, M. Walker. Proceedigs of the 7th Natioal Coferece o Artificial Itelligece, 2000 We will ow study three other approaches for dealig with large state-spaces Value fuctio approximatio Policy gradiet methods Least Squared Policy Iteratio 48

Parametric Density Estimation: Maximum Likelihood Estimation

Parametric Density Estimation: Maximum Likelihood Estimation Parametric Desity stimatio: Maimum Likelihood stimatio C6 Today Itroductio to desity estimatio Maimum Likelihood stimatio Itroducto Bayesia Decisio Theory i previous lectures tells us how to desig a optimal

More information

Statistics for Economics & Business

Statistics for Economics & Business Statistics for Ecoomics & Busiess Cofidece Iterval Estimatio Learig Objectives I this chapter, you lear: To costruct ad iterpret cofidece iterval estimates for the mea ad the proportio How to determie

More information

point estimator a random variable (like P or X) whose values are used to estimate a population parameter

point estimator a random variable (like P or X) whose values are used to estimate a population parameter Estimatio We have oted that the pollig problem which attempts to estimate the proportio p of Successes i some populatio ad the measuremet problem which attempts to estimate the mea value µ of some quatity

More information

Standard Deviations for Normal Sampling Distributions are: For proportions For means _

Standard Deviations for Normal Sampling Distributions are: For proportions For means _ Sectio 9.2 Cofidece Itervals for Proportios We will lear to use a sample to say somethig about the world at large. This process (statistical iferece) is based o our uderstadig of samplig models, ad will

More information

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3)

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3) Today: Fiish Chapter 9 (Sectios 9.6 to 9.8 ad 9.9 Lesso 3) ANNOUNCEMENTS: Quiz #7 begis after class today, eds Moday at 3pm. Quiz #8 will begi ext Friday ad ed at 10am Moday (day of fial). There will be

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpeCourseWare http://ocwmitedu 430 Itroductio to Statistical Methods i Ecoomics Sprig 009 For iformatio about citig these materials or our Terms of Use, visit: http://ocwmitedu/terms 430 Itroductio

More information

Overlapping Generations

Overlapping Generations Eco. 53a all 996 C. Sims. troductio Overlappig Geeratios We wat to study how asset markets allow idividuals, motivated by the eed to provide icome for their retiremet years, to fiace capital accumulatio

More information

Chapter 8: Estimation of Mean & Proportion. Introduction

Chapter 8: Estimation of Mean & Proportion. Introduction Chapter 8: Estimatio of Mea & Proportio 8.1 Estimatio, Poit Estimate, ad Iterval Estimate 8.2 Estimatio of a Populatio Mea: σ Kow 8.3 Estimatio of a Populatio Mea: σ Not Kow 8.4 Estimatio of a Populatio

More information

Maximum Empirical Likelihood Estimation (MELE)

Maximum Empirical Likelihood Estimation (MELE) Maximum Empirical Likelihood Estimatio (MELE Natha Smooha Abstract Estimatio of Stadard Liear Model - Maximum Empirical Likelihood Estimator: Combiatio of the idea of imum likelihood method of momets,

More information

5. Best Unbiased Estimators

5. Best Unbiased Estimators Best Ubiased Estimators http://www.math.uah.edu/stat/poit/ubiased.xhtml 1 of 7 7/16/2009 6:13 AM Virtual Laboratories > 7. Poit Estimatio > 1 2 3 4 5 6 5. Best Ubiased Estimators Basic Theory Cosider agai

More information

. (The calculated sample mean is symbolized by x.)

. (The calculated sample mean is symbolized by x.) Stat 40, sectio 5.4 The Cetral Limit Theorem otes by Tim Pilachowski If you have t doe it yet, go to the Stat 40 page ad dowload the hadout 5.4 supplemet Cetral Limit Theorem. The homework (both practice

More information

Estimating Proportions with Confidence

Estimating Proportions with Confidence Aoucemets: Discussio today is review for midterm, o credit. You may atted more tha oe discussio sectio. Brig sheets of otes ad calculator to midterm. We will provide Scatro form. Homework: (Due Wed Chapter

More information

CHAPTER 8 Estimating with Confidence

CHAPTER 8 Estimating with Confidence CHAPTER 8 Estimatig with Cofidece 8.2 Estimatig a Populatio Proportio The Practice of Statistics, 5th Editio Stares, Tabor, Yates, Moore Bedford Freema Worth Publishers Estimatig a Populatio Proportio

More information

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge Biomial Model Stock Price Dyamics The value of a optio at maturity depeds o the price of the uderlyig stock at maturity. The value of the optio today depeds o the expected value of the optio at maturity

More information

Inferential Statistics and Probability a Holistic Approach. Inference Process. Inference Process. Chapter 8 Slides. Maurice Geraghty,

Inferential Statistics and Probability a Holistic Approach. Inference Process. Inference Process. Chapter 8 Slides. Maurice Geraghty, Iferetial Statistics ad Probability a Holistic Approach Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike 4.0

More information

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010 Combiig imperfect data, ad a itroductio to data assimilatio Ross Baister, NCEO, September 00 rbaister@readigacuk The probability desity fuctio (PDF prob that x lies betwee x ad x + dx p (x restrictio o

More information

Section 3.3 Exercises Part A Simplify the following. 1. (3m 2 ) 5 2. x 7 x 11

Section 3.3 Exercises Part A Simplify the following. 1. (3m 2 ) 5 2. x 7 x 11 123 Sectio 3.3 Exercises Part A Simplify the followig. 1. (3m 2 ) 5 2. x 7 x 11 3. f 12 4. t 8 t 5 f 5 5. 3-4 6. 3x 7 4x 7. 3z 5 12z 3 8. 17 0 9. (g 8 ) -2 10. 14d 3 21d 7 11. (2m 2 5 g 8 ) 7 12. 5x 2

More information

Notes on Expected Revenue from Auctions

Notes on Expected Revenue from Auctions Notes o Epected Reveue from Auctios Professor Bergstrom These otes spell out some of the mathematical details about first ad secod price sealed bid auctios that were discussed i Thursday s lecture You

More information

CAPITAL PROJECT SCREENING AND SELECTION

CAPITAL PROJECT SCREENING AND SELECTION CAPITAL PROJECT SCREEIG AD SELECTIO Before studyig the three measures of ivestmet attractiveess, we will review a simple method that is commoly used to scree capital ivestmets. Oe of the primary cocers

More information

1. Suppose X is a variable that follows the normal distribution with known standard deviation σ = 0.3 but unknown mean µ.

1. Suppose X is a variable that follows the normal distribution with known standard deviation σ = 0.3 but unknown mean µ. Chapter 9 Exercises Suppose X is a variable that follows the ormal distributio with kow stadard deviatio σ = 03 but ukow mea µ (a) Costruct a 95% cofidece iterval for µ if a radom sample of = 6 observatios

More information

APPLICATION OF GEOMETRIC SEQUENCES AND SERIES: COMPOUND INTEREST AND ANNUITIES

APPLICATION OF GEOMETRIC SEQUENCES AND SERIES: COMPOUND INTEREST AND ANNUITIES APPLICATION OF GEOMETRIC SEQUENCES AND SERIES: COMPOUND INTEREST AND ANNUITIES Example: Brado s Problem Brado, who is ow sixtee, would like to be a poker champio some day. At the age of twety-oe, he would

More information

Math 124: Lecture for Week 10 of 17

Math 124: Lecture for Week 10 of 17 What we will do toight 1 Lecture for of 17 David Meredith Departmet of Mathematics Sa Fracisco State Uiversity 2 3 4 April 8, 2008 5 6 II Take the midterm. At the ed aswer the followig questio: To be revealed

More information

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices?

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices? FINM6900 Fiace Theory How Is Asymmetric Iformatio Reflected i Asset Prices? February 3, 2012 Referece S. Grossma, O the Efficiecy of Competitive Stock Markets where Traders Have Diverse iformatio, Joural

More information

Topic-7. Large Sample Estimation

Topic-7. Large Sample Estimation Topic-7 Large Sample Estimatio TYPES OF INFERENCE Ò Estimatio: É Estimatig or predictig the value of the parameter É What is (are) the most likely values of m or p? Ò Hypothesis Testig: É Decidig about

More information

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1 Chapter 8 Cofidece Iterval Estimatio Copyright 2015, 2012, 2009 Pearso Educatio, Ic. Chapter 8, Slide 1 Learig Objectives I this chapter, you lear: To costruct ad iterpret cofidece iterval estimates for

More information

Using Math to Understand Our World Project 5 Building Up Savings And Debt

Using Math to Understand Our World Project 5 Building Up Savings And Debt Usig Math to Uderstad Our World Project 5 Buildig Up Savigs Ad Debt Note: You will have to had i aswers to all umbered questios i the Project Descriptio See the What to Had I sheet for additioal materials

More information

5 Statistical Inference

5 Statistical Inference 5 Statistical Iferece 5.1 Trasitio from Probability Theory to Statistical Iferece 1. We have ow more or less fiished the probability sectio of the course - we ow tur attetio to statistical iferece. I statistical

More information

Forecasting bad debt losses using clustering algorithms and Markov chains

Forecasting bad debt losses using clustering algorithms and Markov chains Forecastig bad debt losses usig clusterig algorithms ad Markov chais Robert J. Till Experia Ltd Lambert House Talbot Street Nottigham NG1 5HF {Robert.Till@uk.experia.com} Abstract Beig able to make accurate

More information

Sampling Distributions and Estimation

Sampling Distributions and Estimation Cotets 40 Samplig Distributios ad Estimatio 40.1 Samplig Distributios 40. Iterval Estimatio for the Variace 13 Learig outcomes You will lear about the distributios which are created whe a populatio is

More information

ii. Interval estimation:

ii. Interval estimation: 1 Types of estimatio: i. Poit estimatio: Example (1) Cosider the sample observatios 17,3,5,1,18,6,16,10 X 8 X i i1 8 17 3 5 118 6 16 10 8 116 8 14.5 14.5 is a poit estimate for usig the estimator X ad

More information

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Meas ad Proportios Itroductio: I this chapter we wat to fid out the value of a parameter for a populatio. We do t kow the value of this parameter for the etire

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL A SULEMENTAL MATERIAL Theorem (Expert pseudo-regret upper boud. Let us cosider a istace of the I-SG problem ad apply the FL algorithm, where each possible profile A is a expert ad receives, at roud, a

More information

0.1 Valuation Formula:

0.1 Valuation Formula: 0. Valuatio Formula: 0.. Case of Geeral Trees: q = er S S S 3 S q = er S S 4 S 5 S 4 q 3 = er S 3 S 6 S 7 S 6 Therefore, f (3) = e r [q 3 f (7) + ( q 3 ) f (6)] f () = e r [q f (5) + ( q ) f (4)] = f ()

More information

SETTING GATES IN THE STOCHASTIC PROJECT SCHEDULING PROBLEM USING CROSS ENTROPY

SETTING GATES IN THE STOCHASTIC PROJECT SCHEDULING PROBLEM USING CROSS ENTROPY 19 th Iteratioal Coferece o Productio Research SETTING GATES IN THE STOCHASTIC PROJECT SCHEDULING PROBLEM USING CROSS ENTROPY I. Bedavid, B. Golay Faculty of Idustrial Egieerig ad Maagemet, Techio Israel

More information

A random variable is a variable whose value is a numerical outcome of a random phenomenon.

A random variable is a variable whose value is a numerical outcome of a random phenomenon. The Practice of Statistics, d ed ates, Moore, ad Stares Itroductio We are ofte more iterested i the umber of times a give outcome ca occur tha i the possible outcomes themselves For example, if we toss

More information

Lecture 5: Sampling Distribution

Lecture 5: Sampling Distribution Lecture 5: Samplig Distributio Readigs: Sectios 5.5, 5.6 Itroductio Parameter: describes populatio Statistic: describes the sample; samplig variability Samplig distributio of a statistic: A probability

More information

Introduction to Probability and Statistics Chapter 7

Introduction to Probability and Statistics Chapter 7 Itroductio to Probability ad Statistics Chapter 7 Ammar M. Sarha, asarha@mathstat.dal.ca Departmet of Mathematics ad Statistics, Dalhousie Uiversity Fall Semester 008 Chapter 7 Statistical Itervals Based

More information

Confidence Intervals Introduction

Confidence Intervals Introduction Cofidece Itervals Itroductio A poit estimate provides o iformatio about the precisio ad reliability of estimatio. For example, the sample mea X is a poit estimate of the populatio mea μ but because of

More information

1 Estimating sensitivities

1 Estimating sensitivities Copyright c 27 by Karl Sigma 1 Estimatig sesitivities Whe estimatig the Greeks, such as the, the geeral problem ivolves a radom variable Y = Y (α) (such as a discouted payoff) that depeds o a parameter

More information

We learned: $100 cash today is preferred over $100 a year from now

We learned: $100 cash today is preferred over $100 a year from now Recap from Last Week Time Value of Moey We leared: $ cash today is preferred over $ a year from ow there is time value of moey i the form of willigess of baks, busiesses, ad people to pay iterest for its

More information

Institute of Actuaries of India Subject CT5 General Insurance, Life and Health Contingencies

Institute of Actuaries of India Subject CT5 General Insurance, Life and Health Contingencies Istitute of Actuaries of Idia Subject CT5 Geeral Isurace, Life ad Health Cotigecies For 2017 Examiatios Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical techiques which

More information

INTERVAL GAMES. and player 2 selects 1, then player 2 would give player 1 a payoff of, 1) = 0.

INTERVAL GAMES. and player 2 selects 1, then player 2 would give player 1 a payoff of, 1) = 0. INTERVAL GAMES ANTHONY MENDES Let I ad I 2 be itervals of real umbers. A iterval game is played i this way: player secretly selects x I ad player 2 secretly ad idepedetly selects y I 2. After x ad y are

More information

Solution to Tutorial 6

Solution to Tutorial 6 Solutio to Tutorial 6 2012/2013 Semester I MA4264 Game Theory Tutor: Xiag Su October 12, 2012 1 Review Static game of icomplete iformatio The ormal-form represetatio of a -player static Bayesia game: {A

More information

Anomaly Correction by Optimal Trading Frequency

Anomaly Correction by Optimal Trading Frequency Aomaly Correctio by Optimal Tradig Frequecy Yiqiao Yi Columbia Uiversity September 9, 206 Abstract Uder the assumptio that security prices follow radom walk, we look at price versus differet movig averages.

More information

The Time Value of Money in Financial Management

The Time Value of Money in Financial Management The Time Value of Moey i Fiacial Maagemet Muteau Irea Ovidius Uiversity of Costata irea.muteau@yahoo.com Bacula Mariaa Traia Theoretical High School, Costata baculamariaa@yahoo.com Abstract The Time Value

More information

CAPITAL ASSET PRICING MODEL

CAPITAL ASSET PRICING MODEL CAPITAL ASSET PRICING MODEL RETURN. Retur i respect of a observatio is give by the followig formula R = (P P 0 ) + D P 0 Where R = Retur from the ivestmet durig this period P 0 = Curret market price P

More information

Models of Asset Pricing

Models of Asset Pricing APPENDIX 1 TO CHAPTER 4 Models of Asset Pricig I this appedix, we first examie why diversificatio, the holdig of may risky assets i a portfolio, reduces the overall risk a ivestor faces. The we will see

More information

Models of Asset Pricing

Models of Asset Pricing APPENDIX 1 TO CHAPTER4 Models of Asset Pricig I this appedix, we first examie why diversificatio, the holdig of may risky assets i a portfolio, reduces the overall risk a ivestor faces. The we will see

More information

ECON 5350 Class Notes Maximum Likelihood Estimation

ECON 5350 Class Notes Maximum Likelihood Estimation ECON 5350 Class Notes Maximum Likelihood Estimatio 1 Maximum Likelihood Estimatio Example #1. Cosider the radom sample {X 1 = 0.5, X 2 = 2.0, X 3 = 10.0, X 4 = 1.5, X 5 = 7.0} geerated from a expoetial

More information

1 Random Variables and Key Statistics

1 Random Variables and Key Statistics Review of Statistics 1 Radom Variables ad Key Statistics Radom Variable: A radom variable is a variable that takes o differet umerical values from a sample space determied by chace (probability distributio,

More information

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Meas ad Proportios Itroductio: We wat to kow the value of a parameter for a populatio. We do t kow the value of this parameter for the etire populatio because

More information

Lecture 5 Point Es/mator and Sampling Distribu/on

Lecture 5 Point Es/mator and Sampling Distribu/on Lecture 5 Poit Es/mator ad Samplig Distribu/o Fall 03 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech Road map Poit Es/ma/o Cofidece Iterval

More information

The material in this chapter is motivated by Experiment 9.

The material in this chapter is motivated by Experiment 9. Chapter 5 Optimal Auctios The material i this chapter is motivated by Experimet 9. We wish to aalyze the decisio of a seller who sets a reserve price whe auctioig off a item to a group of bidders. We begi

More information

Linear Programming for Portfolio Selection Based on Fuzzy Decision-Making Theory

Linear Programming for Portfolio Selection Based on Fuzzy Decision-Making Theory The Teth Iteratioal Symposium o Operatios Research ad Its Applicatios (ISORA 2011 Duhuag, Chia, August 28 31, 2011 Copyright 2011 ORSC & APORC, pp. 195 202 Liear Programmig for Portfolio Selectio Based

More information

4.5 Generalized likelihood ratio test

4.5 Generalized likelihood ratio test 4.5 Geeralized likelihood ratio test A assumptio that is used i the Athlete Biological Passport is that haemoglobi varies equally i all athletes. We wish to test this assumptio o a sample of k athletes.

More information

Limits of sequences. Contents 1. Introduction 2 2. Some notation for sequences The behaviour of infinite sequences 3

Limits of sequences. Contents 1. Introduction 2 2. Some notation for sequences The behaviour of infinite sequences 3 Limits of sequeces I this uit, we recall what is meat by a simple sequece, ad itroduce ifiite sequeces. We explai what it meas for two sequeces to be the same, ad what is meat by the -th term of a sequece.

More information

A DOUBLE INCREMENTAL AGGREGATED GRADIENT METHOD WITH LINEAR CONVERGENCE RATE FOR LARGE-SCALE OPTIMIZATION

A DOUBLE INCREMENTAL AGGREGATED GRADIENT METHOD WITH LINEAR CONVERGENCE RATE FOR LARGE-SCALE OPTIMIZATION A DOUBLE INCREMENTAL AGGREGATED GRADIENT METHOD WITH LINEAR CONVERGENCE RATE FOR LARGE-SCALE OPTIMIZATION Arya Mokhtari, Mert Gürbüzbalaba, ad Alejadro Ribeiro Departmet of Electrical ad Systems Egieerig,

More information

Online appendices from The xva Challenge by Jon Gregory. APPENDIX 10A: Exposure and swaption analogy.

Online appendices from The xva Challenge by Jon Gregory. APPENDIX 10A: Exposure and swaption analogy. APPENDIX 10A: Exposure ad swaptio aalogy. Sorese ad Bollier (1994), effectively calculate the CVA of a swap positio ad show this ca be writte as: CVA swap = LGD V swaptio (t; t i, T) PD(t i 1, t i ). i=1

More information

Subject CT5 Contingencies Core Technical. Syllabus. for the 2011 Examinations. The Faculty of Actuaries and Institute of Actuaries.

Subject CT5 Contingencies Core Technical. Syllabus. for the 2011 Examinations. The Faculty of Actuaries and Institute of Actuaries. Subject CT5 Cotigecies Core Techical Syllabus for the 2011 Examiatios 1 Jue 2010 The Faculty of Actuaries ad Istitute of Actuaries Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical

More information

Journal of Statistical Software

Journal of Statistical Software JSS Joural of Statistical Software Jue 2007, Volume 19, Issue 6. http://www.jstatsoft.org/ Ratioal Arithmetic Mathematica Fuctios to Evaluate the Oe-sided Oe-sample K-S Cumulative Samplig Distributio J.

More information

ACTUARIAL RESEARCH CLEARING HOUSE 1990 VOL. 2 INTEREST, AMORTIZATION AND SIMPLICITY. by Thomas M. Zavist, A.S.A.

ACTUARIAL RESEARCH CLEARING HOUSE 1990 VOL. 2 INTEREST, AMORTIZATION AND SIMPLICITY. by Thomas M. Zavist, A.S.A. ACTUARIAL RESEARCH CLEARING HOUSE 1990 VOL. INTEREST, AMORTIZATION AND SIMPLICITY by Thomas M. Zavist, A.S.A. 37 Iterest m Amortizatio ad Simplicity Cosider simple iterest for a momet. Suppose you have

More information

CHAPTER 2 PRICING OF BONDS

CHAPTER 2 PRICING OF BONDS CHAPTER 2 PRICING OF BONDS CHAPTER SUARY This chapter will focus o the time value of moey ad how to calculate the price of a bod. Whe pricig a bod it is ecessary to estimate the expected cash flows ad

More information

A New Approach to Obtain an Optimal Solution for the Assignment Problem

A New Approach to Obtain an Optimal Solution for the Assignment Problem Iteratioal Joural of Sciece ad Research (IJSR) ISSN (Olie): 231-7064 Idex Copericus Value (2013): 6.14 Impact Factor (2015): 6.31 A New Approach to Obtai a Optimal Solutio for the Assigmet Problem A. Seethalakshmy

More information

Sequences and Series

Sequences and Series Sequeces ad Series Matt Rosezweig Cotets Sequeces ad Series. Sequeces.................................................. Series....................................................3 Rudi Chapter 3 Exercises........................................

More information

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion Basic formula for the Chi-square test (Observed - Expected ) Expected Basic formula for cofidece itervals sˆ x ± Z ' Sample size adjustmet for fiite populatio (N * ) (N + - 1) Formulas for estimatig populatio

More information

AUTOMATIC GENERATION OF FUZZY PAYOFF MATRIX IN GAME THEORY

AUTOMATIC GENERATION OF FUZZY PAYOFF MATRIX IN GAME THEORY AUTOMATIC GENERATION OF FUZZY PAYOFF MATRIX IN GAME THEORY Dr. Farha I. D. Al Ai * ad Dr. Muhaed Alfarras ** * College of Egieerig ** College of Coputer Egieerig ad scieces Gulf Uiversity * Dr.farha@gulfuiversity.et;

More information

Lecture 4: Probability (continued)

Lecture 4: Probability (continued) Lecture 4: Probability (cotiued) Desity Curves We ve defied probabilities for discrete variables (such as coi tossig). Probabilities for cotiuous or measuremet variables also are evaluated usig relative

More information

FOUNDATION ACTED COURSE (FAC)

FOUNDATION ACTED COURSE (FAC) FOUNDATION ACTED COURSE (FAC) What is the Foudatio ActEd Course (FAC)? FAC is desiged to help studets improve their mathematical skills i preparatio for the Core Techical subjects. It is a referece documet

More information

Models of Asset Pricing

Models of Asset Pricing 4 Appedix 1 to Chapter Models of Asset Pricig I this appedix, we first examie why diversificatio, the holdig of may risky assets i a portfolio, reduces the overall risk a ivestor faces. The we will see

More information

Production planning optimization in the wood remanufacturing mills using multi-stage stochastic programming

Production planning optimization in the wood remanufacturing mills using multi-stage stochastic programming roductio plaig oimizatio i the wood remaufacturig mills usig multi-stage stochastic programmig Rezva Rafiei 1, Luis Atoio De Sata-Eulalia 1, Mustapha Nourelfath 2 1 Faculté d admiistratio, Uiversité de

More information

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions A Empirical Study of the Behaviour of the Sample Kurtosis i Samples from Symmetric Stable Distributios J. Marti va Zyl Departmet of Actuarial Sciece ad Mathematical Statistics, Uiversity of the Free State,

More information

Calculation of the Annual Equivalent Rate (AER)

Calculation of the Annual Equivalent Rate (AER) Appedix to Code of Coduct for the Advertisig of Iterest Bearig Accouts. (31/1/0) Calculatio of the Aual Equivalet Rate (AER) a) The most geeral case of the calculatio is the rate of iterest which, if applied

More information

Many Techniques Developed. Uncertainty. Aspects of Uncertainty. Decision Theory = Probability + Utility Theory

Many Techniques Developed. Uncertainty. Aspects of Uncertainty. Decision Theory = Probability + Utility Theory Ucertait CSE 47 Ma Techiques Developed Fu Logic Certait Factors No-mootoic logic robabilit Ol oe has stood the test of time! UW CSE AI Facult Aspects of Ucertait Suppose ou have a flight at oo Whe should

More information

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i The iformatio required by the mea-variace approach is substatial whe the umber of assets is large; there are mea values, variaces, ad )/2 covariaces - a total of 2 + )/2 parameters. Sigle-factor model:

More information

Problem Set 1a - Oligopoly

Problem Set 1a - Oligopoly Advaced Idustrial Ecoomics Sprig 2014 Joha Steek 6 may 2014 Problem Set 1a - Oligopoly 1 Table of Cotets 2 Price Competitio... 3 2.1 Courot Oligopoly with Homogeous Goods ad Differet Costs... 3 2.2 Bertrad

More information

Topic 14: Maximum Likelihood Estimation

Topic 14: Maximum Likelihood Estimation Toic 4: November, 009 As before, we begi with a samle X = (X,, X of radom variables chose accordig to oe of a family of robabilities P θ I additio, f(x θ, x = (x,, x will be used to deote the desity fuctio

More information

Monetary Economics: Problem Set #5 Solutions

Monetary Economics: Problem Set #5 Solutions Moetary Ecoomics oblem Set #5 Moetary Ecoomics: oblem Set #5 Solutios This problem set is marked out of 1 poits. The weight give to each part is idicated below. Please cotact me asap if you have ay questios.

More information

living well in retirement Adjusting Your Annuity Income Your Payment Flexibilities

living well in retirement Adjusting Your Annuity Income Your Payment Flexibilities livig well i retiremet Adjustig Your Auity Icome Your Paymet Flexibilities what s iside 2 TIAA Traditioal auity Icome 4 TIAA ad CREF Variable Auity Icome 7 Choices for Adjustig Your Auity Icome 7 Auity

More information

Appendix 1 to Chapter 5

Appendix 1 to Chapter 5 Appedix 1 to Chapter 5 Models of Asset Pricig I Chapter 4, we saw that the retur o a asset (such as a bod) measures how much we gai from holdig that asset. Whe we make a decisio to buy a asset, we are

More information

Productivity depending risk minimization of production activities

Productivity depending risk minimization of production activities Productivity depedig risk miimizatio of productio activities GEORGETTE KANARACHOU, VRASIDAS LEOPOULOS Productio Egieerig Sectio Natioal Techical Uiversity of Athes, Polytechioupolis Zografou, 15780 Athes

More information

Marking Estimation of Petri Nets based on Partial Observation

Marking Estimation of Petri Nets based on Partial Observation Markig Estimatio of Petri Nets based o Partial Observatio Alessadro Giua ( ), Jorge Júlvez ( ) 1, Carla Seatzu ( ) ( ) Dip. di Igegeria Elettrica ed Elettroica, Uiversità di Cagliari, Italy {giua,seatzu}@diee.uica.it

More information

A point estimate is the value of a statistic that estimates the value of a parameter.

A point estimate is the value of a statistic that estimates the value of a parameter. Chapter 9 Estimatig the Value of a Parameter Chapter 9.1 Estimatig a Populatio Proportio Objective A : Poit Estimate A poit estimate is the value of a statistic that estimates the value of a parameter.

More information

Risk Assessment for Project Plan Collapse

Risk Assessment for Project Plan Collapse 518 Proceedigs of the 8th Iteratioal Coferece o Iovatio & Maagemet Risk Assessmet for Project Pla Collapse Naoki Satoh 1, Hiromitsu Kumamoto 2, Norio Ohta 3 1. Wakayama Uiversity, Wakayama Uiv., Sakaedai

More information

1 The Power of Compounding

1 The Power of Compounding 1 The Power of Compoudig 1.1 Simple vs Compoud Iterest You deposit $1,000 i a bak that pays 5% iterest each year. At the ed of the year you will have eared $50. The bak seds you a check for $50 dollars.

More information

Faculdade de Economia da Universidade de Coimbra

Faculdade de Economia da Universidade de Coimbra Faculdade de Ecoomia da Uiversidade de Coimbra Grupo de Estudos Moetários e Fiaceiros (GEMF) Av. Dias da Silva, 65 300-5 COIMBRA, PORTUGAL gemf@fe.uc.pt http://www.uc.pt/feuc/gemf PEDRO GODINHO Estimatig

More information

BIOSTATS 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval

BIOSTATS 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval BIOSTATS 540 Fall 015 6. Estimatio Page 1 of 7 Uit 6. Estimatio Use at least twelve observatios i costructig a cofidece iterval - Gerald va Belle What is the mea of the blood pressures of all the studets

More information

The Valuation of the Catastrophe Equity Puts with Jump Risks

The Valuation of the Catastrophe Equity Puts with Jump Risks The Valuatio of the Catastrophe Equity Puts with Jump Risks Shih-Kuei Li Natioal Uiversity of Kaohsiug Joit work with Chia-Chie Chag Outlie Catastrophe Isurace Products Literatures ad Motivatios Jump Risk

More information

Unbiased estimators Estimators

Unbiased estimators Estimators 19 Ubiased estimators I Chapter 17 we saw that a dataset ca be modeled as a realizatio of a radom sample from a probability distributio ad that quatities of iterest correspod to features of the model distributio.

More information

Kernel Density Estimation. Let X be a random variable with continuous distribution F (x) and density f(x) = d

Kernel Density Estimation. Let X be a random variable with continuous distribution F (x) and density f(x) = d Kerel Desity Estimatio Let X be a radom variable wit cotiuous distributio F (x) ad desity f(x) = d dx F (x). Te goal is to estimate f(x). Wile F (x) ca be estimated by te EDF ˆF (x), we caot set ˆf(x)

More information

LESSON #66 - SEQUENCES COMMON CORE ALGEBRA II

LESSON #66 - SEQUENCES COMMON CORE ALGEBRA II LESSON #66 - SEQUENCES COMMON CORE ALGEBRA II I Commo Core Algebra I, you studied sequeces, which are ordered lists of umbers. Sequeces are extremely importat i mathematics, both theoretical ad applied.

More information

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS Lecture 4: Parameter Estimatio ad Cofidece Itervals GENOME 560 Doug Fowler, GS (dfowler@uw.edu) 1 Review: Probability Distributios Discrete: Biomial distributio Hypergeometric distributio Poisso distributio

More information

1 Estimating the uncertainty attached to a sample mean: s 2 vs.

1 Estimating the uncertainty attached to a sample mean: s 2 vs. Political Sciece 100a/200a Fall 2001 Cofidece itervals ad hypothesis testig, Part I 1 1 Estimatig the ucertaity attached to a sample mea: s 2 vs. σ 2 Recall the problem of descriptive iferece: We wat to

More information

Policy Improvement for Repeated Zero-Sum Games with Asymmetric Information

Policy Improvement for Repeated Zero-Sum Games with Asymmetric Information Policy Improvemet for Repeated Zero-Sum Games with Asymmetric Iformatio Malachi Joes ad Jeff S. Shamma Abstract I a repeated zero-sum game, two players repeatedly play the same zero-sum game over several

More information

Subject CT1 Financial Mathematics Core Technical Syllabus

Subject CT1 Financial Mathematics Core Technical Syllabus Subject CT1 Fiacial Mathematics Core Techical Syllabus for the 2018 exams 1 Jue 2017 Subject CT1 Fiacial Mathematics Core Techical Aim The aim of the Fiacial Mathematics subject is to provide a groudig

More information

11.7 (TAYLOR SERIES) NAME: SOLUTIONS 31 July 2018

11.7 (TAYLOR SERIES) NAME: SOLUTIONS 31 July 2018 .7 (TAYLOR SERIES NAME: SOLUTIONS 3 July 08 TAYLOR SERIES ( The power series T(x f ( (c (x c is called the Taylor Series for f(x cetered at x c. If c 0, this is called a Maclauri series. ( The N-th partial

More information

Twitter: @Owe134866 www.mathsfreeresourcelibrary.com Prior Kowledge Check 1) State whether each variable is qualitative or quatitative: a) Car colour Qualitative b) Miles travelled by a cyclist c) Favourite

More information

Research Article The Probability That a Measurement Falls within a Range of n Standard Deviations from an Estimate of the Mean

Research Article The Probability That a Measurement Falls within a Range of n Standard Deviations from an Estimate of the Mean Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 70806, 8 pages doi:0.540/0/70806 Research Article The Probability That a Measuremet Falls withi a Rage of Stadard Deviatios

More information

These characteristics are expressed in terms of statistical properties which are estimated from the sample data.

These characteristics are expressed in terms of statistical properties which are estimated from the sample data. 0. Key Statistical Measures of Data Four pricipal features which characterize a set of observatios o a radom variable are: (i) the cetral tedecy or the value aroud which all other values are buched, (ii)

More information

of Asset Pricing R e = expected return

of Asset Pricing R e = expected return Appedix 1 to Chapter 5 Models of Asset Pricig EXPECTED RETURN I Chapter 4, we saw that the retur o a asset (such as a bod) measures how much we gai from holdig that asset. Whe we make a decisio to buy

More information