arxiv: v2 [cs.lg] 7 Oct 2016

Size: px
Start display at page:

Download "arxiv: v2 [cs.lg] 7 Oct 2016"

Transcription

1 Hadi Daeshmad Aurelie Lucchi Thomas Hofma Departmet of Computer Sciece, ETH Zurich, Switzerlad arxiv: v2 cs.lg] 7 Oct 216 Abstract For may machie learig problems, data is abudat ad it may be prohibitive to make multiple passes through the full traiig set. I this cotext, we ivestigate strategies for dyamically icreasig the effective sample size, whe usig iterative methods such as stochastic gradiet descet. Our iterest is motivated by the rise of variace-reduced methods, which achieve liear covergece rates that scale favorably for smaller sample sizes. Exploitig this feature, we show theoretically ad empirically how to obtai sigificat speed-ups with a ovel algorithm that reaches statistical accuracy o a -sample i 2, istead of log steps. 1. Itroductio I empirical risk miimizatio (ERM) (Vapik, 1998) the traiig set S is used to defie a sample risk R S, which is the miimized with regard to a pre-defied fuctio class. Oe effectively equates learig algorithms with optimizatio algorithms. However, for all practical purposes a approximate solutio of R S will be sufficiet, as log as the optimizatio error is small relative to the statistical accuracy at sample size := S. This is importat for massive data sets, where optimizatio to umerical precisio is ifeasible. Istead of performig early stoppig o black-box optimizatio, oe ought to uderstad the trade-offs betwee statistical ad computatioal accuracy, cf. (Chadrasekara & Jorda, 213). I this paper, we ivestigate a much eglected facet of this topic, amely how to dyamically cotrol the effective sample size i optimizatio. May large-scale optimizatio algorithms are iterative: they use sampled or aggregated data to perform a sequece Proceedigs of the 33 rd Iteratioal Coferece o Machie Learig, New York, NY, USA, 216. JMLR: W&CP volume 48. Copyright 216 by the author(s). of update steps. This icludes the popular family of gradiet descet methods. Ofte, the computatioal complexity icreases with the size of the traiig sample, e.g. i steepest-descet, where the cost of a gradiet computatio scales with. Does oe really eed a highly accurate gradiet though, i particular i the early phase of optimizatio? Why ot use subsets T t S which are icreased i size with the iteratio cout t, matchig-up statistical accuracy with optimizatio accuracy i a dyamic maer? This is the geeral program we pursue i this paper. I order to make this idea cocrete ad to reach competitive results, we focus o a recet variat of stochastic gradiet descet (SGD), which is kow as SAGA (Defazio et al., 214). As we will show, this algorithm has a particularly iterestig property i how its covergece rate depeds o Empirical Risk Miimizatio Formally, we assume that traiig examples x S X have bee draw i.i.d. from some uderlyig, but ukow probability distributio P. We fix a fuctio class F parametrized by weight vectorsw R d ad defie the expected risk asr(w) := Ef x (w), wheref is ax-idexed family of loss fuctios, ofte covex. We deote the miimum ad the miimizer of R(w) over F by R ad w, respectively. Give that P is ukow, ERM suggests to rely o the empirical (or sample) risk with regard tos R S (w) := 1 x S f x (w), ws := argmir S (w). (1) w F Note that oe may absorb a regularizer i the defiitio of the lossf x Geeralizatio bouds The relatio betwee w ad ws has bee widely studied i the literature o learig theory. It is usually aalysed with the help of uiform covergece bouds that take the geeric form (Bouchero et al., 25) ] E S sup R(w) R S (w) H(), (2) w F

2 where the expectatio is over a radom -sample S. Here H is a boud that depeds o, usually through a ratio /d, where d is the capacity of F (e.g. VC dimesio). This fast covergece rate has bee show to hold for a class of strictly covex loss fuctios such as quadratic, ad logistic loss (Bartlett et al., 26; 25). I the realizable case, we may be able to observe a favorable H() d/, whereas i the pessimistic case, we may oly be able to establish weaker bouds such as H() d/ (e.g. for liear fuctio classes); see also (Bousquet & Bottou, 28). We igore additioal log factors that ca be elimiated usig the chaiig techique (Bousquet, 22; Bousquet & Bottou, 28) Statistical efficiecy Assume ow that we have some approximate optimizatio algorithm, which give S produces solutios w S that are o averageǫ() optimal, i.e. E S R S (w S ) R S ] ǫ(). Oe ca the provide the followig quality guaratee i expectatio over sample sets S (Bousquet & Bottou, 28) E S R(w S ) R H()+ǫ(), (3) which is a additive decompositio of the expected solutio suboptimality ito a estimatio (or statistical) error H() ad a optimizatio (or computatioal) error ǫ(). For a give computatioal budget, oe typically fids that ǫ() is icreasig with, whereas H() is always decreasig. This hits at a trade-off, which may suggest to chose a sample size m <. Ituitively speakig, cocetratig the computatioal budget o fewer data may be better tha spreadig computatios too thily Stochastic Gradiet Optimizatio For large scale problems, stochastic gradiet descet is a method of choice i order to optimize problems of the form give i Eq. (1). Yet, while SGD update directios equal the true (egative) gradiet directio i expectatio, high variace typically leads to sub-liear covergece. This is where variace-reducig methods for ERM such as SAG (Roux et al., 212), SVRG (Johso & Zhag, 213), ad SAGA (Defazio et al., 214) come ito play. We focus o the latter here, where oe ca establish the followig result o the covergece rate (see appedix). Lemma 1. Let all f x be covex with L-Lipschitz cotiuous gradiets ad assume that R S is µ-strogly covex. The the suboptimality of the SAGA iteratew t aftertsteps is over a radomly sampled S bouded by ( E A RS (w t ) R S] ρ t 1 C S, ρ = 1 mi, µ ), L This highlights two differet regimes: For small, the coditio umber κ := L µ dictates how fast the optimizatio algorithm coverges. O the other had, for large, the covergece rate of SAGA becomesρ = Cotributios Our mai questio is: ca we obtai faster covergece to a statistically accurate solutio by ruig SAGA o a iitially smaller sample, whose size is the gradually icreased? Motivated by a simple, yet succict aalysis, we preset a ovel algorithm, called DYNASAGA that implemets this idea ad achieves ǫ() H() after oly 2 iteratios. 2. Related Work Stochastic approximatio is a powerful tool for miimizig objective Eq. (1) for covex loss fuctios. The pioeerig work of (Robbis & Moro, 1951) is essetially a streamig SGD method where each observatio is used oly oce. Aother major milestoes has bee the idea of iterate averagig (Polyak & Juditsky, 1992). A thorough theoretical aalysis of asymptotic covergece of SGD ca be foud i (Kusher & Yi, 23), whereas some o-asymptotic results have bee preseted i (Moulies & Bach, 211). A lie of recet work kow as variace-reduced SGD, e.g. (Roux et al., 212; Shalev-Shwartz & Zhag, 213; Johso & Zhag, 213; Defazio et al., 214; 215; Koečỳ & Richtárik, 213; Zhag et al., 213), has exploited the fiite sum structure of the empirical risk to establish liear covergece for strogly covex objectives ad also a a better covergece rate for purely covex objectives (Mahdavi et al., 213). There is also evidece of slightly improved statistical efficiecy (Babaezhad et al., 215). (Frostig et al., 215) provides a o-asymptotic aalysis of a streamig SVRG algorithm (SSVRG), for which a covergece rate approachig that of the ERM is established. There have also bee related data-adaptive samplig approaches, e.g. i the cotext of usupervised learig (Lucic et al., 215) or for o-uiform samplig of data poits (Schmidt et al., 213; He & Takác, 215) with the goal of samplig importat data poits more ofte. This directio is largely orthogoal to our dyamic sizig of the sample, which is purely based o radom subsamplig. Our samplig strategy is istead based o revisitig samples which has also bee explored i (Wag et al., 216) to empirically improve the covergece of certai variacereduced methods. where the expectatio is over the algorithmic radomess.

3 taied by differetiatig V with regard to 1/m ad solvig form(see Lemma 9 i appedix). ǫ(m )+H(m ) m ǫ(m) H(m) Figure 1. Tradeoff betwee sample statistical accuracy term H(m) ad optimizatio suboptimality ǫ(m) usig sample size m <. Note that ǫ(m) is draw by takig the first order approximatio of the upper boud Ce m. Here, m = O(/log) yields the best balace betwee these two terms. 3. Methodology 3.1. Settig ad Assumptios We work uder the assumptios made i Lemma 1 ad focus o the large data regime, where κ ad the geometric rate of covergece of SAGA depeds o through ρ = 1 1/. This is a iterestig regime as the guarateed progress per update is larger for smaller samples. This form of ρ implies for the case of performig t = iteratios, i.e. performig oe pass 1 : ( E A R S (w ) R S] 1 1 ) C S C S e. (4) So we are guarateed to improve the solutio suboptimality o average by a factor 1/e per pass. This i tur implies that i order to get to a guarateed accuracy O( α ), we eed O(α log ) update steps Sample Size Optimizatio For illustrative purposes, let us use the above result to select a sample size for SAGA, which yields the best guaratees. Propositio 2. Assume H(m) = D/m ad is give. DefieC to be a upper-boud oc S, S (from Lemma 1), the for m κ, V(m) := D m + Ce m provides a boud o the expected suboptimality of SAGA. It is miimized for the choice { } m = max κ, log+log C. D Proof. The first claim follows directly from the assumptios ad Lemma 1. Moreover the tightest boud is ob- 1 The SAGA aalysis holds for i.i.d. samplig, so strictly speakig this is ot a pass, but correspods to update steps. m The result implies that we will perform roughly log + log C D epochs o the optimally sized sample. Also the value of the boud is (for simplicity, assumig C = D) V(m ) = log + 1 V() = e, (5) showig that the sigle pass approximatio error o the full sample is too large (costat), relative to the statistical accuracy Dyamic Sample Growth As we have see, optimizig over a smaller sample ca be beeficial (if we believe the sigificace of the bouds). But why chose a sigle sample size oce ad for all? A smaller sample set seems advatageous early o, but as a optimizatio algorithm approaches the empirical miimizer, it is hit by the statistical accuracy limit. This suggests that we should dyamically icremet the size of the sample set. We illustrate this idea i Figure 2. I order to aalyze such a dyamic samplig scheme, we eed to relate the suboptimality o a sub-sample T to a suboptimality boud os. We establish a basic result i the followig theorem. Theorem 3. Let w be a (ǫ, T )-optimal solutio, i.e.r T (w) R T ǫ, wheret S,m := T, := S. The the suboptimality of w for R S is bouded w.h.p. i the choice of T as: E S R S (w) R S Proof. Cosider the followig equality m ] ǫ+ H(m). (6) R S (w) R S = R S (w) (1) R T (w) (2) R T (3) R S We boud the three ivolved differeces (i expectatio) as follows: (2): R T (w) R T ǫ by assumptio. (3): E S R T (wt ) R S(wS )] as T S. For (1) we apply the boud (see Lemma 1 i the appedix) E S T R S (w) R T (w)] m R(w) R T(w). Moreover E T R(w) R T (w)] sup w R(w ) R T (w ) H(m) by Eq. (2), which cocludes the proof. I plai Eglish, this result suggests the followig: If we have optimized w to (ǫ,t ) accuracy o a sub-sample T ad we wat to cotiue optimizig o a larger samples T, the we ca boud the suboptimality or S by the same ǫ plus a additioal switchig cost of( m)/ H(m).

4 R(w) H(/4) H(/3) H(/2) sample size H() Figure 2. Illustratio of a optimal progress path via sample size adjustmet. The vertical black lies show the progress made at each step, thus illustratig the faster covergece for smaller sample size. Table 1. Compariso of obtaied bouds for differet SAGA variats whe performig T κ update steps. METHOD OPTIMIZATION ERROR SAMPLES SAGA (oe pass) cost. T SAGA (optimal size) O(logT H(T)) T/logT DYNASAGA O(H(T)) T/2 4. Algorithms & Aalysis 4.1. Computatioal Limited Learig The work of (Bottou, 21) emphasized that for massive data sets the limitig factor of ay learig algorithm will be its computatioal complexityt, rather tha the umber of samples. For SGD this computatioal limit typically traslates ito the umber of stochastic gradiets evaluated by the algorithm, i.e. T becomes the umber of update steps. Oe obvious strategy with abudat data is to sample a ew data poit i every iteratio. There are asymptotic results establishig bouds for various SGD variats i (Bousquet & Bottou, 28). However, SAGA ad related algorithms rely o memorizig past stochastic gradiets, cf. (Hofma et al., 215), which makes it beeficial to revisit data poits, ad which is at the root of results such as Lemma 1. This leads to a qualitatively differet behavior ad our fidigs idicate that ideed, the trade-offs for large scale learig eed to be re-visited, cf. Table SAGA with Dyamic Sample Sizes We suggest to modify SAGA to work with a dyamic sample size schedule. Let us defie a schedule as a mootoic fuctio M : Z + Z +, where t is the iteratio umber Algorithm 2 DYNASAGA 1: Iput: traiig examplesx = (x 1,x 2,...,x ),x i P total umber of iteratios T (e.g. T = 2) startig poitw R d (e.gw = ) learig rateη > (e.g.η = 1 4L ) sample schedulem : 1 : T] 1 : ] 2: w w 3: fori = 1,..., do 4: α i f xi (w ) {ca also be doe o the fly} 5: ed for 6: fort = 1,...,T do 7: samplex i Uiform(x 1,...,x M(t) ) 8: g f xi (w t 1 ) 9: A M(t) j=1 α j/m(t){ca be doe icremetally} 1: w t w t 1 η(g α i +A) 11: α i g 12: ed for ad M(t) the effective sample size used at t. We assume that a sequece of data poits X = (x 1,...,x ) draw from P is give such that M iduces a ested sequece of samplest t := {x i : 1 i M(t)}. DYNASAGA geeralizes SAGA (Defazio et al., 214) i that it samples data poits o-uiformly at each iteratio. Specifically, for a give schedule M ad iteratio t, it samples uiformly from T t, but igores X T t. The pseudocode for DYNASAGA is show i Algorithm Upper Boud Recurrece Assume we are give a stochastic optimizatio method that guaratees a geometrical decay at each iteratio, i.e. E A RS (w t ) R S] ρ RS (w t 1 ) R ] S (7) where S = ad expectatio is over radomess of optimizatio process. 2 For acceleratio, we pursue the strategy of usig the basic iequalities obtaied so far ad to stitch them together i the form of a recurrece. At ay iteratiot we allow ourselves the choice to augmet the curret sample of size m by some icremet m. We defie a upper boud fuctio U as follows ρ U(t 1,) U(t,) = mi mi m< U(t,m)+ m H(m) ], such that U(,m) = ξ, where the iitial error ξ is defied as: ξ := 4L R(w ) R(w ) ]. (9) µ 2 Note that this assumptio is slightly stroger tha Lemma 1 but it leads to a much simpler proof techique. (8)

5 We refer the reader to Lemma 8 i the Appedix for further details o how to derive the expressio forξ. The costructio of Eq. (8) is motivated by the followig result: Propositio 4. W.h.p. over the radom -sample X, the iterate sequecew t geerated by DYNASAGA fulfils E X RT (w t ) R T ] U(t,). Proof. By iductio over t. The result for t = follows directly from Lemma 8. The first case i Eq. (8) for the iductio step (fixed sample size) follows from Eq. (7). The secod case holds by virtue of Theorem 3 for aym, hece also for the miimum. Although the U-recursio ca be solved for small usig dyamic programmig (assumig kowledge of all costats), we aalyse a much simpler heuristics ad its behavior. This leads to iterestig isights, while beig very practical. I particular, our algorithm is a aytime algorithm, which does ot require kowledge of the total umber of iteratiost ahead of time Sample Schedules I this sectio, we preset ad aalyse two adaptive sample-size schemes for DYNASAGA. We start with sample size κ ad perform 2κ steps. From the o, we add a ew sample every other iteratio. The effective sample size is thus M LIN (t) = max { 2κ, } t 2 (1) Note that this strategy defies a upper boud o U(2t, t) adu(2t+1,t). We have also implemeted a variat where we perform updates i alteratio: every other iteratio we sample a ew data poit, which is added to the set. However, we also force a update o this fresh sample. I alteratio, we simply re-sample a existig data poit uiformly at radom. We do ot provide a theoretical aalysis for this scheme but show experimetally that it slightly outperforms the strategy (see results i the appedix). We thus report results for the ALTERNAT- ING strategy i the experimetal sectio Aalysis We ow provide a aalysis that establishes the covergece rate of the strategy. Lemma 5. For H() = D α, < α 1, the strategy obtais the followig suboptimality U(2,) H()+ ξ 2 ( κ ) 2 (11) Proof. By iductio over. The base case follows from C m ξ. Usig Eq. (8) ad (11) for the iductive case, we get U(2(+1),+1) (8) ρ 2 +1 U(2,)+ 1 ] +1 H() (11) ξ ( ) 2 κ + 2 (+2) 2 +1 (+1) 3 H() Note that by defiitio of the logarithmic fuctio, log(+2)] < 2log(+1), ad moreover H() +1H(+1) = 1 α 1, (+1) 1 α which completes the proof. This meas that for large eoughthe strategy is able to approach the statistical accuracy with2 iteratios, i.e. two passes over the data. Note the very sigificat improvemet relative to the log factor iheret to the optimal fixed sample size choice (see Table 1 for a compariso of these two bouds). What does that imply for thet = case that we have bee emphasizig? It is simple to state a aswer as a corollary. Corollary 6. Uder the same assumptios as Lemma 5, it holds for eve U(,) ( 3 2 α 1) ( κ 2 H()+2ξ ) Proof. Note that with Eq. (8) (a) ad Lemma 5 (b) we get U(2,2) (a) U(2,)+ 1 (b) H() 3 ( κ ) H()+2ξ 2 The fact that H() = 2 α H(2) completes the proof. The proof of the above corollary suggests to oly use = T/2 samples, whe performig T steps ad to simply igore the other half (that potetially could have bee sampled). Oe might woder if a better strategy tha the oe could be defied, e.g. by iteratig more tha twice o each ewly added sample or by icreasig the sample size by more tha oe. The ext lemma aswers this questio ad proves that the strategy is optimal for large-scale datasets as log as H() 1/. Lemma 7. Assume that H() D/, the the strategy is optimal for all sample size > κ.

6 l R(w) R(w ) ] 9 11 κ = Suboptimality of Risk y = 1.4 x l() 9 11 κ =.75 Suboptimality of Risk y =.59 x l() Figure 3. Results o sythetic dataset. (left) Sice, the empirical suboptimality is 1/, we expect the slope measured o this plot to be close to oe. (right) Sice κ =.75 slows dow the covergece rate, the slope of this plot is less tha oe. Table 2. Details of the real datasets used i our experimets. All datasets were selected from the LIBSVM dataset collectio. DATASET SIZE NUMBER OF FEATURES RCV1.BINARY A9A W8A IJCNN REAL-SIM COVTYPE.BINARY SUSY 5 18 coditio umber κ. Proof. Here, we briefly state a sketch of the proof. The details are preseted i Appedix A.2. First, we reformulate the problem of the optimal sample size schedule i terms of umber of iteratios o each samples size. Give that this problem is covex, we ca use the KKT coditios to prove the optimality of icremetig by oe sample (see Lemma 12) ad iteratig twice o each sample size (see Lemma 13). 5. Experimetal Results We preset experimetal results o sythetic as well as real-world data, which largely cofirms the above aalysis Baselies We compare DYNASAGA (both the ad ALTER- NATING strategy) to various optimizatio methods preseted i Sectio 2. This icludes SGD (with costat ad decreasig step-size), SAGA, streamig SVRG (SSVRG) as well as the mixed SGD/SVRG approach preseted i (Babaezhad et al., 215) Experimet o sythetic data We cosider liear regressio, where iputs a R d are draw from a Gaussia distributio N(,Σ d d ) ad outputs are corrupted by additive oise y = x,w + ǫ, ǫ N (,σ 2). We are give i.i.d observatios of this model, S = {(a i,y i )} i=1, from which we compute the least squares riskr S (w) = 1 i=1 ( a i,w y i ) 2. By cosiderig the matrix A to be a row-wise arragemet of the iput vectors a i, we ca write the Hessia matrix of R (w) as Σ = 1 AT A. Whe d, the matrix Σ coverges to Σ ad we ca therefore assume that R (w) is µ-strogly covex ad L-Lipschitz where the costats µ ad L are the smallest ad largest eigevalues of Σ. We experimet with two differet values for the Case κ = : We use a diagoal Σ with elemets decreasig from 1 to 1, hece κ =. I this particular case the aalysis derived i Lemma 5 predicts a upper boud U(,) < O( 1 ) which is cofirmed by the results show i Figure 3. Case κ = 3 4 : Whe κ = 3 4, the term ( κ 2 ) is the domiatig term i the proposed upper-boud. ( I this case, 1 U(, ) is thus upper-bouded by O ), which is oce agai verified experimetally i Figure Experimets o Real Datasets We also ra experimets o several real-world datasets i order to compare the performace of DYNASAGA to stateof-the-art methods. The details of the datasets are show i Table 2. Throughout all the experimets we used the logistic loss with a regularizer λ = 1 3. Figures 4, ad 5 show the suboptimality o the empirical risk ad expected risk after a sigle pass over the datasets. The various parameters used for the baselie methods are described i Table 3. A critical factor i the performace of most baselies, especially SGD, is the selectio of the step-size. We picked the best-performig step-size withi the commo rage guided by existig theoretical aalyses, specifically η = 1/L ad η = C C+µt for various values of C. Overall, we ca see that DYNASAGA performs very well, both as a optimizatio as well as a learig algorithm. SGD is also very competitive ad typically achieves faster covergece tha the other baselies, however, its behaviour is ot stable throughout all the datasets. The SGD variat with decreasig step-size is typically very fast i the early stages but the slows dow after a certai umber of steps. The results o the RCV dataset are somehow surprisig as SGD with costat step-size clearly outperforms all methods but we show i the appedix that its behaviour 3 We also preset some additioal results for various regularizers of the form λ = 1 p,p < 1 i the appedix

7 SGD SAGA dyasaga SSVRG SGD:.5 SGD:.5 SGD/SVRG RCV 2. A9A W8A 4. IJCNN1 5. REAL-SIM x 1 5 x COVTYPE 7. SUSY Figure 4. Suboptimality ] o the empirical risk. The vertical axis shows the suboptimality of the empirical risk, i.e. log 2 E 1 RT(w t ) R T where the expectatio is take over 1 idepedet rus. The traiig set icludes 9% of the data. The vertical red dashed lie is draw after exactly oe epoch over the data. gets worse as we icrease the coditio umber. As ca be see very clearly, DYNASAGA yields excellet solutios i terms of expected risk after oe pass (see suboptimality values that itersect with the vertical red dashed lies). 6. Coclusio We have preseted a ew methodology to exploit the tradeoff betwee computatioal ad statistical complexity, i order to achieve fast covergece to a statistically efficiet solutio. Specifically, we have focussed o a modificatio of SAGA ad suggested a simple dyamic samplig schedule that adds oe ew data poit every other update step. Our aalysis shows competitive covergece rates both i term of suboptimality o the empirical risk as well as (more importatly) the expected risk i a oe pass or a two pass settig. These results have bee validated experimetally. Our approach depeds o the uderlyig optimizatio method oly through its covergece rate for miimizig a empirical risk. We thus suspect that a similar sample size adaptio is applicable to a much wider rage of algorithms, icludig to o-covex optimizatio methods for deep learig.

8 SGD SAGA dyasaga SSVRG SGD:.5 SGD:.5 SGD/SVRG RCV 2. A9A W8A 4. IJCNN1 5. REAL-SIM x 1 5 x COVTYPE 7. SUSY Figure 5. Suboptimality o the expected risk. The vertical axis shows the suboptimality of the expected risk, i.e. log 2 E 1 RS(w t ) R S(w T) ], where S is a test set which icludes 1% of the data ad w T is the optimum of the empirical risk o T. The vertical red dashed lie is draw after exactly oe epoch over the data. Refereces Babaezhad, Reza, Ahmed, Mohamed Osama, Virai, Alim, Schmidt, Mark, Koečỳ, Jakub, ad Sallie, Scott. Stop wastig my gradiets: Practical svrg. Advaces i Neural Iformatio Processig Systems, 215. Bartlett, Peter L, Bousquet, Olivier, ad Medelso, Shahar. Local rademacher complexities. Aals of Statistics, pp , 25. Bartlett, Peter L, Jorda, Michael I, ad McAuliffe, Jo D. Covexity, classificatio, ad risk bouds. Joural of the America Statistical Associatio, 11(473): , 26. Bottou, Léo. Large-scale machie learig with stochastic gradiet descet. I Proceedigs of COMP- STAT 21, pp Spriger, 21. Bouchero, Stéphae, Bousquet, Olivier, ad Lugosi, Gábor. Theory of classificatio: A survey of some recet advaces. ESAIM: probability ad statistics, 9: , 25. Bousquet, Olivier. Cocetratio iequalities ad empirical processes theory applied to the aalysis of learig algorithms. PhD thesis, Ecole Polytechique, 22. Bousquet, Olivier ad Bottou, Léo. The tradeoffs of large scale learig. I Advaces i Neural Iformatio Processig Systems, pp , 28.

9 Boyd, Stephe ad Vadeberghe, Lieve. Covex Optimizatio. Cambridge Uiversity Press, New York, NY, USA, 24. Chadrasekara, Vekat ad Jorda, Michael I. Computatioal ad statistical tradeoffs via covex relaxatio. Proceedigs of the Natioal Academy of Scieces, 11 (13):E1181 E119, 213. Defazio, Aaro, Bach, Fracis, ad Lacoste-Julie, Simo. Saga: A fast icremetal gradiet method with support for o-strogly covex composite objectives. I Advaces i Neural Iformatio Processig Systems, pp , 214. Defazio, Aaro J, Caetao, Tibério S, ad Domke, Justi. Fiito: A faster, permutable icremetal gradiet method for big data problems. I The iteratioal coferece o Machie learig, 215. Frostig, Roy, Ge, Rog, Kakade, Sham M., ad Sidford, Aaro. Competig with the empirical risk miimizer i a sigle pass. I The Coferece o Learig Theory, pp , 215. He, Xi ad Takác, Marti. Dual free SDCA for empirical risk miimizatio with adaptive probabilities. CoRR, abs/ , 215. Hofma, Thomas, Lucchi, Aurelie, Lacoste-Julie, Simo, ad McWilliams, Bria. Variace reduced stochastic gradiet descet with eighbors. I Advaces i Neural Iformatio Processig Systems 28, pp Curra Associates, Ic., 215. Johso, Rie ad Zhag, Tog. Acceleratig stochastic gradiet descet usig predictive variace reductio. I Advaces i Neural Iformatio Processig Systems, pp , 213. learig. I Advaces i Neural Iformatio Processig Systems, pp , 211. Polyak, Boris T ad Juditsky, Aatoli B. Acceleratio of stochastic approximatio by averagig. SIAM Joural o Cotrol ad Optimizatio, 3(4): , Robbis, Herbert ad Moro, Sutto. A stochastic approximatio method. The Aals of Mathematical Statistics, pp. 4 47, Roux, Nicolas L, Schmidt, Mark, ad Bach, Fracis R. A stochastic gradiet method with a expoetial covergece rate for fiite traiig sets. I Advaces i Neural Iformatio Processig Systems, pp , 212. Schmidt, Mark, Roux, Nicolas Le, ad Bach, Fracis. Miimizig fiite sums with the stochastic average gradiet. arxiv preprit arxiv: , 213. Shalev-Shwartz, Shai ad Zhag, Tog. Stochastic dual coordiate ascet methods for regularized loss. The Joural of Machie Learig Research, 14: , 213. Vapik, Vlamimir. Statistical learig theory, volume 1. Wiley New York, Wag, Jialei, Wag, Hai, ad Srebro, Natha. Reducig rutime by recyclig samples. arxiv preprit arxiv: , 216. Zhag, Liju, Mahdavi, Mehrdad, ad Ji, Rog. Liear covergece with coditio umber idepedet access of full gradiets. I Advaces i Neural Iformatio Processig Systems, pp , 213. Koečỳ, Jakub ad Richtárik, Peter. Semi-stochastic gradiet descet methods. arxiv preprit arxiv: , 213. Kusher, Harold J ad Yi, George. Stochastic approximatio ad recursive algorithms ad applicatios, volume 35. Spriger Sciece & Busiess Media, 23. Lucic, Mario, Ohaessia, Mesrob I, Karbasi, Ami, ad Krause, Adreas. Tradeoffs for space, time, data ad risk i usupervised learig. I AISTATS, 215. Mahdavi, Mehrdad, Zhag, Liju, ad Ji, Rog. Mixed optimizatio for smooth fuctios. I Advaces i Neural Iformatio Processig Systems, pp , 213. Moulies, Eric ad Bach, Fracis R. No-asymptotic aalysis of stochastic approximatio algorithms for machie

10 A. Appedix A.1. Proofs Proof of Lemma 1. Proof. We start with the covergece rate of SAGA established i (Defazio et al., 214) as E A w t ws 2] ρ t S w ws 2 S ( + RS (w ) R S (w µ S +L S),w ws R ) ] S. (12) We the use the L-smoothess assumptio of f x (w) to relate the suboptimality o the fuctio values to the boud i Eq. (12). E A RS (w t ) R S (w S ) ] = E A Ex S fx (w t ) ] E x S f x (w S )] ] L smoothess LE A w t w S 2] Eq. 12 ρ t S C S, wherec S is the iitial suboptimality o the empirical risk defied as: C S = L w ws 2 S ( + RS (w ) R S (w µ S +L S),w ws R ) ] S Note that this iitial error depeds o the set S ad its size S. I the followig Lemma, we propose a upper boud o this iitial error that is idepedet ofs Lemma 8. W.h.p, the iitial suboptimality error of samples is bouded by: C S ξ := 4L µ R(w ) R(w ) ] Proof. We first use the fact thatr S (w) is µ-strogly covex as well as the optimality ofws to boudc S as ( C S := L w ws S 2 + RS (w ) R S (ws µ S +L ),w ws R S(wS )]) L RS (w ) R S (w µ S) ] + S L RS (w ) R S (w µ S +L S),w ws R S (ws) ] L RS (w ) R S (w µ S) ] + S L RS (w ) R S (w µ S +L S) ] (L>) 2L RS (w ) R S (ws µ )] 2L ] R S (w ) 1] R(w ) 2] R(w ) 3] R(w µ S) R S (ws) We use the geeralizatio bouds i (Vapik, 1998) to upper boud 1] ad 2]. For 3], we used the uiform covergece rate of the ERM that implies (Vapik, 1998): where c is a costat. We the get C S w.h.p 2L µ R(wS) R(w ) csup R S (w) R(w), w H( S )+R(w ) R(w )+ch( S )+H( S ) ]. (13)

11 We also make the further assumptio that with high probability the iitial suboptimality is greater tha a costat factor of the statistical accuracy, i.e. R(w ) R(w ) > (2+c)H( S ). We ca the further upper boudc S as C S 4L µ R(w ) R(w ) ]. (14) Lemma 9 (for Propositio 2). V(m) := D m +Ce m, the argmi V(m) = <m log C D Proof. dv dm 1 = D Ce m! = e m = D C m = log C D Solvig for m, this ideed correspods to a miimum which ca be verified by checkig the boudary valuesm = ad m. Lemma 1 (for Theorem 3). Proof. E S T R S (w) R T (w)] m R(w) R T(w). E S T R S (w) R T (w)] = E S T T R S (w) R T (w)] = E S T 1 f x (w)+ f y (w) 1 f x (w) m x T y S T x T = m E S T 1 f y (w) R T (w) m y S T = m E S T 1 f y (w) R T (w) m y S T = m E S T R S T (w)] R T (w)] = m R(w) R T (w)]

12 A.2. Optimality of the Strategy Startig Small Learig with Adaptive Sample Sizes We here itroduce a ew otatio ad chose to represet a sample size schedule by a vector t = t m,m < where t m deotes the umber of iteratios o sample size m. Note that the total umber of iteratios up to the sample size is T = m< t m. We defie as the sample size that we iterate o immediately before sample size, i.e. = max{k < : t k > }. (15) We ow rewrite the suboptimality boud i terms of the sample size schedulet as where the secod equality is derived usig Lemma 1 ad Theorem 3. A(t ) = E S R S (w(t )) R S (w )] ) = ρ t (A(t )+ H( ), (16) Oe ca relate the upper boudu(,) toa(t ) usig the followig costraied program: U(,) = mi t A(t ) (17) Subject to m : t m t m = m I the followig we aim at showig that the Strategy is the optimal solutio of Equatio 16. We first prove a Lemma that will be used i the rest of our aalysis. Lemma 11 (Expasio ofa(t )). if H() = D/, the A(t ) := C(t )+ C(t ) := ξ i=m m=m +1 B m (t ), where (18) ( ) ti i 1, B m (t ) := i D (m 1)m ( ) ti i 1. (19) i=m i Proof. Although oe could paistakigly uroll the recursivity i Equatio 16, we here provide a simple iductio proof. First, oe ca easily verify that the equatio holds for = m. For the iductive step, we assume it holds for ad prove it holds for all {k : < k }. Accordig to the defiitio of, we havet k = for all < k <, ad therefore ρ t k k = k m= +1 ρ tm m. (2) We will also make use of the followig equality i our aalysis: k k H( ) = H( ) H(k) (H()=D/) = k m= +1 H(m 1) H(m). (21)

13 We are ow ready to prove the iductive step. ( A(t k EQ 16 ) = ρ t k k A(t )+ k ) H( ) k = ρ t k k C(t )+ EQ 19, 2 = C(t k )+ EQ 21 = C(t k )+ EQ 2 = C(t k )+ = C(t k )+ m=m +1 m=m +1 m=m +1 m=m +1 k m=m +1 B m (t )+ k k ( ) B m (t k )+ρ t k k k H( ) k B m (t k )+ρ t k k B m (t k )+ k m= +1 k m= +1 (22) H( ) (23) D (m 1)m (24) (25) B m (t k ) (26) B m (t k ) (27) Usig the defiitios provided i Lemma 11, we ivestigate the optimality coditios of the optimal sample size strategy. I the followig, we simplify our otatios ad writeb m adc istead ofb m (t ) adc(t ). As a first step i our aalysis, we itroduce the followig equatios based o the defiitios ofb m adc. B m = ( ) ti i 1 = i i=m 1 m(m 1) ( i 1 i i m ) ti = m+1 m 1 ( ) tm m 1 B m+1. (28) m ( ( (i 1 ) )) ti exp log = exp i i=m We ow compute the derivative ofa(t ) as ( A(t ) = log(1 1 t m m ) C(t )+ 1 m ( C + m k=m +1 B k ) m k=m +1 i=m B k (t ) ( t i log 1 1 i) ]. (29) ). (3) C(t ) adb m (t ) are log-covex (hece covex) fuctios with respect tot. Sice the sum operator preserves covexity (Boyd & Vadeberghe, 24), A(t ) is covex as well. Let λ i, ν deote the Lagragia coefficiets associated with the iequality ad equality costraits respectively. Accordig the KKT coditios (Boyd & Vadeberghe, 24) for the the optimal solutio, the followig iequalities hold: λ m (31) λ m t m = (32) A(t ) t m λ m +ν = (33) Accordig the above coditio there are two possible cases for the partial derivative A(t ) t m :

14 For the case oft m >, the slackess coditio 32 implies that λ m =. The, accordig to the coditio 33: A(t ) = ν t ( m EQ. 3 = 1 m C + m k=m +1 B k ) = ν (34) For the case oft m =,λ i > (a.) holds based o the complemetary slackess coditio 32. EQ. 3 = 1 m A(t ) t ( m C + = λ i ν (a.) > ν m k=m +1 B k ) < ν (35) I the followig two lemmas we use the coditios of optimality derived i Equatios 34 ad 35 to prove optimality of the Strategy. Specifically, we first prove that for the optimal strategy, t m > for m < m ad t m = for m >. We also prove the optimality of icremetig the sample size by oe. I the secod lemma, we show that t m 2. Lemma 12 (Optimality of sample size icremet). For large eoughm, a schedule witht m = adt m+1 > caot be optimal. Proof. Note that by repeated applicatio of Equatio (28) we obtai B m+1 < B m < < B m +1 EQ. 34 & 35 < ν (36) where optimality coditios a. t m > (EQ.34) ad b. t m +1 = (EQ.35) yeild the last iequality: B m +1 = a. = m +1 k=m +1 m +1 k=m +1 B k m k=m +1 B k C (37) B k +C mν (38) b. < (m+1)ν mν = ν (39) O the other had, optimality of a. t m+1 > (EQ.34) ad b. t m = (EQ.35) also imply B m+1 > ν which is i cotradictio with the previously establishedb m+1 < ν. Ideed, we have B m+1 = m+1 k=m +1 B k a. = (m+1)ν m k=m +1 m k=m +1 B k C (4) B k C (41) b. > (m+1)ν mν = ν (42) Lemma 13 (Optimality of two iteratios). Cosider t largem : m < m, t m 2. as the miimizer of the optimizatio problem 17. For sufficietly

15 Proof. Usig Lemma 12, t m > holds for m < m. We proceed with optimality coditios a. t m > ad b. t m 1 > i equatio 34. B m = m k=m +1 a. = mν B k m 1 k=m +1 m 1 k=m +1 B k C (43) B k C (44) b. = mν (m 1)ν = ν (45) Cosequetly,B m = B m+1 = ν. Usig Equatio 28, oe coclude that t m 2: ( ) ( ) t m 1 m 1 m+1 = m log 1 2 t m+1 m m = log ( ) 1 1 2m 2. (46) m+1 m

16 A.3. Additioal Experimetal results Startig Small Learig with Adaptive Sample Sizes A.3.1. COMPARISON OF THE TWO ADAPTIVE SAMPLE SIZE SCHEMES FOR DYNASAGA We here compare the ad schemes o the collectio of real datasets preseted i Table 2 for a regularizer λ = 1 2. The results for the empirical ad expected risk show i Figure 6 ad Figure 7 show that the scheme slightly outperforms the strategy x SUSY 2. RCV 3. A9A W8A 5. IJCNN1 6. REAL-SIM x COVTYPE Figure 6. Suboptimality o ] the empirical risk. The vertical axis shows the suboptimality of the empirical risk, i.e. log 2 E 1 RT(w t ) R T where the expectatio is take over 1 idepedet rus. The traiig set icludes 9% of the data. The vertical gree dashed lie is draw after exactly oe epoch over the data.

17 x SUSY 2. RCV 3. A9A W8A 5. IJCNN1 6. REAL-SIM x COVTYPE Figure 7. Suboptimality o the expected risk. The vertical axis shows the suboptimality of the expected risk, i.e. log 2 E 1 RS(w t ) R S(w T) ], where S is a test set which icludes 1% of the data ad w T is the optimum of the empirical risk o T. The vertical gree dashed lie is draw after exactly oe epoch over the data. A.3.2. EFFECT OF THE REGULARIZER We here preset additioal results for various regularizers of the form λ = 1,p < 1. I the iterest of clarity we oly p show results o four datasets. We ca see a similar tred to the mai results preseted i the paper for λ = 1 where DYNASAGAshows very fast covergece i terms of both empirical ad expected risk. SGD is also very competitive ad typically achieves faster covergece tha the other baselies, however, its behaviour is ot stable throughout all the datasets.

18 RCV 2. W8A SGD SAGA dyasaga SSVRG SGD:.5 SGD:.5 SGD/SVRG x IJCNN1 4. COVTYPE Figure 8. Suboptimality o the empirical risk with regularizer λ = RCV 2. W8A SGD SAGA dyasaga SSVRG SGD:.5 SGD:.5 SGD/SVRG x IJCNN1 4. COVTYPE Figure 9. Suboptimality o the expected risk with regularizer λ = 3

19 RCV 2. W8A SGD SAGA dyasaga SSVRG SGD:.5 SGD:.5 SGD/SVRG x IJCNN1 4. COVTYPE Figure 1. Suboptimality o the empirical risk with regularizer λ = RCV 2. W8A SGD SAGA dyasaga SSVRG SGD:.5 SGD:.5 SGD/SVRG x IJCNN1 4. COVTYPE Figure 11. Suboptimality o the expected risk with regularizer λ = 4

20 A.4. Details of Experimets The various parameters of all baselies ad DYNASAGA are represeted i Table 3. Table 3. Experimetal settig METHOD PARAMETER NOTATION VALUE.1 SGD STEP SIZE η t.1+µt.3 SAGA STEP SIZE η L+µ SSVRG AND SGD/SVRG FACTOR FOR INCREASING SAMPLE SIZE b 3 A CONSTANT PARAMETER p 2 1 STEP SIZE η 1b p INITIAL BATCH SIZE k κ κ NUMBER OF STEPS ON EACH BATCH SIZE m η SGD:.5 STEP SIZE η.5 SGD:.5 STEP SIZE η.5.3 DYNASAGA STEP SIZE FOR SAMPLE SIZE m η(m) L+µm INITIAL BATCH SIZE k κ NUMBER OF ITERATIONS ON SAMPLE SIZE m t(m) 2

A DOUBLE INCREMENTAL AGGREGATED GRADIENT METHOD WITH LINEAR CONVERGENCE RATE FOR LARGE-SCALE OPTIMIZATION

A DOUBLE INCREMENTAL AGGREGATED GRADIENT METHOD WITH LINEAR CONVERGENCE RATE FOR LARGE-SCALE OPTIMIZATION A DOUBLE INCREMENTAL AGGREGATED GRADIENT METHOD WITH LINEAR CONVERGENCE RATE FOR LARGE-SCALE OPTIMIZATION Arya Mokhtari, Mert Gürbüzbalaba, ad Alejadro Ribeiro Departmet of Electrical ad Systems Egieerig,

More information

5. Best Unbiased Estimators

5. Best Unbiased Estimators Best Ubiased Estimators http://www.math.uah.edu/stat/poit/ubiased.xhtml 1 of 7 7/16/2009 6:13 AM Virtual Laboratories > 7. Poit Estimatio > 1 2 3 4 5 6 5. Best Ubiased Estimators Basic Theory Cosider agai

More information

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions A Empirical Study of the Behaviour of the Sample Kurtosis i Samples from Symmetric Stable Distributios J. Marti va Zyl Departmet of Actuarial Sciece ad Mathematical Statistics, Uiversity of the Free State,

More information

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010

Combining imperfect data, and an introduction to data assimilation Ross Bannister, NCEO, September 2010 Combiig imperfect data, ad a itroductio to data assimilatio Ross Baister, NCEO, September 00 rbaister@readigacuk The probability desity fuctio (PDF prob that x lies betwee x ad x + dx p (x restrictio o

More information

1 Random Variables and Key Statistics

1 Random Variables and Key Statistics Review of Statistics 1 Radom Variables ad Key Statistics Radom Variable: A radom variable is a variable that takes o differet umerical values from a sample space determied by chace (probability distributio,

More information

Math 312, Intro. to Real Analysis: Homework #4 Solutions

Math 312, Intro. to Real Analysis: Homework #4 Solutions Math 3, Itro. to Real Aalysis: Homework #4 Solutios Stephe G. Simpso Moday, March, 009 The assigmet cosists of Exercises 0.6, 0.8, 0.0,.,.3,.6,.0,.,. i the Ross textbook. Each problem couts 0 poits. 0.6.

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpeCourseWare http://ocwmitedu 430 Itroductio to Statistical Methods i Ecoomics Sprig 009 For iformatio about citig these materials or our Terms of Use, visit: http://ocwmitedu/terms 430 Itroductio

More information

Statistics for Economics & Business

Statistics for Economics & Business Statistics for Ecoomics & Busiess Cofidece Iterval Estimatio Learig Objectives I this chapter, you lear: To costruct ad iterpret cofidece iterval estimates for the mea ad the proportio How to determie

More information

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL A SULEMENTAL MATERIAL Theorem (Expert pseudo-regret upper boud. Let us cosider a istace of the I-SG problem ad apply the FL algorithm, where each possible profile A is a expert ad receives, at roud, a

More information

Sequences and Series

Sequences and Series Sequeces ad Series Matt Rosezweig Cotets Sequeces ad Series. Sequeces.................................................. Series....................................................3 Rudi Chapter 3 Exercises........................................

More information

Maximum Empirical Likelihood Estimation (MELE)

Maximum Empirical Likelihood Estimation (MELE) Maximum Empirical Likelihood Estimatio (MELE Natha Smooha Abstract Estimatio of Stadard Liear Model - Maximum Empirical Likelihood Estimator: Combiatio of the idea of imum likelihood method of momets,

More information

Sampling Distributions and Estimation

Sampling Distributions and Estimation Cotets 40 Samplig Distributios ad Estimatio 40.1 Samplig Distributios 40. Iterval Estimatio for the Variace 13 Learig outcomes You will lear about the distributios which are created whe a populatio is

More information

Overlapping Generations

Overlapping Generations Eco. 53a all 996 C. Sims. troductio Overlappig Geeratios We wat to study how asset markets allow idividuals, motivated by the eed to provide icome for their retiremet years, to fiace capital accumulatio

More information

Estimation of Population Variance Utilizing Auxiliary Information

Estimation of Population Variance Utilizing Auxiliary Information Iteratioal Joural of Statistics ad Systems ISSN 0973-675 Volume 1, Number (017), pp. 303-309 Research Idia Publicatios http://www.ripublicatio.com Estimatio of Populatio Variace Utilizig Auxiliary Iformatio

More information

Lecture 9: The law of large numbers and central limit theorem

Lecture 9: The law of large numbers and central limit theorem Lecture 9: The law of large umbers ad cetral limit theorem Theorem.4 Let X,X 2,... be idepedet radom variables with fiite expectatios. (i) (The SLLN). If there is a costat p [,2] such that E X i p i i=

More information

Rafa l Kulik and Marc Raimondo. University of Ottawa and University of Sydney. Supplementary material

Rafa l Kulik and Marc Raimondo. University of Ottawa and University of Sydney. Supplementary material Statistica Siica 009: Supplemet 1 L p -WAVELET REGRESSION WITH CORRELATED ERRORS AND INVERSE PROBLEMS Rafa l Kulik ad Marc Raimodo Uiversity of Ottawa ad Uiversity of Sydey Supplemetary material This ote

More information

1 Estimating sensitivities

1 Estimating sensitivities Copyright c 27 by Karl Sigma 1 Estimatig sesitivities Whe estimatig the Greeks, such as the, the geeral problem ivolves a radom variable Y = Y (α) (such as a discouted payoff) that depeds o a parameter

More information

point estimator a random variable (like P or X) whose values are used to estimate a population parameter

point estimator a random variable (like P or X) whose values are used to estimate a population parameter Estimatio We have oted that the pollig problem which attempts to estimate the proportio p of Successes i some populatio ad the measuremet problem which attempts to estimate the mea value µ of some quatity

More information

5 Statistical Inference

5 Statistical Inference 5 Statistical Iferece 5.1 Trasitio from Probability Theory to Statistical Iferece 1. We have ow more or less fiished the probability sectio of the course - we ow tur attetio to statistical iferece. I statistical

More information

Estimating Proportions with Confidence

Estimating Proportions with Confidence Aoucemets: Discussio today is review for midterm, o credit. You may atted more tha oe discussio sectio. Brig sheets of otes ad calculator to midterm. We will provide Scatro form. Homework: (Due Wed Chapter

More information

Parametric Density Estimation: Maximum Likelihood Estimation

Parametric Density Estimation: Maximum Likelihood Estimation Parametric Desity stimatio: Maimum Likelihood stimatio C6 Today Itroductio to desity estimatio Maimum Likelihood stimatio Itroducto Bayesia Decisio Theory i previous lectures tells us how to desig a optimal

More information

Productivity depending risk minimization of production activities

Productivity depending risk minimization of production activities Productivity depedig risk miimizatio of productio activities GEORGETTE KANARACHOU, VRASIDAS LEOPOULOS Productio Egieerig Sectio Natioal Techical Uiversity of Athes, Polytechioupolis Zografou, 15780 Athes

More information

ECON 5350 Class Notes Maximum Likelihood Estimation

ECON 5350 Class Notes Maximum Likelihood Estimation ECON 5350 Class Notes Maximum Likelihood Estimatio 1 Maximum Likelihood Estimatio Example #1. Cosider the radom sample {X 1 = 0.5, X 2 = 2.0, X 3 = 10.0, X 4 = 1.5, X 5 = 7.0} geerated from a expoetial

More information

Unbiased estimators Estimators

Unbiased estimators Estimators 19 Ubiased estimators I Chapter 17 we saw that a dataset ca be modeled as a realizatio of a radom sample from a probability distributio ad that quatities of iterest correspod to features of the model distributio.

More information

18.S096 Problem Set 5 Fall 2013 Volatility Modeling Due Date: 10/29/2013

18.S096 Problem Set 5 Fall 2013 Volatility Modeling Due Date: 10/29/2013 18.S096 Problem Set 5 Fall 2013 Volatility Modelig Due Date: 10/29/2013 1. Sample Estimators of Diffusio Process Volatility ad Drift Let {X t } be the price of a fiacial security that follows a geometric

More information

We analyze the computational problem of estimating financial risk in a nested simulation. In this approach,

We analyze the computational problem of estimating financial risk in a nested simulation. In this approach, MANAGEMENT SCIENCE Vol. 57, No. 6, Jue 2011, pp. 1172 1194 iss 0025-1909 eiss 1526-5501 11 5706 1172 doi 10.1287/msc.1110.1330 2011 INFORMS Efficiet Risk Estimatio via Nested Sequetial Simulatio Mark Broadie

More information

CAPITAL PROJECT SCREENING AND SELECTION

CAPITAL PROJECT SCREENING AND SELECTION CAPITAL PROJECT SCREEIG AD SELECTIO Before studyig the three measures of ivestmet attractiveess, we will review a simple method that is commoly used to scree capital ivestmets. Oe of the primary cocers

More information

Bayes Estimator for Coefficient of Variation and Inverse Coefficient of Variation for the Normal Distribution

Bayes Estimator for Coefficient of Variation and Inverse Coefficient of Variation for the Normal Distribution Iteratioal Joural of Statistics ad Systems ISSN 0973-675 Volume, Number 4 (07, pp. 7-73 Research Idia Publicatios http://www.ripublicatio.com Bayes Estimator for Coefficiet of Variatio ad Iverse Coefficiet

More information

A New Constructive Proof of Graham's Theorem and More New Classes of Functionally Complete Functions

A New Constructive Proof of Graham's Theorem and More New Classes of Functionally Complete Functions A New Costructive Proof of Graham's Theorem ad More New Classes of Fuctioally Complete Fuctios Azhou Yag, Ph.D. Zhu-qi Lu, Ph.D. Abstract A -valued two-variable truth fuctio is called fuctioally complete,

More information

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS Lecture 4: Parameter Estimatio ad Cofidece Itervals GENOME 560 Doug Fowler, GS (dfowler@uw.edu) 1 Review: Probability Distributios Discrete: Biomial distributio Hypergeometric distributio Poisso distributio

More information

The material in this chapter is motivated by Experiment 9.

The material in this chapter is motivated by Experiment 9. Chapter 5 Optimal Auctios The material i this chapter is motivated by Experimet 9. We wish to aalyze the decisio of a seller who sets a reserve price whe auctioig off a item to a group of bidders. We begi

More information

Hopscotch and Explicit difference method for solving Black-Scholes PDE

Hopscotch and Explicit difference method for solving Black-Scholes PDE Mälardale iversity Fiacial Egieerig Program Aalytical Fiace Semiar Report Hopscotch ad Explicit differece method for solvig Blac-Scholes PDE Istructor: Ja Röma Team members: A Gog HaiLog Zhao Hog Cui 0

More information

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices?

FINM6900 Finance Theory How Is Asymmetric Information Reflected in Asset Prices? FINM6900 Fiace Theory How Is Asymmetric Iformatio Reflected i Asset Prices? February 3, 2012 Referece S. Grossma, O the Efficiecy of Competitive Stock Markets where Traders Have Diverse iformatio, Joural

More information

Random Sequences Using the Divisor Pairs Function

Random Sequences Using the Divisor Pairs Function Radom Sequeces Usig the Divisor Pairs Fuctio Subhash Kak Abstract. This paper ivestigates the radomess properties of a fuctio of the divisor pairs of a atural umber. This fuctio, the atecedets of which

More information

EVEN NUMBERED EXERCISES IN CHAPTER 4

EVEN NUMBERED EXERCISES IN CHAPTER 4 Joh Riley 7 July EVEN NUMBERED EXERCISES IN CHAPTER 4 SECTION 4 Exercise 4-: Cost Fuctio of a Cobb-Douglas firm What is the cost fuctio of a firm with a Cobb-Douglas productio fuctio? Rather tha miimie

More information

x satisfying all regularity conditions. Then

x satisfying all regularity conditions. Then AMS570.01 Practice Midterm Exam Sprig, 018 Name: ID: Sigature: Istructio: This is a close book exam. You are allowed oe-page 8x11 formula sheet (-sided). No cellphoe or calculator or computer is allowed.

More information

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1 Chapter 8 Cofidece Iterval Estimatio Copyright 2015, 2012, 2009 Pearso Educatio, Ic. Chapter 8, Slide 1 Learig Objectives I this chapter, you lear: To costruct ad iterpret cofidece iterval estimates for

More information

Neighboring Optimal Solution for Fuzzy Travelling Salesman Problem

Neighboring Optimal Solution for Fuzzy Travelling Salesman Problem Iteratioal Joural of Egieerig Research ad Geeral Sciece Volume 2, Issue 4, Jue-July, 2014 Neighborig Optimal Solutio for Fuzzy Travellig Salesma Problem D. Stephe Digar 1, K. Thiripura Sudari 2 1 Research

More information

A New Approach to Obtain an Optimal Solution for the Assignment Problem

A New Approach to Obtain an Optimal Solution for the Assignment Problem Iteratioal Joural of Sciece ad Research (IJSR) ISSN (Olie): 231-7064 Idex Copericus Value (2013): 6.14 Impact Factor (2015): 6.31 A New Approach to Obtai a Optimal Solutio for the Assigmet Problem A. Seethalakshmy

More information

0.1 Valuation Formula:

0.1 Valuation Formula: 0. Valuatio Formula: 0.. Case of Geeral Trees: q = er S S S 3 S q = er S S 4 S 5 S 4 q 3 = er S 3 S 6 S 7 S 6 Therefore, f (3) = e r [q 3 f (7) + ( q 3 ) f (6)] f () = e r [q f (5) + ( q ) f (4)] = f ()

More information

Chapter 8: Estimation of Mean & Proportion. Introduction

Chapter 8: Estimation of Mean & Proportion. Introduction Chapter 8: Estimatio of Mea & Proportio 8.1 Estimatio, Poit Estimate, ad Iterval Estimate 8.2 Estimatio of a Populatio Mea: σ Kow 8.3 Estimatio of a Populatio Mea: σ Not Kow 8.4 Estimatio of a Populatio

More information

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory Olie appedices from Couterparty Risk ad Credit Value Adjustmet a APPENDIX 8A: Formulas for EE, PFE ad EPE for a ormal distributio Cosider a ormal distributio with mea (expected future value) ad stadard

More information

Models of Asset Pricing

Models of Asset Pricing APPENDIX 1 TO CHAPTER4 Models of Asset Pricig I this appedix, we first examie why diversificatio, the holdig of may risky assets i a portfolio, reduces the overall risk a ivestor faces. The we will see

More information

Models of Asset Pricing

Models of Asset Pricing APPENDIX 1 TO CHAPTER 4 Models of Asset Pricig I this appedix, we first examie why diversificatio, the holdig of may risky assets i a portfolio, reduces the overall risk a ivestor faces. The we will see

More information

INTERVAL GAMES. and player 2 selects 1, then player 2 would give player 1 a payoff of, 1) = 0.

INTERVAL GAMES. and player 2 selects 1, then player 2 would give player 1 a payoff of, 1) = 0. INTERVAL GAMES ANTHONY MENDES Let I ad I 2 be itervals of real umbers. A iterval game is played i this way: player secretly selects x I ad player 2 secretly ad idepedetly selects y I 2. After x ad y are

More information

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i

r i = a i + b i f b i = Cov[r i, f] The only parameters to be estimated for this model are a i 's, b i 's, σe 2 i The iformatio required by the mea-variace approach is substatial whe the umber of assets is large; there are mea values, variaces, ad )/2 covariaces - a total of 2 + )/2 parameters. Sigle-factor model:

More information

Reinforcement Learning

Reinforcement Learning Reiforcemet Learig Ala Fer * Based i part o slides by Daiel Weld So far. Give a MDP model we kow how to fid optimal policies (for moderately-sized MDPs) Value Iteratio or Policy Iteratio Give just a simulator

More information

Asymptotics: Consistency and Delta Method

Asymptotics: Consistency and Delta Method ad Delta Method MIT 18.655 Dr. Kempthore Sprig 2016 1 MIT 18.655 ad Delta Method Outlie Asymptotics 1 Asymptotics 2 MIT 18.655 ad Delta Method Cosistecy Asymptotics Statistical Estimatio Problem X 1,...,

More information

A random variable is a variable whose value is a numerical outcome of a random phenomenon.

A random variable is a variable whose value is a numerical outcome of a random phenomenon. The Practice of Statistics, d ed ates, Moore, ad Stares Itroductio We are ofte more iterested i the umber of times a give outcome ca occur tha i the possible outcomes themselves For example, if we toss

More information

MODIFICATION OF HOLT S MODEL EXEMPLIFIED BY THE TRANSPORT OF GOODS BY INLAND WATERWAYS TRANSPORT

MODIFICATION OF HOLT S MODEL EXEMPLIFIED BY THE TRANSPORT OF GOODS BY INLAND WATERWAYS TRANSPORT The publicatio appeared i Szoste R.: Modificatio of Holt s model exemplified by the trasport of goods by ilad waterways trasport, Publishig House of Rzeszow Uiversity of Techology No. 85, Maagemet ad Maretig

More information

Journal of Statistical Software

Journal of Statistical Software JSS Joural of Statistical Software Jue 2007, Volume 19, Issue 6. http://www.jstatsoft.org/ Ratioal Arithmetic Mathematica Fuctios to Evaluate the Oe-sided Oe-sample K-S Cumulative Samplig Distributio J.

More information

Subject CT1 Financial Mathematics Core Technical Syllabus

Subject CT1 Financial Mathematics Core Technical Syllabus Subject CT1 Fiacial Mathematics Core Techical Syllabus for the 2018 exams 1 Jue 2017 Subject CT1 Fiacial Mathematics Core Techical Aim The aim of the Fiacial Mathematics subject is to provide a groudig

More information

Department of Mathematics, S.R.K.R. Engineering College, Bhimavaram, A.P., India 2

Department of Mathematics, S.R.K.R. Engineering College, Bhimavaram, A.P., India 2 Skewess Corrected Cotrol charts for two Iverted Models R. Subba Rao* 1, Pushpa Latha Mamidi 2, M.S. Ravi Kumar 3 1 Departmet of Mathematics, S.R.K.R. Egieerig College, Bhimavaram, A.P., Idia 2 Departmet

More information

We learned: $100 cash today is preferred over $100 a year from now

We learned: $100 cash today is preferred over $100 a year from now Recap from Last Week Time Value of Moey We leared: $ cash today is preferred over $ a year from ow there is time value of moey i the form of willigess of baks, busiesses, ad people to pay iterest for its

More information

4.5 Generalized likelihood ratio test

4.5 Generalized likelihood ratio test 4.5 Geeralized likelihood ratio test A assumptio that is used i the Athlete Biological Passport is that haemoglobi varies equally i all athletes. We wish to test this assumptio o a sample of k athletes.

More information

Solutions to Problem Sheet 1

Solutions to Problem Sheet 1 Solutios to Problem Sheet ) Use Theorem.4 to prove that p log for all real x 3. This is a versio of Theorem.4 with the iteger N replaced by the real x. Hit Give x 3 let N = [x], the largest iteger x. The,

More information

Models of Asset Pricing

Models of Asset Pricing 4 Appedix 1 to Chapter Models of Asset Pricig I this appedix, we first examie why diversificatio, the holdig of may risky assets i a portfolio, reduces the overall risk a ivestor faces. The we will see

More information

The Limit of a Sequence (Brief Summary) 1

The Limit of a Sequence (Brief Summary) 1 The Limit of a Sequece (Brief Summary). Defiitio. A real umber L is a it of a sequece of real umbers if every ope iterval cotaiig L cotais all but a fiite umber of terms of the sequece. 2. Claim. A sequece

More information

AY Term 2 Mock Examination

AY Term 2 Mock Examination AY 206-7 Term 2 Mock Examiatio Date / Start Time Course Group Istructor 24 March 207 / 2 PM to 3:00 PM QF302 Ivestmet ad Fiacial Data Aalysis G Christopher Tig INSTRUCTIONS TO STUDENTS. This mock examiatio

More information

Monopoly vs. Competition in Light of Extraction Norms. Abstract

Monopoly vs. Competition in Light of Extraction Norms. Abstract Moopoly vs. Competitio i Light of Extractio Norms By Arkadi Koziashvili, Shmuel Nitza ad Yossef Tobol Abstract This ote demostrates that whether the market is competitive or moopolistic eed ot be the result

More information

STAT 135 Solutions to Homework 3: 30 points

STAT 135 Solutions to Homework 3: 30 points STAT 35 Solutios to Homework 3: 30 poits Sprig 205 The objective of this Problem Set is to study the Stei Pheomeo 955. Suppose that θ θ, θ 2,..., θ cosists of ukow parameters, with 3. We wish to estimate

More information

. (The calculated sample mean is symbolized by x.)

. (The calculated sample mean is symbolized by x.) Stat 40, sectio 5.4 The Cetral Limit Theorem otes by Tim Pilachowski If you have t doe it yet, go to the Stat 40 page ad dowload the hadout 5.4 supplemet Cetral Limit Theorem. The homework (both practice

More information

STRAND: FINANCE. Unit 3 Loans and Mortgages TEXT. Contents. Section. 3.1 Annual Percentage Rate (APR) 3.2 APR for Repayment of Loans

STRAND: FINANCE. Unit 3 Loans and Mortgages TEXT. Contents. Section. 3.1 Annual Percentage Rate (APR) 3.2 APR for Repayment of Loans CMM Subject Support Strad: FINANCE Uit 3 Loas ad Mortgages: Text m e p STRAND: FINANCE Uit 3 Loas ad Mortgages TEXT Cotets Sectio 3.1 Aual Percetage Rate (APR) 3.2 APR for Repaymet of Loas 3.3 Credit Purchases

More information

Appendix 1 to Chapter 5

Appendix 1 to Chapter 5 Appedix 1 to Chapter 5 Models of Asset Pricig I Chapter 4, we saw that the retur o a asset (such as a bod) measures how much we gai from holdig that asset. Whe we make a decisio to buy a asset, we are

More information

DESCRIPTION OF MATHEMATICAL MODELS USED IN RATING ACTIVITIES

DESCRIPTION OF MATHEMATICAL MODELS USED IN RATING ACTIVITIES July 2014, Frakfurt am Mai. DESCRIPTION OF MATHEMATICAL MODELS USED IN RATING ACTIVITIES This documet outlies priciples ad key assumptios uderlyig the ratig models ad methodologies of Ratig-Agetur Expert

More information

Lecture 4: Probability (continued)

Lecture 4: Probability (continued) Lecture 4: Probability (cotiued) Desity Curves We ve defied probabilities for discrete variables (such as coi tossig). Probabilities for cotiuous or measuremet variables also are evaluated usig relative

More information

Standard Deviations for Normal Sampling Distributions are: For proportions For means _

Standard Deviations for Normal Sampling Distributions are: For proportions For means _ Sectio 9.2 Cofidece Itervals for Proportios We will lear to use a sample to say somethig about the world at large. This process (statistical iferece) is based o our uderstadig of samplig models, ad will

More information

Anomaly Correction by Optimal Trading Frequency

Anomaly Correction by Optimal Trading Frequency Aomaly Correctio by Optimal Tradig Frequecy Yiqiao Yi Columbia Uiversity September 9, 206 Abstract Uder the assumptio that security prices follow radom walk, we look at price versus differet movig averages.

More information

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India July 2012

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India July 2012 Game Theory Lecture Notes By Y. Narahari Departmet of Computer Sciece ad Automatio Idia Istitute of Sciece Bagalore, Idia July 01 Chapter 4: Domiat Strategy Equilibria Note: This is a oly a draft versio,

More information

An Improved Estimator of Population Variance using known Coefficient of Variation

An Improved Estimator of Population Variance using known Coefficient of Variation J. Stat. Appl. Pro. Lett. 4, No. 1, 11-16 (017) 11 Joural of Statistics Applicatios & Probability Letters A Iteratioal Joural http://dx.doi.org/10.18576/jsapl/04010 A Improved Estimator of Populatio Variace

More information

Introduction to Probability and Statistics Chapter 7

Introduction to Probability and Statistics Chapter 7 Itroductio to Probability ad Statistics Chapter 7 Ammar M. Sarha, asarha@mathstat.dal.ca Departmet of Mathematics ad Statistics, Dalhousie Uiversity Fall Semester 008 Chapter 7 Statistical Itervals Based

More information

CHANGE POINT TREND ANALYSIS OF GNI PER CAPITA IN SELECTED EUROPEAN COUNTRIES AND ISRAEL

CHANGE POINT TREND ANALYSIS OF GNI PER CAPITA IN SELECTED EUROPEAN COUNTRIES AND ISRAEL The 9 th Iteratioal Days of Statistics ad Ecoomics, Prague, September 0-, 05 CHANGE POINT TREND ANALYSIS OF GNI PER CAPITA IN SELECTED EUROPEAN COUNTRIES AND ISRAEL Lia Alatawa Yossi Yacu Gregory Gurevich

More information

CHAPTER 8 Estimating with Confidence

CHAPTER 8 Estimating with Confidence CHAPTER 8 Estimatig with Cofidece 8.2 Estimatig a Populatio Proportio The Practice of Statistics, 5th Editio Stares, Tabor, Yates, Moore Bedford Freema Worth Publishers Estimatig a Populatio Proportio

More information

SETTING GATES IN THE STOCHASTIC PROJECT SCHEDULING PROBLEM USING CROSS ENTROPY

SETTING GATES IN THE STOCHASTIC PROJECT SCHEDULING PROBLEM USING CROSS ENTROPY 19 th Iteratioal Coferece o Productio Research SETTING GATES IN THE STOCHASTIC PROJECT SCHEDULING PROBLEM USING CROSS ENTROPY I. Bedavid, B. Golay Faculty of Idustrial Egieerig ad Maagemet, Techio Israel

More information

CAPITAL ASSET PRICING MODEL

CAPITAL ASSET PRICING MODEL CAPITAL ASSET PRICING MODEL RETURN. Retur i respect of a observatio is give by the followig formula R = (P P 0 ) + D P 0 Where R = Retur from the ivestmet durig this period P 0 = Curret market price P

More information

The Valuation of the Catastrophe Equity Puts with Jump Risks

The Valuation of the Catastrophe Equity Puts with Jump Risks The Valuatio of the Catastrophe Equity Puts with Jump Risks Shih-Kuei Li Natioal Uiversity of Kaohsiug Joit work with Chia-Chie Chag Outlie Catastrophe Isurace Products Literatures ad Motivatios Jump Risk

More information

Topic-7. Large Sample Estimation

Topic-7. Large Sample Estimation Topic-7 Large Sample Estimatio TYPES OF INFERENCE Ò Estimatio: É Estimatig or predictig the value of the parameter É What is (are) the most likely values of m or p? Ò Hypothesis Testig: É Decidig about

More information

of Asset Pricing R e = expected return

of Asset Pricing R e = expected return Appedix 1 to Chapter 5 Models of Asset Pricig EXPECTED RETURN I Chapter 4, we saw that the retur o a asset (such as a bod) measures how much we gai from holdig that asset. Whe we make a decisio to buy

More information

Online appendices from The xva Challenge by Jon Gregory. APPENDIX 10A: Exposure and swaption analogy.

Online appendices from The xva Challenge by Jon Gregory. APPENDIX 10A: Exposure and swaption analogy. APPENDIX 10A: Exposure ad swaptio aalogy. Sorese ad Bollier (1994), effectively calculate the CVA of a swap positio ad show this ca be writte as: CVA swap = LGD V swaptio (t; t i, T) PD(t i 1, t i ). i=1

More information

Chapter 5: Sequences and Series

Chapter 5: Sequences and Series Chapter 5: Sequeces ad Series 1. Sequeces 2. Arithmetic ad Geometric Sequeces 3. Summatio Notatio 4. Arithmetic Series 5. Geometric Series 6. Mortgage Paymets LESSON 1 SEQUENCES I Commo Core Algebra I,

More information

An Empirical Study on the Contribution of Foreign Trade to the Economic Growth of Jiangxi Province, China

An Empirical Study on the Contribution of Foreign Trade to the Economic Growth of Jiangxi Province, China usiess, 21, 2, 183-187 doi:1.4236/ib.21.2222 Published Olie Jue 21 (http://www.scirp.org/joural/ib) 183 A Empirical Study o the Cotributio of Foreig Trade to the Ecoomic Growth of Jiagxi Provice, Chia

More information

Dr. Maddah ENMG 624 Financial Eng g I 03/22/06. Chapter 6 Mean-Variance Portfolio Theory

Dr. Maddah ENMG 624 Financial Eng g I 03/22/06. Chapter 6 Mean-Variance Portfolio Theory Dr Maddah ENMG 64 Fiacial Eg g I 03//06 Chapter 6 Mea-Variace Portfolio Theory Sigle Period Ivestmets Typically, i a ivestmet the iitial outlay of capital is kow but the retur is ucertai A sigle-period

More information

Subject CT5 Contingencies Core Technical. Syllabus. for the 2011 Examinations. The Faculty of Actuaries and Institute of Actuaries.

Subject CT5 Contingencies Core Technical. Syllabus. for the 2011 Examinations. The Faculty of Actuaries and Institute of Actuaries. Subject CT5 Cotigecies Core Techical Syllabus for the 2011 Examiatios 1 Jue 2010 The Faculty of Actuaries ad Istitute of Actuaries Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical

More information

Optimizing of the Investment Structure of the Telecommunication Sector Company

Optimizing of the Investment Structure of the Telecommunication Sector Company Iteratioal Joural of Ecoomics ad Busiess Admiistratio Vol. 1, No. 2, 2015, pp. 59-70 http://www.aisciece.org/joural/ijeba Optimizig of the Ivestmet Structure of the Telecommuicatio Sector Compay P. N.

More information

Monetary Economics: Problem Set #5 Solutions

Monetary Economics: Problem Set #5 Solutions Moetary Ecoomics oblem Set #5 Moetary Ecoomics: oblem Set #5 Solutios This problem set is marked out of 1 poits. The weight give to each part is idicated below. Please cotact me asap if you have ay questios.

More information

of Asset Pricing APPENDIX 1 TO CHAPTER EXPECTED RETURN APPLICATION Expected Return

of Asset Pricing APPENDIX 1 TO CHAPTER EXPECTED RETURN APPLICATION Expected Return APPENDIX 1 TO CHAPTER 5 Models of Asset Pricig I Chapter 4, we saw that the retur o a asset (such as a bod) measures how much we gai from holdig that asset. Whe we make a decisio to buy a asset, we are

More information

The Time Value of Money in Financial Management

The Time Value of Money in Financial Management The Time Value of Moey i Fiacial Maagemet Muteau Irea Ovidius Uiversity of Costata irea.muteau@yahoo.com Bacula Mariaa Traia Theoretical High School, Costata baculamariaa@yahoo.com Abstract The Time Value

More information

Exam 1 Spring 2015 Statistics for Applications 3/5/2015

Exam 1 Spring 2015 Statistics for Applications 3/5/2015 8.443 Exam Sprig 05 Statistics for Applicatios 3/5/05. Log Normal Distributio: A radom variable X follows a Logormal(θ, σ ) distributio if l(x) follows a Normal(θ, σ ) distributio. For the ormal radom

More information

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge Biomial Model Stock Price Dyamics The value of a optio at maturity depeds o the price of the uderlyig stock at maturity. The value of the optio today depeds o the expected value of the optio at maturity

More information

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3)

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3) Today: Fiish Chapter 9 (Sectios 9.6 to 9.8 ad 9.9 Lesso 3) ANNOUNCEMENTS: Quiz #7 begis after class today, eds Moday at 3pm. Quiz #8 will begi ext Friday ad ed at 10am Moday (day of fial). There will be

More information

SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME

SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME All Right Reserved No. of Pages - 10 No of Questios - 08 SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME YEAR I SEMESTER I (Group B) END SEMESTER EXAMINATION

More information

Research Article The Probability That a Measurement Falls within a Range of n Standard Deviations from an Estimate of the Mean

Research Article The Probability That a Measurement Falls within a Range of n Standard Deviations from an Estimate of the Mean Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 70806, 8 pages doi:0.540/0/70806 Research Article The Probability That a Measuremet Falls withi a Rage of Stadard Deviatios

More information

Simulation Efficiency and an Introduction to Variance Reduction Methods

Simulation Efficiency and an Introduction to Variance Reduction Methods Mote Carlo Simulatio: IEOR E4703 Columbia Uiversity c 2017 by Marti Haugh Simulatio Efficiecy ad a Itroductio to Variace Reductio Methods I these otes we discuss the efficiecy of a Mote-Carlo estimator.

More information

Institute of Actuaries of India Subject CT5 General Insurance, Life and Health Contingencies

Institute of Actuaries of India Subject CT5 General Insurance, Life and Health Contingencies Istitute of Actuaries of Idia Subject CT5 Geeral Isurace, Life ad Health Cotigecies For 2017 Examiatios Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical techiques which

More information

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion Basic formula for the Chi-square test (Observed - Expected ) Expected Basic formula for cofidece itervals sˆ x ± Z ' Sample size adjustmet for fiite populatio (N * ) (N + - 1) Formulas for estimatig populatio

More information

Faculdade de Economia da Universidade de Coimbra

Faculdade de Economia da Universidade de Coimbra Faculdade de Ecoomia da Uiversidade de Coimbra Grupo de Estudos Moetários e Fiaceiros (GEMF) Av. Dias da Silva, 65 300-5 COIMBRA, PORTUGAL gemf@fe.uc.pt http://www.uc.pt/feuc/gemf PEDRO GODINHO Estimatig

More information

FOUNDATION ACTED COURSE (FAC)

FOUNDATION ACTED COURSE (FAC) FOUNDATION ACTED COURSE (FAC) What is the Foudatio ActEd Course (FAC)? FAC is desiged to help studets improve their mathematical skills i preparatio for the Core Techical subjects. It is a referece documet

More information

Twitter: @Owe134866 www.mathsfreeresourcelibrary.com Prior Kowledge Check 1) State whether each variable is qualitative or quatitative: a) Car colour Qualitative b) Miles travelled by a cyclist c) Favourite

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

Mixed and Implicit Schemes Implicit Schemes. Exercise: Verify that ρ is unimodular: ρ = 1.

Mixed and Implicit Schemes Implicit Schemes. Exercise: Verify that ρ is unimodular: ρ = 1. Mixed ad Implicit Schemes 3..4 The leapfrog scheme is stable for the oscillatio equatio ad ustable for the frictio equatio. The Euler forward scheme is stable for the frictio equatio but ustable for the

More information