Active Sensing. Abstract. 1 Introduction

Size: px
Start display at page:

Download "Active Sensing. Abstract. 1 Introduction"

Transcription

1 Atve Sensng Shpeng Yu, Balaj Krshnapuram, Romer Rosales, R. Bharat Rao CAD and Knowledge Solutons, Semens Medal Solutons USA, In. Abstrat Labels are often expensve to get, and ths motvates atve learnng whh hooses the most nformatve samples for label aquston. In ths paper we study atve sensng n a mult-vew settng, motvated from many problems where grouped features are also expensve to obtan and need to be aqured (or sensed atvely (e.g., n aner dagnoss eah patent mght go through many tests suh as CT, Ultrasound and MRI to get valuable features. The strength of ths model s that one atvely sensed (sample, vew par would mprove the jont mult-vew lassfaton on all the samples. For ths purpose we extend the Bayesan o-tranng framework suh that t an handle mssng vews n a prnpled way, and ntrodue two rtera for vew aquston. Experments on one toy data and two real-world medal problems show the effetveness of ths model. 1 Introduton Labeled data an be expensve to obtan n a varety of mahne learnng problems. Atve learnng addresses the problem of effently hoosng data samples to be labeled n order to mprove overall learnng performane. From a aner dagnoss perspetve, ths s equvalent to hoosng patents to do a bopsy suh that the tumor s orretly dagnosed (bengn/malgnant. In ths paper we onsder a related but dfferent problem, now motvated by the fat that features may also be expensve to obtan (muh n the Appearng n Proeedngs of the 1 th Internatonal Conferene on Artfal Intellgene and Statsts (AISTATS 009, Clearwater Beah, Florda, USA. Volume 5 of JMLR: W&CP 5. Copyrght 009 by the authors. Features L Vews L 1 Atve Learnng Atve Sensng Fgure 1: Settngs for atve learnng (left and atve sensng wth vews (rght. The L olumn denotes the labels (outputs. Lght blue bloks denote observed data, and red bloks denote mssng data. same way as labels. More generally we onsder subsets of features; we refer to them as vews. In aner dagnoss, features ould ome from dfferent magng modaltes suh as CT, Ultrasound and MRI. In problems where there exst dfferent vews of the data, some of these vews ould be mssng for ertan samples (due to, e.g., hgh ost or lmted budget. We all atve sensng the proess of effently hoosng what vews and samples to addtonally aqure to mprove the overall learnng performane (f. Fg. 1. Examples of the atve sensng settng desrbed above are abundant. For land mne deteton n a sensor network, we may have dfferent types of sensors (as dfferent vews deployed at one loaton, but some sensors may not be avalable for all loatons due to hgh ost. So the nterest s to dede whh loaton and whh type of sensor we should addtonally onsder to aheve better deteton auray. In the medal dagnoss senaro, our motvatng applaton, spealsts rely on dfferent sets of medal fators, suh as demographs, magng, and bo-markers, to make lnal desons. A patent does not undergo all possble tests 639

2 Atve Sensng at one (due to varous sde effets suh as radaton and ontrast, but these tests are seleted based on the evdene olleted up to a partular pont. 1 It s seen that standard atve learnng would not work n ths settng. When some vews are mssng, one soluton s to learn a model usng those samples for whh all the vews are avalable. Another soluton s to mpute the mssng features usng the observed features. However, as the man motvatng fator for mult-vew learnng approahes, the nformaton provded by the ombned set of vews (taken at one s n general larger than that provded by any algorthm that onsders the vews separately (e.g., vews an renfore eah other. We provde two approahes for effently hoosng the (sample, vew par, based on the mutual nformaton (nvolvng varous random varables and on the predtve unertanty, respetvely. We formalze these wthn the reently proposed Bayesan otranng framework (Yu et al., 008, wth an mportant extenson to aount for data wth mssng vew nformaton. Ths provdes an undreted graphal model representaton of the atve sensng problem. We also provde methods for addressng densty modelng and approxmated nferene sub-problems arsng n ths probablst settng. Empral studes usng one toy data and two real-world medal problems learly show the effetveness of ths model. The rest of the paper s organzed as follows. We survey the related lterature n Seton. The Bayesan o-tranng model s extended to handle mssng vews n Seton 3. Seton 4 desrbes two methods for atve sensng,.e., dedng whh nomplete samples should be further haraterzed, and whh sensors should be deployed on them. Expermental results are provded n Seton 5. We onlude wth a bref dsusson and future work n Seton 6. Related Work Atve sensng provdes a new senaro n atve data aquston. The present formulaton benefts from prevous work n experment desgn (Lndley, 1956; Fedorov, 197, atve learnng (MaKay, 199; Seung et al., 199, and sensor plaement (Krause et al., 008. Whle there exst a onsderable body of work on the general noton of atve data aquston, to the best of our knowledge, ths s the frst paper to fous on ths noton of atve sensng feature aquston spefally for mprovng mult-vew learnng jontly for all unlabeled samples. Feature aquston was addressed n, e.g., (Melvlle 1 Ths s normally referred to as dfferental dagnoss. et al., 004; Blg and Getoor, 007, but there s a lear dfferene to atve sensng. Prevous feature aquston only onsders one sample at a tme,.e., when one sample s n onsderaton, the other samples are not affeted. But n atve sensng, one atvely aqured (sample, vew par wll mprove the lassfaton performane of all the unlabeled samples va a o-tranng settng. A related yet dfferent problem was onsdered to dentfy the optmal spatal loatons for plang a sngle type of sensor to model spatally varyng phenomena (Krause et al., 008; however, ths work addressed the use of a sngle type of sensor, and dd not onsder the senaro of multple vews. Co-tranng (Blum and Mthell, 1998 s based on the dea that the error rate on unseen test samples an be upper bounded by the dsagreement between the lassfaton-desons obtaned from ndependent haraterzatons (vews of the data (Dasgupta et al., 001. Reently, Bayesan o-tranng (Yu et al., 008 was proposed whh defnes an undreted graphal model for o-tranng and provdes a prnpled soluton to mult-vew learnng. However t an only handle data wthout mssng vews, and n ths paper we extend t suh that t provdes the bass for atve sensng. One of our rtera for atve sensng s to hoose the (sample, vew par whh provdes the maxmum mutual nformaton (MI (Cover and Thomas, 1991 about the non-parametr lassfaton funton. In order to aomplsh ths, we use the D-optmalty rteron, whle other hoes suh as A-optmalty and E-optmalty are also avalable (Flaherty et al., 006. Apart from MI maxmzaton, other objetve rtera for atve learnng nlude unertanty samplng (Lews and Gale, 1994; Cohn et al., 1996 and performane optmzaton (e.g., (Roy and MCallum, 001. In ths paper we make two prnpal ontrbutons. Frst, we extend Bayesan o-tranng to allow for mssng vews, aommodatng nompletely haraterzed objets. Deployng addtonal sensors to haraterze an objet would naturally help mprovng lassfaton auray. Our seond ontrbuton s to dentfy whh objets should be haraterzed usng addtonal sensors n order to mprove the lassfaton of all the unlabeled data. Ths s sgnfantly dfferent from prevous feature aquston work. 3 Bayesan Co-Tranng wth Mssng Vews Bayesan o-tranng defnes an undreted graphal model for sem-supervsed mult-vew learnng (Yu 640

3 Yu, Krshnapuram, Rosales, Rao x 1 (1 f(x 1 y 1 x 1 (1 f 1 (x 1 (1 f (x 1 f (x 1 ( x 1 ( x j (1 x k (1 x n (1 f(x j f(x k f(x n y j y k y n x j (1 x k (1 x n (1 f 1 (x j (1 f 1 (x n (1 y 1 f (x j y j f (x n f (x j ( f 1 (x (1 k f (x k f (x ( k y k f (x n ( x j ( x k ( x n ( (a (b y n Fgure : Bayesan o-tranng fator graphs for (a one-vew and (b two-vew problems, wth mssng vews. Observed varables are marked as dark/bold, and unobserved ones are marked as red/non-bold, nludng funtons f 1, f, f (blue/non-bold. Unobserved varables n a dotted box (suh as x (1 j are potental observatons for atve sensng. All labels y are denoted as observed n the graph, but ths s not requred. et al., 008. The orgnal work assumes that the nput data are omplete,.e., all the vews are observed for every data sample, but for atve sensng we need to defne a o-tranng strategy for data wth nomplete or mssng vews. In ths seton we extend Bayesan o-tranng to the ase where there are mssng (sample, vew pars n the nput data. The same notatons as n (Yu et al., 008 are preserved unless otherwse mentoned. Suppose we have m dfferent vews of a set of n data samples. Let x (j R d j be the features for the th sample obtaned usng the jth vew, where d j s the dmensonalty of the nput spae for vew j. Let eah vew j be observed for a subset of n j samples, and let I j denote the ndes of these samples n the whole sample set. Fnally let y = (y 1,..., y n denote the labels for these samples. In ths paper we onsder a bnary lassfaton senaro where eah y { 1, +1}. In Bayesan o-tranng, let f j denote the latent funton for the jth vew, and f j GP(0, κ j be ts GP pror n vew j. The onsensus funton f s defned to ensure ondtonal ndependene between the output y and the m latent funtons {f j } (Yu et al., 008 (f. Fg. for the fator graph. The undreted graphal model leads to the followng jont probablty: p (y, f, f 1,..., f m = 1 n m ψ(y, f (x ψ(f j ψ(f j, f, (1 Z =1 j=1 where f = {f (x } n =1 and f j = {f j (x (j } Ij are Note that subsrpts ndex the data sample, and supersrpts (wth round brakets ndex the vew. olumn vetors of length n and n j, respetvely. Note that unlke n (Yu et al., 008, f j s only realzed on a subset of samples (as denoted n I j and s of length n j (nstead of n. The wthn-vew potental ψ(f j s defned va the GP pror, ψ(f j = exp( 1 f j K 1 j f j, where K j R n j n j s the ovarane matrx for vew j; the onsensus potental ψ(f j, f desrbes how eah latent funton f j s related to the onsensus funton f, whh we defne as follows: ( ψ(f j, f = exp f j f (I j / σj. ( Note that f (I j takes the length-n j subset of vetor f wth ndes gven by I j. The dea here s to defne the onsensus potental for vew j usng only the data samples observed n vew j. As n (Yu et al., 008, σ j > 0 quantfes how far the latent funton f j s apart from f, and the output potental ψ(y, f (x s defned as λ(y f (x wth logst funton λ(z = (1 + exp( z Co-Tranng Kernel wth Mssng Vews As n (Yu et al., 008, we an also derve a otranng kernel K by ntegratng out all the latent funtons {f j } n (1. It s alulated as K = Λ 1, Λ = m j=1 A j, and eah A j s a n n matrx as A j (I j, I j = (K j + σ j I 1, and 0 otherwse. (3 That s, A j s an expanson of one-vew nformaton matrx (K j + σ j I 1 to the full sze n n, wth the 641

4 Atve Sensng other (unndexed entres flled wth 0. It s easly seen that suh a kernel K s ndeed postve defnte as long as eah one-vew kernel K j s postve defnte. Very mportantly, we note that one addtonal observaton of a (sample, vew par wll affet all the elements of the o-tranng kernel. Ths s exatly the property we would lke to have n atve sensng. 3. Co-Regularzaton wth Mssng Vews To be omplete we also gve the margnalzaton result for o-regularzaton. Ignorng the output y for the moment, ntegratng out the onsensus vew f leads to the followng jont pror: { p(f 1,..., f m = 1 Z exp 1 m f j K 1 j f j j=1 [ 1 [f j (x f k (x] / ] } 1. j<k x I j I k σ j σ k σ I l x l The frst part regularzes the funtonal spae of eah vew, and the seond part onstrans that every par of vews need to agree on the outputs for o-observed samples (nversely weghted by vew varanes and the sum of presons of the vews n whh the sample s observed. 4 Atve Sensng In atve sensng, we are nterested n seletng the best unobserved (sample, vew par for sensng, or for vew aquston, whh wll mprove the overall lassfaton performane. In ths seton we manly dsuss an approah based on the mutual nformaton framework, whh measures the expeted nformaton gan after observng an addtonal (sample, vew par. Another approah based on the predtve unertanty s also brefly dsussed n Seton 4.5. In the followng let D O and D U denote the observed and unobserved (sample, vew pars, respetvely. 4.1 Laplae Approxmaton To alulate the mutual nformaton we need to alulate the dfferental entropy of the onsensus vew funton f. Wth o-tranng kernel and the logst regresson loss, Laplae approxmaton an be appled to approxmate the posteror dstrbuton of f as a Gaussan dstrbuton. In partular, let the pror of the onsensus vew take the GP pror wth o-tranng kernel,.e., f N (0, K. Wth the logst regresson loss, the a posteror dstrbuton of f, p(f D O, y, s approxmately N (ˆf, ( post 1, (4 where ˆf s the maxmum a posteror (MAP estmate of f, and the posteror preson matrx post = K 1 + Φ, wth Φ the Hessan of the negatve loglkelhood. It turns out that Φ s a dagonal matrx, wth Φ(, = η (1 η where η = λ(ˆf (x. The dfferental entropy of f under ths Laplae approxmaton s H(f = n log(πe 1 log det( post, where det( denote matrx determnant. 4. Mutual Informaton for Atve Sensng Remnd that x (j denote the features n the jth vew for the th sample. In atve sensng, the mutual nformaton (MI between the onsensus vew funton f and the unobserved (sample, vew par x (j D U s the expeted derease n entropy of f when x (j s observed, I(f, x (j = E[H(f ] E[H(f x (j ] = 1 log det( post + 1 E [log det( x(,j post ], where the expetaton s wth respet to p(x (j D O, y, the dstrbuton of the unobserved (sample, vew par gven all the observed pars and avalable outputs. x(,j post s the a posteror preson matrx, derved from Seton 4.1, after one par x (j s observed. The maxmum MI rteron has been used before to dentfy the best unlabeled sample n atve learnng (MaKay, 199. Here we adopt ths rteron and hoose the unobserved par whh maxmzes MI: (, j = arg max I(f, x (j D U x (j = arg max E [log det( x(,j post ]. (5 D U x (j 4.3 Densty Modelng In order to alulate the expetaton n (5, we need a ondtonal densty model for the unobserved pars,.e., p(x (j D O, y. Ths of ourse depends on the type of the features n eah vew, and n ths paper we use a speal Gaussan mxture model (GMM. Let the jont nput densty be p(x (1,..., x (m = p(y = +1p(x (1,..., x (m y = +1 + p(y = 1p(x (1,..., x (m y = 1, and eah ondtonal densty takes a omponent-wse fatorzed GMM form, e.g., for postve lass, p(x (1,..., x (m y = +1 = π + N (x (j µ +(j, Σ +(j. j 64

5 Yu, Krshnapuram, Rosales, Rao Here µ +(j and Σ +(j are the mean and ovarane for vew j n omponent, and π + > 0, π+ = 1 are the mxture weghts for the postve lass. Note that although the ondtonal densty for eah mxture omponent s deoupled for dfferent vews, the jont ondtonal densty s not. 3 Under ths model, the jont densty p(x (1,..., x (m s also a GMM, and any margnal (ondtoned on y or not densty s stll a GMM, e.g., p(x (j y = +1 = π+ N (x (j µ +(j, Σ +(j. Now t s easy to alulate p(x (j D O, y. Let x (O be the set of observed vews for x, we need to dstngush two dfferent settngs. When the label y s avalable, e.g., y = +1, we have p(x (j D O, y = p(x (j x (O, y = +1 = π +(j (x (O N (x (j µ +(j, Σ +(j, (6 whh s agan a GMM model, wth the mxng weghts π +(j (x (O = π + k O N (x(k µ +(k, Σ +(k p(x (O y = +1 When the label y s not avalable, we need to ntegrate out the labelng unertanty and ompute p(x (j D O, y = p(x (j x (O = p(y = +1p(x (j x (O, y = +1 + p(y = 1p(x (j x (O, y = 1, whh s a GMM model as well, as seen from ( Expetaton Calulaton We are now ready to ompute the expetaton n (5. The a posteror preson matrx after one (sample, vew par x (j s observed, x(,j post, s alulated as x(,j post = Φ + (K x(,j 1 = Φ + A x(,j j + k j A k, (7 where K x(,j are the new K and A j matres after the new par s observed. Based on (3, to alulate A x(,j j and A x(,j j we need to realulate the kernel for the jth vew, K j, after an addtonal par x (j s observed. Ths s smply done by addng one more row and olumn to the old K j as: [ K x(,j Kj b j j = 3 A straghtforward EM algorthm an be derved to estmate all these parameters. When labels are only avalable for a very lmted number of samples, one mght assume a full generatve GMM model negletng the dependeny on labels (nstead of a ondtonal GMM model. b j a j ],. where a j = κ j (x (j, x (j lth entry as κ j (x (j l part of A x(,j j R, and b j R nj has the, x (j. Then from (3, the non-zero s alulated as ( [ 1 K x(,j j + σj Kj + σ I = j I b j b j a j + σj ] 1 (8 [ Γj + λ = j Γ j b j b j Γ j λ j Γ j b j λ j b j Γ j λ j usng the blok-matrx nverse formula, where Γ j = (K j + σ j I 1 and λ j = 1 a j +σ j b j Γ jb j. As seen from (7 and (8, t s dffult to dretly alulate the expetaton n (5. Sne for any matrx Q, E [log det(q] log det(e [Q] due to the onavty of log det(, we alternatvely take the upper bound log det( E [ x(,j post ] as the seleton rtera. From (7 and (8, ths redues to omputng E[λ j ], E[λ j b j ] and E[λ j b j b j ], where the expetatons are wth respet to p(x (j D O, y, a GMM model (f. Seton 4.3. In general one needs to alulate these expetatons numerally, as dfferent kernel funtons lead to dfferent ntegrals. As another approxmaton one mght assume eah of the GMM omponent s a pont-mass suh that the mean s used for the alulaton. 4.5 Dsusson The mutual nformaton based approah dretly measures the expeted nformaton gan for every (sample, vew par. A dfferent (and smpler approah s based on the predtve unertanty, n whh the most unertan sample (after the urrent lassfer s traned s seleted for vew aquston (see also (Melvlle et al., 004. Ths unertanty (.e., predtve varane s estmated as the dagonal entres of the a posteror ovarane matrx ( post 1, as seen from (4. However t s not lear what vew to aqure for ths sample (f more than one vew s mssng for the sample. The advantage of ths approah s that no densty modelng s neessary for unobserved vews. 5 Empral Study For the followng experments we are gven a lassfaton task wth mssng vews. At eah teraton we are allowed to selet an unobserved (sample, vew par for sensng (.e., feature aquston. We ompare the lassfaton performane on unlabeled data usng the followng three sensng approahes: Atve Sensng MI: The par s seleted based on the mutual nformaton rtera (5. Atve Sensng VAR: A sample s seleted frst whh has the maxmal predtve varane and has ], 643

6 Atve Sensng x ( x (1 x ( x (1 Test AUC Sore Atve Sensng MI Atve Sensng VAR Random Sensng Learn wth Full Features Number of aqured (sample, vew pars n order Fgure 3: Toy example for atve sensng (left. Bg red-square/blue-trangular markers denote +1/ 1 labeled ponts; remanng ponts are unlabeled. Data are sampled from two Gaussans wth mean (,, (, and unt varane. After hdng some of the features the data look lke (mddle wth removed features replaed wth 0. Comparson of atve sensng wth random sensng s shown on the rght. mssng vews, and then one of the mssng vews s randomly seleted for sensng. Random Sensng: A random unobserved (sample, vew par s seleted for sensng. After the par s aqured n eah teraton, learnng s done usng the Bayesan o-tranng model. Note that for all the three approahes, the aqured (sample, vew par wll affet all the samples n the next teraton (va the o-tranng kernel. In atve sensng wth MI, we use EM algorthm to learn the GMM struture wth mssng entres, and the GMM model s re-estmated after eah par s seleted and flled n (ths s fast thanks to the nremental updates n the EM algorthm. 5.1 Toy Data We frst llustrate atve sensng wth a toy example. Fg. 3 (left shows a well separated two-lass problem whh was used n (Yu et al., 008, wth bg squares and trangles representng the labeled postve and negatve samples, and blak dots denotng unlabeled ponts. To smulate our atve sensng experment, we randomly hde one of the two features of eah sample wth 40% probablty eah, and wth 0% probablty observe both features. The fnal nomplete tranng data are shown n Fg. 3 (mddle wth the nomplete samples shown along the frst or seond axs. It an be seen that only fully observed postve and negatve samples are avalable. For atve sensng MI we use the Gaussan kernel wth wdth 0.5, and let the GMM hoose the number of lusters automatally. Standard transdutve settng s appled where all the unlabeled data are avalable for o-tranng kernel alulaton. In Fg. 3 (rght we ompare atve sensng wth random sensng, usng the Area Under the ROC Curve (AUC for the unlabeled data. The x-axs labels eah aqured par n order. Ths ndates that atve sensng s muh better than random sensng n mprovng the lassfaton performane. The Bayes optmal auray (reahable when there s no mssng data s reahed by the 16th query by atve sensng whereas random sensng mproves muh slower wth the number of aqured pars. The two atve sensng algorthms show smlar results. 5. Survval Predton for Lung Caner We onsder -year survval predton for advaned non-small ell lung aner (NSCLC patents treated wth (hemo-radotherapy. Ths s urrently a very hallengng problem n lnal researh, sne the prognoss of ths group of patents s very poor (less than 40% survve two years. Currently most models n the lterature rely on varous lnal fators of the patent suh as gender and the WHO performane status. Very reently, magng-related fators suh as the sze of the tumor and the number of postve lymph node statons are shown to be better predtors (Dehng-Oberje et al., 009. However, t s expensve to obtan the mages and to manually measure these fators. Therefore we study how to selet the best set of patents to go through magng to get addtonal features. All the relevant fators are lsted n Fg. 4 (left wth short desrptons. These fators are all known to be predtve (Dehng-Oberje et al., 009. From Bayesan o-tranng pont of vew we have vews, wth 3 features n the frst (lnal feature vew and features n the seond (magng-based feature vew. Our study ontans 33 advaned NSCLC patents treated at the MAASTRO Cln n the Netherlands from 00 to 006, among whh 77 survved years (labeled +1. All the features are avalable for these patents, and are normalzed to have zero mean and 644

7 Yu, Krshnapuram, Rosales, Rao 0.67 Features for NSCLC -years Survval Predton Feature Desrpton Vew GENDER 1-Male, -Female 1st WHO WHO performane status 1st FEV1 Fored expratory volume n 1 seond 1st GTV Gross tumor volume nd NPLN Number of postve lymph node statons nd Test AUC Sore Atve Sensng MI Atve Sensng VAR Random Sensng Number of aqured (sample, vew pars n order Fgure 4: Experments on NSCLC survval predton. The features for the vews are lsted n the left table, and the performane omparson of atve sensng and random sensng s shown n the rght fgure. As baselnes, tranng wth full features (.e., no sensng needed yelds 0.73; tranng wth mean mputaton (.e., usng the mean of eah feature to fll n the mssng entres yelds 0.6. unt varane before tranng. We randomly hoose 30% of the patents as tranng samples (wth labels known, and the rest 70% as unlabeled samples. We use lnear kernel for eah vew, and let the GMM algorthm automatally hoose the number of lusters. As the atve sensng setup, the frst vew s avalable for all the patents, and the seond vew s avalable only for randomly hosen 50% patents. So our goal s to sequentally selet patents to aqure features n vew, suh that the overall lassfer performane s maxmzed. Fg. 4 (rght shows the test AUC sores (wth error-bars of atve sensng and random sensng, wth dfferent number of aqured pars. Performane s averaged over 0 runs wth randomly hosen 50% patents at the start. Atve sensng n general yelds better performane, and s sgnfantly better after 5 frst pars. Atve sensng based on MI and VAR agan yeld very smlar results. We have also tested other expermental settngs, and the omparson s not senstve to ths setup. 5.3 Pathologal Complete Response (pcr Predton for Retal Caner Our seond example s to predt tumor response after hemo-radotherapy for loally advaned retal aner. Ths s mportant n ndvdualzng treatment strateges, sne patents wth a patholog omplete response (pcr after therapy,.e., wth no evdene of vable tumor on patholog analyss, would need less nvasve surgery or another radotherapy strategy nstead of reseton. Most avalable models ombne lnal fators suh as gender and age, and pretreatment magng-based fators suh as tumor length and SUV max (from CT/PET magng, but t s expeted that addng magng data olleted after therapy would lead to a better predtve model (though wth a hgher ost. In ths study we show how to effetvely selet patents to go through pre-treatment and post-treatment magng to better predt pcr. We use the data from (Capr et al., 007 whh ontans 78 prospetvely olleted retal aner patents. All patents underwent a CT/PET san before treatment and 4 days after treatment, and 1 of them had pcr (labeled +1. We splt all the features nto 3 vews (lnal, pre-treatment magng, post-treatment magng, and the features are lsted n Fg. 5 (left. For atve sensng, we assume that all the (labeled or unlabeled patents have vew 1 features avalable, 70% of the patents have vew features avalable, and 40% of the patents have vew 3 features avalable. Ths s to aount for the fat that vew 3 features are most expensve to get. All the other settngs are the same as the NSCLC survval predton study. Fg. 5 (rght shows the performane omparson of atve sensng wth random sensng, and t s seen that after about 18 par aqustons, atve sensng s sgnfantly better than random sensng. Atve sensng MI and VAR share a smlar trend, and the MI based atve sensng s overall better than VAR based atve sensng. The dfferene s however not statstally sgnfant. The optmal AUC (when there are no mssng features s shown as a dotted lne, and we see that wth around 34 atvely aqured pars, atve sensng an almost aheve the optmum. It takes however muh longer for random sensng to reah ths performane. 6 Conluson and Future Work Ths paper makes two prmary ontrbutons. Frst of all, for the purpose of atve sensng we extend 645

8 Atve Sensng Features for pcr Predton n Retal Caner Feature Desrpton Vew GENDER 1-Male, -Female 1st AGE Age n years 1st STAGE Stagng of aner 1st LENGTH Max dameter of the tumor nd SUVPre SUV max before treatment nd SUV Absolute dfferene of SUV max before and after treatment 3rd RI Response Index, SUV n % 3rd Test AUC Sore Atve Sensng MI Atve Sensng VAR Random Sensng Learn wth Full Features Number of aqured (sample, vew pars n order Fgure 5: Experments on pcr predton for retal aner. The features for the 3 vews are lsted n the left table, and the performane omparson of atve sensng and random sensng s shown n the rght fgure. As baselnes, tranng wth full features (.e., no sensng needed yelds 0.74 (shown as a dotted lne; tranng wth mean mputaton (.e., usng the mean of eah feature to fll n the mssng entres yelds 0.55 (not shown. the Bayesan o-tranng framework to handle real-lfe data where objets are often nompletely haraterzed,.e., only a subset of vews are avalable for ertan samples. Seond, we ntrodue two approahes for atve sensng, based on mutual nformaton and predtve varane, respetvely, whh automatally dedes whh (sample, vew par should be aqured further to get the most beneft. Note that one atvely aqured par would mprove the overall mult-vew lassfaton performane for all the unlabeled samples. Expermental results on two real medal lassfaton problems ndate that the proposed approah s ndeed more aurate than randomly aqurng unobserved (sample, vew pars. As part of the future work, we wll take nto aount the atual ost nvolved n the vew aquston for better deson makng. Ths mght be mportant, for nstane, n medal dagnoss where Ultrasound and MRI ndue qute dfferent osts. Another step s to ombne atve sensng wth atve learnng, suh that one an query both an unobserved (sample, vew par, and an unobserved label. Referenes M. Blg and L. Getoor. VOILA: Effent Feature-value Aquston for Classfaton. In AAAI, 007. A. Blum and T. Mthell. Combnng labeled and unlabeled data wth o-tranng. In COLT, C. Capr, L. Rampn, P. Erba, F. Galeott, G. Crepald, E. Bant, M. Gava, S. Fant, G. Maran, P. Muzzo, and D. Rubello. Sequental FDG-PET/CT relably predts response of loally advaned retal aner to neoadjuvant hemo-radaton therapy. Eur J Nul Med Mol Imagng, 34, 007. D. Cohn, Z. Ghahraman, and M. Jordan. Atve learnng wth statstal models. Journal of Artfal Intellgene Researh, 4:19 145, T. Cover and J. Thomas. Elements of Informaton Theory. Wley Intersene, S. Dasgupta, M. Lttman, and D. MAllester. PAC generalzaton bounds for o-tranng. In NIPS, 001. C. Dehng-Oberje, S. Yu, D. De Ruyssher, S. Meershout, K. van Beek, Y. Levens, J. van Meerbeek, W. de Neve, G. Fung, B. Rao, S. Krshnan, H. van der Wede, and P. Lambn. Development and external valdaton of a predton model for -year survval of non-small ell lung aner patents treated wth (hemo radotherapy. To appear n Int J Radat Onol Bol Phys, 009. V. Fedorov. Theory of Optmal Experments. Aadem Press, 197. P. Flaherty, M. Jordan, and A. Arkn. Robust desgn of bologal experments. In NIPS, 006. A. Krause, A. Sngh, and C. Guestrn. Near-optmal sensor plaements n gaussan proesses:theory, effent algorthms and empral studes. JMLR, 9:35 84, 008. D. Lews and W. Gale. A sequental algorthm for tranng text lassfers. In SIGIR, pages 3 1, D. Lndley. On a measure of the nformaton provded by an experment. Ann. Math. Stat, 7: , D. MaKay. Informaton-based objetve funtons for atve data seleton. Neural Comp., 4: , 199. P. Melvlle, M. Saar-Tsehansky, F. Provost, and R. Mooney. Atve feature-value aquston for lassfer nduton. In ICDM, 004. N. Roy and A. MCallum. Toward optmal atve learnng through samplng estmaton of error reduton. In ICML, pages , 001. S. Seung, M. Opper, and H. Sompolnsky. Query by ommttee. In Ffth Workshop on Computatonal Learnng Theory, pages 87 94, 199. S. Yu, B. Krshnapuram, R. Rosales, H. Stek, and B. Rao. Bayesan o-tranng. In NIPS,

II. Random Variables. Variable Types. Variables Map Outcomes to Numbers

II. Random Variables. Variable Types. Variables Map Outcomes to Numbers II. Random Varables Random varables operate n much the same way as the outcomes or events n some arbtrary sample space the dstncton s that random varables are smply outcomes that are represented numercally.

More information

/ Computational Genomics. Normalization

/ Computational Genomics. Normalization 0-80 /02-70 Computatonal Genomcs Normalzaton Gene Expresson Analyss Model Computatonal nformaton fuson Bologcal regulatory networks Pattern Recognton Data Analyss clusterng, classfcaton normalzaton, mss.

More information

Introduction to PGMs: Discrete Variables. Sargur Srihari

Introduction to PGMs: Discrete Variables. Sargur Srihari Introducton to : Dscrete Varables Sargur srhar@cedar.buffalo.edu Topcs. What are graphcal models (or ) 2. Use of Engneerng and AI 3. Drectonalty n graphs 4. Bayesan Networks 5. Generatve Models and Samplng

More information

Tests for Two Correlations

Tests for Two Correlations PASS Sample Sze Software Chapter 805 Tests for Two Correlatons Introducton The correlaton coeffcent (or correlaton), ρ, s a popular parameter for descrbng the strength of the assocaton between two varables.

More information

COS 511: Theoretical Machine Learning. Lecturer: Rob Schapire Lecture #21 Scribe: Lawrence Diao April 23, 2013

COS 511: Theoretical Machine Learning. Lecturer: Rob Schapire Lecture #21 Scribe: Lawrence Diao April 23, 2013 COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture #21 Scrbe: Lawrence Dao Aprl 23, 2013 1 On-Lne Log Loss To recap the end of the last lecture, we have the followng on-lne problem wth N

More information

Putting Your Money Where Your Mouth Is A Betting Platform for Better Prediction. Abstract

Putting Your Money Where Your Mouth Is A Betting Platform for Better Prediction. Abstract Revew of Network Eonoms Vol.6, Issue June 007 Puttng Your Money Where Your Mouth Is A Bettng Platform for Better Predton FANG FANG College of Busness Admnstraton, Calforna State Unversty at San Maros MAXWELL

More information

TCOM501 Networking: Theory & Fundamentals Final Examination Professor Yannis A. Korilis April 26, 2002

TCOM501 Networking: Theory & Fundamentals Final Examination Professor Yannis A. Korilis April 26, 2002 TO5 Networng: Theory & undamentals nal xamnaton Professor Yanns. orls prl, Problem [ ponts]: onsder a rng networ wth nodes,,,. In ths networ, a customer that completes servce at node exts the networ wth

More information

NCCI S 2007 HAZARD GROUP MAPPING

NCCI S 2007 HAZARD GROUP MAPPING NCCI S 2007 HAZARD GROUP MAPPING by John P. Robertson ABSTRACT At the begnnng of 2007, the NCCI mplemented a new seven-hazard-group system, replang the prevous four-hazard-group system. Ths artle desrbes

More information

Sensitivity of health-related scales is a non-decreasing function of

Sensitivity of health-related scales is a non-decreasing function of Senstvty of health-related sales s a non-dereasng funton of ther lasses. Vasleos Maroulas and Demosthenes B. Panagotaos 2,* Insttute for Mathemats and ts Applatons, Unversty of Mnnesota, Mnneapols, USA.

More information

Study on Trade Restrictiveness of Agricultural Policies in Iran

Study on Trade Restrictiveness of Agricultural Policies in Iran Internatonal Journal of Agrultural Sene and Researh Volume, Number 1, Wnter 011(Seral #) Study on Trade Restrtveness of Agrultural Poles n Iran G. rouz 1 *; R. Moghaddas 1 ; S. Yazdan 1: Department of

More information

Strategic Dynamic Sourcing from Competing Suppliers with Transferable Capacity Investment

Strategic Dynamic Sourcing from Competing Suppliers with Transferable Capacity Investment Strateg Dynam Sourng from Competng Supplers wth Transferable Capaty nvestment Cuhong L Laurens G. Debo Shool of Busness, Unversty of Connetut, Storrs, CT 0669 The Booth Shool of Busness, Unversty of Chago,

More information

Appendix - Normally Distributed Admissible Choices are Optimal

Appendix - Normally Distributed Admissible Choices are Optimal Appendx - Normally Dstrbuted Admssble Choces are Optmal James N. Bodurtha, Jr. McDonough School of Busness Georgetown Unversty and Q Shen Stafford Partners Aprl 994 latest revson September 00 Abstract

More information

An Economic Analysis of Interconnection Arrangements between Internet Backbone Providers

An Economic Analysis of Interconnection Arrangements between Internet Backbone Providers ONLINE SUPPLEMENT TO An Eonom Analyss of Interonneton Arrangements between Internet Bakbone Provders Yong Tan Unversty of Washngton Busness Shool Box 353 Seattle Washngton 9895-3 ytan@uwashngtonedu I Robert

More information

3/3/2014. CDS M Phil Econometrics. Vijayamohanan Pillai N. Truncated standard normal distribution for a = 0.5, 0, and 0.5. CDS Mphil Econometrics

3/3/2014. CDS M Phil Econometrics. Vijayamohanan Pillai N. Truncated standard normal distribution for a = 0.5, 0, and 0.5. CDS Mphil Econometrics Lmted Dependent Varable Models: Tobt an Plla N 1 CDS Mphl Econometrcs Introducton Lmted Dependent Varable Models: Truncaton and Censorng Maddala, G. 1983. Lmted Dependent and Qualtatve Varables n Econometrcs.

More information

ENHANCING BASEL METHOD VIA CONDITIONAL DISTRIBUTIONS THAT CAPTURE STRONGER CONNECTION AMONG CREDIT LOSSES IN DOWNTURNS

ENHANCING BASEL METHOD VIA CONDITIONAL DISTRIBUTIONS THAT CAPTURE STRONGER CONNECTION AMONG CREDIT LOSSES IN DOWNTURNS ENHANCING BASEL METHOD VIA CONDITIONAL DISTRIBUTIONS THAT CAPTURE STRONGER CONNECTION AMONG CREDIT LOSSES IN DOWNTURNS Fernando F. Morera Credt Researh Centre, Unversty of Ednburgh Emal: F.F.Morera@sms.ed.a.uk

More information

Linear Combinations of Random Variables and Sampling (100 points)

Linear Combinations of Random Variables and Sampling (100 points) Economcs 30330: Statstcs for Economcs Problem Set 6 Unversty of Notre Dame Instructor: Julo Garín Sprng 2012 Lnear Combnatons of Random Varables and Samplng 100 ponts 1. Four-part problem. Go get some

More information

15-451/651: Design & Analysis of Algorithms January 22, 2019 Lecture #3: Amortized Analysis last changed: January 18, 2019

15-451/651: Design & Analysis of Algorithms January 22, 2019 Lecture #3: Amortized Analysis last changed: January 18, 2019 5-45/65: Desgn & Analyss of Algorthms January, 09 Lecture #3: Amortzed Analyss last changed: January 8, 09 Introducton In ths lecture we dscuss a useful form of analyss, called amortzed analyss, for problems

More information

Multifactor Term Structure Models

Multifactor Term Structure Models 1 Multfactor Term Structure Models A. Lmtatons of One-Factor Models 1. Returns on bonds of all maturtes are perfectly correlated. 2. Term structure (and prces of every other dervatves) are unquely determned

More information

The vertical differentiation model in the insurance market

The vertical differentiation model in the insurance market The vertal dfferentaton model n the nsurane market Mahto Okura * bstrat Ths note exlores the vertal dfferentaton model n the nsurane market. The man results are as follows. Frst, the eulbrum re dfferental

More information

Reducing the Space-Time Complexity of the CMA-ES

Reducing the Space-Time Complexity of the CMA-ES Redung the Spae-Tme Complexty of the -ES James N Knght and Monte Lunaek Computer Sene Department Colorado State Unversty Fort Collns, CO 8053-1873 nate@solostateedu, lunaek@solostateedu ABSTRACT A lmted

More information

Supplementary material for Non-conjugate Variational Message Passing for Multinomial and Binary Regression

Supplementary material for Non-conjugate Variational Message Passing for Multinomial and Binary Regression Supplementary materal for Non-conjugate Varatonal Message Passng for Multnomal and Bnary Regresson October 9, 011 1 Alternatve dervaton We wll focus on a partcular factor f a and varable x, wth the am

More information

Elements of Economic Analysis II Lecture VI: Industry Supply

Elements of Economic Analysis II Lecture VI: Industry Supply Elements of Economc Analyss II Lecture VI: Industry Supply Ka Hao Yang 10/12/2017 In the prevous lecture, we analyzed the frm s supply decson usng a set of smple graphcal analyses. In fact, the dscusson

More information

Quiz on Deterministic part of course October 22, 2002

Quiz on Deterministic part of course October 22, 2002 Engneerng ystems Analyss for Desgn Quz on Determnstc part of course October 22, 2002 Ths s a closed book exercse. You may use calculators Grade Tables There are 90 ponts possble for the regular test, or

More information

Raising Food Prices and Welfare Change: A Simple Calibration. Xiaohua Yu

Raising Food Prices and Welfare Change: A Simple Calibration. Xiaohua Yu Rasng Food Prces and Welfare Change: A Smple Calbraton Xaohua Yu Professor of Agrcultural Economcs Courant Research Centre Poverty, Equty and Growth Unversty of Göttngen CRC-PEG, Wlhelm-weber-Str. 2 3773

More information

Mode is the value which occurs most frequency. The mode may not exist, and even if it does, it may not be unique.

Mode is the value which occurs most frequency. The mode may not exist, and even if it does, it may not be unique. 1.7.4 Mode Mode s the value whch occurs most frequency. The mode may not exst, and even f t does, t may not be unque. For ungrouped data, we smply count the largest frequency of the gven value. If all

More information

OCR Statistics 1 Working with data. Section 2: Measures of location

OCR Statistics 1 Working with data. Section 2: Measures of location OCR Statstcs 1 Workng wth data Secton 2: Measures of locaton Notes and Examples These notes have sub-sectons on: The medan Estmatng the medan from grouped data The mean Estmatng the mean from grouped data

More information

Foundations of Machine Learning II TP1: Entropy

Foundations of Machine Learning II TP1: Entropy Foundatons of Machne Learnng II TP1: Entropy Gullaume Charpat (Teacher) & Gaétan Marceau Caron (Scrbe) Problem 1 (Gbbs nequalty). Let p and q two probablty measures over a fnte alphabet X. Prove that KL(p

More information

Consumption Based Asset Pricing

Consumption Based Asset Pricing Consumpton Based Asset Prcng Mchael Bar Aprl 25, 208 Contents Introducton 2 Model 2. Prcng rsk-free asset............................... 3 2.2 Prcng rsky assets................................ 4 2.3 Bubbles......................................

More information

OPERATIONS RESEARCH. Game Theory

OPERATIONS RESEARCH. Game Theory OPERATIONS RESEARCH Chapter 2 Game Theory Prof. Bbhas C. Gr Department of Mathematcs Jadavpur Unversty Kolkata, Inda Emal: bcgr.umath@gmal.com 1.0 Introducton Game theory was developed for decson makng

More information

Data Mining Linear and Logistic Regression

Data Mining Linear and Logistic Regression 07/02/207 Data Mnng Lnear and Logstc Regresson Mchael L of 26 Regresson In statstcal modellng, regresson analyss s a statstcal process for estmatng the relatonshps among varables. Regresson models are

More information

Enhancment of Inventory Management Approaches in Vehicle Routing-Cross Docking Problems

Enhancment of Inventory Management Approaches in Vehicle Routing-Cross Docking Problems Enhanment of Inventory Management Approahes n Vehle Routng-Cross Dong Problems Mahd Alnaghan*, Hamed Amanpour*, Erfan Babaee rolaee* *Department of Industral and Systems Engneerng, Isfahan Unversty of

More information

Applications of Myerson s Lemma

Applications of Myerson s Lemma Applcatons of Myerson s Lemma Professor Greenwald 28-2-7 We apply Myerson s lemma to solve the sngle-good aucton, and the generalzaton n whch there are k dentcal copes of the good. Our objectve s welfare

More information

Scribe: Chris Berlind Date: Feb 1, 2010

Scribe: Chris Berlind Date: Feb 1, 2010 CS/CNS/EE 253: Advanced Topcs n Machne Learnng Topc: Dealng wth Partal Feedback #2 Lecturer: Danel Golovn Scrbe: Chrs Berlnd Date: Feb 1, 2010 8.1 Revew In the prevous lecture we began lookng at algorthms

More information

Lecture 7. We now use Brouwer s fixed point theorem to prove Nash s theorem.

Lecture 7. We now use Brouwer s fixed point theorem to prove Nash s theorem. Topcs on the Border of Economcs and Computaton December 11, 2005 Lecturer: Noam Nsan Lecture 7 Scrbe: Yoram Bachrach 1 Nash s Theorem We begn by provng Nash s Theorem about the exstance of a mxed strategy

More information

3: Central Limit Theorem, Systematic Errors

3: Central Limit Theorem, Systematic Errors 3: Central Lmt Theorem, Systematc Errors 1 Errors 1.1 Central Lmt Theorem Ths theorem s of prme mportance when measurng physcal quanttes because usually the mperfectons n the measurements are due to several

More information

Elton, Gruber, Brown, and Goetzmann. Modern Portfolio Theory and Investment Analysis, 7th Edition. Solutions to Text Problems: Chapter 9

Elton, Gruber, Brown, and Goetzmann. Modern Portfolio Theory and Investment Analysis, 7th Edition. Solutions to Text Problems: Chapter 9 Elton, Gruber, Brown, and Goetzmann Modern Portfolo Theory and Investment Analyss, 7th Edton Solutons to Text Problems: Chapter 9 Chapter 9: Problem In the table below, gven that the rskless rate equals

More information

The Integration of the Israel Labour Force Survey with the National Insurance File

The Integration of the Israel Labour Force Survey with the National Insurance File The Integraton of the Israel Labour Force Survey wth the Natonal Insurance Fle Natale SHLOMO Central Bureau of Statstcs Kanfey Nesharm St. 66, corner of Bach Street, Jerusalem Natales@cbs.gov.l Abstact:

More information

MgtOp 215 Chapter 13 Dr. Ahn

MgtOp 215 Chapter 13 Dr. Ahn MgtOp 5 Chapter 3 Dr Ahn Consder two random varables X and Y wth,,, In order to study the relatonshp between the two random varables, we need a numercal measure that descrbes the relatonshp The covarance

More information

Analysis of Variance and Design of Experiments-II

Analysis of Variance and Design of Experiments-II Analyss of Varance and Desgn of Experments-II MODULE VI LECTURE - 4 SPLIT-PLOT AND STRIP-PLOT DESIGNS Dr. Shalabh Department of Mathematcs & Statstcs Indan Insttute of Technology Kanpur An example to motvate

More information

Likelihood Fits. Craig Blocker Brandeis August 23, 2004

Likelihood Fits. Craig Blocker Brandeis August 23, 2004 Lkelhood Fts Crag Blocker Brandes August 23, 2004 Outlne I. What s the queston? II. Lkelhood Bascs III. Mathematcal Propertes IV. Uncertantes on Parameters V. Mscellaneous VI. Goodness of Ft VII. Comparson

More information

CHAPTER 3: BAYESIAN DECISION THEORY

CHAPTER 3: BAYESIAN DECISION THEORY CHATER 3: BAYESIAN DECISION THEORY Decson makng under uncertanty 3 rogrammng computers to make nference from data requres nterdscplnary knowledge from statstcs and computer scence Knowledge of statstcs

More information

A Case Study for Optimal Dynamic Simulation Allocation in Ordinal Optimization 1

A Case Study for Optimal Dynamic Simulation Allocation in Ordinal Optimization 1 A Case Study for Optmal Dynamc Smulaton Allocaton n Ordnal Optmzaton Chun-Hung Chen, Dongha He, and Mchael Fu 4 Abstract Ordnal Optmzaton has emerged as an effcent technque for smulaton and optmzaton.

More information

CHAPTER 9 FUNCTIONAL FORMS OF REGRESSION MODELS

CHAPTER 9 FUNCTIONAL FORMS OF REGRESSION MODELS CHAPTER 9 FUNCTIONAL FORMS OF REGRESSION MODELS QUESTIONS 9.1. (a) In a log-log model the dependent and all explanatory varables are n the logarthmc form. (b) In the log-ln model the dependent varable

More information

Chapter 5 Student Lecture Notes 5-1

Chapter 5 Student Lecture Notes 5-1 Chapter 5 Student Lecture Notes 5-1 Basc Busness Statstcs (9 th Edton) Chapter 5 Some Important Dscrete Probablty Dstrbutons 004 Prentce-Hall, Inc. Chap 5-1 Chapter Topcs The Probablty Dstrbuton of a Dscrete

More information

CS 286r: Matching and Market Design Lecture 2 Combinatorial Markets, Walrasian Equilibrium, Tâtonnement

CS 286r: Matching and Market Design Lecture 2 Combinatorial Markets, Walrasian Equilibrium, Tâtonnement CS 286r: Matchng and Market Desgn Lecture 2 Combnatoral Markets, Walrasan Equlbrum, Tâtonnement Matchng and Money Recall: Last tme we descrbed the Hungaran Method for computng a maxmumweght bpartte matchng.

More information

Problem Set 6 Finance 1,

Problem Set 6 Finance 1, Carnege Mellon Unversty Graduate School of Industral Admnstraton Chrs Telmer Wnter 2006 Problem Set 6 Fnance, 47-720. (representatve agent constructon) Consder the followng two-perod, two-agent economy.

More information

Final Exam. 7. (10 points) Please state whether each of the following statements is true or false. No explanation needed.

Final Exam. 7. (10 points) Please state whether each of the following statements is true or false. No explanation needed. Fnal Exam Fall 4 Econ 8-67 Closed Book. Formula Sheet Provded. Calculators OK. Tme Allowed: hours Please wrte your answers on the page below each queston. (5 ponts) Assume that the rsk-free nterest rate

More information

Finance 402: Problem Set 1 Solutions

Finance 402: Problem Set 1 Solutions Fnance 402: Problem Set 1 Solutons Note: Where approprate, the fnal answer for each problem s gven n bold talcs for those not nterested n the dscusson of the soluton. 1. The annual coupon rate s 6%. A

More information

Ch Rival Pure private goods (most retail goods) Non-Rival Impure public goods (internet service)

Ch Rival Pure private goods (most retail goods) Non-Rival Impure public goods (internet service) h 7 1 Publc Goods o Rval goods: a good s rval f ts consumpton by one person precludes ts consumpton by another o Excludable goods: a good s excludable f you can reasonably prevent a person from consumng

More information

Appendix for Solving Asset Pricing Models when the Price-Dividend Function is Analytic

Appendix for Solving Asset Pricing Models when the Price-Dividend Function is Analytic Appendx for Solvng Asset Prcng Models when the Prce-Dvdend Functon s Analytc Ovdu L. Caln Yu Chen Thomas F. Cosmano and Alex A. Hmonas January 3, 5 Ths appendx provdes proofs of some results stated n our

More information

CS54701: Information Retrieval

CS54701: Information Retrieval CS54701: Informaton Retreval Federated Search 22 March 2016 Prof. Chrs Clfton Federated Search Outlne Introducton to federated search Man research problems Resource Representaton Resource Selecton Results

More information

Bayesian belief networks

Bayesian belief networks CS 2750 achne Learnng Lecture 12 ayesan belef networks los Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square CS 2750 achne Learnng Densty estmaton Data: D { D1 D2.. Dn} D x a vector of attrbute values ttrbutes:

More information

A Comparison of Statistical Methods in Interrupted Time Series Analysis to Estimate an Intervention Effect

A Comparison of Statistical Methods in Interrupted Time Series Analysis to Estimate an Intervention Effect Transport and Road Safety (TARS) Research Joanna Wang A Comparson of Statstcal Methods n Interrupted Tme Seres Analyss to Estmate an Interventon Effect Research Fellow at Transport & Road Safety (TARS)

More information

A Bayesian Classifier for Uncertain Data

A Bayesian Classifier for Uncertain Data A Bayesan Classfer for Uncertan Data Bao Qn, Yun Xa Department of Computer Scence Indana Unversty - Purdue Unversty Indanapols, USA {baoqn, yxa}@cs.upu.edu Fang L Department of Mathematcal Scences Indana

More information

Equilibrium in Prediction Markets with Buyers and Sellers

Equilibrium in Prediction Markets with Buyers and Sellers Equlbrum n Predcton Markets wth Buyers and Sellers Shpra Agrawal Nmrod Megddo Benamn Armbruster Abstract Predcton markets wth buyers and sellers of contracts on multple outcomes are shown to have unque

More information

Random Variables. b 2.

Random Variables. b 2. Random Varables Generally the object of an nvestgators nterest s not necessarly the acton n the sample space but rather some functon of t. Techncally a real valued functon or mappng whose doman s the sample

More information

occurrence of a larger storm than our culvert or bridge is barely capable of handling? (what is The main question is: What is the possibility of

occurrence of a larger storm than our culvert or bridge is barely capable of handling? (what is The main question is: What is the possibility of Module 8: Probablty and Statstcal Methods n Water Resources Engneerng Bob Ptt Unversty of Alabama Tuscaloosa, AL Flow data are avalable from numerous USGS operated flow recordng statons. Data s usually

More information

Fast Laplacian Solvers by Sparsification

Fast Laplacian Solvers by Sparsification Spectral Graph Theory Lecture 19 Fast Laplacan Solvers by Sparsfcaton Danel A. Spelman November 9, 2015 Dsclamer These notes are not necessarly an accurate representaton of what happened n class. The notes

More information

An Application of Alternative Weighting Matrix Collapsing Approaches for Improving Sample Estimates

An Application of Alternative Weighting Matrix Collapsing Approaches for Improving Sample Estimates Secton on Survey Research Methods An Applcaton of Alternatve Weghtng Matrx Collapsng Approaches for Improvng Sample Estmates Lnda Tompkns 1, Jay J. Km 2 1 Centers for Dsease Control and Preventon, atonal

More information

EDC Introduction

EDC Introduction .0 Introducton EDC3 In the last set of notes (EDC), we saw how to use penalty factors n solvng the EDC problem wth losses. In ths set of notes, we want to address two closely related ssues. What are, exactly,

More information

Information Flow and Recovering the. Estimating the Moments of. Normality of Asset Returns

Information Flow and Recovering the. Estimating the Moments of. Normality of Asset Returns Estmatng the Moments of Informaton Flow and Recoverng the Normalty of Asset Returns Ané and Geman (Journal of Fnance, 2000) Revsted Anthony Murphy, Nuffeld College, Oxford Marwan Izzeldn, Unversty of Lecester

More information

FORD MOTOR CREDIT COMPANY SUGGESTED ANSWERS. Richard M. Levich. New York University Stern School of Business. Revised, February 1999

FORD MOTOR CREDIT COMPANY SUGGESTED ANSWERS. Richard M. Levich. New York University Stern School of Business. Revised, February 1999 FORD MOTOR CREDIT COMPANY SUGGESTED ANSWERS by Rchard M. Levch New York Unversty Stern School of Busness Revsed, February 1999 1 SETTING UP THE PROBLEM The bond s beng sold to Swss nvestors for a prce

More information

Production and Supply Chain Management Logistics. Paolo Detti Department of Information Engeneering and Mathematical Sciences University of Siena

Production and Supply Chain Management Logistics. Paolo Detti Department of Information Engeneering and Mathematical Sciences University of Siena Producton and Supply Chan Management Logstcs Paolo Dett Department of Informaton Engeneerng and Mathematcal Scences Unversty of Sena Convergence and complexty of the algorthm Convergence of the algorthm

More information

Graphical Methods for Survival Distribution Fitting

Graphical Methods for Survival Distribution Fitting Graphcal Methods for Survval Dstrbuton Fttng In ths Chapter we dscuss the followng two graphcal methods for survval dstrbuton fttng: 1. Probablty Plot, 2. Cox-Snell Resdual Method. Probablty Plot: The

More information

Economic Design of Short-Run CSP-1 Plan Under Linear Inspection Cost

Economic Design of Short-Run CSP-1 Plan Under Linear Inspection Cost Tamkang Journal of Scence and Engneerng, Vol. 9, No 1, pp. 19 23 (2006) 19 Economc Desgn of Short-Run CSP-1 Plan Under Lnear Inspecton Cost Chung-Ho Chen 1 * and Chao-Yu Chou 2 1 Department of Industral

More information

Measures of Spread IQR and Deviation. For exam X, calculate the mean, median and mode. For exam Y, calculate the mean, median and mode.

Measures of Spread IQR and Deviation. For exam X, calculate the mean, median and mode. For exam Y, calculate the mean, median and mode. Part 4 Measures of Spread IQR and Devaton In Part we learned how the three measures of center offer dfferent ways of provdng us wth a sngle representatve value for a data set. However, consder the followng

More information

Tests for Two Ordered Categorical Variables

Tests for Two Ordered Categorical Variables Chapter 253 Tests for Two Ordered Categorcal Varables Introducton Ths module computes power and sample sze for tests of ordered categorcal data such as Lkert scale data. Assumng proportonal odds, such

More information

Problem Set #4 Solutions

Problem Set #4 Solutions 4.0 Sprng 00 Page Problem Set #4 Solutons Problem : a) The extensve form of the game s as follows: (,) Inc. (-,-) Entrant (0,0) Inc (5,0) Usng backwards nducton, the ncumbent wll always set hgh prces,

More information

Chapter 3 Descriptive Statistics: Numerical Measures Part B

Chapter 3 Descriptive Statistics: Numerical Measures Part B Sldes Prepared by JOHN S. LOUCKS St. Edward s Unversty Slde 1 Chapter 3 Descrptve Statstcs: Numercal Measures Part B Measures of Dstrbuton Shape, Relatve Locaton, and Detectng Outlers Eploratory Data Analyss

More information

ECE 586GT: Problem Set 2: Problems and Solutions Uniqueness of Nash equilibria, zero sum games, evolutionary dynamics

ECE 586GT: Problem Set 2: Problems and Solutions Uniqueness of Nash equilibria, zero sum games, evolutionary dynamics Unversty of Illnos Fall 08 ECE 586GT: Problem Set : Problems and Solutons Unqueness of Nash equlbra, zero sum games, evolutonary dynamcs Due: Tuesday, Sept. 5, at begnnng of class Readng: Course notes,

More information

Chapter 3 Student Lecture Notes 3-1

Chapter 3 Student Lecture Notes 3-1 Chapter 3 Student Lecture otes 3-1 Busness Statstcs: A Decson-Makng Approach 6 th Edton Chapter 3 Descrbng Data Usng umercal Measures 005 Prentce-Hall, Inc. Chap 3-1 Chapter Goals After completng ths chapter,

More information

MULTIPLE CURVE CONSTRUCTION

MULTIPLE CURVE CONSTRUCTION MULTIPLE CURVE CONSTRUCTION RICHARD WHITE 1. Introducton In the post-credt-crunch world, swaps are generally collateralzed under a ISDA Master Agreement Andersen and Pterbarg p266, wth collateral rates

More information

Carbon Price Risk Influence on GenCo s Portfolio Optimization

Carbon Price Risk Influence on GenCo s Portfolio Optimization Carbon Pre Rsk nfluene on enco s Portfolo Optmzaton Parul Mathura Department of Eletral Engneerng, Malavya Natonal nsttute of Tehnology Japur, Japur, nda parulvj4@gmal.om Abstrat Worldwde stress on low

More information

Financial mathematics

Financial mathematics Fnancal mathematcs Jean-Luc Bouchot jean-luc.bouchot@drexel.edu February 19, 2013 Warnng Ths s a work n progress. I can not ensure t to be mstake free at the moment. It s also lackng some nformaton. But

More information

A Set of new Stochastic Trend Models

A Set of new Stochastic Trend Models A Set of new Stochastc Trend Models Johannes Schupp Longevty 13, Tape, 21 th -22 th September 2017 www.fa-ulm.de Introducton Uncertanty about the evoluton of mortalty Measure longevty rsk n penson or annuty

More information

- contrast so-called first-best outcome of Lindahl equilibrium with case of private provision through voluntary contributions of households

- contrast so-called first-best outcome of Lindahl equilibrium with case of private provision through voluntary contributions of households Prvate Provson - contrast so-called frst-best outcome of Lndahl equlbrum wth case of prvate provson through voluntary contrbutons of households - need to make an assumpton about how each household expects

More information

Physics 4A. Error Analysis or Experimental Uncertainty. Error

Physics 4A. Error Analysis or Experimental Uncertainty. Error Physcs 4A Error Analyss or Expermental Uncertanty Slde Slde 2 Slde 3 Slde 4 Slde 5 Slde 6 Slde 7 Slde 8 Slde 9 Slde 0 Slde Slde 2 Slde 3 Slde 4 Slde 5 Slde 6 Slde 7 Slde 8 Slde 9 Slde 20 Slde 2 Error n

More information

Two-Settlement Electricity Markets with Price Caps and Cournot Generation Firms

Two-Settlement Electricity Markets with Price Caps and Cournot Generation Firms Two-Settlement Eletrty Markets wth Pre Caps and Cournot Generaton Frms Jan Yao, Shmuel S. Oren, Ilan Adler Department of Industral Engneerng and Operatons Researh 4141 Etheverry Hall, Unversty of Calforna

More information

Using Expenditure PPPs for Sectoral Output and Productivity Comparisons

Using Expenditure PPPs for Sectoral Output and Productivity Comparisons Internatonal Comparson Program Chapter 24 Usng Expendture PPPs for Setoral Output and Produtvty Comparsons Robert Inklaar and Marel P. Tmmer Measurng the Sze of the World Eonomy ICP Book Contents Methodology

More information

Optimising a general repair kit problem with a service constraint

Optimising a general repair kit problem with a service constraint Optmsng a general repar kt problem wth a servce constrant Marco Bjvank 1, Ger Koole Department of Mathematcs, VU Unversty Amsterdam, De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands Irs F.A. Vs Department

More information

Introduction to game theory

Introduction to game theory Introducton to game theory Lectures n game theory ECON5210, Sprng 2009, Part 1 17.12.2008 G.B. Ashem, ECON5210-1 1 Overvew over lectures 1. Introducton to game theory 2. Modelng nteractve knowledge; equlbrum

More information

Teaching Note on Factor Model with a View --- A tutorial. This version: May 15, Prepared by Zhi Da *

Teaching Note on Factor Model with a View --- A tutorial. This version: May 15, Prepared by Zhi Da * Copyrght by Zh Da and Rav Jagannathan Teachng Note on For Model th a Ve --- A tutoral Ths verson: May 5, 2005 Prepared by Zh Da * Ths tutoral demonstrates ho to ncorporate economc ves n optmal asset allocaton

More information

A new balancing approach in Balanced Scorecard by applying cooperative game theory

A new balancing approach in Balanced Scorecard by applying cooperative game theory IEOM 011 A new balanng approah n Balaned Soreard by applyng ooperatve game theory M. Jafar-Eskandar Department of Industral Engneerng Iran Unversty of Sene and Tehnology, Narmak, Tehran 16846-13114, Iran

More information

Taxation and Externalities. - Much recent discussion of policy towards externalities, e.g., global warming debate/kyoto

Taxation and Externalities. - Much recent discussion of policy towards externalities, e.g., global warming debate/kyoto Taxaton and Externaltes - Much recent dscusson of polcy towards externaltes, e.g., global warmng debate/kyoto - Increasng share of tax revenue from envronmental taxaton 6 percent n OECD - Envronmental

More information

Parallel Prefix addition

Parallel Prefix addition Marcelo Kryger Sudent ID 015629850 Parallel Prefx addton The parallel prefx adder presented next, performs the addton of two bnary numbers n tme of complexty O(log n) and lnear cost O(n). Lets notce the

More information

On the Block-cut Transformation Graphs

On the Block-cut Transformation Graphs Journal of Couter and Matheatal Senes, Vol.66,354-36, June 015 An Internatonal Researh Journal, www.oath-ournal.org ISSN 0976-577 Prnt ISSN 319-8133 Onlne On the Blok-ut Transforaton Grahs B. Basaanagoud

More information

A Bootstrap Confidence Limit for Process Capability Indices

A Bootstrap Confidence Limit for Process Capability Indices A ootstrap Confdence Lmt for Process Capablty Indces YANG Janfeng School of usness, Zhengzhou Unversty, P.R.Chna, 450001 Abstract The process capablty ndces are wdely used by qualty professonals as an

More information

Price and Quantity Competition Revisited. Abstract

Price and Quantity Competition Revisited. Abstract rce and uantty Competton Revsted X. Henry Wang Unversty of Mssour - Columba Abstract By enlargng the parameter space orgnally consdered by Sngh and Vves (984 to allow for a wder range of cost asymmetry,

More information

Using Conditional Heteroskedastic

Using Conditional Heteroskedastic ITRON S FORECASTING BROWN BAG SEMINAR Usng Condtonal Heteroskedastc Varance Models n Load Research Sample Desgn Dr. J. Stuart McMenamn March 6, 2012 Please Remember» Phones are Muted: In order to help

More information

Online Appendix for Merger Review for Markets with Buyer Power

Online Appendix for Merger Review for Markets with Buyer Power Onlne Appendx for Merger Revew for Markets wth Buyer Power Smon Loertscher Lesle M. Marx July 23, 2018 Introducton In ths appendx we extend the framework of Loertscher and Marx (forthcomng) to allow two

More information

ISyE 512 Chapter 9. CUSUM and EWMA Control Charts. Instructor: Prof. Kaibo Liu. Department of Industrial and Systems Engineering UW-Madison

ISyE 512 Chapter 9. CUSUM and EWMA Control Charts. Instructor: Prof. Kaibo Liu. Department of Industrial and Systems Engineering UW-Madison ISyE 512 hapter 9 USUM and EWMA ontrol harts Instructor: Prof. Kabo Lu Department of Industral and Systems Engneerng UW-Madson Emal: klu8@wsc.edu Offce: Room 317 (Mechancal Engneerng Buldng) ISyE 512 Instructor:

More information

IND E 250 Final Exam Solutions June 8, Section A. Multiple choice and simple computation. [5 points each] (Version A)

IND E 250 Final Exam Solutions June 8, Section A. Multiple choice and simple computation. [5 points each] (Version A) IND E 20 Fnal Exam Solutons June 8, 2006 Secton A. Multple choce and smple computaton. [ ponts each] (Verson A) (-) Four ndependent projects, each wth rsk free cash flows, have the followng B/C ratos:

More information

Capability Analysis. Chapter 255. Introduction. Capability Analysis

Capability Analysis. Chapter 255. Introduction. Capability Analysis Chapter 55 Introducton Ths procedure summarzes the performance of a process based on user-specfed specfcaton lmts. The observed performance as well as the performance relatve to the Normal dstrbuton are

More information

Creating a zero coupon curve by bootstrapping with cubic splines.

Creating a zero coupon curve by bootstrapping with cubic splines. MMA 708 Analytcal Fnance II Creatng a zero coupon curve by bootstrappng wth cubc splnes. erg Gryshkevych Professor: Jan R. M. Röman 0.2.200 Dvson of Appled Mathematcs chool of Educaton, Culture and Communcaton

More information

BROOKINGS DISCUSSION PAPERS IN INTERNATIONAL ECONOMICS

BROOKINGS DISCUSSION PAPERS IN INTERNATIONAL ECONOMICS BROOKINGS DISCUSSION PAPERS IN INTERNATIONAL ECONOMICS No. 55 CHANGES IN EQUITY RISK PERCEPTIONS: GLOBAL CONSEQUENCES AND POLICY RESPONSES Warwk J. MKbbn and Davd Vnes February Warwk J. MKbbn s a professor

More information

Probability distribution of multi-hop-distance in one-dimensional sensor networks q

Probability distribution of multi-hop-distance in one-dimensional sensor networks q Computer etworks (7) 77 79 www.elsever.com/locate/comnet Probablty dstrbuton of mult-hop-dstance n one-dmensonal sensor networks q Serdar Vural *, Eylem Ekc Department of Electrcal and Computer Engneerng,

More information

Radiative Transfer Equations General

Radiative Transfer Equations General obtan the total ntensty I(r, l, t) as Radatve Transfer Equatons General varables IEF P n physs After Kato, S. et al. 28, Blak-Hole Areton Dsks In hghly energet astronomal systems, suh as areton dsks, matter

More information

Introduction. Chapter 7 - An Introduction to Portfolio Management

Introduction. Chapter 7 - An Introduction to Portfolio Management Introducton In the next three chapters, we wll examne dfferent aspects of captal market theory, ncludng: Brngng rsk and return nto the pcture of nvestment management Markowtz optmzaton Modelng rsk and

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) May 17, 2016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston 2 A B C Blank Queston

More information

Which of the following provides the most reasonable approximation to the least squares regression line? (a) y=50+10x (b) Y=50+x (d) Y=1+50x

Which of the following provides the most reasonable approximation to the least squares regression line? (a) y=50+10x (b) Y=50+x (d) Y=1+50x Whch of the followng provdes the most reasonable approxmaton to the least squares regresson lne? (a) y=50+10x (b) Y=50+x (c) Y=10+50x (d) Y=1+50x (e) Y=10+x In smple lnear regresson the model that s begn

More information