Top-down particle filtering for Bayesian decision trees

Size: px
Start display at page:

Download "Top-down particle filtering for Bayesian decision trees"

Transcription

1 Top-down particle filtering for Bayesian decision trees Balaji Lakshminarayanan 1, Daniel M. Roy 2 and Yee Whye Teh 3 1. Gatsby Unit, UCL, 2. University of Cambridge and 3. University of Oxford

2 Outline Introduction Sequential prior over decision trees Bayesian inference: Top-down particle filtering Experiments Design choices in the SMC algorithm SMC vs MCMC Conclusion

3 Outline Introduction Sequential prior over decision trees Bayesian inference: Top-down particle filtering Experiments Design choices in the SMC algorithm SMC vs MCMC Conclusion

4 Introduction Input: attributes X = {x i } N i=1, labels Y = {y i} N i=1 (i.i.d) y i {1,..., K} (classification) or y i R (regression) Goal: Model p(y x)

5 Introduction Input: attributes X = {x i } N i=1, labels Y = {y i} N i=1 (i.i.d) y i {1,..., K} (classification) or y i R (regression) Goal: Model p(y x) Assume p(y x) is specified by decision tree T Bayesian decision trees: Posterior: p(t Y, X ) p(y T, X ) p(t X ) }{{}}{{} likelihood prior Prediction: p(y x ) = T p(t Y, X )p(y x, T )

6 Example: Classification tree x 1 > θ 0 x 2 > θ 10 θ 11 1 x 2 0 B 0 B 11 B 10 x 1 1 θ: Multinomial parameters at leaf nodes

7 Example: Regression tree x 1 > θ 0 x 2 > θ 10 θ 11 1 x B 0 B 11 7 B 10 x θ: Gaussian parameters at leaf nodes

8 Motivation Classic non-bayesian induction algorithms (e.g. CART) learn a single tree in a top-down manner using greedy heuristics (post-pruning and/or bagging necessary) MCMC for Bayesian decision trees: [Chipman et al., 1998]: local Monte Carlo modifications to the tree structure (less prone to over fitting but slow to mix) Our contribution: Sequential Monte Carlo (SMC) algorithm that approximates the posterior, in a top-down manner Take home message: SMC provides better computation vs predictive performance tradeoff than MCMC

9 Bayesian decision trees: likelihood p(t Y, X ) p(y T, X ) p(t X ) }{{}}{{} likelihood prior

10 Likelihood Assume x n falls in the j th leaf node of T Likelihood for n th data point: p(y n x n, T, θ) = p(y n θ j, x n ) p(y T, X, Θ) = p(y n x n, T, θ) = n p(y n θ j ) j leaves(t) n N(j)

11 Likelihood Assume x n falls in the j th leaf node of T Likelihood for n th data point: p(y n x n, T, θ) = p(y n θ j, x n ) p(y T, X, Θ) = p(y n x n, T, θ) = n p(y n θ j ) j leaves(t) n N(j) Better: integrate out θ j, use marginal likelihood p(y T, X ) = p(y n θ j )p(θ j )dθ j j leaves(t) θ j n N(j) Classification: Dirichlet - Multinomial Regression: Normal - Normal Inverse Gamma

12 Outline Introduction Sequential prior over decision trees Bayesian inference: Top-down particle filtering Experiments Design choices in the SMC algorithm SMC vs MCMC Conclusion

13 Bayesian decision trees: prior p(t Y, X ) p(y T, X ) p(t X ) }{{}}{{} likelihood prior

14 Partial trees 0. Start with empty tree. 1 ɛ x 2 0 B ɛ x 1 1

15 Partial trees 1. Choose to split root node with feature 1 and threshold ɛ : x 1 > x 2 0 B 0 B 1 x 1 1

16 Partial trees 2. Choose to not split node 0. 1 ɛ : x 1 > x 2 0 B 0 B 1 x 1 1

17 Partial trees 3. Choose to split node 1 with with feature 2 and threshold ɛ : x 1 > : x 2 > x 2 0 B 0 B 11 B 10 x 1 1

18 Partial trees 4. Choose to not split node Choose to not split node 11. ɛ : x 1 > : x 2 > x 2 0 B 0 B 11 B 10 x 1 1

19 Sequence of random variables for a tree ɛ : x 1 > : x 2 > ρ ɛ = 1, κ ɛ = 1, τ ɛ = ρ 0 = 0 3. ρ 1 = 1, κ 1 = 2, τ 1 = ρ 10 = 0 5. ρ 11 = 0

20 Sequential prior over decision trees Probability of split (assuming a valid split exists): ( βs p(j split) = α s 1 depth(j)) α s (0, 1), β s [0, ) κ j, τ j sampled uniformly from the range of valid splits

21 Sequential prior over decision trees Probability of split (assuming a valid split exists): ( βs p(j split) = α s 1 depth(j)) α s (0, 1), β s [0, ) κ j, τ j sampled uniformly from the range of valid splits Prior distribution: p(t, κ, τ X ) = j leaves(t) j nonleaves(t) p(j not split) p(j split)p(κ j, τ j )

22 Outline Introduction Sequential prior over decision trees Bayesian inference: Top-down particle filtering Experiments Design choices in the SMC algorithm SMC vs MCMC Conclusion

23 Bayesian decision trees: posterior p(t Y, X ) p(y T, X ) p(t X ) }{{}}{{} likelihood prior

24 SMC algorithm for Bayesian decision trees Importance sampler: Draw T (c) q( ) p(y X ) = T p(y, T X ) C 1 C c=1 p(t (c) ) q(t (c) ) p(y X, T (c) ) } {{ } w (c)

25 SMC algorithm for Bayesian decision trees Importance sampler: Draw T (c) q( ) p(y X ) = T p(y, T X ) C 1 C c=1 p(t (c) ) q(t (c) ) p(y X, T (c) ) } {{ } w (c) Normalize: w (c) = Approximate posterior: w (c) c w (c ) p(t Y, X ) c w (c) δ(t = T (c) )

26 SMC algorithm for Bayesian decision trees (contd.) Sequential importance sampler (SIS): n n p(t n ) = p(t 0 ) p(t n T n 1) q(t n ) = q 0 (T 0 ) q n (T n T n 1) n =1 n =1 p(y X, T n ) = p(y X, T 0 ) p(y X, T 1 ) p(y X, T 0 ) p(y X, T n ) p(y X, T n 1 )

27 SMC algorithm for Bayesian decision trees (contd.) Sequential importance sampler (SIS): n n p(t n ) = p(t 0 ) p(t n T n 1) q(t n ) = q 0 (T 0 ) q n (T n T n 1) n =1 n =1 p(y X, T n ) = p(y X, T 0 ) p(y X, T 1 ) p(y X, T 0 ) p(y X, T n ) p(y X, T n 1 ) w = 1 C = w 0 n p(t n ) q(t n ) p(y X, T n) n =1 p(t n T n 1) p(y X, T n ) q n (T n T n 1) p(y X, T n 1) }{{} local likelihood Sequential Monte Carlo (SMC): SIS adaptive resampling steps Every node is processed just once: no multi-path issues

28 Outline Introduction Sequential prior over decision trees Bayesian inference: Top-down particle filtering Experiments Design choices in the SMC algorithm SMC vs MCMC Conclusion

29 Experimental setup Datasets: magic-04: N = 19K, D = 10, K = 2. pendigits: N = 11K, D = 16, K = % - 30% train-test split Numbers averaged across 10 different initializations

30 Outline Introduction Sequential prior over decision trees Bayesian inference: Top-down particle filtering Experiments Design choices in the SMC algorithm SMC vs MCMC Conclusion

31 SMC design choices Proposals prior proposal: qn (ρ j, κ j, τ j ) = p(ρ j, κ j, τ j ) optimal proposal: q n (ρ j = stop) p(j not split)p(y N(j) X N(j) ), q n (ρ j = split, κ j, τ j ) p(j split)p(κ j, τ j ) p(y N(j0) X N(j0) ) p(y N(j1) X N(j1) ). }{{}}{{} left child right child Set of nodes considered for expansion at iteration n node-wise: next node layer-wise: all nodes at depth n Multinomial resampling

32 Effect of SMC design choices log p(y X) (test) SMC optimal [node] SMC prior [node] SMC optimal [layer] SMC prior [layer] Mean Time (s) log p(y X) (test) SMC optimal [node] SMC prior [node] SMC optimal [layer] SMC prior [layer] Number of particles Figure: Results on magic-04 dataset

33 Effect of irrelevant features on SMC design choices madelon: N = 2.6K, D = 500, K = 2 (96% of the features are irrelevant) log p(y X) (test) SMC optimal [node] SMC prior [node] Mean Time (s) log p(y X) (test) SMC optimal [node] SMC prior [node] Number of particles Figure: Results on madelon dataset

34 Outline Introduction Sequential prior over decision trees Bayesian inference: Top-down particle filtering Experiments Design choices in the SMC algorithm SMC vs MCMC Conclusion

35 Predictive performance vs computation: SMC vs MCMC Fix hyper parameters α = 5, α s = 0.95, β s = 0.5 MCMC [Chipman et al., 1998]: one of the 4 proposals: grow prune change swap MCMC averages predictions over all previous trees Vary number of particles in SMC, number of MCMC iterations and compare runtime vs performance

36 Predictive performance vs computation: SMC vs MCMC log p(y X) (test) Mean Time (s) SMC optimal [node] SMC prior [node] Chipman-MCMC CART (gini) CART (entropy) Accuracy (test) Mean Time (s) SMC optimal [node] SMC prior [node] Chipman-MCMC CART (gini) CART (entropy) Figure: Results on magic-04 dataset

37 Take home message SMC (prior, node-wise) is at least an order of magnitude faster than MCMC

38 Outline Introduction Sequential prior over decision trees Bayesian inference: Top-down particle filtering Experiments Design choices in the SMC algorithm SMC vs MCMC Conclusion

39 Conclusion SMC for fast Bayesian inference for decision trees mimick the top-down generative process of decision trees use local likelihoods resampling steps to guide tree growth For a fixed computational budget, SMC outperforms MCMC

40 Conclusion SMC for fast Bayesian inference for decision trees mimick the top-down generative process of decision trees use local likelihoods resampling steps to guide tree growth For a fixed computational budget, SMC outperforms MCMC Future directions Particle-MCMC for Bayesian Additive Regression Trees Mondrian process prior: projective and exchangeable prior for decision trees [Roy and Teh, 2009]

41 Thank you! Code available at

42 Chipman, H. A., George, E. I., and McCulloch, R. E. (1998). Bayesian CART model search. J. Am. Stat. Assoc., pages Roy, D. M. and Teh, Y. W. (2009). The Mondrian process. In Adv. Neural Information Proc. Systems, volume 21, pages

ECS171: Machine Learning

ECS171: Machine Learning ECS171: Machine Learning Lecture 15: Tree-based Algorithms Cho-Jui Hsieh UC Davis March 7, 2018 Outline Decision Tree Random Forest Gradient Boosted Decision Tree (GBDT) Decision Tree Each node checks

More information

Adaptive Experiments for Policy Choice. March 8, 2019

Adaptive Experiments for Policy Choice. March 8, 2019 Adaptive Experiments for Policy Choice Maximilian Kasy Anja Sautmann March 8, 2019 Introduction The goal of many experiments is to inform policy choices: 1. Job search assistance for refugees: Treatments:

More information

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties Posterior Inference Example. Consider a binomial model where we have a posterior distribution for the probability term, θ. Suppose we want to make inferences about the log-odds γ = log ( θ 1 θ), where

More information

Machine Learning for Quantitative Finance

Machine Learning for Quantitative Finance Machine Learning for Quantitative Finance Fast derivative pricing Sofie Reyners Joint work with Jan De Spiegeleer, Dilip Madan and Wim Schoutens Derivative pricing is time-consuming... Vanilla option pricing

More information

Introduction to Sequential Monte Carlo Methods

Introduction to Sequential Monte Carlo Methods Introduction to Sequential Monte Carlo Methods Arnaud Doucet NCSU, October 2008 Arnaud Doucet () Introduction to SMC NCSU, October 2008 1 / 36 Preliminary Remarks Sequential Monte Carlo (SMC) are a set

More information

Extending MCTS

Extending MCTS Extending MCTS 2-17-16 Reading Quiz (from Monday) What is the relationship between Monte Carlo tree search and upper confidence bound applied to trees? a) MCTS is a type of UCT b) UCT is a type of MCTS

More information

Relevant parameter changes in structural break models

Relevant parameter changes in structural break models Relevant parameter changes in structural break models A. Dufays J. Rombouts Forecasting from Complexity April 27 th, 2018 1 Outline Sparse Change-Point models 1. Motivation 2. Model specification Shrinkage

More information

Estimating a Dynamic Oligopolistic Game with Serially Correlated Unobserved Production Costs. SS223B-Empirical IO

Estimating a Dynamic Oligopolistic Game with Serially Correlated Unobserved Production Costs. SS223B-Empirical IO Estimating a Dynamic Oligopolistic Game with Serially Correlated Unobserved Production Costs SS223B-Empirical IO Motivation There have been substantial recent developments in the empirical literature on

More information

Lecture 17: More on Markov Decision Processes. Reinforcement learning

Lecture 17: More on Markov Decision Processes. Reinforcement learning Lecture 17: More on Markov Decision Processes. Reinforcement learning Learning a model: maximum likelihood Learning a value function directly Monte Carlo Temporal-difference (TD) learning COMP-424, Lecture

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections 1 / 40 Chapter 7: Estimation Sections 7.1 Statistical Inference Bayesian Methods: Chapter 7 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods:

More information

Decision Trees An Early Classifier

Decision Trees An Early Classifier An Early Classifier Jason Corso SUNY at Buffalo January 19, 2012 J. Corso (SUNY at Buffalo) Trees January 19, 2012 1 / 33 Introduction to Non-Metric Methods Introduction to Non-Metric Methods We cover

More information

Credit Card Default Predictive Modeling

Credit Card Default Predictive Modeling Credit Card Default Predictive Modeling Background: Predicting credit card payment default is critical for the successful business model of a credit card company. An accurate predictive model can help

More information

Analysis of the Bitcoin Exchange Using Particle MCMC Methods

Analysis of the Bitcoin Exchange Using Particle MCMC Methods Analysis of the Bitcoin Exchange Using Particle MCMC Methods by Michael Johnson M.Sc., University of British Columbia, 2013 B.Sc., University of Winnipeg, 2011 Project Submitted in Partial Fulfillment

More information

The exam is closed book, closed calculator, and closed notes except your three crib sheets.

The exam is closed book, closed calculator, and closed notes except your three crib sheets. CS 188 Spring 2016 Introduction to Artificial Intelligence Final V2 You have approximately 2 hours and 50 minutes. The exam is closed book, closed calculator, and closed notes except your three crib sheets.

More information

Likelihood-based Optimization of Threat Operation Timeline Estimation

Likelihood-based Optimization of Threat Operation Timeline Estimation 12th International Conference on Information Fusion Seattle, WA, USA, July 6-9, 2009 Likelihood-based Optimization of Threat Operation Timeline Estimation Gregory A. Godfrey Advanced Mathematics Applications

More information

High Frequency Trading Strategy Based on Prex Trees

High Frequency Trading Strategy Based on Prex Trees High Frequency Trading Strategy Based on Prex Trees Yijia Zhou, 05592862, Financial Mathematics, Stanford University December 11, 2010 1 Introduction 1.1 Goal I am an M.S. Finanical Mathematics student

More information

The Monte Carlo Method in High Performance Computing

The Monte Carlo Method in High Performance Computing The Monte Carlo Method in High Performance Computing Dieter W. Heermann Monte Carlo Methods 2015 Dieter W. Heermann (Monte Carlo Methods)The Monte Carlo Method in High Performance Computing 2015 1 / 1

More information

-divergences and Monte Carlo methods

-divergences and Monte Carlo methods -divergences and Monte Carlo methods Summary - english version Ph.D. candidate OLARIU Emanuel Florentin Advisor Professor LUCHIAN Henri This thesis broadly concerns the use of -divergences mainly for variance

More information

Inverse reinforcement learning from summary data

Inverse reinforcement learning from summary data Inverse reinforcement learning from summary data Antti Kangasrääsiö, Samuel Kaski Aalto University, Finland ECML PKDD 2018 journal track Published in Machine Learning (2018), 107:1517 1535 September 12,

More information

Investing through Economic Cycles with Ensemble Machine Learning Algorithms

Investing through Economic Cycles with Ensemble Machine Learning Algorithms Investing through Economic Cycles with Ensemble Machine Learning Algorithms Thomas Raffinot Silex Investment Partners Big Data in Finance Conference Thomas Raffinot (Silex-IP) Economic Cycles-Machine Learning

More information

Computer Vision Group Prof. Daniel Cremers. 7. Sequential Data

Computer Vision Group Prof. Daniel Cremers. 7. Sequential Data Group Prof. Daniel Cremers 7. Sequential Data Bayes Filter (Rep.) We can describe the overall process using a Dynamic Bayes Network: This incorporates the following Markov assumptions: (measurement) (state)!2

More information

Extracting Information from the Markets: A Bayesian Approach

Extracting Information from the Markets: A Bayesian Approach Extracting Information from the Markets: A Bayesian Approach Daniel Waggoner The Federal Reserve Bank of Atlanta Florida State University, February 29, 2008 Disclaimer: The views expressed are the author

More information

Financial Time Series Volatility Analysis Using Gaussian Process State-Space Models

Financial Time Series Volatility Analysis Using Gaussian Process State-Space Models 15 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Financial Time Series Volatility Analysis Using Gaussian Process State-Space Models Jianan Han, Xiao-Ping Zhang Department of

More information

Lecture 9: Classification and Regression Trees

Lecture 9: Classification and Regression Trees Lecture 9: Classification and Regression Trees Advanced Applied Multivariate Analysis STAT 2221, Spring 2015 Sungkyu Jung Department of Statistics, University of Pittsburgh Xingye Qiao Department of Mathematical

More information

(5) Multi-parameter models - Summarizing the posterior

(5) Multi-parameter models - Summarizing the posterior (5) Multi-parameter models - Summarizing the posterior Spring, 2017 Models with more than one parameter Thus far we have studied single-parameter models, but most analyses have several parameters For example,

More information

BAYESIAN STATISTICAL PROCESS ADJUSTMENT FOR UNKNOWN PARAMETER SYSTEMS

BAYESIAN STATISTICAL PROCESS ADJUSTMENT FOR UNKNOWN PARAMETER SYSTEMS The Pennsylvania State University The Graduate School Department of Industrial & Manufacturing Engineering BAYESIAN STATISTICAL PROCESS ADJUSTMENT FOR UNKNOWN PARAMETER SYSTEMS A Thesis in Industrial Engineering

More information

Comparison of Pricing Approaches for Longevity Markets

Comparison of Pricing Approaches for Longevity Markets Comparison of Pricing Approaches for Longevity Markets Melvern Leung Simon Fung & Colin O hare Longevity 12 Conference, Chicago, The Drake Hotel, September 30 th 2016 1 / 29 Overview Introduction 1 Introduction

More information

Pattern Recognition Chapter 5: Decision Trees

Pattern Recognition Chapter 5: Decision Trees Pattern Recognition Chapter 5: Decision Trees Asst. Prof. Dr. Chumphol Bunkhumpornpat Department of Computer Science Faculty of Science Chiang Mai University Learning Objectives How decision trees are

More information

American Option Pricing: A Simulated Approach

American Option Pricing: A Simulated Approach Utah State University DigitalCommons@USU All Graduate Plan B and other Reports Graduate Studies 5-2013 American Option Pricing: A Simulated Approach Garrett G. Smith Utah State University Follow this and

More information

Sequential Monte Carlo Samplers

Sequential Monte Carlo Samplers Sequential Monte Carlo Samplers Pierre Del Moral Université Nice Sophia Antipolis, France Arnaud Doucet University of British Columbia, Canada Ajay Jasra University of Oxford, UK Summary. In this paper,

More information

Lecture 7: Bayesian approach to MAB - Gittins index

Lecture 7: Bayesian approach to MAB - Gittins index Advanced Topics in Machine Learning and Algorithmic Game Theory Lecture 7: Bayesian approach to MAB - Gittins index Lecturer: Yishay Mansour Scribe: Mariano Schain 7.1 Introduction In the Bayesian approach

More information

Down-Up Metropolis-Hastings Algorithm for Multimodality

Down-Up Metropolis-Hastings Algorithm for Multimodality Down-Up Metropolis-Hastings Algorithm for Multimodality Hyungsuk Tak Stat310 24 Nov 2015 Joint work with Xiao-Li Meng and David A. van Dyk Outline Motivation & idea Down-Up Metropolis-Hastings (DUMH) algorithm

More information

Bayesian Normal Stuff

Bayesian Normal Stuff Bayesian Normal Stuff - Set-up of the basic model of a normally distributed random variable with unknown mean and variance (a two-parameter model). - Discuss philosophies of prior selection - Implementation

More information

CS188 Spring 2012 Section 4: Games

CS188 Spring 2012 Section 4: Games CS188 Spring 2012 Section 4: Games 1 Minimax Search In this problem, we will explore adversarial search. Consider the zero-sum game tree shown below. Trapezoids that point up, such as at the root, represent

More information

Non-informative Priors Multiparameter Models

Non-informative Priors Multiparameter Models Non-informative Priors Multiparameter Models Statistics 220 Spring 2005 Copyright c 2005 by Mark E. Irwin Prior Types Informative vs Non-informative There has been a desire for a prior distributions that

More information

Q1. [?? pts] Search Traces

Q1. [?? pts] Search Traces CS 188 Spring 2010 Introduction to Artificial Intelligence Midterm Exam Solutions Q1. [?? pts] Search Traces Each of the trees (G1 through G5) was generated by searching the graph (below, left) with a

More information

Estimation of the Markov-switching GARCH model by a Monte Carlo EM algorithm

Estimation of the Markov-switching GARCH model by a Monte Carlo EM algorithm Estimation of the Markov-switching GARCH model by a Monte Carlo EM algorithm Maciej Augustyniak Fields Institute February 3, 0 Stylized facts of financial data GARCH Regime-switching MS-GARCH Agenda Available

More information

A new look at tree based approaches

A new look at tree based approaches A new look at tree based approaches Xifeng Wang University of North Carolina Chapel Hill xifeng@live.unc.edu April 18, 2018 Xifeng Wang (UNC-Chapel Hill) Short title April 18, 2018 1 / 27 Outline of this

More information

Multidimensional Monotonicity Discovery with mbart

Multidimensional Monotonicity Discovery with mbart Multidimensional Monotonicity Discovery with mart Rob McCulloch Arizona State Collaborations with: Hugh Chipman (Acadia), Edward George (Wharton, University of Pennsylvania), Tom Shively (UT Austin) October

More information

Bayesian course - problem set 3 (lecture 4)

Bayesian course - problem set 3 (lecture 4) Bayesian course - problem set 3 (lecture 4) Ben Lambert November 14, 2016 1 Ticked off Imagine once again that you are investigating the occurrence of Lyme disease in the UK. This is a vector-borne disease

More information

Monotonically Constrained Bayesian Additive Regression Trees

Monotonically Constrained Bayesian Additive Regression Trees Constrained Bayesian Additive Regression Trees Robert McCulloch University of Chicago, Booth School of Business Joint with: Hugh Chipman (Acadia), Ed George (UPenn, Wharton), Tom Shively (U Texas, McCombs)

More information

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior (5) Multi-parameter models - Summarizing the posterior Models with more than one parameter Thus far we have studied single-parameter models, but most analyses have several parameters For example, consider

More information

1 Bayesian Bias Correction Model

1 Bayesian Bias Correction Model 1 Bayesian Bias Correction Model Assuming that n iid samples {X 1,...,X n }, were collected from a normal population with mean µ and variance σ 2. The model likelihood has the form, P( X µ, σ 2, T n >

More information

Notes on the EM Algorithm Michael Collins, September 24th 2005

Notes on the EM Algorithm Michael Collins, September 24th 2005 Notes on the EM Algorithm Michael Collins, September 24th 2005 1 Hidden Markov Models A hidden Markov model (N, Σ, Θ) consists of the following elements: N is a positive integer specifying the number of

More information

# generate data num.obs <- 100 y <- rnorm(num.obs,mean = theta.true, sd = sqrt(sigma.sq.true))

# generate data num.obs <- 100 y <- rnorm(num.obs,mean = theta.true, sd = sqrt(sigma.sq.true)) Posterior Sampling from Normal Now we seek to create draws from the joint posterior distribution and the marginal posterior distributions and Note the marginal posterior distributions would be used to

More information

Enforcing monotonicity of decision models: algorithm and performance

Enforcing monotonicity of decision models: algorithm and performance Enforcing monotonicity of decision models: algorithm and performance Marina Velikova 1 and Hennie Daniels 1,2 A case study of hedonic price model 1 Tilburg University, CentER for Economic Research,Tilburg,

More information

Modeling skewness and kurtosis in Stochastic Volatility Models

Modeling skewness and kurtosis in Stochastic Volatility Models Modeling skewness and kurtosis in Stochastic Volatility Models Georgios Tsiotas University of Crete, Department of Economics, GR December 19, 2006 Abstract Stochastic volatility models have been seen as

More information

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting Quantile Regression By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting Agenda Overview of Predictive Modeling for P&C Applications Quantile

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections 1 / 31 : Estimation Sections 7.1 Statistical Inference Bayesian Methods: 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods: 7.5 Maximum Likelihood

More information

Stochastic Volatility and Jumps: Exponentially Affine Yes or No? An Empirical Analysis of S&P500 Dynamics

Stochastic Volatility and Jumps: Exponentially Affine Yes or No? An Empirical Analysis of S&P500 Dynamics Stochastic Volatility and Jumps: Exponentially Affine Yes or No? An Empirical Analysis of S&P5 Dynamics Katja Ignatieva Paulo J. M. Rodrigues Norman Seeger This version: April 3, 29 Abstract This paper

More information

Numerical Methods for Pricing Energy Derivatives, including Swing Options, in the Presence of Jumps

Numerical Methods for Pricing Energy Derivatives, including Swing Options, in the Presence of Jumps Numerical Methods for Pricing Energy Derivatives, including Swing Options, in the Presence of Jumps, Senior Quantitative Analyst Motivation: Swing Options An electricity or gas SUPPLIER needs to be capable,

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 14th February 2006 Part VII Session 7: Volatility Modelling Session 7: Volatility Modelling

More information

CS340 Machine learning Bayesian model selection

CS340 Machine learning Bayesian model selection CS340 Machine learning Bayesian model selection Bayesian model selection Suppose we have several models, each with potentially different numbers of parameters. Example: M0 = constant, M1 = straight line,

More information

Laplace approximation

Laplace approximation NPFL108 Bayesian inference Approximate Inference Laplace approximation Filip Jurčíček Institute of Formal and Applied Linguistics Charles University in Prague Czech Republic Home page: http://ufal.mff.cuni.cz/~jurcicek

More information

Session 5. Predictive Modeling in Life Insurance

Session 5. Predictive Modeling in Life Insurance SOA Predictive Analytics Seminar Hong Kong 29 Aug. 2018 Hong Kong Session 5 Predictive Modeling in Life Insurance Jingyi Zhang, Ph.D Predictive Modeling in Life Insurance JINGYI ZHANG PhD Scientist Global

More information

Adjusted Priors for Bayes Factors Involving Reparameterized Order Constraints

Adjusted Priors for Bayes Factors Involving Reparameterized Order Constraints Adjusted Priors for Bayes Factors Involving Reparameterized Order Constraints Supplementary Material Daniel W. Heck & Eric-Jan Wagenmakers April 29, 2016 Contents 1 The Product-Binomial Model 2 1.1 Parameter

More information

Importance sampling and Monte Carlo-based calibration for time-changed Lévy processes

Importance sampling and Monte Carlo-based calibration for time-changed Lévy processes Importance sampling and Monte Carlo-based calibration for time-changed Lévy processes Stefan Kassberger Thomas Liebmann BFS 2010 1 Motivation 2 Time-changed Lévy-models and Esscher transforms 3 Applications

More information

Recent Advances in Fractional Stochastic Volatility Models

Recent Advances in Fractional Stochastic Volatility Models Recent Advances in Fractional Stochastic Volatility Models Alexandra Chronopoulou Industrial & Enterprise Systems Engineering University of Illinois at Urbana-Champaign IPAM National Meeting of Women in

More information

Option Pricing Using Bayesian Neural Networks

Option Pricing Using Bayesian Neural Networks Option Pricing Using Bayesian Neural Networks Michael Maio Pires, Tshilidzi Marwala School of Electrical and Information Engineering, University of the Witwatersrand, 2050, South Africa m.pires@ee.wits.ac.za,

More information

Reinforcement Learning and Simulation-Based Search

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Outline 1 Reinforcement Learning 2 3 Planning Under Uncertainty Reinforcement Learning Markov Decision Process Definition A Markov Decision

More information

Bayesian inference of Gaussian mixture models with noninformative priors arxiv: v1 [stat.me] 19 May 2014

Bayesian inference of Gaussian mixture models with noninformative priors arxiv: v1 [stat.me] 19 May 2014 Bayesian inference of Gaussian mixture models with noninformative priors arxiv:145.4895v1 [stat.me] 19 May 214 Colin J. Stoneking May 21, 214 Abstract This paper deals with Bayesian inference of a mixture

More information

Implementing Models in Quantitative Finance: Methods and Cases

Implementing Models in Quantitative Finance: Methods and Cases Gianluca Fusai Andrea Roncoroni Implementing Models in Quantitative Finance: Methods and Cases vl Springer Contents Introduction xv Parti Methods 1 Static Monte Carlo 3 1.1 Motivation and Issues 3 1.1.1

More information

Monte-Carlo Planning Look Ahead Trees. Alan Fern

Monte-Carlo Planning Look Ahead Trees. Alan Fern Monte-Carlo Planning Look Ahead Trees Alan Fern 1 Monte-Carlo Planning Outline Single State Case (multi-armed bandits) A basic tool for other algorithms Monte-Carlo Policy Improvement Policy rollout Policy

More information

Using Halton Sequences. in Random Parameters Logit Models

Using Halton Sequences. in Random Parameters Logit Models Journal of Statistical and Econometric Methods, vol.5, no.1, 2016, 59-86 ISSN: 1792-6602 (print), 1792-6939 (online) Scienpress Ltd, 2016 Using Halton Sequences in Random Parameters Logit Models Tong Zeng

More information

COS 513: Gibbs Sampling

COS 513: Gibbs Sampling COS 513: Gibbs Sampling Matthew Salesi December 6, 2010 1 Overview Concluding the coverage of Markov chain Monte Carlo (MCMC) sampling methods, we look today at Gibbs sampling. Gibbs sampling is a simple

More information

Monte Carlo Methods (Estimators, On-policy/Off-policy Learning)

Monte Carlo Methods (Estimators, On-policy/Off-policy Learning) 1 / 24 Monte Carlo Methods (Estimators, On-policy/Off-policy Learning) Julie Nutini MLRG - Winter Term 2 January 24 th, 2017 2 / 24 Monte Carlo Methods Monte Carlo (MC) methods are learning methods, used

More information

SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION

SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION Vol. 6, No. 1, Summer 2017 2012 Published by JSES. SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN Fadel Hamid Hadi ALHUSSEINI a Abstract The main focus of the paper is modelling

More information

Estimation Appendix to Dynamics of Fiscal Financing in the United States

Estimation Appendix to Dynamics of Fiscal Financing in the United States Estimation Appendix to Dynamics of Fiscal Financing in the United States Eric M. Leeper, Michael Plante, and Nora Traum July 9, 9. Indiana University. This appendix includes tables and graphs of additional

More information

Bayesian Estimation of the Markov-Switching GARCH(1,1) Model with Student-t Innovations

Bayesian Estimation of the Markov-Switching GARCH(1,1) Model with Student-t Innovations Bayesian Estimation of the Markov-Switching GARCH(1,1) Model with Student-t Innovations Department of Quantitative Economics, Switzerland david.ardia@unifr.ch R/Rmetrics User and Developer Workshop, Meielisalp,

More information

SIMULATION METHOD FOR SOLVING HYBRID INFLUENCE DIAGRAMS IN DECISION MAKING. Xi Chen Enlu Zhou

SIMULATION METHOD FOR SOLVING HYBRID INFLUENCE DIAGRAMS IN DECISION MAKING. Xi Chen Enlu Zhou Proceedings of the 2010 Winter Simulation Conference B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan, and E. Yücesan, eds. SIMULATION METHOD FOR SOLVING HYBRID INFLUENCE DIAGRAMS IN DECISION MAKING

More information

Bayesian Computation in Finance

Bayesian Computation in Finance Bayesian Computation in Finance Satadru Hore 1, Michael Johannes 2 Hedibert Lopes 3,Robert McCulloch 4, and Nicholas Polson 5 Abstract In this paper we describe the challenges of Bayesian computation in

More information

Estimation after Model Selection

Estimation after Model Selection Estimation after Model Selection Vanja M. Dukić Department of Health Studies University of Chicago E-Mail: vanja@uchicago.edu Edsel A. Peña* Department of Statistics University of South Carolina E-Mail:

More information

Probabilistic Meshless Methods for Bayesian Inverse Problems. Jon Cockayne July 8, 2016

Probabilistic Meshless Methods for Bayesian Inverse Problems. Jon Cockayne July 8, 2016 Probabilistic Meshless Methods for Bayesian Inverse Problems Jon Cockayne July 8, 2016 1 Co-Authors Chris Oates Tim Sullivan Mark Girolami 2 What is PN? Many problems in mathematics have no analytical

More information

Markov Decision Processes

Markov Decision Processes Markov Decision Processes Robert Platt Northeastern University Some images and slides are used from: 1. CS188 UC Berkeley 2. AIMA 3. Chris Amato Stochastic domains So far, we have studied search Can use

More information

"Pricing Exotic Options using Strong Convergence Properties

Pricing Exotic Options using Strong Convergence Properties Fourth Oxford / Princeton Workshop on Financial Mathematics "Pricing Exotic Options using Strong Convergence Properties Klaus E. Schmitz Abe schmitz@maths.ox.ac.uk www.maths.ox.ac.uk/~schmitz Prof. Mike

More information

The Loans_processed.csv file is the dataset we obtained after the pre-processing part where the clean-up python code was used.

The Loans_processed.csv file is the dataset we obtained after the pre-processing part where the clean-up python code was used. Machine Learning Group Homework 3 MSc Business Analytics Team 9 Alexander Romanenko, Artemis Tomadaki, Justin Leiendecker, Zijun Wei, Reza Brianca Widodo The Loans_processed.csv file is the dataset we

More information

Binary and Binomial Heaps. Disclaimer: these slides were adapted from the ones by Kevin Wayne

Binary and Binomial Heaps. Disclaimer: these slides were adapted from the ones by Kevin Wayne Binary and Binomial Heaps Disclaimer: these slides were adapted from the ones by Kevin Wayne Priority Queues Supports the following operations. Insert element x. Return min element. Return and delete minimum

More information

A Macro-Finance Model of the Term Structure: the Case for a Quadratic Yield Model

A Macro-Finance Model of the Term Structure: the Case for a Quadratic Yield Model Title page Outline A Macro-Finance Model of the Term Structure: the Case for a 21, June Czech National Bank Structure of the presentation Title page Outline Structure of the presentation: Model Formulation

More information

Random Tree Method. Monte Carlo Methods in Financial Engineering

Random Tree Method. Monte Carlo Methods in Financial Engineering Random Tree Method Monte Carlo Methods in Financial Engineering What is it for? solve full optimal stopping problem & estimate value of the American option simulate paths of underlying Markov chain produces

More information

Outline. Objective. Previous Results Our Results Discussion Current Research. 1 Motivation. 2 Model. 3 Results

Outline. Objective. Previous Results Our Results Discussion Current Research. 1 Motivation. 2 Model. 3 Results On Threshold Esteban 1 Adam 2 Ravi 3 David 4 Sergei 1 1 Stanford University 2 Harvard University 3 Yahoo! Research 4 Carleton College The 8th ACM Conference on Electronic Commerce EC 07 Outline 1 2 3 Some

More information

Using Agent Belief to Model Stock Returns

Using Agent Belief to Model Stock Returns Using Agent Belief to Model Stock Returns America Holloway Department of Computer Science University of California, Irvine, Irvine, CA ahollowa@ics.uci.edu Introduction It is clear that movements in stock

More information

2.1 Mathematical Basis: Risk-Neutral Pricing

2.1 Mathematical Basis: Risk-Neutral Pricing Chapter Monte-Carlo Simulation.1 Mathematical Basis: Risk-Neutral Pricing Suppose that F T is the payoff at T for a European-type derivative f. Then the price at times t before T is given by f t = e r(t

More information

Quarterly Storage Model of U.S. Cotton Market: Estimation of the Basis under Rational Expectations. Oleksiy Tokovenko 1 Lewell F.

Quarterly Storage Model of U.S. Cotton Market: Estimation of the Basis under Rational Expectations. Oleksiy Tokovenko 1 Lewell F. Quarterly Storage Model of U.S. Cotton Market: Estimation of the Basis under Rational Expectations Oleksiy Tokovenko 1 Lewell F. Gunter Selected Paper prepared for presentation at the American Agricultural

More information

Tree Diagram. Splitting Criterion. Splitting Criterion. Introduction. Building a Decision Tree. MS4424 Data Mining & Modelling Decision Tree

Tree Diagram. Splitting Criterion. Splitting Criterion. Introduction. Building a Decision Tree. MS4424 Data Mining & Modelling Decision Tree Introduction MS4424 Data Mining & Modelling Decision Tree Lecturer : Dr Iris Yeung Room No : P7509 Tel No : 2788 8566 Email : msiris@cityu.edu.hk decision tree is a set of rules represented in a tree structure

More information

LendingClub Loan Default and Profitability Prediction

LendingClub Loan Default and Profitability Prediction LendingClub Loan Default and Profitability Prediction Peiqian Li peiqian@stanford.edu Gao Han gh352@stanford.edu Abstract Credit risk is something all peer-to-peer (P2P) lending investors (and bond investors

More information

Application of MCMC Algorithm in Interest Rate Modeling

Application of MCMC Algorithm in Interest Rate Modeling Application of MCMC Algorithm in Interest Rate Modeling Xiaoxia Feng and Dejun Xie Abstract Interest rate modeling is a challenging but important problem in financial econometrics. This work is concerned

More information

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMSN50)

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMSN50) Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMSN50) Magnus Wiktorsson Centre for Mathematical Sciences Lund University, Sweden Lecture 6 Sequential Monte Carlo methods II February

More information

Identifying Long-Run Risks: A Bayesian Mixed-Frequency Approach

Identifying Long-Run Risks: A Bayesian Mixed-Frequency Approach Identifying : A Bayesian Mixed-Frequency Approach Frank Schorfheide University of Pennsylvania CEPR and NBER Dongho Song University of Pennsylvania Amir Yaron University of Pennsylvania NBER February 12,

More information

Technical Appendix: Policy Uncertainty and Aggregate Fluctuations.

Technical Appendix: Policy Uncertainty and Aggregate Fluctuations. Technical Appendix: Policy Uncertainty and Aggregate Fluctuations. Haroon Mumtaz Paolo Surico July 18, 2017 1 The Gibbs sampling algorithm Prior Distributions and starting values Consider the model to

More information

Algorithms and Networking for Computer Games

Algorithms and Networking for Computer Games Algorithms and Networking for Computer Games Chapter 4: Game Trees http://www.wiley.com/go/smed Game types perfect information games no hidden information two-player, perfect information games Noughts

More information

Multi-Armed Bandit, Dynamic Environments and Meta-Bandits

Multi-Armed Bandit, Dynamic Environments and Meta-Bandits Multi-Armed Bandit, Dynamic Environments and Meta-Bandits C. Hartland, S. Gelly, N. Baskiotis, O. Teytaud and M. Sebag Lab. of Computer Science CNRS INRIA Université Paris-Sud, Orsay, France Abstract This

More information

Computational Methods in Finance

Computational Methods in Finance Chapman & Hall/CRC FINANCIAL MATHEMATICS SERIES Computational Methods in Finance AM Hirsa Ltfi) CRC Press VV^ J Taylor & Francis Group Boca Raton London New York CRC Press is an imprint of the Taylor &

More information

Bayesian Analysis of Structural Credit Risk Models with Microstructure Noises

Bayesian Analysis of Structural Credit Risk Models with Microstructure Noises Bayesian Analysis of Structural Credit Risk Models with Microstructure Noises Shirley J. HUANG, Jun YU November 2009 Paper No. 17-2009 ANY OPINIONS EXPRESSED ARE THOSE OF THE AUTHOR(S) AND NOT NECESSARILY

More information

Computational Statistics Handbook with MATLAB

Computational Statistics Handbook with MATLAB «H Computer Science and Data Analysis Series Computational Statistics Handbook with MATLAB Second Edition Wendy L. Martinez The Office of Naval Research Arlington, Virginia, U.S.A. Angel R. Martinez Naval

More information

Action Selection for MDPs: Anytime AO* vs. UCT

Action Selection for MDPs: Anytime AO* vs. UCT Action Selection for MDPs: Anytime AO* vs. UCT Blai Bonet 1 and Hector Geffner 2 1 Universidad Simón Boĺıvar 2 ICREA & Universitat Pompeu Fabra AAAI, Toronto, Canada, July 2012 Online MDP Planning and

More information

Particle Learning for Fat-tailed Distributions 1

Particle Learning for Fat-tailed Distributions 1 Particle Learning for Fat-tailed Distributions 1 Hedibert F. Lopes and Nicholas G. Polson University of Chicago Booth School of Business Abstract It is well-known that parameter estimates and forecasts

More information

Monte-Carlo Planning Look Ahead Trees. Alan Fern

Monte-Carlo Planning Look Ahead Trees. Alan Fern Monte-Carlo Planning Look Ahead Trees Alan Fern 1 Monte-Carlo Planning Outline Single State Case (multi-armed bandits) A basic tool for other algorithms Monte-Carlo Policy Improvement Policy rollout Policy

More information

Conjugate Models. Patrick Lam

Conjugate Models. Patrick Lam Conjugate Models Patrick Lam Outline Conjugate Models What is Conjugacy? The Beta-Binomial Model The Normal Model Normal Model with Unknown Mean, Known Variance Normal Model with Known Mean, Unknown Variance

More information

Bandit algorithms for tree search Applications to games, optimization, and planning

Bandit algorithms for tree search Applications to games, optimization, and planning Bandit algorithms for tree search Applications to games, optimization, and planning Rémi Munos SequeL project: Sequential Learning http://sequel.futurs.inria.fr/ INRIA Lille - Nord Europe Journées MAS

More information

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Solutions to Final Exam

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Solutions to Final Exam Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay Solutions to Final Exam Problem A: (30 pts) Answer briefly the following questions. 1. Suppose that

More information