Uncertainty Analysis with UNICORN

Similar documents
Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Web Extension: Continuous Distributions and Estimating Beta with a Calculator

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

M249 Diagnostic Quiz

Software Tutorial ormal Statistics

Describing Uncertain Variables

Introduction to vine copulas

MBA 7020 Sample Final Exam

Monte Carlo Simulation (General Simulation Models)

Diploma in Financial Management with Public Finance

Gamma Distribution Fitting

Probability and Statistics

Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty

MAS187/AEF258. University of Newcastle upon Tyne

Chapter 3 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc.

Random Variables and Probability Distributions

ก ก ก ก ก ก ก. ก (Food Safety Risk Assessment Workshop) 1 : Fundamental ( ก ( NAC 2010)) 2 3 : Excel and Statistics Simulation Software\

Getting started with WinBUGS

the display, exploration and transformation of the data are demonstrated and biases typically encountered are highlighted.

The following content is provided under a Creative Commons license. Your support

Kevin Dowd, Measuring Market Risk, 2nd Edition

CHAPTER 2 Describing Data: Numerical

UPDATED IAA EDUCATION SYLLABUS

ELEMENTS OF MONTE CARLO SIMULATION

CS 361: Probability & Statistics

DATA SUMMARIZATION AND VISUALIZATION

SIMULATION OF ELECTRICITY MARKETS

Lecture 6: Non Normal Distributions

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Appendix A. Selecting and Using Probability Distributions. In this appendix

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Probability. An intro for calculus students P= Figure 1: A normal integral

CS 237: Probability in Computing

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Introduction to Population Modeling

WEB APPENDIX 8A 7.1 ( 8.9)

Testing the significance of the RV coefficient

Market Risk Analysis Volume I

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

EENG473 Mobile Communications Module 3 : Week # (11) Mobile Radio Propagation: Large-Scale Path Loss

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.

Statistical Modeling Techniques for Reserve Ranges: A Simulation Approach

CHAPTERS 5 & 6: CONTINUOUS RANDOM VARIABLES

1 Describing Distributions with numbers

To keep our co-ordinates organised in Mathematical Literacy, we will always use a table. R4,50 R9,00 R22,50

Maximum Likelihood Estimation

Market Risk Analysis Volume II. Practical Financial Econometrics

Portfolio Sharpening

Chapter 2 Uncertainty Analysis and Sampling Techniques

Operational Risk Aggregation

Jacob: The illustrative worksheet shows the values of the simulation parameters in the upper left section (Cells D5:F10). Is this for documentation?

8. From FRED, search for Canada unemployment and download the unemployment rate for all persons 15 and over, monthly,

Operational Risk Modeling

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

DECISION SUPPORT Risk handout. Simulating Spreadsheet models

MAS187/AEF258. University of Newcastle upon Tyne

Alternative VaR Models

Measuring Risk Dependencies in the Solvency II-Framework. Robert Danilo Molinari Tristan Nguyen WHL Graduate School of Business and Economics

A First Course in Probability

A New Hybrid Estimation Method for the Generalized Pareto Distribution

Chapter 7: Estimation Sections

INTERNATIONAL JOURNAL FOR INNOVATIVE RESEARCH IN MULTIDISCIPLINARY FIELD ISSN Volume - 3, Issue - 2, Feb

DYNAMIC ECONOMETRIC MODELS Vol. 8 Nicolaus Copernicus University Toruń Mateusz Pipień Cracow University of Economics

Prepared By. Handaru Jati, Ph.D. Universitas Negeri Yogyakarta.

International Financial Markets 1. How Capital Markets Work

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

Chapter 6 Analyzing Accumulated Change: Integrals in Action

Decision Support Models 2012/2013

Counting Basics. Venn diagrams

Chapter 1 Microeconomics of Consumer Theory

appstats5.notebook September 07, 2016 Chapter 5

MVE051/MSG Lecture 7

Math of Finance Exponential & Power Functions

Survey of Math Chapter 21: Savings Models Handout Page 1

The misleading nature of correlations

3.2 Aids to decision making

Properties of IRR Equation with Regard to Ambiguity of Calculating of Rate of Return and a Maximum Number of Solutions

Gujarat University Choice Based Credit System (CBCS) Syllabus for Statistics (UG) B. Sc. Semester III and IV Effective from June, 2018.

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING

Question from Session Two

Lecture 3: Factor models in modern portfolio choice

Clark. Outside of a few technical sections, this is a very process-oriented paper. Practice problems are key!

Adjusting Nominal Values to

Probability & Statistics Modular Learning Exercises

Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance

On the Use of Stock Index Returns from Economic Scenario Generators in ERM Modeling

Today's Agenda Hour 1 Correlation vs association, Pearson s R, non-linearity, Spearman rank correlation,

Lecture 2 Describing Data

Economic Capital. Implementing an Internal Model for. Economic Capital ACTUARIAL SERVICES

Statistical Methods in Practice STAT/MATH 3379

Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

Introduction to Algorithmic Trading Strategies Lecture 8

February 2010 Office of the Deputy Assistant Secretary of the Army for Cost & Economics (ODASA-CE)

Hints on Some of the Exercises

MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL

One note for Session Two

Rapid computation of prices and deltas of nth to default swaps in the Li Model

Transcription:

Uncertainty Analysis with UNICORN D.A.Ababei D.Kurowicka R.M.Cooke D.A.Ababei@ewi.tudelft.nl D.Kurowicka@ewi.tudelft.nl R.M.Cooke@ewi.tudelft.nl Delft Institute for Applied Mathematics Delft University of Technology The Netherlands 1 Introduction UNICORN is a standalone uncertainty analysis software package. The name of the package stands for UNcertainty analysis with CORrelatioNs and its main focus is dependence modelling for high dimensional distributions. Random variables can be coupled using a number of dependence structures (such as dependence trees, vines, Bayesian belief nets). An extended formula parser is available for defining the model output, as are a number of post-processing options such as report generation, various graphical interpretations, conditioning, sensitivity analysis and probabilistic inversion. We intend to explore some of the more important facets of the software by means of a simple example. The theoretical background as well as more detailed examples and Unicorn exercises can be found in Kurowicka and Cooke (2006). 2 Example: Investment We invest $1000 for five years. The yearly interest (V i; i=1,...,5) is uniform on the interval [0.05, 0.15]. Successive yearly interests have a rank correlation of 0.70. We will try to model this scenario in Unicorn. The random variables necessary as input for the model are created (Section 2.1) and the model output is defined, as a function of the input variables (Section 2.2). We try to find a dependence structure across the random variables that best suits the problem (Section 2.3). Once the model is thus defined, it is sampled (Section 2.4), and we explore a few post-processing techniques available for analysing the results (Section 2.5). Bayesian belief net functionality is described in Section 2.6. 2.1 Random Variables The creation of input random variables is the first step in a Unicorn project, and this takes place in the Random Variable View. Each variable can be assigned any of the Correspondence to: D.A.Ababei, Delft University of Technology, Mekelweg 4, 2628 CD Delft, The Netherlands, telephone: +31 1527 87262. 1

following distributions (each with its own set of parameters): constant, uniform, loguniform, normal, log-normal, exponential, Gamma, Weibull, Beta, triangular or discrete. Alternatively a random variable s distribution can be imported from a distribution file created elsewhere. We create six random variables, one constant (1000) and five uniforms on [0.05, 0.15]. Figure 2.1: Random Variable View 2.2 Formulas (User Defined Functions) We proceed to add a formula, in the Formula/User Defined Function View, a function of the input random variables that defines the model output. We create a formula named 5yrReturn with the expression start (1 + V 1) (1 + V 2) (1 + V 3) (1 + V 4) (1 + V 5). Figure 2.2: Formula/User Defined Function View If the random variables V i were independent, this would suffice for our model. However, input variables very rarely are. Our exercise is no exception: its random variables are dependent, and we must incorporate the dependence information in our model. 2.3 Dependence Structures We know the marginal distribution of the interest variables, and that successive variables are correlated with a rank correlation of 0.7. This does not fully specify the joint distribution, and we explore a few different dependence structures (the Dependence View). First we construct a symmetric structure, where all interests are correlated to eachother at 0.70: a tree where all yearly interests are correlated to a latent variable, with a rank correlation equal to the square root of the desired correlation (Figure 2.3). We first 2

Figure 2.3: Tree for Investment with latent variable Figure 2.4: Tree for Investment add the latent random variable to the list of random variables (we can go back to the Random Variables View and specify new variables at any time). It does however seem unlikely that the strength of the dependence between all yearly interests would be the same. We construct the tree in Figure 2.4, which is a model where the correlation between two random variables decreases with the distance between them in the tree (e.g. the correlation between V 1 and V 5, equal to 0.24, is a lot smaller than that between V 1 and V 2) while fulfilling our condition of a 0.70 rank correlation between successive yearly interests. Further we may reflect that, if the interest is high in one year, it is unlikely to be high in both the year before and the year after it. This relation between two yearly interests, conditional on a third, cannot be described in a probability tree, but it can be captured with a D-vine with a negative conditional rank correlation between V i and V i + 2, given V i + 1, i = 1, 2, 3 (we take 0.7 as this correlation). Figure 2.5: D-vine for Investment, in the Dependence View A glance at the distribution of the output variable, 5yrReturn, reveals that it is different for each of the three dependence structures described above. The means and standard deviations only give us an idea of the differences (Table 1). Dependence Structure 5yrReturn mean 5yrReturn standard deviation Tree with latent variable 1630 185.7 Tree with 0.7 correlations 1620 167.6 D-vine 1610 130.8 Table 1: 5yrReturn distribution details for different dependence structures It is also possible to create a hybrid structure, in which one or more of a tree s nodes are vines themselves (Kurowicka and Cooke, 2006, Ch.6). Other dependence structures can be specified. Canonical vines (C-vines) are also available, as are Bayesian belief nets (which we will treat separately at the end of this paper). Whatever the structure, though, the choice of copula for realising the specified rank correlations is important, and in the 3

case of trees and vines we can choose between the diagonal band, Frank s and elliptical copulae. Two other copulae are additionally available, namely the minimum information copula for trees and the normal copula for vines. Figure 2.6: The diagonal band, elliptical, Frank s and minimum information copulae with correlation 0.7 (scatter plot rendered by Unicorn s graphical interpretation tool) 2.4 Sampling In order to evaluate our model, we will sample the joint distribution described by the random variables marginal distributions and the dependence structure that we have chosen. We have control over the number of samples to be generated, as well as the format(s) of the output sample file(s). 2.5 Post Processing After the joint distribution has been sampled, we have a few options open to us for analysing the result. Reporting summarises information about the input and output data. Various properties of the output distributions are described (percentiles, moments, etc) as is the correlation matrix (rank or product moment) of the entire joint distribution. The graphical interpretation tool allows a visual exploration of the joint distribution, with scatter plots (Figure 2.6), density plots, histograms and cobweb plots available. In a cobweb plot, variables are represented as vertical lines (the percentile, natural or logarithmic scale can be chosen for the variables). Each sample realizes one value of each variable. Connecting these values by line segments, one sample is represented as a jagged line intersecting all the vertical lines (Figure 2.7). Figure 2.7: Cobweb plot for the D-vine for Investment Figure 2.8: Cobweb plot for the D-vine for Investment, conditionalised on the upper 5% of V 1 s distribution 4

An interesting feature of the cobweb plot representation is that it is interactive, in the sense that it allows us to conditionalise on certain ranges. Suppose that the interest for the third year, V 3, is in the upper 5% of its distribution. Figure 2.8 shows the result of this conditionalising in the D-vine. Note that the very high values of V 3 imply high values for 5yrReturn. V 3 was chosen for a reason: after examining the five cobweb plots conditionalised on the upper 5% of the distribution of each of the interest variables, we have noticed that V 3 yields the highest values for 5yrReturn. In this sense, the graphical interpretation tool can also be seen as providing local sensitivity analysis. For global sensitivity measures, the sensitivity analysis module is available. It assesses the importance of the input variables for the output variable(s), by calculating various statistics and sensitivity measures (such as partial correlations, correlation ratios, etc.) based on the joint distribution sample. For our exercise, the highest correlation ratio (at 0.775) is shown to be between V 3 and 5yrReturn, which confirms what we have noticed in the conditional cobweb plots. We also have a probabilistic inversion tool at our disposal, for determining input variable distributions given specific constraints on the output distribution(s). 1 2.6 Bayesian Belief Nets One of the dependence structures that we can specify across the random variables in Unicorn is the Bayesian belief net (BBN). This case is treated separately because, currently, choosing this particular dependence structure allows access to functionality which is otherwise unavailable, such as analytical conditioning. The functionality associated with the other dependence structures will also be extended over time (the Unicorn software package is being continuously developed). In Unicorn, the BBN nodes are associated to either continuous or discrete random variables, and the BBN arcs correspond to rank correlations and conditional rank correlations. In the investment example we wish to include a new influence in the model: at the end of the third year the government plans to reform the tax system, which would influence interest rates. This can easily be captured, together with the previous constraints, in a BBN (Figure 2.9). The part of the BBN containing the interest rates V i is equivalent to the D-vine in Section 2.3. A new random variable has been added: T ax, normally distributed, with mean 20 and standard deviation 2 2. Of course we have to determine the values for the influences of the tax on the interest for the fourth and fifth year (which are represented in the BBN by two conditional rank correlation coefficients). After the BBN structure has been decided, evidence can be propagated through the graph, via analytical conditioning. In Unicorn, one or more of the variables can be set to a point value within their range, and the BBN can then be updated. When working with BBNs the marginal distributions of the nodes can be visualised directly in the graph, a feature which is particularly useful when conditioning the structure. Suppose we now 1 More information about probabilistic inversion, some applications and examples can be found in Chapter 9 of Kurowicka and Cooke (2006). 2 This distribution is arbitrarily chosen, any other distribution could have been considered. 5

Figure 2.9: Bayesian belief net for Investment learn that the interests for the first and second year, V 1 and V 2, have a value of 0.14 and 0.1 respectively. We can see how this affects the other interest rates (Figure 2.10). Figure 2.10: Conditionalised BBN for Investment, with V 1 = 0.14 and V 2 = 0.1 The grey distributions in the background are the unconditional marginal distributions, provided for comparison. The conditional means and standard deviations are displayed under the histograms. The joint distribution described by a BBN (both unconditional and conditional) can be sampled exactly as in the case of the dependence structures discussed earlier. The distribution of 5yrReturn can also be analysed in both cases. We find out that its mean and standard deviation are 1610 and 130.4 for the unconditional BBN, and 1550 and 68.7 in the case of the conditional BBN. The decrease in mean is consistent with the conditional marginal distributions seen for V 3, V 4 and V 5 (Figure 2.10) while the decrease in standard deviation was to be expected as evidence was introduced, and hence there is less uncertainty in the model. 3 Conclusion The UNICORN software package is a flexible tool for dependence modelling for high dimensional distributions, incorporating a number of dependence structures which make it suitable for modelling an ample range of scenarios, as well as various post processing tools for analysing the model output. Its continuing development within Delft s Institute of Applied Mathematics ensures it stays abreast with scientific research. References Kurowicka, D. and Cooke, R. (2006). Uncertainty Analysis with High Dimensional Dependence Modelling. Wiley. 6