Section 1.3: More Probability and Decisions: Linear Combinations and Continuous Random Variables

Similar documents
Section 0: Introduction and Review of Basic Concepts

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Section 1.4: Learning from data

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Lecture 3: Return vs Risk: Mean-Variance Analysis

Chapter 16. Random Variables. Copyright 2010 Pearson Education, Inc.

Discrete probability distributions

Review. Binomial random variable

Business Statistics 41000: Probability 4

Populations and Samples Bios 662

Lecture 4: Return vs Risk: Mean-Variance Analysis

15.063: Communicating with Data Summer Recitation 3 Probability II

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

Math 5760/6890 Introduction to Mathematical Finance

Business Statistics 41000: Probability 3

Statistics for Business and Economics

A useful modeling tricks.

Chapter 16. Random Variables. Copyright 2010, 2007, 2004 Pearson Education, Inc.

STA Module 3B Discrete Random Variables

Chapter 4 Continuous Random Variables and Probability Distributions

1. Covariance between two variables X and Y is denoted by Cov(X, Y) and defined by. Cov(X, Y ) = E(X E(X))(Y E(Y ))

Standard Normal, Inverse Normal and Sampling Distributions

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Chapter 4 Continuous Random Variables and Probability Distributions

Numerical Descriptive Measures. Measures of Center: Mean and Median

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Statistic Midterm. Spring This is a closed-book, closed-notes exam. You may use any calculator.

Continuous Probability Distributions & Normal Distribution

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

This chapter reviews basic probability concepts that are necessary for the modeling and statistical analysis of financial data.

Continuous random variables

The Normal Distribution

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

AP Statistics Chapter 6 - Random Variables

Discrete Random Variables

4.1 Introduction Estimating a population mean The problem with estimating a population mean with a sample mean: an example...

Random Variables. Copyright 2009 Pearson Education, Inc.

Module 4: Probability

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Honor Code: By signing my name below, I pledge my honor that I have not violated the Booth Honor Code during this examination.

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Statistics 511 Supplemental Materials

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

NORMAL APPROXIMATION. In the last chapter we discovered that, when sampling from almost any distribution, e r2 2 rdrdϕ = 2π e u du =2π.

Discrete Random Variables and Probability Distributions

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

X Prob

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

E509A: Principle of Biostatistics. GY Zou

Chapter 5. Statistical inference for Parametric Models

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Chapter 7: Point Estimation and Sampling Distributions

Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

6. Continous Distributions

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

Data Analysis and Statistical Methods Statistics 651

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Chapter 8 Statistical Intervals for a Single Sample

Statistics and Probability

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Examples of continuous probability distributions: The normal and standard normal

Random variables. Discrete random variables. Continuous random variables.

Probability: Week 4. Kwonsang Lee. University of Pennsylvania February 13, 2015

Sampling Distribution

Economics 430 Handout on Rational Expectations: Part I. Review of Statistics: Notation and Definitions

Descriptive Statistics (Devore Chapter One)

Data Analysis and Statistical Methods Statistics 651

Chapter 5: Statistical Inference (in General)

Review of commonly missed questions on the online quiz. Lecture 7: Random variables] Expected value and standard deviation. Let s bet...

Introduction to Computational Finance and Financial Econometrics Descriptive Statistics

Law of Large Numbers, Central Limit Theorem

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

Prob and Stats, Nov 7

Sampling and sampling distribution

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

Calculating VaR. There are several approaches for calculating the Value at Risk figure. The most popular are the

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

FEEG6017 lecture: The normal distribution, estimation, confidence intervals. Markus Brede,

The Binomial Distribution

Risk and Return and Portfolio Theory

Martingales, Part II, with Exercise Due 9/21

Lecture 1: The Econometrics of Financial Returns

BIOL The Normal Distribution and the Central Limit Theorem

5.3 Statistics and Their Distributions

The Binomial Distribution

Statistics, Their Distributions, and the Central Limit Theorem

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE

ECON Introductory Econometrics. Lecture 1: Introduction and Review of Statistics

Standard Normal Calculations

6. THE BINOMIAL DISTRIBUTION

Describing Data: One Quantitative Variable

Transcription:

Section 1.3: More Probability and Decisions: Linear Combinations and Continuous Random Variables Jared S. Murray The University of Texas at Austin McCombs School of Business OpenIntro Statistics, Chapters 2.4.2, 2.4.3, and 3.1-3.2 1

Introduction We ve seen how the expected value (our best prediction) and variance/standard deviation (how risky our best prediction is) help us think about uncertainty and make decisions in simple scenarios We need some more tools for thinking about 1. Multiple random variables (sources of uncertainty) 2. Other kinds of random variables - continuous outcomes 2

Covariance A measure of dependence between two random variables... It tells us how two unknown quantities tend to move together: Positive One goes up (down), the other tends to go up (down). Negative One goes down (up), the other tends to go up (down). If X and Y are independent, Cov(X, Y ) = 0 BUT Cov(X, Y ) = 0 does not mean X and Y are independent (more on this later). The Covariance is defined as (for discrete X and Y ): Cov(X, Y ) = n m Pr(x i, y j ) [x i E(X )] [y j E(Y )] i=1 j=1 3

Ford vs. Tesla Assume a very simple joint distribution of monthly returns for Ford (F ) and Tesla (T ): t=-7% t=0% t=7% Pr(F=f) f=-4% 0.06 0.07 0.02 0.15 f=0% 0.03 0.62 0.02 0.67 f=4% 0.00 0.11 0.07 0.18 Pr(T=t) 0.09 0.80 0.11 1 Let s summarize this table with some numbers... 4

Example: Ford vs. Tesla t=-7% t=0% t=7% Pr(F=f) f=-4% 0.06 0.07 0.02 0.15 f=0% 0.03 0.62 0.02 0.67 f=4% 0.00 0.11 0.07 0.18 Pr(T=t) 0.09 0.80 0.11 1 E(F ) = 0.12, E(T ) = 0.14 Var(F ) = 5.25, sd(f ) = 2.29, Var(T ) = 9.76, sd(t ) = 3.12 What is the better stock? 5

Example: Ford vs. Tesla t=-7% t=0% t=7% Pr(F=f) f=-4% 0.06 0.07 0.02 0.15 f=0% 0.03 0.62 0.02 0.67 f=4% 0.00 0.11 0.07 0.18 Pr(T=t) 0.09 0.80 0.11 1 Cov(F, T ) =( 7 0.14)( 4 0.12)0.06 + ( 7 0.14)(0 0.12)0.03+ ( 7 0.14)(4 0.12)0.00+(0 0.14)( 4 0.12)0.07+ (0 0.14)(0 0.12)0.62 + (0 0.14)(4 0.12)0.11+ (7 0.14)( 4 0.12)0.02 + (7 0.14)(0 0.12)0.02+ (7 0.14)(4 0.12)0.07 = 3.063 Okay, the covariance in positive... makes sense, but can we get a more intuitive number? 6

Correlation Corr(X, Y ) = Cov(X, Y ) sd(x )sd(y ) What are the units of Corr(X, Y )? It doesn t depend on the units of X or Y! 1 Corr(X, Y ) 1 In our Ford vs. Tesla example: Corr(F, T ) = 3.063 = 0.428 (not too strong!) 2.29 3.12 7

Linear Combination of Random Variables Is it better to hold Ford or Tesla? How about half and half? To answer this question we need to understand the behavior of the weighted sum (linear combinations) of two random variables... Let X and Y be two random variables: E(aX + by + c) = ae(x ) + be(y ) + c Var(aX +by +c) = a 2 Var(X )+b 2 Var(Y )+2ab Cov(X, Y ) 8

Linear Combination of Random Variables Applying this to the Ford vs. Tesla example... E(0.5F + 0.5T ) = 0.5E(F ) + 0.5E(T ) = 0.5 0.12 + 0.5 0.14 = 0.13 Var(0.5F + 0.5T ) = (0.5) 2 Var(F ) + (0.5) 2 Var(T ) + 2(0.5)(0.5) Cov(F, T ) = (0.5) 2 (5.25) + (0.5) 2 (9.76) + 2(0.5)(0.5) 3.063 = 5.28 sd(0.5f + 0.5T ) = 2.297 so, what is better? Holding Ford, Tesla or the combination? 9

Risk Adjustment: Sharpe Ratio The Sharpe ratio is a unitless quantity used to compare investments: (average return) - (return on a risk-free investment) standard deviation of returns Idea: Standardize the average excess return by the amount of risk. ( Risk adjusted returns ) Ignoring the risk-free investment, what are the Sharpe ratios for Ford, Tesla, and the 50-50 portfolio? 10

Linear Combination of Random Variables More generally... E(w 1 X 1 + w 2 X 2 +...w p X p + c) = w 1 E(X 1 ) + w 2 E(X 2 ) +... + w p E(X p ) + c = p i=1 w ie(x i ) + c Var(w 1 X 1 + w 2 X 2 +...w p X p + c) = w 2 1 Var(X 1) + w 2 2 Var(X 2) +...+wp 2 Var(X p )+2w 1 w 2 Cov(X 1, X 2 )+2w 1 w 3 Cov(X 1, X 3 )+... = p i=1 w i 2Var(X i) + p i=1 j i w iw j Cov(X i, X j ) where w 1, w 2,..., w p and c are constants 11

Continuous Random Variables Suppose we are trying to predict tomorrow s return on the S&P500 (Or on a real Ford/Tesla portfolio)... Question: What is the random variable of interest? What are its possible outcomes? Could you list them? Question: How can we describe our uncertainty about tomorrow s outcome? 12

Continuous Random Variables Recall: a random variable is a number about which we re uncertain, but can describe the possible outcomes. Listing all possible values isn t possible for continuous random variables, we have to use intervals. The probability the r.v. falls in an interval is given by the area under the probability density function. For a continuous r.v., the probability assigned to any single value is zero! 13

The Normal Distribution The Normal distribution is the most used probability distribution to describe a continuous random variable. Its probability density function (pdf) is symmetric and bell-shaped. The probability the number ends up in an interval is given by the area under the pdf. standard normal pdf 0.0 0.1 0.2 0.3 0.4 4 2 0 2 4 14

The Normal Distribution The standard Normal distribution has mean 0 and has variance 1. Notation: If Z N(0, 1) (Z is the random variable) Pr( 1 < Z < 1) = 0.68 Pr( 1.96 < Z < 1.96) = 0.95 standard normal pdf 0.0 0.1 0.2 0.3 0.4 standard normal pdf 0.0 0.1 0.2 0.3 0.4 4 2 0 2 4 z 4 2 0 2 4 z 15

The Normal Distribution Note: For simplicity we will often use P( 2 < Z < 2) 0.95 Questions: What is Pr(Z < 2)? How about Pr(Z 2)? What is Pr(Z < 0)? 16

The Normal Distribution The standard normal is not that useful by itself. When we say the normal distribution, we really mean a family of distributions. We obtain pdfs in the normal family by shifting the bell curve around and spreading it out (or tightening it up). 17

The Normal Distribution We write X N(µ, σ 2 ). X has a Normal distribution with mean µ and variance σ 2. The parameter µ determines where the curve is. The center of the curve is µ. The parameter σ determines how spread out the curve is. The area under the curve in the interval (µ 2σ, µ + 2σ) is 95%. Pr(µ 2 σ < X < µ + 2 σ) 0.95 µ 2σ µ σ µ µ + σ µ + 2σ 18

Recall: Mean and Variance of a Random Variable For the normal family of distributions we can see that the parameter µ determines where the distribution is located or centered. The expected value µ is usually our best guess for a prediction. The parameter σ (the standard deviation) indicates how spread out the distribution is. This gives us and indication about how uncertain or how risky our prediction is. 19

The Normal Distribution Example: Below are the pdfs of X 1 N(0, 1), X 2 N(3, 1), and X 3 N(0, 16). Which pdf goes with which X? 8 6 4 2 0 2 4 6 8 20

The Normal Distribution Example Assume the annual returns on the SP500 are normally distributed with mean 6% and standard deviation 15%. SP500 N(6, 225). (Notice: 15 2 = 225). Two questions: (i) What is the chance of losing money in a given year? (ii) What is the value such that there s only a 2% chance of losing that or more? Lloyd Blankfein: I spend 98% of my time thinking about.02 probability events! (i) Pr(SP500 < 0) and (ii) Pr(SP500 <?) = 0.02 21

The Normal Distribution Example prob less than 0 prob is 2% 0.000 0.010 0.020 40 20 0 20 40 60 sp500 0.000 0.010 0.020 40 20 0 20 40 60 sp500 (i) Pr(SP500 < 0) = 0.35 and (ii) Pr(SP500 < 25) = 0.02 22

The Normal Distribution in R In R, calculations with the normal distribution are easy! (Remember to use SD, not Var) To compute Pr(SP500 < 0) =?: pnorm(0, mean = 6, sd = 15) ## [1] 0.3445783 To solve Pr(SP500 <?) = 0.02: qnorm(0.02, mean = 6, sd = 15) ## [1] -24.80623 23

The Normal Distribution 1. Note: In X N(µ, σ 2 ) µ is the mean and σ 2 is the variance. 2. Standardization: if X N(µ, σ 2 ) then Z = X µ σ N(0, 1) 3. Summary: X N(µ, σ 2 ): µ: where the curve is σ: how spread out the curve is 95% chance X µ ± 2σ. 24

The Normal Distribution Another Example Prior to the 1987 crash, monthly S&P500 returns (r) followed (approximately) a normal with mean 0.012 and standard deviation equal to 0.043. How extreme was the crash of -0.2176? The standardization helps us interpret these numbers... r N(0.012, 0.043 2 ) For the crash, z = r 0.012 0.043 N(0, 1) z = 0.2176 0.012 0.043 = 5.27 How extreme is this zvalue? 5 standard deviations away!! 25

Portfolios, once again... As before, let s assume that the annual returns on the SP500 are normally distributed with mean 6% and standard deviation of 15%, i.e., SP500 N(6, 15 2 ) Let s also assume that annual returns on bonds are normally distributed with mean 2% and standard deviation 5%, i.e., Bonds N(2, 5 2 ) What is the best investment? What else do I need to know if I want to consider a portfolio of SP500 and bonds? 26

Portfolios once again... Additionally, let s assume the correlation between the returns on SP500 and the returns on bonds is -0.2. How does this information impact our evaluation of the best available investment? Recall that for two random variables X and Y : E(aX + by ) = ae(x ) + be(y ) Var(aX + by ) = a 2 Var(X ) + b 2 Var(Y ) + 2ab Cov(X, Y ) One more very useful property... sum of normal random variables is a new normal random variable! 27

Portfolios once again... What is the behavior of the returns of a portfolio with 70% in the SP500 and 30% in Bonds? E(0.7SP500 + 0.3Bonds) = 0.7E(SP500) + 0.3E(Bonds) = 0.7 6 + 0.3 2 = 4.8 Var(0.7SP500 + 0.3Bonds) = (0.7) 2 Var(SP500) + (0.3) 2 Var(Bonds) + 2(0.7)(0.3) Corr(SP500, Bonds) sd(sp500) sd(bonds) = (0.7) 2 (15 2 ) + (0.3) 2 (5 2 ) + 2(0.7)(0.3) 0.2 15 5 = 106.2 Portfolio N(4.8, 10.3 2 ) What do you think about this portfolio? Is there a better set of weights? 28

Simulating Normal Random Variables Imagine you invest $1 in the SP500 today and want to know how much money you are going to have in 20 years. We can assume, once again, that the returns on the SP500 on a given year follow N(6, 15 2 ) Let s also assume returns are independent year after year... Are my total returns just the sum of returns over 20 years? Not quite... compounding gets in the way. Let s simulate potential futures 29

Simulating one normal r.v. At the end of the first year I have $(1 (1 + pct return/100)). val = 1 + rnorm(1, 6, 15)/100 print(val) ## [1] 0.9660319 rnorm(n, mu, sigma) draws n samples from a normal distribution with mean µ and standard deviation σ. 30

Simulating compounding We reinvest our earnings in year 2, and every year after that: for(year in 2:20) { val = val*(1 + rnorm(1, 6, 15)/100) } print(val) ## [1] 4.631522 31

Simulating a few more futures We did pretty well - our $1 has grown to $4.63, but is that typical? Let s do a few more simulations: Value of $1 1 2 3 4 5 0 5 10 15 20 year 32

More efficient simulations Let s simulate 10,000 futures under this model. Recall the value of my investment at time T is T (1 + r t /100) t=1 where r t is the percent return in year t library(mosaic) num.sim = 10000 num.years = 20 values = do(num.sim) * { prod(1 + rnorm(num.years, 6, 15)/100) } 33

Simulation results Now we can answer all kinds of questions: What is the mean value of our investment after 20 years? vals = values$result mean(vals) ## [1] 3.187742 What s the probability we beat a fixed-income investment (say at 2%)? sum(vals > 1.02^20)/num.sim ## [1] 0.8083 34

Simulation results What s the median value? median(vals) ## [1] 2.627745 (Recall: The median of a probability distribution (say m) is the point such that Pr(X m) = 0.5 and Pr(X > m) = 0.5 when X has the given distribution). Remember the mean of our simulated values was 3.19... 35

Median and skewness For symmetric distributions, the expected value (mean) and the median are the same... look at all of our normal distribution examples. But sometimes, distributions are skewed, i.e., not symmetric. In those cases the median becomes another helpful summary! 36

Probability density function of our wealth at T = 20 We see the estimated distribution is skewed to the right if we use the simulations to estimate the pdf: Value of $1 in 20 years 0.00 0.10 0.20 mean ( 3.19 ) median ( 2.63 ) 0 5 10 15 20 25 $$ 37

What s next? What s mising from this picture? Where did SP500 s 6% returns with an SD of 15% come from? Up next: Learning parameters from data (statistics!), and uncertainty in parameters 38