Universal Portfolios

Similar documents
Chapter 3 - Lecture 4 Moments and Moment Generating Funct

ECON 6022B Problem Set 2 Suggested Solutions Fall 2011

Scenario Generation and Sampling Methods

CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION

Optimal Stopping. Nick Hay (presentation follows Thomas Ferguson s Optimal Stopping and Applications) November 6, 2008

Lecture 17: More on Markov Decision Processes. Reinforcement learning

Copyright (C) 2001 David K. Levine This document is an open textbook; you can redistribute it and/or modify it under the terms of version 1 of the

Homework Assignments

Session 9: The expected utility framework p. 1

Lecture 23: April 10

Chapter 8. Markowitz Portfolio Theory. 8.1 Expected Returns and Covariance

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Asymptotic results discrete time martingales and stochastic algorithms

Probability. An intro for calculus students P= Figure 1: A normal integral

Simulation Wrap-up, Statistics COS 323

u (x) < 0. and if you believe in diminishing return of the wealth, then you would require

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Notes on Auctions. Theorem 1 In a second price sealed bid auction bidding your valuation is always a weakly dominant strategy.

Advanced Risk Management

Lecture Notes 1

Asymmetric Information: Walrasian Equilibria, and Rational Expectations Equilibria

Economics 2010c: Lecture 4 Precautionary Savings and Liquidity Constraints

Theoretical Statistics. Lecture 3. Peter Bartlett

Final exam solutions

Managerial Economics Uncertainty

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2017

18.440: Lecture 32 Strong law of large numbers and Jensen s inequality

Lecture 7: Bayesian approach to MAB - Gittins index

Models and Decision with Financial Applications UNIT 1: Elements of Decision under Uncertainty

Information Theory and Networks

Risk aversion and choice under uncertainty

TUFTS UNIVERSITY DEPARTMENT OF CIVIL AND ENVIRONMENTAL ENGINEERING ES 152 ENGINEERING SYSTEMS Spring Lesson 16 Introduction to Game Theory

Expected Utility and Risk Aversion

1 Consumption and saving under uncertainty

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

Casino gambling problem under probability weighting

QDquaderni. Defensive Online Portfolio Selection E. Fagiuoli, F. Stella, A. Ventura. research report n. 1 july university of milano bicocca

Lecture 19: March 20

Multi-armed bandit problems

The method of Maximum Likelihood.

Problem Set 3. Thomas Philippon. April 19, Human Wealth, Financial Wealth and Consumption

Chapter 1. Utility Theory. 1.1 Introduction

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE

STOCHASTIC CALCULUS AND BLACK-SCHOLES MODEL

EE266 Homework 5 Solutions

Microeconomic Theory III Spring 2009

MS-E2114 Investment Science Lecture 5: Mean-variance portfolio theory

Lecture 10: The knapsack problem

CS364B: Frontiers in Mechanism Design Lecture #18: Multi-Parameter Revenue-Maximization

FE 5204 Stochastic Differential Equations

Definition 9.1 A point estimate is any function T (X 1,..., X n ) of a random sample. We often write an estimator of the parameter θ as ˆθ.

Optimization Models in Financial Mathematics

16 MAKING SIMPLE DECISIONS

Probability without Measure!

Multi-period mean variance asset allocation: Is it bad to win the lottery?

Sublinear Time Algorithms Oct 19, Lecture 1

TOPIC: PROBABILITY DISTRIBUTIONS

(Practice Version) Midterm Exam 1

Optimization Problem In Single Period Markets

The Conservative Expected Value: A New Measure with Motivation from Stock Trading via Feedback

Chapter 8: Sampling distributions of estimators Sections

Characterization of the Optimum

Lecture Quantitative Finance Spring Term 2015

X ln( +1 ) +1 [0 ] Γ( )

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 11 10/9/2013. Martingales and stopping times II

Lecture 11: Bandits with Knapsacks

X i = 124 MARTINGALES

Advanced Topics in Derivative Pricing Models. Topic 4 - Variance products and volatility derivatives

Lifetime Portfolio Selection: A Simple Derivation

16 MAKING SIMPLE DECISIONS

Sampling Distribution

Optimizing Portfolios

Stochastic Optimal Control

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

Stat 6863-Handout 1 Economics of Insurance and Risk June 2008, Maurice A. Geraghty

The value of foresight

A lower bound on seller revenue in single buyer monopoly auctions

Mechanism Design and Auctions

Equity correlations implied by index options: estimation and model uncertainty analysis

Mean-Variance Analysis

6. Martingales. = Zn. Think of Z n+1 as being a gambler s earnings after n+1 games. If the game if fair, then E [ Z n+1 Z n

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

An Introduction to Stochastic Calculus

LECTURE NOTES 3 ARIEL M. VIALE

FURTHER ASPECTS OF GAMBLING WITH THE KELLY CRITERION. We consider two aspects of gambling with the Kelly criterion. First, we show that for

Time Resolution of the St. Petersburg Paradox: A Rebuttal

LECTURE 2: MULTIPERIOD MODELS AND TREES

5. In fact, any function of a random variable is also a random variable

Chapter 7: Point Estimation and Sampling Distributions

Bernoulli and Binomial Distributions

1.1 Basic Financial Derivatives: Forward Contracts and Options

Performance of Stochastic Programming Solutions

7. Infinite Games. II 1

market opportunity line fair odds line Example 6.6, p. 120.

COS 511: Theoretical Machine Learning. Lecturer: Rob Schapire Lecture #24 Scribe: Jordan Ash May 1, 2014

A Comparison of Universal and Mean-Variance Efficient Portfolios p. 1/28

STATE UNIVERSITY OF NEW YORK AT ALBANY Department of Economics. Ph. D. Comprehensive Examination: Macroeconomics Spring, 2009

PORTFOLIO THEORY. Master in Finance INVESTMENTS. Szabolcs Sebestyén

SF2972 GAME THEORY Infinite games

KIER DISCUSSION PAPER SERIES

Transcription:

CS28B/Stat24B (Spring 2008) Statistical Learning Theory Lecture: 27 Universal Portfolios Lecturer: Peter Bartlett Scribes: Boriska Toth and Oriol Vinyals Portfolio optimization setting Suppose we have m instruments (i.e. stocks). At each period of time t, we have wealth V t, and we invest a proportion p t,i in each instrument i. Instrument i increases in value by a factor x t,i 0. Our capital at V period t thus increases by a factor t V t = i x t,ip t,i = x tp t. p t is in the simplex m, and x t is a random m-dimensional vector. The first question to consider is: what is a fair bet? Is E x b the right measure? Should one be satisfied making an investment of b, when E x b >? In this setting of investment, one cares greatly about risk, in other words, the variance and not just the expectation of returns. The famous example below demonstrates. Example. St. Petersburg paradox Suppose x is a r.v. such that, at each time t =, 2,.. there is a coin flip. Let t denote the first time a heads comes up. Then the payout is 2 t for t = t, and 0 otherwise. Then E t= 2 i 2 i = t= = Even though this procedure has infinite expected return, it doesn t make sense to invest much money in it, as there is very low probability of a large return. (Note that here we are using an additive and not multiplicative model of increases in wealth at each time.) Four main approaches to portfolio optimization, that all capture the tradeoff between expected wealth and risk, are:. Markovitz approach 2. Utility function approach 3. Optimizing for optimal growth rate 4. Constant rebalanced portfolio (CRP) approach. Markovitz optimization This approach was developed by Markovitz in the 50 s. Var(X) = Σ. We have returns X R m +, with EX = µ and

2 Universal Portfolios We maximize expected return, subject to a constraint on the risk (this example doesn t contain multiple time steps): max EX : max µ p : p m p m Var(X p) R p Σp R Without the constraints that bound variance, we have a linear optimization [max µ p] over a simplex m. The optimal solution to that is always to pick a vertex of the simplex, which corresponds to the pure strategy of investing in one instrument. The constraint p Σp R restricts p m to also be within an ellipsoid, so that the solution might involve mixing over more than one instrument (diversification helps control risk)..2 Utility function approach This approach consists of defining a utility function U() of values, chosen to take into account the tradeoff between expected wealth and risk, so that we maximize E U(x p). This is the solution to the St. Petersburg problem proposed by Bernoulli. To capture the criterion that a given expected gain with higher variance is worth less than one with lower variance, we should use a concave function with EU(V ) U(EV ) Taking the second order Taylor approximation to U(V ) at V = EV, and taking the expectation of both sides, we have EU(V ) U(EV ) + 2 U (EV )Var(V ) For a concave function, U (EV ) 0, so we have a term that penalizes variance in optimizing the expected utility. The log() function is used widely as a utility function, as it is in the constant rebalanced portfolio approach..3 Example using log utility function Suppose we have 2 instruments, with one of them risk-free, such that at all times: Pr(X t, = ) = Pr(X t,2 = 0) = p < 2 Pr(X t,2 = 2) = p We start with > 0 capital. Putting all our capital on 2, we have E V T = (2( p)) T. This goes to infinity in the limit of T. Now suppose we want to maximize the asymptotic growth rate of the log-of-return. Denote by b t the proportion of capital bet on instrument 2 at time t. Also define W t = [X t,2 = 2] ( win ). Then V t+ = V t ( + b t ) Wt ( b t ) Wt

Universal Portfolios 3 The asymptotic growth rate of the log-of-return is: G = lim T T log V T 2 = lim T T T t= W t log 2 ( + b t ) + ( W t ) log 2 ( b t ) This equality comes from expanding V T as a telescoping product. Note that if Z t Z, t =, 2,.. are i.i.d. random variables, and f() a real function on Z, then with probability we have lim T T T t= f(z t) = E(Z t ) So we have that with probability G = ( p) log 2 ( + b t ) + p log 2 ( b t ) = E log V t+ V t It is straightforward to see that the optimal investment at any time t is b t = 2p. Note that in this example, since we have i.i.d. returns, it doesn t actually make a difference in expectation to average growth rate over time. Both the expected growth rate, and the optimal b t, are the same in each period. Also note that there is a close connection between maximizing the expected log return, and finding the ideal channel capacity in information theory. 2 Constant rebalanced portfolios Definition. For random returns x t R m +, a strategy b is log optimal if b t (x, x 2,... x t ) = argmax b m E[log(b x t ) x 0,... x t ]. To motivate this definition, suppose we want to maximize E (log V T ) by choosing strategy b t (x, x 2,... x t ) that can depend on all known return values. Expressing V T as a telescoping product, and using Vt V t = b tx t, we have Thus we choose each b t as max E [ T log(b tx t )] {b t} t= b t = argmax bt(x,... x t ) E [log(b tx t )] = argmax bt E [log(b tx t ) x,... x t ] Note that if the returns are i.i.d., then b t (x, x 2,... x t ) = argmax b m E[log(b x], so we have a constant log optimal strategy b at each time. This is called a constant rebalanced portfolio (CPR). This means we must rebalance our investment, after the stocks have grown at non-uniform rates to yield a different balance than b, such that we use strategy b at each discrete time interval.

4 Universal Portfolios Theorem 2.. Given i.i.d. returns x t, if the log optimal strategy b has capital growth, V,... V T and some strategy b has growth, V,... V T, then lim sup T T log V T V T We can see that this makes sense because log V T V T 0 = log V T almost surely log V T that lim T T T t= log(x tb ) E log(x b), which is maximized by b. = T t= [log(x tb) log(x tb )]. We saw Thus, in the i.i.d. case, choosing the log optimal portfolio b as a CPR is optimal. To summarize, in the i.i.d. case, there is a constant strategy vector b, such that if the investment is rebalanced at each step to strategy b, then we have optimal asymptotic growth rate of log returns. This overall investment strategy of finding a log optimal investment vector, and rebalancing to keep investment constant, is called the CRP. 3 CRP b We define the CRP b as the strategy described above of rebalancing a portfolio at each time step to the constant investment vector b m (here we will not just be considering the case of i.i.d. returns and log optimal b). E.g. : Let {( ) ( 2 ) ( ) } X t =, 2 2,,... 2 2 2 If we pick the constant rebalanced strategy of b = ( 2 2), that is, uniform allocation of money, the growth is exponential with rate= 5 4. On the other hand, it is easy to check that any non-rebalanced strategy, where money is not moved between the two instruments after the first investment, achieves a growth of only. E.g. 2: Dow Jones Industrial Averages (DJIA), which is a commonly used index to evaluate the US market performance and which equally distributes the weight among 30 companies, is another example of CRP. The year-to-date change for 2008 is.56 %. Theorem 3.. There is a strategy b U (universal strategy) for which, for all X,..., X T, log(v T (b U )) log(v T (b )) (m ) log(t + ), where CRP b is the CRP having optimal log returns (best constant strategy in retrospect). Before giving proof, we first demonstrate a simplified version of this theorem. Consider the m pure CRP s: CRP b, b {e i, i =... m}. Suppose we want to do reasonably well (in the usual, log optimal sense) as compared to the best of these CRP s. A good strategy then is to use CRP bw, where b W places m in each instrument and does not rebalance. Then m T T log(v T (b W )) = log X t,j /m log max X t,j /m = = max log j ( T t= j= t= j t= ) X t,j log m = max log(v T (e j )) log m j

Universal Portfolios 5 which means that, by not knowing which is the best instrument, we pay at most a price of log m. Proof. The universal strategy b U consists of allocating capital uniformly across the simplex m at the first time step, so that we have dµ(b) capital at each strategy b m. Once we allocate capital uniformly across the simplex, money is never moved between the different portfolios b, so the different portfolios grow at different rates and don t have uniform allocation across the simplex after the first iteration. Note that the money invested in each portfolio b m is rebalanced at each step. Further, let V t (b) denote the ratio of capital at time t to capital at time 0 resulting from an investment of CRP b. By definition, (b) = b. Thus, at time t, we have V t (b) dµ(b) capital on strategy b. The actual strategy chosen at each time t is found by integrating across the simplex as follows: b t = m bv t (b) dµ(b) m V t (b) dµ(b) = m bv t (b)dµ(b) m V t (b)dµ(b) Let us now consider the set S a ball (in a topological sense) on the simplex centered on the optimal solution b and with radius ε, For b S, V (b) S = {( ε)b + εa : a m } = V (( ε)b + εa) ( ε) V (b ) and, using the same inequality for times t, t and telescoping the product, we get V T (b) V T (b ) ( ε)t. So far, we ve seen that choosing b S is not a bad choice. Also, the proportion of allocated to S is µ(s) = µ({εa : a m }) = ε m. It is straightforward to see that log V T (b U ) V T (b ) log(( ε)t ε m ) If we choose ε = T +, log(v T ) log(v T (b )) (m ) log(t + ). Some references here are Kelly, Breiman, Algoet, and Cover. 4 Prediction with log loss Given a distribution X t, we want to predict it using an estimation p t. The log loss is defined as: l( p t, X t ) = log( p t (X t )) In the i.i.d. case, the expected log loss and the KL divergence are closely related (they are the same except for an additional term of the entropy of the true distribution). If we have several predictions of the probability function p t,,..., p t,m, we can formulate the problem to find the optimal mixture weight in terms of log loss (eg. language models). In particular, we want to find the optimal b m to minimize the following loss function:

6 Universal Portfolios ( m ) l( p t, X t ) = log b i p t,i (X t ) This is equivalent to the problem presented in this lecture and, therefore, we can use portfolio strategies to solve it. i=