Dynamic Portfolio Execution Detailed Proofs

Similar documents
Dynamic Portfolio Execution

Asymmetric Information: Walrasian Equilibria, and Rational Expectations Equilibria

The Optimization Process: An example of portfolio optimization

Stochastic Optimal Control

Martingale Pricing Theory in Discrete-Time and Discrete-Space Models

3.4 Copula approach for modeling default dependency. Two aspects of modeling the default times of several obligors

Lecture 7: Bayesian approach to MAB - Gittins index

Optimizing Portfolios

Characterization of the Optimum

Chapter 7: Portfolio Theory

An Approximation Algorithm for Capacity Allocation over a Single Flight Leg with Fare-Locking

Order book resilience, price manipulations, and the positive portfolio problem

Price manipulation in models of the order book

Part 1: q Theory and Irreversible Investment

Dynamic Portfolio Choice II

Arbitrage Conditions for Electricity Markets with Production and Storage

D.1 Sufficient conditions for the modified FV model

Course information FN3142 Quantitative finance

Asymptotic methods in risk management. Advances in Financial Mathematics

MACROECONOMICS. Prelim Exam

Portfolio Management and Optimal Execution via Convex Optimization

Black-Litterman Model

Moral Hazard: Dynamic Models. Preliminary Lecture Notes

CSCI 1951-G Optimization Methods in Finance Part 07: Portfolio Optimization

Lecture Notes 1

MATH3075/3975 FINANCIAL MATHEMATICS TUTORIAL PROBLEMS

1 Consumption and saving under uncertainty

Stock Repurchase with an Adaptive Reservation Price: A Study of the Greedy Policy

MS-E2114 Investment Science Lecture 5: Mean-variance portfolio theory

Optimization Problem In Single Period Markets

Quantitative Risk Management

Optimal order execution

Computer Vision Group Prof. Daniel Cremers. 7. Sequential Data

Online Appendix for The Political Economy of Municipal Pension Funding

Finite Memory and Imperfect Monitoring

Financial Giffen Goods: Examples and Counterexamples

Introduction to Probability Theory and Stochastic Processes for Finance Lecture Notes

What can we do with numerical optimization?

B. Online Appendix. where ɛ may be arbitrarily chosen to satisfy 0 < ɛ < s 1 and s 1 is defined in (B1). This can be rewritten as

Supplementary online material to Information tradeoffs in dynamic financial markets

Techniques for Calculating the Efficient Frontier

OPTIMAL PORTFOLIO CONTROL WITH TRADING STRATEGIES OF FINITE

Exponential utility maximization under partial information

Outline. 1 Introduction. 2 Algorithms. 3 Examples. Algorithm 1 General coordinate minimization framework. 1: Choose x 0 R n and set k 0.

The Correlation Smile Recovery

Lecture IV Portfolio management: Efficient portfolios. Introduction to Finance Mathematics Fall Financial mathematics

Self-Exciting Corporate Defaults: Contagion or Frailty?

Lecture 3: Factor models in modern portfolio choice

Carnegie Mellon University Graduate School of Industrial Administration

Eco504 Spring 2010 C. Sims FINAL EXAM. β t 1 2 φτ2 t subject to (1)

Lecture 5: Iterative Combinatorial Auctions

Chapter 9 Dynamic Models of Investment

Pricing Problems under the Markov Chain Choice Model

Asymptotic results discrete time martingales and stochastic algorithms

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices

Technical Note: Multi-Product Pricing Under the Generalized Extreme Value Models with Homogeneous Price Sensitivity Parameters

Web Appendix: Proofs and extensions.

Revenue Management Under the Markov Chain Choice Model

e-companion ONLY AVAILABLE IN ELECTRONIC FORM

Optimal liquidation with market parameter shift: a forward approach

Log-Robust Portfolio Management

Optimal Security Liquidation Algorithms

Online Appendix. ( ) =max

Optimal Trading Strategy With Optimal Horizon

Question 1 Consider an economy populated by a continuum of measure one of consumers whose preferences are defined by the utility function:

Financial Risk Management

CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION

Chapter 14. The Multi-Underlying Black-Scholes Model and Correlation

MSc Financial Engineering CHRISTMAS ASSIGNMENT: MERTON S JUMP-DIFFUSION MODEL. To be handed in by monday January 28, 2013

Evaluation of proportional portfolio insurance strategies

QI SHANG: General Equilibrium Analysis of Portfolio Benchmarking

Price Impact and Optimal Execution Strategy

Budget Setting Strategies for the Company s Divisions

Supplemental Materials for What is the Optimal Trading Frequency in Financial Markets? Not for Publication. October 21, 2016

FE570 Financial Markets and Trading. Stevens Institute of Technology

Information aggregation for timing decision making.

STATE UNIVERSITY OF NEW YORK AT ALBANY Department of Economics. Ph. D. Comprehensive Examination: Macroeconomics Fall, 2010

6.231 DYNAMIC PROGRAMMING LECTURE 5 LECTURE OUTLINE

Modelling Anti-Terrorist Surveillance Systems from a Queueing Perspective

BROWNIAN MOTION II. D.Majumdar

A Macroeconomic Framework for Quantifying Systemic Risk. June 2012

An Intertemporal Capital Asset Pricing Model

Strategic Trading of Informed Trader with Monopoly on Shortand Long-Lived Information

6. Continous Distributions

Statistical Inference and Methods

Portfolio Choice and Permanent Income

Dynamic Replication of Non-Maturing Assets and Liabilities

Hedging under Arbitrage

THE OPTIMAL ASSET ALLOCATION PROBLEMFOR AN INVESTOR THROUGH UTILITY MAXIMIZATION

Eco 504, Macroeconomic Theory II Final exam, Part 1, Monetary Theory and Policy, with Solutions

2.1 Mean-variance Analysis: Single-period Model

Valuing volatility and variance swaps for a non-gaussian Ornstein-Uhlenbeck stochastic volatility model

Approximate Revenue Maximization with Multiple Items

Solving dynamic portfolio choice problems by recursing on optimized portfolio weights or on the value function?

13.3 A Stochastic Production Planning Model

STOCHASTIC CALCULUS AND BLACK-SCHOLES MODEL

Appendix: Common Currencies vs. Monetary Independence

IEOR E4703: Monte-Carlo Simulation

Multi-armed bandits in dynamic pricing

EC316a: Advanced Scientific Computation, Fall Discrete time, continuous state dynamic models: solution methods

Transcription:

Dynamic Portfolio Execution Detailed Proofs Gerry Tsoukalas, Jiang Wang, Kay Giesecke March 16, 2014 1 Proofs Lemma 1 (Temporary Price Impact) A buy order of size x being executed against i s ask-side inventory qi a, displaces the best ask price from a i,n a i,n, according to: a i,n (x) a i,n q a i (p)dp = x. Combining this expression with Assumption 1, we have a i,n (x) a i,n q a i 1 p ai,n dp = q a i ( ) a i,n (x) a i,n = x a i,n (x) = a i,n + x qi a. Therefore for x = x + i,n, we have a i,n = a i,n + x+ i,n q a i as fi,n a = a i,n a i,n = x+ i,n qi a. Similarly, for a sell order of size x, we have and the temporary price impact displacement is defined bi,n b i,n (x) q b i 1 p bi,n dp = q b i ( bi,n b i,n(x) ) = x b i,n (x) = b i,n x qi b. Therefore for x = x i,n, we have b i,n = b i,n x i,n q b i and b i,n b i,n = x i,n qi a. Lemma 2 (Best Prevailing Bid/Ask-Prices) We present below the derivation for the dynamics of the best ask price; the dynamics for the best bid price are derived in a similar way. The exponential decay Assumption 4, specifies the dynamics of the best ask price over period τ. The best ask price at n, before the trade at n arrives, depends on the previous displacement Tsoukalas (gtsouk@wharton.upenn.edu) is from the Wharton School, University of Pennsylvania. Wang (wangj@mit.edu) is from MIT Sloan School of Management, CAFR and NBER. Giesecke (giesecke@stanford.edu) is from Stanford University, MS&E. 1

from the trade at n 1, a i,n 1, and on how quickly new limit orders arrives and push the book towards the next steady state, a i,n. The notation n defines the time immediately preceding both the arrival of the trade and the realization of the random walk at n, i.e. a i,n = a i,n + ɛ i,n. More specifically, we have a i,n = a i,n + (a i,n 1 a i,n )e ρa i τ. (25) Executing an order at n 1, the temporary price impact denoted fi,n 1 a = x+ i,n qi a to will move i s best ask price a i,n 1 = a i,n 1 + f a i,n 1. (26) The executed order also has a net permanent price impact denoted g i,n 1 = λ ij (x + j,n 1 x j,n 1 ) on i s mid-price. Further, the new steady-state ask price at time n, before the trade arrives at n, is given by Plugging the expressions from (26) and (27) into equation (25), we obtain a i,n = a i,n 1 + g i,n 1, (27) a i,n = a i,n + (a i,n 1 a i,n )e ρa i τ = a i,n + (a i,n 1 + f a i,n 1 (a i,n 1 + g i,n 1 ))e ρa i τ = a i,n + (a i,n 1 a i,n 1 + f a i,n 1 g i,n 1 )e ρa i τ. Then, defining the functions d a i,n 1 = a i,n 1 a i,n 1 (28) and κ a i (x + i,n 1, x i,n 1 ) = κa i (x ± i,n 1 ) = f a i,n 1 g i,n 1, (29) the price a i,n can be written as a i,n = a i,n + (d a i,n 1 + κ a i (x ± i,n 1 ))e ρa i τ. (30) From equation (30), we have a i,n a i,n = (da i,n 1 + κa i (x± i,n 1 ))e ρa i τ and from equation (28), we have a i,n a i,n = d a i,n. (31) Therefore, combining these two equations, we obtain d a i,n = (d a i,n 1 + κ a i (x ± i,n 1 ))e ρa i τ, (32) 2

which is the recursive form of the state variable d a i,n given in Assumption 4. Assuming all the order books are originally full, i.e., d a i,0 = 0, i I M, we equivalently have in non-recursive form Note, κ a i (x± i,n 1 ) is given by d a i,n = κ a i (x ± i,n 1 )e ρa i (n k+1)τ. κ a i (x ± i,n 1 ) = f a i,k 1 g i,k 1 = x+ i,k 1 q a i λ ij (x + j,k 1 x j,k 1 ) = x + i,k 1 q a i λ ij δx j,k 1, (33) where δx j,k 1 = x + j,k 1 x j,k 1. Therefore, replacing κa i (x± i,n 1 ) by its explicit form in equation (33), we have d a i,n = x+ i,k 1 qi a λ ij δx j,k 1 e ρa i (n k+1)τ. (34) Next, we proceed to calculate the best ask price at a general time n. From equation (31), we have a i,n = a i,n + da i,n, where the steady-state price a i,n is given in Assumption 3. Specifically, a i,n = v i,n + 1 2 s i where v i,n is defined in Assumption 5 and takes the form Therefore, we have v i,n = u i,n + λ ij a i,n = a i,n + d a i,n = u i,n + 1 2 s i + ( ) x + j,k 1 x j,k 1. λ ij ( ) x + j,k 1 x j,k 1 + d a i,n. Finally, combining the above with the expression in equation (34), and extending the analysis to m assets and to the bid side, we have a i,n = u i,n + 1 2 s i + b i,n = u i,n 1 2 s i + λ ij δx j,k 1 + λ ij δx j,k 1 + x+ i,k 1 q a i x i,k 1 q b i λ ij δx j,k 1 e ρa i (n k+1)τ, (35) λ ij δx j,k 1 e ρb i (n k+1)τ. (36) Next, we proceed to rewrite these expressions in vector form, utilizing the recursive expressions. The state vectors z and d introduced in the text follow naturally by looking at the terms in the two previous equations (35) and (36). Letting z ;n be the state vector of net shares remaining to be purchased right before the next 3

order arrives at time n, we have by definition z ;n = z ;0 (x + ;k 1 x ;k 1 ) = z ;0 M x ;k 1. Recursively, we can write z ;n = z ;n 1 M x ;n 1. Using this recursive form, the PPI term in the price process equations (35) and (36) can be written as λ ij δx j,k 1 = Λ(z ;0 z ;n )] i, where ] i is an operator which returns the i-th line of a matrix and Λ is the matrix of PPI factors. Using Assumption 4, the second state vector can be written in vector form as d a ;n = (d a ;n 1 + κ a x ;n 1 )e ρaτ. Similarly for the bid side we have d b ;n = (d b ;n 1 + κb x ;n 1 )e ρbτ. Combining the above results, and identifying the terms in equations (35) and (36), we have the final vector forms for the best available ask and bid prices a ;n = u ;n + 1 2 s ;n + Λ(z ;0 z ;n ) + d a ;n, b ;n = u ;n 1 2 s ;n + Λ(z ;0 z ;n ) + d b ;n. Lemma 3 (Execution Costs/Revenues) Following an executed order, the associated costs/revenues can simply be calculated by integrating the best available bid/ask prices over the total amount of units executed x. It follows that c i,n (x) = x 0 a i,n(u)du and r i,n (x) = where a i,n (x) and b i,n (x) are given in Lemma 1. Therefore, we have c i,n (x) = x 0 a i,n(u)du = x 0 (a i,n + u q a i x 0 b i,n(u)du, )du = (a i,n x + x2 2qi a ), r i,n (x) = x 0 b i,n(u)du = x 0 (b i,n u q b i )du = (b i,n x x2 2qi a ). 4

Specifically, for an incoming buy order x = x + i,n or sell order x = x i,n, we have costs and revenues given by c i,n (x + i,n ) = ( a i,n + x+ i,n 2q a i ) x +i,n and r i,n(x i,n ) = ( The vector forms across all assets at time n can then be expressed as Proposition 1 (Path Independence) b i,n x i,n 2q b i c n = x + ;n (a ;n + Q a x + ;n) and r n = x ;n (b ;n Q b x ;n). We first prove that path independence holds for a simplified unconstrained version of the problem. From there, it becomes clear that this result can be extended to the original constrained problem. Key to the proof will be a separability property of the wealth and value functions at each period, which can be separated into a linear stochastic term and a deterministic function of the controls and state variables. The reward associated with the manager s buy and sell orders at n is ) x i,n. π n = x ;n (b ;n Q b x ;n) x + ;n (a ;n + Q a x + ;n) = x ;np ;n x ;nqx ;n, where we introduce the following notation: x ;n = x + ;n x ;n ] ; Q = Q a 0 0 Q b ] ; p ;n = a ;n b ;n ] ] (u ;n + 1 = 2 s ;n + Λ(z ;0 z ;n ) + d a ;n) u ;n 1 2 s ;n + Λ(z ;0 z ;n ) + d b. ;n Isolating the terms which depend explicitly on the noise term u ;n, the reward can be written as π n = θ n x ;n M u ;n, (37) where the function θ n = θ n (z ;n, d ;n, x ;n ) is by construction, a deterministic function linear in the state vectors, and quadratic in the controls at n. Formally, θ n represents the manager s execution costs for an order at time n, net of the exogenous stochastic parameters of the problem: θ n = x ;np ;n x ;nqx ;n +x ;n M u ;n. It will be useful for the proof to write θ n in a quadratic form given by θ n = x ;nψ θn x ;n + x ;nψ θn, for appropriately chosen matrix Ψ θn and vector ψ θn. The manager s cumulative wealth at an arbitrary time n is W n = n j=0 π j. Recursively, we have The manager s optimization problem over his terminal wealth is W n = W n 1 + π n. (38) J 0 = max x E 0 e αw N ], 5

where J 0 ( ) denotes the value function at 0. We can gain some useful insights into the properties of the value function and optimal control by looking at the first two iterations, starting at N. Analysis at the boundary By definition, at the final time step, the optimal policy is equal to the state vector of remaining trades, i.e., x ;N M = z ;N. Then, using the equations (37) and (38), the value function at N is J N = e αw N = e α(w N 1+π N ) = e α(w N 1+θ N x ;N M u ;N ) = e α(w N 1+θ N z ;N u ;N ), (39) where π N = θ N x ;N Mu ;N and θ N = θ N(z ;N, d ;N, x ;N ) = θ N (z ;N, d ;N ). Rolling one step back, we show that J N 1 only depends on the known cumulative wealth from the previous period, N 2, and the state vectors of the problem at N 1. Given x ;N, the value function at N 1 is J N 1 = max E N 1 e αw N ]. From equation (38), we have W N = W N 2 + π N 1 + πn. Then, applying equation (37) we obtain J N 1 = max E N 1 e α(w N 2+(θ N 1 x ;N 1 M u ;N 1 )+(θn x ;N M u ;N )) ]. (40) At this stage, we can remove the dependence on the state at time N by using the recursive state equations: We know that x ;N M = z ;N = z ;N 1 x ;N 1 M. Similarly, the vector θn re-written as introduced earlier, can be θ N = θ N (z ;N, d ;N, x ;N) = θ N (z ;N 1 M, (d ;N 1 + κ )e ρτ, z ;N 1 M ) = φ N 1 (z ;N 1, d ;N 1, ), where we introduced φ N 1 = x ;N 1 Ψ φ N 1 + x ;N 1 ψ φ N 1, a deterministic function of the state vectors at N 1 with the same quadratic properties as θn. It follows that ] J N 1 = max E N 1 e α(w N 2+θ N 1 +φ N 1 x ;N 1 M u ;N 1 (z ;N 1 x ;N 1 M )u ;N ). Lastly, replacing u ;N by its recursive form u ;N = u ;N 1 + ɛ ;n, we obtain ] J N 1 = max E N 1 e α(w N 2+θ N 1 +φ N 1 z ;N 1 u ;N 1 (z ;N 1 x ;N 1 M )ɛ ;N ). (41) At this stage, we highlight several important properties: The expectation is conditional on the adapted filtration F N 1, implying that the only stochastic term is ɛ ;N, which is normally distributed. Furthermore, the path-dependent term x ;N 1 Mu ;N 1 cancels out during the last operation. This will have important consequences on the structure of the optimal policy at N 1, as we will show in the first-order conditions. 6

Next, we separate the deterministic and stochastic terms by defining the functions V = W N 2 + θ N 1 + φ N 1 z ;N 1u ;N 1, (42) Ṽ = (z ;N 1 x ;N 1 M )ɛ ;N. (43) To summarize, we have shown that the value function at N 1 can be written as J N 1 = max E N 1 e α(v Ṽ )], where by construction, V is a deterministic function, conditional on F N 1, and Ṽ is normally distributed. Given these properties, and using the identity EeṼ ] = e EṼ ]+ 1 2 VarṼ ] for any normal distributed variable Ṽ, we have J N 1 = max e α E N 1V Ṽ ]+ α 2 2 Var N 1V Ṽ ]. (44) By monotonicity of the exponential, the optimal policy at N 1 can be obtained by solving the equivalent optimization problem given by x ;N 1 = arg max E N 1 V Ṽ ] α 2 Var N 1V Ṽ ]. Thus, we can see that at N 1, optimizing over an exponential utility is equivalent to optimizing over a mean-variance objective. The next step is to gain some insights into the properties of the optimal control at N 1. For this, we will continue working with the mean-variance form. Following equations (42) and (43), the mean is simply E N 1 V Ṽ ] = W N 2 + θ N 1 + φ N 1 z ;N 1u ;N 1, while the variance is Var N 1 V Ṽ ] = Var N 1Ṽ ] = Var N 1(z ;N 1 x ;N 1 M )ɛ ;N ] = (z ;N 1 x ;N 1 M )(τσ ɛ )(z ;N 1 M ). We can see that the N 1 variance only depends on the state vector of remaining trades and the control at N 1. Combining the mean and variance expressions we have x ;N 1 = arg max W N 2 + θ N 1 + φ N 1 z ;N 1u ;N 1 1 2 α(z ;N 1 x ;N 1 M )(τσ ɛ )(z ;N 1 M ). (45) By construction, the objective in equation (45) is quadratic in the controls and the state, but is not necessarily concave in for all possible parameter values of the problem. Concavity conditions follow by imposing 7

negative semidefiniteness of the matrix Ψ θn 1 + Ψ φn 1 1 2 ατ MΣ ɛ M.1 Assuming concavity holds, the first order conditions at this stage give 0 = x;n 1 (φ N 1 + θ N 1 ) + α(z ;N 1 x ;N 1 M )(τσ ɛ ). Looking at the system of equations obtained from the first-order conditions, it is clear that x ;N 1 is not path-dependent. Indeed, there is no term which depends explicitly on the realization of the random walk u ;N 1, at N 1. Furthermore, since θ N 1 and φ N 1 are, by construction, quadratic functions of and only depend on the state vectors at N 1, then x ;N 1 will simply be a function of the two state vectors z ;N 1 and d ;N 1. We write the general form as x ;N 1 = H ;N 1 (z ;N 1, d ;N 1 ), (46) where H ;N 1 is a deterministic function of the state vectors at N 1 whose exact expression is not relevant for the proof. It follows that plugging this back into J N 1 will yield a value function which takes the form JN 1 = e α(w N 2+Θ N 1 z ;N 1 u ;N 1), where we have separated the exponential into a deterministic term: Θ N 1 = Θ N 1 (z ;N 1, d ;N 1, x ;N 1) = Θ N 1 (z ;N 1, d ;N 1, H ;N 1 (z ;N 1, d ;N 1 )) = Θ N 1(z ;N 1, d ;N 1 ) = φ N 1 + θ N 1 1 2 α(z ;N 1 x ;N 1 M )(τσ ɛ )(z ;N 1 Mx ;N 1), and a path-dependent term: z ;N 1 u ;N 1. We proceed to prove by induction that this separability property is conserved for all times, leading to optimal controls that are path-independent. General proof by induction. Let J n = J n (z ;n, d ;n, W n 1, n) be the value function at time n. The equivalent DP of problem (18) without inequality constraints, is given by J n 1 = max x ;n 1 E n 1 J n]. (47) 1 It is relatively straightforward to check that a non-empty set of parameters exists, for which concavity holds. A trivial case is when α = 0 and there is no permanent and cross-impact, in which case, all matrices at each time step are diagonal by construction, with negative eigenvalues that are proportional to the inverse densities 1/q a and 1/q b of the ask and bid sides. 8

Assume that the value function and optimal policy at n take the following forms: J n = e α(w n 1+Θ n z ;nu ;n), (48) x ;n = H ;n (z ;n, d ;n ), (49) where Θ n = Θ n(z ;n, d ;n ) = Θ n (z ;n, d ;n, x ;n). We need several properties to hold. The first is that if the value function has this form at time n, it will lead to an optimal control x ;n 1 that is path-independent at time n 1. The second is that at n 1, J n 1 will conserve this separable form. This would also imply that x ;n 2 would be path-independent, and so forth. The third property is to check that this holds true at the boundary, which we have already confirmed through our previous analysis at times N and N 1. The fourth and final property is to impose concavity of the objective at each time n. Next, we look at an arbitrary time n 1. From the induction assumption (48), we have J n = e α(w n 1+Θ n z ;n u;n), which after using Assumption 2 and equation (38), can be written as J n = e α(w n 2+θ n 1 +Θ n z ;n 1 u ;n 1 (z ;n 1 x ;n 1 M )ɛ ;n). Using the recursive state equations we can express Θ n as a function of the states at n 1. so that Θ n(z ;n, d ;n ) = Θ n(z ;n 1 x ;n 1, (d ;n 1 + κx ;n 1 )e ρτ ) = Φ n 1 (z ;n 1, d ;n 1, x ;n 1 ) = Φ n 1, J n = e α(w n 2+θ n 1 +Φ n 1 z ;n 1 u ;n 1 (z ;n 1 x ;n 1 M )ɛ ;n). Next, we look at the dynamic programming equation (47). We have J n 1 = max x ;n 1 E n 1 J n] ] = max E n 1 e α(w n 2+θ n 1 +Φ n 1 z ;n 1 u ;n 1 (z ;n 1 x ;n 1 M )ɛ ;n) x ;n 1 = max x ;n 1 e α E n 1W n 2 +θ n 1 +Φ n 1 z ;n 1 u ;n 1 (z ;n 1 x ;n 1 M )ɛ ;n] + α2 2 Var n 1W n 2 +θ n 1 +Φ n 1 z ;n 1 u ;n 1 (z ;n 1 x ;n 1 M )ɛ ;n]. 9

After taking the mean and the variance at n 1, the optimal policy at this stage can be obtained by solving the equivalent mean-variance optimization problem x ;n 1 = arg max W n 2 + θ n 1 + Φ n 1 z ;n 1u ;n 1 x ;n 1 1 2 α(z ;n 1 x ;n 1 M )τσ ɛ (z ;n 1 Mx ;n 1 ), (50) which is quadratic in x ;n 1 as well as in the state vectors at n 1. Concavity in x ;n 1 is imposed by requiring that the matrix Ψ θn 1 + Ψ Φn 1 1 2 ατ MΣ ɛ M is negative semidefinite. The first order conditions will yield an optimal control which is clearly path-independent as the only term which explicitly depends on u ;n 1 is not a function of x ;n 1. Letting x ;n 1 = H ;n 1(z ;n 1, d ;n 1 ) be the optimal solution at n 1, we complete the induction proof by setting Θ n 1 = θ n 1 + Φ n 1 1 2 α(z ;n 1 x ;n 1 M )τσ ɛ (z ;n 1 M x ;n 1), which gives the form we need, namely J n 1 = e α(w n 2+Θ n 1 z ;n 1 u ;n 1). We can thus conclude that the optimal control at the previous time step is path-independent and separability is preserved. Inequality constrained problem. To complete the proof, we still need to look at the case with inequality constraints on the control variables x ;n 0. Let ν ;n be the associated positivity multiplier vector at time n. To establish path-independence for the optimal solution in this case, we need to show that both x ;n and ν ;n are path-independent. The formal proof for this follows and is similar to the unconstrained problem. We can obtain the necessary insights by looking at the two last periods of the problem. At time N, we have by definition 0 x ;N and x ;N M = z ;N. Looking at the problem at N 1, and using equation (44), we have J N 1 = max e α E N 1V Ṽ ]+ α 2 2 Var N 1V Ṽ ] subject to 0. (51) We can then write an equivalent maximization problem to the Problem (51), by taking the logarithm and plugging in the forms from equations (42) and (43). We obtain: max W N 2 + θ N 1 + φ N 1 z ;N 1u ;N 1 1 2 α(z ;N 1 x ;N 1 M )(τσ ɛ )(z ;N 1 M ) subject to 0. 10

To deal with the inequality constraint, we introduce the Lagrange positivity multiplier ν ;N 1. The problem then becomes: max W N 2 + θ N 1 + φ N 1 z ;N 1u ;N 1 1 2 α(z ;N 1 x ;N 1 M )(τσ ɛ )(z ;N 1 M ) + ν ;N 1. Imposing concavity on the previous equation, the first order conditions at this stage give: 0 = x;n 1 (φ N 1 + θ N 1 ) + α(z ;N 1 x ;N 1 M )(τσ ɛ ) + ν ;N 1, (52) where, by definition of φ N 1 and θ N 1, the term x;n 1 (φ N 1 + θ N 1 ) is a deterministic linear function of. This is also clearly the case for the second term α(z ;N 1 x ;N 1 M)(τΣ ɛ ) which only depends on the control and state vectors at N 1 and the stationary covariance matrix. As there are no stochastic terms in the system of equations, we can conclude that the multipliers at N 1 (which can be calculated by considering the complementary slackness and dual feasibility conditions for each asset at N 1) are necessarily deterministic. More specifically, from equation (52), we can write the form of the optimal control at N 1 as x ;N 1 = H ;N 1 (z ;N 1, d ;N 1, ν ;N 1 ), where as before, the exact expression of H ;N 1 is not relevant for the proof. Then, plugging this back into the objective function, it becomes clear that this new form does not affect the separability property. Indeed, the function Θ N 1 remains deterministic, but now depends on the multipliers: Θ N 1 = Θ N 1 (z ;N 1, d ;N 1, x ;N 1) = Θ N 1 (z ;N 1, d ;N 1, H ;N 1 (z ;N 1, d ;N 1, ν ;N 1 )) = Θ N 1(z ;N 1, d ;N 1, ν ;N 1 ). From here, we could proceed using the same induction arguments that we developed for the unconstrained problem to show that the first-order conditions at each time period lead to deterministic optimal controls and multipliers at all periods. We conclude that ν ;n will preserve this path-independence property for all n. Our path-independence result can be compared to other types of price impact models that have been developed in the literature. In particular, Almgren & Chriss (2000) and Huberman & Stanzl (2005) show that a similar static policy exists in their mean-variance framework. However, it is worth mentioning that this result is not generally robust to the type of noise process assumed in the model. For example, if we wanted to include serial correlation in our framework, we could show that this would lead to an optimal policy which is path-dependent (i.e., not static). In this case, we would need to develop a different solution methodology without being able to rely on static equivalence. In other words, the optimal policy would be adaptive. Other examples of adaptive optimal liquidation policies can be found in Lorenz & Almgren (2012). 11

Lemma 4 (Equivalent Wealth Formulation) We proceed via verification, starting with the inferred form, and showing that by expansion, we obtain the desired expressions equivalent to (35) and (36). Expanding the wealth process we have ] ] ] ] ] x + D a D ab x + c a x + W n = x D ba D b x c b x ( = x + D a x + x + D ab x x D ba x + + x D b x ) (u + 1 2 s)x+ + (u 1 2 s)x. Focusing on the ask side, we can show after some algebra that x + D a x + = x + 1 where expanding each term, we have x i D a iix + i = N n=0 (D a 11 x + 1 + Da 12x + 2 +... ) + + x+ M (D a M1 x + 1 + Da M2x + 2 +... ), x + i,n ( 1 2q l i Similarly, for the cross terms we have x i D a ijx + j = N n=0 x + i,n + x + i,n ( ) ) x + λ i x + i,k 1 + i,k 1 qi l λ i x + i,k 1 e τρli (n k+1). ( ) λ ij x + j,k 1 λ ijx + j,k 1 e τρl i (n k+1). Combining the above expressions and summing over i and j, we obtain x + D a x + = x 1 (D a 11x + 1 + Da 12x + 2 +... ) + + x M(D a M1x + 1 + Da M2x + 2 +... ) M N = x + 1 i,n 2q l x + i,n + λ ij x + j,k 1 + x+ i,k 1 i n i q l λ ij x + j,k 1 e τρl i (n k+1). j i j Furthermore, we also need to include the cross-terms coming from the opposite side of the book: x + D ab x = M i N n x + i,n λ ij x j,k 1 λ ij x j,k 1 e τρl i (n k+1). Lastly, we also provide the equivalence for the linear terms which is straightforward: ( c a ) x + = ( u + 1 2 s) x + = j = i u 1 u M j s 1 s M. + 1 2. (u i,n + 1 2 s i)x + i,n. n x + 12

By identification, we recover the complete form of the ask-side price process (the bid-side is derived in a similar way). The form for the costs follows immediately. Equations (20) and (21) (Expectation and Variance) Following Proposition 1, the optimal control vector x is deterministic. Therefore we can remove it from the operator and take the expectation directly on the linear term in equation (19). EW N ] = E x Dx c x] = x Dx E c x]. After some algebra, the expectation over the stochastic linear term can be expressed as E c x] = u ;01 Kx + 1 2 s K + x, where u ;0 = u 1,0 ;... ; u M,0 ] and u ;0 1 = u 1,0 1 N+1 ;... ; u M,0 1 N+1 ]. The expected wealth then becomes EW N ] = x Dx u ;0 1 Kx 1 2 s K + x. We can show that the variance term reduces to VarW N ] = Var x Dx c x] = Var ( u + 1 2 s ) x + ( u 1 2 s ) x ] = Varu Kx]. Which gives VarW N ] = x K Σ u K x. Proposition 2 (Equivalent Quadratic Program) Combining the equations we obtain for the mean and variance, the problem (18) can be written as maximize x 0 x Dx u ;01 Kx 1 2 s K + x 1 2 αx K Σ u Kx subject to 1 Kx = z ;0. The above problem can be equivalently written as a minimization problem over the risk-adjusted execution shortfall (i.e., net execution cost). The execution shortfall is defined as the difference between the preexecution market value of the portfolio (W 0 ), and the expected post-execution wealth (µ WN ), i.e., it is equal to W 0 µ WN, where W 0 = u ;0 z ;0 = u ;0 (1 Kx) is constant and can thus be added to the objective function without affecting the optimal solution. The problem then becomes minimize x 0 u ;01 Kx + x Dx + u ;01 Kx + 1 2 s K + x + 1 2 αx K Σ u Kx subject to 1 Kx = z ;0. 13

Let D = D + 1 2 α KΣ u K. Since we have we set the symmetric form 1 2 D = x D x = x ( D + D ( D +D 2 2 ) x, ). So finally, the problem (18) is equivalent to minimize x 0 1 2 x Dx + c x subject to 1 Kx = z ;0, where c = 1 2 s K + and D = ((D + 1 2 α KΣ u K ) + (D + 1 2 α KΣ u K ) ). References Almgren, Robert & Neil Chriss (2000), Optimal execution of portfolio transactions, Journal of Risk 3(2), 5 29. Huberman, Gur & Wener Stanzl (2005), Optimal liquidity trading, Review of Finance 9(2), 165 200. Lorenz, Julian & Robert Almgren (2012), Mean-variance optimal adaptive execution, Applied Mathematical Finance 18(5), 395 422. 14