High Dimensional Edgeworth Expansion. Applications to Bootstrap and Its Variants

Similar documents
On Complexity of Multistage Stochastic Programs

arxiv: v1 [math.st] 18 Sep 2018

The rth moment of a real-valued random variable X with density f(x) is. x r f(x) dx

A New Multivariate Kurtosis and Its Asymptotic Distribution

PORTFOLIO THEORY. Master in Finance INVESTMENTS. Szabolcs Sebestyén

Asymptotic methods in risk management. Advances in Financial Mathematics

Decision theoretic estimation of the ratio of variances in a bivariate normal distribution 1

Fast Convergence of Regress-later Series Estimators

A New Test for Correlation on Bivariate Nonnormal Distributions

Asymptotic results discrete time martingales and stochastic algorithms

Martingales. by D. Cox December 2, 2009

IEOR E4602: Quantitative Risk Management

Cumulants and triangles in Erdős-Rényi random graphs

Replication under Price Impact and Martingale Representation Property

An Improved Saddlepoint Approximation Based on the Negative Binomial Distribution for the General Birth Process

Time-changed Brownian motion and option pricing

A Robust Option Pricing Problem

Portfolio Optimization. Prof. Daniel P. Palomar

On the Distribution of Multivariate Sample Skewness for Assessing Multivariate Normality

Copulas? What copulas? R. Chicheportiche & J.P. Bouchaud, CFM

STATS 200: Introduction to Statistical Inference. Lecture 4: Asymptotics and simulation

STAT/MATH 395 PROBABILITY II

An Introduction to Stochastic Calculus

Auction. Li Zhao, SJTU. Spring, Li Zhao Auction 1 / 35

Asymptotic study of an inhomogeneous Markov jump process

Math 489/Math 889 Stochastic Processes and Advanced Mathematical Finance Dunbar, Fall 2007

12 The Bootstrap and why it works

In Discrete Time a Local Martingale is a Martingale under an Equivalent Probability Measure

STAT 830 Convergence in Distribution

Multi-armed bandits in dynamic pricing

Large Deviations and Stochastic Volatility with Jumps: Asymptotic Implied Volatility for Affine Models

Portfolio Management and Optimal Execution via Convex Optimization

3 Arbitrage pricing theory in discrete time.

M5MF6. Advanced Methods in Derivatives Pricing

Optimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models

Limit Theorems for the Empirical Distribution Function of Scaled Increments of Itô Semimartingales at high frequencies

Improved Inference for Signal Discovery Under Exceptionally Low False Positive Error Rates

IEOR E4703: Monte-Carlo Simulation

4: SINGLE-PERIOD MARKET MODELS

Markets with convex transaction costs

UNIFORM BOUNDS FOR BLACK SCHOLES IMPLIED VOLATILITY

Equity correlations implied by index options: estimation and model uncertainty analysis

Math 416/516: Stochastic Simulation

Multivariate Skewness: Measures, Properties and Applications

Quantitative Risk Management

Lecture Note 9 of Bus 41914, Spring Multivariate Volatility Models ChicagoBooth

LECTURE NOTES 10 ARIEL M. VIALE

Sampling Distribution

Optimal Security Liquidation Algorithms

Risk Aggregation with Dependence Uncertainty

BROWNIAN MOTION Antonella Basso, Martina Nardon

All Investors are Risk-averse Expected Utility Maximizers. Carole Bernard (UW), Jit Seng Chen (GGY) and Steven Vanduffel (Vrije Universiteit Brussel)

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Parametric Inference and Dynamic State Recovery from Option Panels. Nicola Fusari

All Investors are Risk-averse Expected Utility Maximizers

The ruin probabilities of a multidimensional perturbed risk model

Lecture IV Portfolio management: Efficient portfolios. Introduction to Finance Mathematics Fall Financial mathematics

Case Study: Heavy-Tailed Distribution and Reinsurance Rate-making

CSCI 1951-G Optimization Methods in Finance Part 07: Portfolio Optimization

Using radial basis functions for option pricing

MATH 5510 Mathematical Models of Financial Derivatives. Topic 1 Risk neutral pricing principles under single-period securities models

LECTURE 3: FREE CENTRAL LIMIT THEOREM AND FREE CUMULANTS

Populations and Samples Bios 662

Implied Systemic Risk Index (work in progress, still at an early stage)

Modeling Co-movements and Tail Dependency in the International Stock Market via Copulae

Math489/889 Stochastic Processes and Advanced Mathematical Finance Homework 5

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

RECURSIVE VALUATION AND SENTIMENTS

Short-time asymptotics for ATM option prices under tempered stable processes

Lecture outline. Monte Carlo Methods for Uncertainty Quantification. Importance Sampling. Importance Sampling

Pricing Algorithms for financial derivatives on baskets modeled by Lévy copulas

ADVANCED OPERATIONAL RISK MODELLING IN BANKS AND INSURANCE COMPANIES

Consumption- Savings, Portfolio Choice, and Asset Pricing

The stochastic calculus

S t d with probability (1 p), where

Introducing Expected Returns into Risk Parity Portfolios: A New Framework for Asset Allocation

Drunken Birds, Brownian Motion, and Other Random Fun

Square-Root Measurement for Ternary Coherent State Signal

Valuation of performance-dependent options in a Black- Scholes framework

Convergence Analysis of Monte Carlo Calibration of Financial Market Models

Stochastic Programming and Financial Analysis IE447. Midterm Review. Dr. Ted Ralphs

GENERATION OF STANDARD NORMAL RANDOM NUMBERS. Naveen Kumar Boiroju and M. Krishna Reddy

2.1 Mean-variance Analysis: Single-period Model

Approximations of Stochastic Programs. Scenario Tree Reduction and Construction

Toward a coherent Monte Carlo simulation of CVA

Forecast Horizons for Production Planning with Stochastic Demand

3.4 Copula approach for modeling default dependency. Two aspects of modeling the default times of several obligors

Modeling of Price. Ximing Wu Texas A&M University

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices

On the Distribution of Kurtosis Test for Multivariate Normality

F19: Introduction to Monte Carlo simulations. Ebrahim Shayesteh

Comparison of Pricing Approaches for Longevity Markets

Polynomial processes in stochastic portofolio theory

Portfolio Optimization with Alternative Risk Measures

3.2 No-arbitrage theory and risk neutral probability measure

Worst-Case Value-at-Risk of Non-Linear Portfolios

REINSURANCE RATE-MAKING WITH PARAMETRIC AND NON-PARAMETRIC MODELS

Rohini Kumar. Statistics and Applied Probability, UCSB (Joint work with J. Feng and J.-P. Fouque)

The Conservative Expected Value: A New Measure with Motivation from Stock Trading via Feedback

Trust Region Methods for Unconstrained Optimisation

Transcription:

With Applications to Bootstrap and Its Variants Department of Statistics, UC Berkeley Stanford-Berkeley Colloquium, 2016

Francis Ysidro Edgeworth (1845-1926) Peter Gavin Hall (1951-2016)

Table of Contents 1 Background 2 High Dimensional Settings 3 Main Results

Table of Contents 1 Background 2 High Dimensional Settings 3 Main Results

Setup Given the data X = (X 1,..., X n ) i.i.d. F ; X i R p, EX i = 0, EX i X T i = V ; Sample mean X = 1 n n i=1 X i; Goal: approximate the distribution of X, i.e. approx. ( nv ) P 1 2 X A for A C where C denote the collection of all covex sets in R p.

CLT with Fixed Dimensions Let Φ be the measure of N(0, I p p ). When the dimension p is fixed: Central Limit Theorem (CLT): ( nv ) sup P 1 2 X A A C Φ(A) = o(1);

CLT with Fixed Dimensions Let Φ be the measure of N(0, I p p ). When the dimension p is fixed: Central Limit Theorem (CLT): ( nv ) sup P 1 2 X A A C Φ(A) = o(1); Berry-Esseen Bound (with third-order moments): ( nv ) sup P 1 2 X A Φ(A) = O A C ( n 1 2 ) ;

Edgeworth Expansion with Fixed Dimensions Edgeworth Expansion (with (ν + 3)-order moments): ( nv ) sup P 1 2 X A Φ(A) A C ν n ( j 2 Pj (A) = O j=1 n ν+1 2 where P j ( ) are sign measures determined by the cumulants of F. When ν = 1 and p = 1, ( nv ) sup P 1 2 X A Φ(A) n 1 2 P1 (A) = O ( n 1) A C where P 1 (A) has a density p 1 (x) = 1 6 (x 3 x) 1 2π e x2 2. ) ;

Edgeworth Expansion and Bootstrap Draw a bootstrap sample X1,..., X n of X 1,..., X n ; i.i.d. ˆF n where ˆF n is the ecdf

Edgeworth Expansion and Bootstrap Draw a bootstrap sample X1,..., X n of X 1,..., X n ; i.i.d. ˆF n where ˆF n is the ecdf Heuristically, a first-order edgeworth expansion implies ( n(v sup P ) 1 2 ( X X ) A ) Φ(A) n X 1 2 P 1 (A) = O ( n 1) ; A C where V = Var(X1 ) and P 1 ( ) is determined by the cumulants of X1.

Edgeworth Expansion and Bootstrap Draw a bootstrap sample X1,..., X n of X 1,..., X n ; i.i.d. ˆF n where ˆF n is the ecdf Heuristically, a first-order edgeworth expansion implies ( n(v sup P ) 1 2 ( X X ) A ) Φ(A) n X 1 2 P 1 (A) = O ( n 1) ; A C where V = Var(X1 ) and P 1 ( ) is determined by the cumulants of X1. Recall that ( nv ) sup P 1 2 X A Φ(A) n 1 2 P1 (A) = O(n 1 ); A C

Edgeworth Expansion and Bootstrap The cumulants of F and those of ˆF n are closed and thus ( ) sup P 1 (A) P1 (A) = O A C n 1 2.

Edgeworth Expansion and Bootstrap The cumulants of F and those of ˆF n are closed and thus ( ) sup P 1 (A) P1 (A) = O A C n 1 2. As a consequence, ( n(v sup P ) 1 2 ( X ( nv X ) A ) P X 1 2 X A) = O ( n 1). A C

Edgeworth Expansion and Bootstrap The cumulants of F and those of ˆF n are closed and thus ( ) sup P 1 (A) P1 (A) = O A C n 1 2. As a consequence, ( n(v sup P ) 1 2 ( X ( nv X ) A ) P X 1 2 X A) = O ( n 1). A C This is called Higher-Order Accuracy (Hall, 1992).

Table of Contents 1 Background 2 High Dimensional Settings 3 Main Results

CLT in High Dimensions The CLT in high dimensions has been investigated since 60 s, e.g. Sazonov, 1968; Portnoy, 1986; Gotze, 1991.

CLT in High Dimensions The CLT in high dimensions has been investigated since 60 s, e.g. Sazonov, 1968; Portnoy, 1986; Gotze, 1991. Sharp result is obtained by Bentkus (2003), ( nv ) sup P 1 2 X A Φ(A) = O A C ( ) p 7 4 ; n

CLT in High Dimensions The CLT in high dimensions has been investigated since 60 s, e.g. Sazonov, 1968; Portnoy, 1986; Gotze, 1991. Sharp result is obtained by Bentkus (2003), ( nv ) sup P 1 2 X A Φ(A) = O A C ( ) p 7 4 ; n Fundamental limit: p = o(n 2 7 ) for CLT to hold;

Edgeworth Expansion in High Dimensions In contrast to CLT, very few works on edgeworth expansion in high dimensions; Some results on Banach space but focus on Ef ( X ) for smooth f instead of the law of X (Gotze, 1981; Bentkus, 1984).

Edgeworth Expansion in High Dimensions In contrast to CLT, very few works on edgeworth expansion in high dimensions; Some results on Banach space but focus on Ef ( X ) for smooth f instead of the law of X (Gotze, 1981; Bentkus, 1984). Using existing techniques (Bhattacharya & Rao, 1986): p PolyLog(n);

Edgeworth Expansion in High Dimensions In contrast to CLT, very few works on edgeworth expansion in high dimensions; Some results on Banach space but focus on Ef ( X ) for smooth f instead of the law of X (Gotze, 1981; Bentkus, 1984). Using existing techniques (Bhattacharya & Rao, 1986): p PolyLog(n); Fundamental limit: n 1 2 P 1 (A) is of order n 1 2 p 3, without further constraints, p n 1 6.

Edgeworth Expansion in High Dimensions In contrast to CLT, very few works on edgeworth expansion in high dimensions; Some results on Banach space but focus on Ef ( X ) for smooth f instead of the law of X (Gotze, 1981; Bentkus, 1984). Using existing techniques (Bhattacharya & Rao, 1986): p PolyLog(n); Fundamental limit: n 1 2 P 1 (A) is of order n 1 2 p 3, without further constraints, p n 1 6. Question: How fast can the dimension grow with n?

Table of Contents 1 Background 2 High Dimensional Settings 3 Main Results

Edgeworth Expansion in High Dimension Theorem 1. Let X 1,..., X n be i.i.d. samples with zero mean and covariance matrix V. Assume that 1 λ max (V ) = O(1), λ min (V ) = Ω(1); 2 p = O(n γ ) for some γ < 1 14 ; 3 X ij B = O(1); Then for any positive integer S, ( ν (p ) ν+1 ) sup P( nv 1 2 X A) Φ(A) n j 9 2 Pj (A) = O 2. n A C n j=1

Here C n includes all convex sets plus all sets with form {F 1 (A) : A is convex} for arbitrary smooth non-linear functions F : R p R.

Here C n includes all convex sets plus all sets with form {F 1 (A) : A is convex} for arbitrary smooth non-linear functions F : R p R. As a corollary, for a smooth F, P(F ( ν n X ) A) Φ 0,V (F 1 (A)) + n j 2 Pj (V 1 2 F 1 (A)). This gives an edgeworth expansion for smooth transform of mean. j=1

Bootstrap in High Dimension Theorem 2. Let X 1,..., X n be i.i.d. samples with zero mean and covariance matrix V and X1,..., X n i.i.d. ˆF n. Assume that 1 λ max (V ) = O(1), λ min (V ) = Ω(1); 2 p = Θ(n γ ) for some 0 < γ < 1 17 ; 3 X ij B = O(1). Then with probability 1 exp { Ω (n γ )}, ( sup P V ( 1 2 X X ) ( ) A) P V 1 2 X A Cp9 n. A C n This is strictly better than the bound given by CLT since n << p 7 4, when p = O(n 1 n p 9 17 ).

Thanks!

References Bentkus, V. (1984). Asymptotic expansions in the central limit theorem in hilbert space. Lithuanian Mathematical Journal, 24(3), 210 225. Bentkus, V. (2003). On the dependence of the berry esseen bound on dimension. Journal of Statistical Planning and Inference, 113(2), 385 402. Bhattacharya, R. N., & Rao, R. R. (1986). Normal approximation and asymptotic expansions (Vol. 64). SIAM. Gotze, F. (1981). On edgeworth expansions in banach spaces. The Annals of Probability, 852 859. Gotze, F. (1991). On the rate of convergence in the multivariate clt. The Annals of Probability, 724 739. Hall, P. (1992). The bootstrap and edgeworth expansion. Portnoy, S. (1986). On the central limit theorem in r p when p. Probability theory and related fields, 73(4), 571 583.

References Sazonov, V. (1968). On the multi-dimensional central limit theorem. Sankhyā: The Indian Journal of Statistics, Series A, 181 204.