An introduction to game-theoretic probability from statistical viewpoint

Similar documents
Comparison of proof techniques in game-theoretic probability and measure-theoretic probability

Laws of probabilities in efficient markets

A new formulation of asset trading games in continuous time with essential forcing of variation exponent

Introduction to Game-Theoretic Probability

MATHEMATICAL ENGINEERING TECHNICAL REPORTS. A new formulation of asset trading games in continuous time with essential forcing of variation exponent

The Game-Theoretic Framework for Probability

Probability without Measure!

Probability, Price, and the Central Limit Theorem. Glenn Shafer. Rutgers Business School February 18, 2002

GAME-THEORETIC DERIVATION OF DISCRETE DISTRIBUTIONS AND DISCRETE PRICING FORMULAS

Outline of Lecture 1. Martin-Löf tests and martingales

X i = 124 MARTINGALES

Chapter 5. Sampling Distributions

Game-Theoretic Probability and Defensive Forecasting

Introduction Random Walk One-Period Option Pricing Binomial Option Pricing Nice Math. Binomial Models. Christopher Ting.

MATH3075/3975 FINANCIAL MATHEMATICS TUTORIAL PROBLEMS

Finish what s been left... CS286r Fall 08 Finish what s been left... 1

Lecture 17: More on Markov Decision Processes. Reinforcement learning

Lecture 7: Bayesian approach to MAB - Gittins index

Introduction to Probability Theory and Stochastic Processes for Finance Lecture Notes

On Existence of Equilibria. Bayesian Allocation-Mechanisms

Martingale Measure TA

Homework Assignments

Department of Mathematics. Mathematics of Financial Derivatives

Chapter 7: Estimation Sections

Basic Arbitrage Theory KTH Tomas Björk

4: SINGLE-PERIOD MARKET MODELS

Module 10:Application of stochastic processes in areas like finance Lecture 36:Black-Scholes Model. Stochastic Differential Equation.

A game-theoretic ergodic theorem for imprecise Markov chains

arxiv: v1 [cs.lg] 21 May 2011

Mathematics of Finance Final Preparation December 19. To be thoroughly prepared for the final exam, you should

Thursday, March 3

Martingale Pricing Theory in Discrete-Time and Discrete-Space Models

Using the Maximin Principle

Pricing theory of financial derivatives

6: MULTI-PERIOD MARKET MODELS

3 Stock under the risk-neutral measure

TABLE OF CONTENTS - VOLUME 2

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Convergence. Any submartingale or supermartingale (Y, F) converges almost surely if it satisfies E Y n <. STAT2004 Martingale Convergence

Approximate Revenue Maximization with Multiple Items

Maximum Likelihood Estimation

3 Arbitrage pricing theory in discrete time.

Lecture 6: Option Pricing Using a One-step Binomial Tree. Thursday, September 12, 13

Lecture 19: March 20

Defensive Forecasting

MATH20180: Foundations of Financial Mathematics

Lecture Notes for Chapter 6. 1 Prototype model: a one-step binomial tree

Rationalizable Strategies

Optimal stopping problems for a Brownian motion with a disorder on a finite interval

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 11 10/9/2013. Martingales and stopping times II

IEOR 3106: Introduction to OR: Stochastic Models. Fall 2013, Professor Whitt. Class Lecture Notes: Tuesday, September 10.

CONVERGENCE OF OPTION REWARDS FOR MARKOV TYPE PRICE PROCESSES MODULATED BY STOCHASTIC INDICES

Risk Neutral Valuation

PhD Qualifier Examination

FE 5204 Stochastic Differential Equations

Black-Scholes and Game Theory. Tushar Vaidya ESD

Binomial Random Variables. Binomial Random Variables

A GENERALIZED MARTINGALE BETTING STRATEGY

4 Martingales in Discrete-Time

Replication and Absence of Arbitrage in Non-Semimartingale Models

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Martingales. by D. Cox December 2, 2009

INTRODUCTION TO ARBITRAGE PRICING OF FINANCIAL DERIVATIVES

Chapter 7: Estimation Sections

Asymptotic results discrete time martingales and stochastic algorithms

Math 489/Math 889 Stochastic Processes and Advanced Mathematical Finance Dunbar, Fall 2007

The Capital Asset Pricing Model as a corollary of the Black Scholes model

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2017

Probability. An intro for calculus students P= Figure 1: A normal integral

CS 361: Probability & Statistics

The Accrual Anomaly in the Game-Theoretic Setting

CS134: Networks Spring Random Variables and Independence. 1.2 Probability Distribution Function (PDF) Number of heads Probability 2 0.

Remarks on Probability

On the Lower Arbitrage Bound of American Contingent Claims

FURTHER ASPECTS OF GAMBLING WITH THE KELLY CRITERION. We consider two aspects of gambling with the Kelly criterion. First, we show that for

Best-Reply Sets. Jonathan Weinstein Washington University in St. Louis. This version: May 2015

A First Course in Probability

Arbitrage Theory without a Reference Probability: challenges of the model independent approach

MATH 264 Problem Homework I

MATH 5510 Mathematical Models of Financial Derivatives. Topic 1 Risk neutral pricing principles under single-period securities models

Optimizing S-shaped utility and risk management

LECTURE 2: MULTIPERIOD MODELS AND TREES

For more information about how to cite these materials visit

Speculative Bubbles, Heterogeneous Beliefs, and Learning

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی

Why Bankers Should Learn Convex Analysis

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 23

MATH 425 EXERCISES G. BERKOLAIKO

Non-semimartingales in finance

1.1 Interest rates Time value of money

The value of foresight

Math-Stat-491-Fall2014-Notes-V

Equivalence between Semimartingales and Itô Processes

Comparing Allocations under Asymmetric Information: Coase Theorem Revisited

Sy D. Friedman. August 28, 2001

Reasoning with Probabilities. Compactness. Dutch Book. Dutch Book. Synchronic Dutch Book Diachronic Dutch Book. Some Puzzles.

Prediction Market Prices as Martingales: Theory and Analysis. David Klein Statistics 157

Mathematics in Finance

Finite Additivity in Dubins-Savage Gambling and Stochastic Games. Bill Sudderth University of Minnesota

Transcription:

.. An introduction to game-theoretic probability from statistical viewpoint Akimichi Takemura (joint with M.Kumon, K.Takeuchi and K.Miyabe) University of Tokyo May 14, 2013 RPTC2013 Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 1 / 26

Contents of this talk.1 List of Tokyo papers.2 Background on game-theoretic probability (GTP).3 Introduction to game-theoretic probability using a coin-tossing game.4 Bayesian Skeptic for the coin-tossing game.5 Non-negative martingales and likelihood ratios Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 2 / 26

List of Tokyo papers Introduction (in http://www.probabilityandfinance.com/). 1 On a simple strategy weakly forcing the strong law of large numbers in the bounded forecasting game, Kumon and Takemura. Ann. Inst. Stat. Math., 60, 801 812. 2008.. 2 Game theoretic derivation of discrete distributions and discrete pricing formulas, Takemura and Taiji Suzuki. J. Japan Stat. Soc., 37, 87 104. 2007.. 3 Capital process and optimality properties of Bayesian Skeptic in the fair and biased coin games, Kumon, Takemura and Takeuchi. Stochastic Analysis and Applications, 26, 1161 1180. 2008.. 4 Game-theoretic versions of strong law of large numbers for unbounded variables, Kumon, Takemura and A.Takeuchi. Stochastics, 79, 449 468. 2007.. 5 Implications of contrarian and one-sided strategies for the fair-coin game, Yasunori Horikoshi and Takemura. Stochastic Processes and their Applications, 118, 2125 2142. 2008.. 6 A new formulation of asset trading games in continuous time with essential forcing of variation exponent, Takeuchi, Kumon and Takemura. Bernoulli, 15, 1243 1258. 2009. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 3 / 26

List of Tokyo papers Introduction. 7 Multistep Bayesian strategy in coin-tossing games and its application to asset trading games in continuous time, Takeuchi, Kumon and Takemura. Stochastic Analysis and Applications, 28, 842 861. 2010.. 8 The generality of the zero-one laws, by Takemura, V.Vovk and G.Shafer. Ann. Inst. Stat. Math., 63, 873-886. 2011.. 9 New procedures for testing whether stock price processes are martingales, Takeuchi, Takemura and Kumon. Computational Economics, 37, No.1, 67 88. 2010.. 10 Sequential optimizing strategy in multi-dimensional bounded forecasting games, Kumon, Takemura and Takeuchi. Stochastic Processes and their Applications, 121, 155 183. 2011.. 11 Sequential optimizing investing strategy with neural networks, Ryo Adachi and A.Takemura. Expert Systems With Applications. 38, 12991 12998. 2011.. 12 Approximations and asymptotics of upper hedging prices in multinomial models, by Ryuichi Nakajima, Masayuki Kumon, A.Takemura and Kei Takeuchi. Japan Journal of Industrial and Applied Mathematics, 25, 1 21. 2012. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 4 / 26

List of Tokyo papers Introduction. 13 Convergence of random series and the rate of convergence of strong law of large numbers in game-theoretic probability, by Kenshi Miyabe and A.Takemura. Stochastic Processes and their Applications, 122, 1 30. 2012.. 14 Bayesian logistic betting strategy against probability forecasting, Stochastic Analysis and Applications, 31, 214 234. Masayuki Kumon, Jing Li, A.Takemura and Kei Takeuchi. 2013.. 15 The law of the iterated logarithm in game-theoretic probability with quadratic and stronger hedges, Stochastic Processes and their Applications. Kenshi Miyabe and A.Takemura. 2013. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 5 / 26

Background on game-theoretic probability (GTP) Kolmogorov s Grundbegriffe (1933) established measure theoretic probability. It justifies mathematical operations such as limiting operations. On this firm ground, probability theory found applications in many fields. Axiomatic construction: probability is not defined by itself, like points or lines. This actually broadened the applicability of probability theory. Probability is just the Lebesgue measure, K.Ito, 1944. On the other hand, foundational arguments, such as Richard von Mises s collectives, have been almost forgotten by probabilists. Kolmogorov himself was somewhat hesitant: proposal of Kolmogorov complexity Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 6 / 26

Shafer and Vovk (2001) Shafer and Vovk (2001) Probability and Finance, It s Only a Game! appeared. Vladimir Vovk (PhD, 1988, Moscow State U) is one of the last students of Kolmogorov. Around 2003, Takeuchi started to tell me that the book is very interesting. I gave a course on GTP for studying the book. In my opinion, at present it is the only alternative framework to measure-theoretic probability. Important theorems, such as the strong law of large numbers (SLLN), central limit theorem (CLT), the law of the iterated logarithm (LIL), can be proved in game-theoretic probability without requiring measure theory. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 7 / 26

Strength and weakness of game-theoretic probability (GTP) Strength Some clever proofs are very short. For example, even high school students can understand game-theoretic proof of SLLN. Black-Scholes formula and CLT are equivalent. In Shafer and Vovk, CLT and the Black-Scholes formula are proved simultaneously. Their proof shows that these are equivalent. (They do not use characteristic functions, but use the heat equation.) In GTP, the set of measure-zero is often more explicitly treated, by an explicit betting strategy with its capital diverging to + on the set. Probability is not assume a priori. A game is assumed. Under the game, the players are forced to act probabilistically. (Why stock prices look random?) Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 8 / 26

Strength and weakness of GTP Weakness Some proofs are, of course, almost the same in measure-theoretic probability and GTP. Some simple notions under usual probability, such as independence, identical distribution, are not easy to formulate. (GTP inherently assumes martingale.) In 2001 book, continuous stochastic processes were treated by nonstandard analysis, which was probably not very convincing to many people. This difficulty was overcome based on the idea in A new formulation of asset trading games in continuous time... by Takeuchi, Kumon and Takemura, Bernoulli, 2009, and completely generalized in Continuous-time trading and the emergence of probability by Vladimir Vovk, Finance and Stochastics, 2012. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 9 / 26

Introduction to GTP by a coin-tossing game Complete information game between players (two players version) Skeptic (statistician, investor) bets on some outcome. Reality (nature, market) decides the outcome. Skeptic Reality S R. They play in turn. One round: (Skeptic s turn, Reality s turn) in this order n = 1, 2,... denote rounds. Skeptic s initial capital: K 0 = 1 At each round, Skeptic first announces how much he bets: M n R. M n can be any real number and can be arbitrarily small. Negative M n allowed (short selling). Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 10 / 26

Introduction to GTP by a coin-tossing game After knowing M n, Reality chooses the outcome x n = 0 or x n = 1. Payoff to Skeptic : M n (x n p), where the price 0 < p < 1 of the ticket is given before the game. p is the success probability or the risk neutral probability. Skeptic s capital changes as In summary: K 0 = 1, 0 < p < 1: given FOR n = 1, 2,... Skeptic announces M n R. Reality announces x n {0, 1}. K n := K n 1 + M n (x n p). END FOR K n = K n 1 + M n (x n p). Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 11 / 26

Introduction to GTP by a coin-tossing game Reality can choose the sign of x n p as the opposite of the sign of M n. Therefore Reality can always decrease Skeptic s capital. No-win situation for Skeptic? But then Reality is forced to observe SLLN! Theorem There exists Skeptic s strategy P. (He can announce P even before the start of the game.) If Skeptic uses P, then he is never bankrupt and whenever Reality violates then 1 lim n n (x 1 + + x n ) = p, lim K n =. n Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 12 / 26

Introduction to GTP by a coin-tossing game In the coin-tossing game there exists a non-negative martingale which succeeds on the set { x 1 x 2... x 1 + + x n n p} {0, 1} (1) We say that in the coin-tossing game Skeptic can force SLLN. Reality can also have strategies (not fully explored yet). Bounded forecasting game: (1) is still true even if Reality can choose any real number in [0, 1], and {0, 1} is replaced by [0, 1]. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 13 / 26

Coin-tossing game with the third player Complete information game between three players. Forecaster decides the price of the ticket Skeptic bets on the outcome. Reality decides the outcome. K 0 = 1: given FOR n = 1, 2,... Forecaster announces p n [0, 1]. Skeptic announces M n R. Reality announces x n {0, 1}. K n := K n 1 + M n (x n p n ). END FOR Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 14 / 26

Coin-tossing game with the third player In this game Skeptic can force 1 lim n n n (x i p i ) = 0 i=1 He can also force p n < n n x n < Forecaster can also have strategies. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 15 / 26

Bayesian Skeptic for a coin-tossing game (without Forecaster) Although the proof of SLLN in Shafer and Vovk (2001) is short, we gave an alternative proof (just for a coin-tossing game) based on Bayesian Skeptic (in Stochastic Analysis and Applications, 2008). We found that the strategy was already discussed in Jean Ville (1939 ls ) Étude critique de la notion de collectif (English translation by G.Shafer). Kullback-Leibler divergence very naturally comes out from our strategy. So Ville might have known KL-divergence. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 16 / 26

Bayesian Skeptic for a coin-tossing game (without Forecaster) We suppose that Skeptic uses a strategy based on a beta prior distribution for p p p α 1 (1 p) β 1 /B(α, β), where α, β are prior numbers of heads and tails. Then his prediction of success probability for the n-th round is ˆp n = Consider Skeptics strategy Number of heads up to n 1 + α. n 1 + α + β ˆp n p P : M n = K n 1 p(1 p) In the following we let 1 = α = β for notational simplicity (uniform prior). Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 17 / 26

Bayesian Skeptic for a coin-tossing game If Skeptic uses this P, then his capital at time n is explicitly given as K n = 1 h n!t n! (n + 1)!p hn (1 p) = 0 phn (1 p) tn dp, (2) tn p hn (1 p) tn where h n = x 1 + + x n (# of heads), and t n = n h n. Proof is easy by induction This is a likelihood ratio of Bayes marginal distribution and the binomial distribution with the risk neutral probability p. In general, a capital process K n is always a likelihood ratio. LR process is a non-negative martingale process. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 18 / 26

KL divergence and capital process Stirling s formula for x! ( log x! = x + 1 ) log x x + O(1) = x log x x + O(log x) 2 Asymptotic behavior of log K n log K n = log h n! + log t n! log(n + 1)! h n log p t n log(1 p) = h n log h n + t n log t n n log n (h n + t n n) h n log p t n log(1 p) + O(log n) = h n log h n np + t t n n log + O(log n). n(1 p) Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 19 / 26

KL divergence and capital process The sum of the first two terms is the KL divergence. Hence log K n = nd ( h n p) + O(log n). n If h n /n deviates from p, then Skeptic s capital K n grows exponentially with the rate D ( h n n p ). This is the large deviation principle. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 20 / 26

Non-negative martingales and likelihood ratios As a standard textbook material, it can be easily checked that in the measure-theoretic framework the following two things are equivalent..1 Non-negative martingales with expected value 1..2 Likelihood ratios Martingale LR Let F n, n = 0, 1, 2,... be a filtration. Let F = F be the smallest σ-field containing them. Fix a probability measure P on F and let K n, n = 0, 1, 2,... be a non-negative martingale under P with E(K n ) = 1, n. Define Q n on F n by Q n (A) = A K n dp, A F n. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 21 / 26

Non-negative martingales and likelihood ratios Then it is an easy exercise to show that Q n s are a consistent (i.e. Q n (A) = Q n+1 (A), A F n ) family of distributions and K n is the likelihood ratio: K n = dq n /dp n. LR Martingale Let Q 1, Q 2,... be a consistent family of probability distributions on F n, n = 0, 1,..., such that each Q n is absolutely continuous with P. Define Then E(K n ) = Ω K n = dq n dp. dq n dp dp = dq n = Q n (Ω) = 1. Ω Furthermore it can be easily shown that E(K n+1 F n ) = K n and this is equivalent to the consistency condition. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 22 / 26

Non-negative martingales and likelihood ratio (GTP) From GTP, the capital process K n 0 is a non-negative martingale with expected value 1 under any risk neutral probability measure. However not all non-negative measure-theoretic martingales with expected value 1 can be realized as a capital process. It depends on how rich is the move space of Skeptic, i.e., what kind of strategies are allowed to Skeptic. If the game is complete, such as the coin-tossing game, then the converse is true. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 23 / 26

A sequential test can be constructed from betting Let K n be a non-negative martingale with E(K n ) = 1. By Markov inequality P(sup K n 1/α) α. n Hence a sequential testing procedure with the level of significance α is constructed by rejecting the null hypothesis as soon as K n 1/α. Suppose that the data generating process for X 1, X 2,..., is given as a null hypothesis. If you are allowed to bet on X 1, X 2,... and if you can multiply your capital 20-fold, then the null hypothesis is rejected with the significance level of 5%. See for example, New procedures for testing whether stock price processes are martingales in Computational Economics, 2010. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 24 / 26

A sequential test can be constructed from betting In this sequential setting, betting strategies need not be formal or fully specified. Any betting is OK as long as the future observations are never used (of course). On the other hand, when we obtain a batch sample of size n, we often have to be careful that we should have decided to use a particular procedure before seeing the actual data. This hindsight effect even exists in the maximized likelihood. Then various information criteria are needed to take the hindsight effect into consideration. Compared to the standard batch sample setting, use of betting in a sequential test can be more informal. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 25 / 26

Summary of the talk Introduction I have discussed background for game-theoretic probability. I did some mathematics of Bayesian betting strategy for coin-tossing games. I tried to explain why likelihood ratio appears as the capital process. I indicated that capital process (i.e. LR) can be used as a measure of departure from the null hypothesis, leading to a simple sequential test. Takemura (Univ. of Tokyo) Game-theoretic probability May 14, U.Tokyo 26 / 26