Challenges in Computational Finance and Financial Data Analysis

Similar documents
The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay. Solutions to Final Exam

Energy Price Processes

Chapter 18 Volatility Smiles

Statistical Analysis of Data from the Stock Markets. UiO-STK4510 Autumn 2015

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay Solutions to Final Exam

The Black-Scholes Model

Statistical methods for financial models driven by Lévy processes

Are stylized facts irrelevant in option-pricing?

The Black-Scholes Model

Computational and Statistical Methods in Finance

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Solutions to Final Exam

Valuing Stock Options: The Black-Scholes-Merton Model. Chapter 13

Rough volatility models: When population processes become a new tool for trading and risk management

Financial Engineering. Craig Pirrong Spring, 2006

Z. Wahab ENMG 625 Financial Eng g II 04/26/12. Volatility Smiles

Black Scholes Equation Luc Ashwin and Calum Keeley

INVESTMENTS Class 2: Securities, Random Walk on Wall Street

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay. Solutions to Final Exam.

Chapter 15: Jump Processes and Incomplete Markets. 1 Jumps as One Explanation of Incomplete Markets

Dependence Structure and Extreme Comovements in International Equity and Bond Markets

Financial Models with Levy Processes and Volatility Clustering

WANTED: Mathematical Models for Financial Weapons of Mass Destruction

1 Introduction. 2 Old Methodology BOARD OF GOVERNORS OF THE FEDERAL RESERVE SYSTEM DIVISION OF RESEARCH AND STATISTICS

Lecture 9: Practicalities in Using Black-Scholes. Sunday, September 23, 12

1 Volatility Definition and Estimation

Modeling via Stochastic Processes in Finance

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Final Exam

Financial Returns: Stylized Features and Statistical Models

Volatility of Asset Returns

BUSM 411: Derivatives and Fixed Income

Econophysics V: Credit Risk

Stochastic Processes and Stochastic Calculus - 9 Complete and Incomplete Market Models

Lecture 3: Probability Distributions (cont d)

Pricing and hedging with rough-heston models

The Impact of Volatility Estimates in Hedging Effectiveness

Fin285a:Computer Simulations and Risk Assessment Section 3.2 Stylized facts of financial data Danielson,

Queens College, CUNY, Department of Computer Science Computational Finance CSCI 365 / 765 Fall 2017 Instructor: Dr. Sateesh Mane.

Chapter 4 Variability

Assessing Regime Switching Equity Return Models

Using Fractals to Improve Currency Risk Management Strategies

The Merton Model. A Structural Approach to Default Prediction. Agenda. Idea. Merton Model. The iterative approach. Example: Enron

Optimal Placement of a Small Order Under a Diffusive Limit Order Book (LOB) Model

Two Hours. Mathematical formula books and statistical tables are to be provided THE UNIVERSITY OF MANCHESTER. 22 January :00 16:00

IMPA Commodities Course: Introduction

Mixing Di usion and Jump Processes

Smile in the low moments

Market Volatility and Risk Proxies

Pricing Volatility Derivatives with General Risk Functions. Alejandro Balbás University Carlos III of Madrid

Implied Phase Probabilities. SEB Investment Management House View Research Group

FE570 Financial Markets and Trading. Stevens Institute of Technology

Mathematics of Finance Final Preparation December 19. To be thoroughly prepared for the final exam, you should

The Brattle Group 1 st Floor 198 High Holborn London WC1V 7BD

Computer Exercise 2 Simulation

VOLATILITY AND COST ESTIMATING

FINANCIAL ECONOMETRICS AND EMPIRICAL FINANCE MODULE 2

Lecture Note 8 of Bus 41202, Spring 2017: Stochastic Diffusion Equation & Option Pricing

Stochastic Differential Equations in Finance and Monte Carlo Simulations

Modelling the Term Structure of Hong Kong Inter-Bank Offered Rates (HIBOR)

The Black-Scholes Model

Toward Formal Dualities in Asset-Liability Modeling

Foreign Fund Flows and Asset Prices: Evidence from the Indian Stock Market

(A note) on co-integration in commodity markets

Assicurazioni Generali: An Option Pricing Case with NAGARCH

The Importance (or Non-Importance) of Distributional Assumptions in Monte Carlo Models of Saving. James P. Dow, Jr.

Introduction to Game-Theoretic Probability

Results for option pricing

Practical Hedging: From Theory to Practice. OSU Financial Mathematics Seminar May 5, 2008

This homework assignment uses the material on pages ( A moving average ).

1. What is Implied Volatility?

A Scholar s Introduction to Stocks, Bonds and Derivatives

ANSWERS TO END-OF-CHAPTER QUESTIONS

STOR Lecture 15. Jointly distributed Random Variables - III

Principal Component Analysis of the Volatility Smiles and Skews. Motivation

Introduction Credit risk

[AN INTRODUCTION TO THE BLACK-SCHOLES PDE MODEL]

Martingales, Part II, with Exercise Due 9/21

Random Walk for Stock Price

Absolute Return Volatility. JOHN COTTER* University College Dublin

FMS161/MASM18 Financial Statistics Lecture 1, Introduction and stylized facts. Erik Lindström

Statistics and Finance

6.2 Normal Distribution. Normal Distributions

arxiv: v1 [q-fin.cp] 6 Feb 2018

CFE: Level 1 Exam Sample Questions

About Black-Sholes formula, volatility, implied volatility and math. statistics.

Statistacal Self-Similarity:Fractional Brownian Motion

Queens College, CUNY, Department of Computer Science Computational Finance CSCI 365 / 765 Fall 2017 Instructor: Dr. Sateesh Mane.

NEWCASTLE UNIVERSITY SCHOOL OF MATHEMATICS, STATISTICS & PHYSICS SEMESTER 1 SPECIMEN 2 MAS3904. Stochastic Financial Modelling. Time allowed: 2 hours

Machine Learning and the Insurance Industry Prof. John D. Kelleher

Pricing and Risk Management of guarantees in unit-linked life insurance

Linda Allen, Jacob Boudoukh and Anthony Saunders, Understanding Market, Credit and Operational Risk: The Value at Risk Approach

Monte Carlo Simulations

Module 10:Application of stochastic processes in areas like finance Lecture 36:Black-Scholes Model. Stochastic Differential Equation.

1.1 Interest rates Time value of money

Beyond the Black-Scholes-Merton model

Project Proposals for MS&E 444. Lisa Borland and Jeremy Evnine. Evnine and Associates, Inc. April 2008

Assessing Regime Switching Equity Return Models

FIN FINANCIAL INSTRUMENTS SPRING 2008

An Analysis of a Dynamic Application of Black-Scholes in Option Trading

TRADING PAST THE MARKET NOISE

Rough Heston models: Pricing, hedging and microstructural foundations

Transcription:

Challenges in Computational Finance and Financial Data Analysis James E. Gentle Department of Computational and Data Sciences George Mason University jgentle@gmu.edu http:\\mason.gmu.edu/~jgentle 1

Outline Financial data ² Mining nancial data Why we're interested The pro Stylized facts about V ext 2

Outline Financial data Mining financial data 2-a

Outline Financial data Mining financial data Why we re interested 2-b

Outline Financial data Mining financial data Why we re interested The data generating process 2-c

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data 2-d

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data Volatility patterns 2-e

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data Volatility patterns Text analysis 2-f

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data Volatility patterns Text analysis 3

Data I consider whatever can be encoded and stored in the computer to be data. That is, information is data; knowledge is data; a computer program is data; text documents are data; images are data. 4

Financial Data Financial data include balance sheet and earnings statement data officers and directors news items relative to activities of the company or of its competiters etc. etc. etc. stock prices trading volume etc. etc. etc. 5

Data on Trades of Financial Assets I will limit the discussion to data relating to trades of publicly-traded financial assets, or securities. A security may be a share in a corporation, it may be an option on a number of shares, it may be a bond, it may be share in a portfolio of other securities, and so on. There are approximately 2,800 different securities (corporate shares or portfolio shares) traded on the NY Stock Exchange. Each trading day on the NYSE, approximately 2 billion individual shares are traded in approximately 6 million trades for a total of approximately 75 billion dollars. By most measures, the NYSE is the largest market, but there are several others in the US, including the NASDAQ, at which securities similar to those on the NYSE are traded, and various commodities and futures markets. 6

Data on Trades of Financial Assets The primary data are the multivariate time series of price and volume of every trade for each security. In the US, this may be 20 10 6 bivariate time-stamped points (price and volume) daily. This is not extremely large as datasets go nowadays. And unlike the case in the physical sciences, the amount of data does not depend on the number of experiments the scientist is able to do or on the number of sensors or satellites that are deployed to collect the data. Additional data describe activities of companies or other news items that may affect the price. This rather amorphous set of data is quite huge. 7

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data Volatility patterns Text analysis 8

Data Mining and Knowledge Discovery In the early 1980s it was discovered that when the winner of the Super Bowl was a team from the old American Football League, the market went up for the rest of the year. Who would have expected such a relationship? It could have been discovered by mining of large and disparate datasets. It is knowledge discovery! (It actually happened.) It is interesting! Unfortunately, it is worthless. Data mining and knowledge discovery must be kept in context. 9

Data Mining and Knowledge Discovery: The January Effect Several years ago, it was discovered that there are anomalies in security prices during the first few days of January. The year after the discovery, the anomalies disappeared (although they re still being discussed). Duh! In the field of finance there is an interesting variation on the uncertainty prinicple. The market is efficient! (If you believe that, you probably believe the tooth fairly is what makes the market efficient.) If there was a systemic reason for the January effect, might that cause result in a cyclic, but attenuated anomaly? 10

Technical Analysis: A Venerable Application of Data Mining Technical analysis (as distinguished from fundamental analysis ) is based only on price data. The assumption is that future price changes are related to patterns of past price changes. Momentum or just a random walk? Head and shoulders or just a random walk? Broadening Top or just a random walk? What happens after one of these quaint patterns? 11

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data Volatility patterns Text analysis 12

Why Are We Interested in This Kind of Data? Understanding of the data can help regulators ensure that the trades are fair. Most markets now have in place diagnostic programs that identify suspicious trading activity. The programs are rather primitive. (They work by detecting anomalous data; but to do that we need good models of non-anomalous data.) The ability to mine the potentially relevant text data is lacking. Orderly markets are desirable. Understanding the large volatility swings would help preserve confidence in the markets. 13

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data Volatility patterns Text analysis 14

Pricing Models A stochastic model of the price of a stock may view the price as a random variable that depends on previous prices and some characteristic parameters of the particular stock. For example, in discrete time: S t+1 = f(s t, µ, σ) where t indexes time, µ and σ are parameters, and f is some function that contains a random component. The randomness in f may be assumed to reflect all variation in the price that is not accounted for in the model. 15

Pricing Models The model S t+1 = f(s t, µ, σ) is usually given one of two forms, either a time series model, such as a GARCH model, or a stochastic diffusion model driven by Brownian motion. A simple form of the latter type of model, is geometric Brownian motion, ds(t) = µs(t)dt + σs(t)db(t), in which µ and σ are constants, characteristic of the particular stock being modeled. Use of this model, although a somewhat crude approximation, led to a revolution in the pricing of derivative assets. 16

Pricing Models There are several aspects of observational data that indicate that the simple geometric Brownian motion model does not describe the data generating process very well. One approach would be to substitute some other distribution for the Gaussian. Another would be to superimpose some kind of jump process. Whatever kind of model may work best, it is clear that a key component of the model standard deviation of the rate of return (the σ in the geometric Brownian motion model). The is what financial analysts call risk or volatility. 17

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data Volatility patterns Text analysis 18

Rates of return do not fit a Gaussian distribution well. Heavy tails. The frequency distribution of rates of return decrease more slowly than exp( x 2 /2). Asymmetry in rates of return. Rates of return are slightly negatively skewed. (Because traders react more strongly to negative information than to positive information.) Asymmetry in lagged correlations. Coarse volatility predicts fine volatility better than the other way around. Aggregational normality. Quasi long range dependence. Seasonality. Custering of volatility. 19

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data Volatility patterns Text analysis 20

Volatility Volatility is the standard deviation of the rate of return. A sample standard deviation can usually be used to estimate a model standard deviation. The problem is that it is not constant. Developing a meaningful way to measure volatility in such streaming data is a very interesting research project. The study of volatility, including meaningful ways to measure it, should be a fruitful area for cyber-enabled discovery. 21

A Surrogate for Volatility In the meantime, those who study volatility use the volatility implied by a modified Black-Scholes formula applied to options on the S&P500. It s called the VIX ( volatility index ). Just like other indexes, you can trade futures on it. 22

Volatility 10 20 30 40 0 1000 2000 3000 4000 Time: Daily Jan 2, 1990 Oct 10, 2007 23

Volatility 15 20 25 30 35 0 50 100 150 200 Time: Daily Jan 3, 2007 Oct 10, 2007 24

Volatility Clustering What is the meaning of the clusters of volatility? If we look at the volatility of individual securities, we find a similar clustering. Are volatilities of individual securities positively correlated? (Yes, even if their prices are negatively correlated.) How do you measure correlation of standard deviations? Can increases in volatility of some securities indicate future increased volatility in the index? Can volatility be related to the derivatives market? Can volatility be related to global markets? Volatility patterns suggest constrained clustering. 25

Volatility Clustering Can this swarming behavior be understood? Are there leading indicators of it? Is the most fruitful approach to seek explanations in basic human nature? or, perhaps are there exogenous economic events that trigger volatility increases? or, can an accumulation of various analysts discussions or touts predict increased volatility, perhaps beginning in one sector. 26

Outline Financial data Mining financial data Why we re interested The data generating process Stylized facts about financial data Volatility patterns Text analysis 27

Text Mining There are thousands of documents related to financial assets generated daily. These come in a variety of forms and from a variety of sources. Developing some taxonomy of relevant documents would be a useful exercise. An initial approach would be to limit the catalogue to a small number of documents from a few large financial research houses, and develop methods for relating their content to asset prices. 28

Data Mining of Financial Data Financial data presents a number of challenges for mining. Much of the data mining in this area has yielded only meaningless relationships. Meaningful progress must come from an integrated exploration of data from a wide range of sources, both price/volume data from multiple markets and text data from a variety of sources. 29