Topic-based vector space modeling of Twitter data with application in predictive analytics

Size: px
Start display at page:

Download "Topic-based vector space modeling of Twitter data with application in predictive analytics"

Transcription

1 Topic-based vector space modeling of Twitter data with application in predictive analytics Guangnan Zhu (U ) Australian National University COMP4560 Individual Project Presentation Supervisor: Dr. Timothy Graham

2 2

3 Stock prediction is Magic!!! 3

4 4 Module Outline Motivation and Background Goal Methods Experiments Future Work

5 5 Motivation People want to make money from stock market, Stock Price Prediction is attractive Prediction of Stock Price is challenging (some researches believe that stock price follow random walk) Research on relationship between Social Media and Stock Price Movement

6 6 Background Bag of words, Word2vec Problems: language is not merely a bag of words but a tool with particular properties which have been fashioned in the course of its use. The language is "due to meaning" in one sense[1]. Size too big (Document Term Matrix) [1] Zellig S. Harris (1954) Distributional Structure, WORD, 10:2-3, , DOI: /

7 7 Background Topics <- Topic-based Modeling Topic Modeling based can work for Stock Market Prediction [1] LDA can be used as an effective dimension reduction method for text modeling and extract topics from the text [2] [1] Topic Modeling based Sentiment Analysis on Social Media for Stock Market Prediction (Thien Hai Nguyen, 2015) [2] An empirical study of text classification using Latent Dirichlet Allocation, Lei Li; Yimeng Zhang

8 8 Goal Hypothesis: The 'bag of words' model performs worse Topic-based modeling: How can LDA topic modelling be used a feature extraction technique for supervised machine learning using social media data? How accurately can a topic-based vector space model predict Google's stock market prices? What is the relationship between topics and stock price movements? Construct a good prediction model: Improve ML Model

9 9 Dataset Dataset Social Media Dataset 3-years ( ) tweet text of Google collected using the Twitter API Historical Price Dataset: End of day: 3-years ( ) data of Google from NASDAQ Intra-day: 7-months (01/01/ /07/2016) data of Google from NASDAQ Price(Pi+1, Pi) -> Labels -> (Up, Fair, Down) represent price movement

10 10 Methods Combine Tweet Combine In Days (End-of-Day closing price) Stock market open and close in one day Pros: Combined Text is long and easy to extract topics Cons: No. of Instances is small (22 open days per month) Combine In Hours (Intra-day stock prices) Combine every hours Pros: No. of Instances is large (22*24 = 528 hours per month) Cons: Combine Text is short and hard to extract topics

11 11 Method Topic Model Unsupervised topic modeling Latent Dirichlet Allocation(LDA) LDA represents documents as mixtures of topics that spit out words with certain probabilities. α : per-document topic distributions; β : per-topic word distribution

12 12 Method Topic Model Gamma Function construct a document-topic matrix

13 13 Method Topic Model Black Box Topic model is a unsupervised and it is a black box. We don t know the what exactly the topics are. But we can have a look in which words are belongs to the topic.

14 14 Method Machine Learning Prediction using ML ML Methods: Support Vector Machine (SVM) XGBoost

15 15 Result Combine Methods 56% in prediction -> satisfied result ---- (Schumaker and Chen, 2009b; Si et al., 2013; Tsibouris and Zeidenberg, 1995)

16 16 Result No. of Topics (Value of K) (End-of-day data) Accuracy k

17 17 Result ML Methods & Predicted Result (End-ofday data.) 56% in prediction -> satisfied result ---- (Schumaker and Chen, 2009b; Si et al., 2013; Tsibouris and Zeidenberg, 1995)

18 Result ML Methods & Predicted Result (End-ofday data) value_d 0.4 Evaluation_Measures SVM_Acc SVM_F_up_Measure XGB_Acc XGB_F_up_Measure k_value 18

19 19 Conclusion Topics discussed on Twitter can predict stock price movements LDA topic models can be used as input feature for supervised machine learning, and achieve close to stateof-the-art accuracy SVM tends to have better performance compared to more advanced algorithms such as XGBoost Topic-based vector space model performs better than a BoW model

20 20 Future Work LDA just focus on topics of texts. Need to consider more factors like opinions, mood and so on LDA should specific parameter K first. An non-parameter technique is needed The result is not stable. Sometimes may work really bad. It is not suitable for real world prediction. A more stable technique is needed. Prediction of stock price needs more factors. Different models should be fused together.

21 21 Thank you. Any Questions.

The Influence of News Articles on The Stock Market.

The Influence of News Articles on The Stock Market. The Influence of News Articles on The Stock Market. COMP4560 Presentation Supervisor: Dr Timothy Graham U6015364 Zhiheng Zhou Australian National University At Ian Ross Design Studio On 2018-5-18 Motivation

More information

Stock Prediction Using Twitter Sentiment Analysis

Stock Prediction Using Twitter Sentiment Analysis Problem Statement Stock Prediction Using Twitter Sentiment Analysis Stock exchange is a subject that is highly affected by economic, social, and political factors. There are several factors e.g. external

More information

Available online at ScienceDirect. Procedia Computer Science 89 (2016 )

Available online at  ScienceDirect. Procedia Computer Science 89 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 89 (2016 ) 441 449 Twelfth International Multi-Conference on Information Processing-2016 (IMCIP-2016) Prediction Models

More information

ECS171: Machine Learning

ECS171: Machine Learning ECS171: Machine Learning Lecture 15: Tree-based Algorithms Cho-Jui Hsieh UC Davis March 7, 2018 Outline Decision Tree Random Forest Gradient Boosted Decision Tree (GBDT) Decision Tree Each node checks

More information

HKUST CSE FYP , TEAM RO4 OPTIMAL INVESTMENT STRATEGY USING SCALABLE MACHINE LEARNING AND DATA ANALYTICS FOR SMALL-CAP STOCKS

HKUST CSE FYP , TEAM RO4 OPTIMAL INVESTMENT STRATEGY USING SCALABLE MACHINE LEARNING AND DATA ANALYTICS FOR SMALL-CAP STOCKS HKUST CSE FYP 2017-18, TEAM RO4 OPTIMAL INVESTMENT STRATEGY USING SCALABLE MACHINE LEARNING AND DATA ANALYTICS FOR SMALL-CAP STOCKS MOTIVATION MACHINE LEARNING AND FINANCE MOTIVATION SMALL-CAP MID-CAP

More information

Novel Approaches to Sentiment Analysis for Stock Prediction

Novel Approaches to Sentiment Analysis for Stock Prediction Novel Approaches to Sentiment Analysis for Stock Prediction Chris Wang, Yilun Xu, Qingyang Wang Stanford University chrwang, ylxu, iriswang @ stanford.edu Abstract Stock market predictions lend themselves

More information

Background for Case Study Used in Workshop

Background for Case Study Used in Workshop Background for Case Study Used in Workshop Fethi Rabhi School of Computer Science and Engineering University of New South Wales Sydney Australia 1 Preliminaries Purpose of lecture Look at domains involved

More information

Stock Trading Following Stock Price Index Movement Classification Using Machine Learning Techniques

Stock Trading Following Stock Price Index Movement Classification Using Machine Learning Techniques Stock Trading Following Stock Price Index Movement Classification Using Machine Learning Techniques 6.1 Introduction Trading in stock market is one of the most popular channels of financial investments.

More information

Can Twitter predict the stock market?

Can Twitter predict the stock market? 1 Introduction Can Twitter predict the stock market? Volodymyr Kuleshov December 16, 2011 Last year, in a famous paper, Bollen et al. (2010) made the claim that Twitter mood is correlated with the Dow

More information

Leverage Financial News to Predict Stock Price Movements Using Word Embeddings and Deep Neural Networks

Leverage Financial News to Predict Stock Price Movements Using Word Embeddings and Deep Neural Networks Leverage Financial News to Predict Stock Price Movements Using Word Embeddings and Deep Neural Networks Yangtuo Peng A THESIS SUBMITTED TO THE FACULTY OF GRADUATE STUDIES IN PARTIAL FULFILLMENT OF THE

More information

THE investment in stock market is a common way of

THE investment in stock market is a common way of PROJECT REPORT, MACHINE LEARNING (COMP-652 AND ECSE-608) MCGILL UNIVERSITY, FALL 2018 1 Comparison of Different Algorithmic Trading Strategies on Tesla Stock Price Tawfiq Jawhar, McGill University, Montreal,

More information

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18,   ISSN Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL NETWORKS K. Jayanthi, Dr. K. Suresh 1 Department of Computer

More information

An introduction to Machine learning methods and forecasting of time series in financial markets

An introduction to Machine learning methods and forecasting of time series in financial markets An introduction to Machine learning methods and forecasting of time series in financial markets Mark Wong markwong@kth.se December 10, 2016 Abstract The goal of this paper is to give the reader an introduction

More information

Foreign Exchange Forecasting via Machine Learning

Foreign Exchange Forecasting via Machine Learning Foreign Exchange Forecasting via Machine Learning Christian González Rojas cgrojas@stanford.edu Molly Herman mrherman@stanford.edu I. INTRODUCTION The finance industry has been revolutionized by the increased

More information

Text Mining Part 2. Opinion Mining / Sentiment Analysis. Combining Text procession with Machine Learning

Text Mining Part 2. Opinion Mining / Sentiment Analysis. Combining Text procession with Machine Learning Text Mining Part 2 Opinion Mining / Sentiment Analysis Combining Text procession with Machine Learning Data Mining Data Mining is the non-trivial extraction of previously unknown and potentially useful

More information

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18,   ISSN International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL

More information

Predicting stock prices for large-cap technology companies

Predicting stock prices for large-cap technology companies Predicting stock prices for large-cap technology companies 15 th December 2017 Ang Li (al171@stanford.edu) Abstract The goal of the project is to predict price changes in the future for a given stock.

More information

Forecasting Agricultural Commodity Prices through Supervised Learning

Forecasting Agricultural Commodity Prices through Supervised Learning Forecasting Agricultural Commodity Prices through Supervised Learning Fan Wang, Stanford University, wang40@stanford.edu ABSTRACT In this project, we explore the application of supervised learning techniques

More information

Beyond GLMs. Xavier Conort & Colin Priest

Beyond GLMs. Xavier Conort & Colin Priest Beyond GLMs Xavier Conort & Colin Priest 1 Agenda 1. GLMs and Actuaries 2. Extensions to GLMs 3. Automating GLM model building 4. Best practice predictive modelling 5. Conclusion 2 1) GLMs Linear models

More information

Classification of trading strategies of agents in a competitive market

Classification of trading strategies of agents in a competitive market Classification of trading strategies of agents in a competitive market CS 689 - Machine Learning Final Project presentation Mark Gruman Manjunath Narayana 12/12/27 Application CAT tournament Objective

More information

Top-down particle filtering for Bayesian decision trees

Top-down particle filtering for Bayesian decision trees Top-down particle filtering for Bayesian decision trees Balaji Lakshminarayanan 1, Daniel M. Roy 2 and Yee Whye Teh 3 1. Gatsby Unit, UCL, 2. University of Cambridge and 3. University of Oxford Outline

More information

Beating the market, using linear regression to outperform the market average

Beating the market, using linear regression to outperform the market average Radboud University Bachelor Thesis Artificial Intelligence department Beating the market, using linear regression to outperform the market average Author: Jelle Verstegen Supervisors: Marcel van Gerven

More information

Are New Modeling Techniques Worth It?

Are New Modeling Techniques Worth It? Are New Modeling Techniques Worth It? Tom Zougas PhD PEng, Manager Data Science, TransUnion TORONTO SAS USER GROUP MAY 2, 2018 Are New Modeling Techniques Worth It? Presenter Tom Zougas PhD PEng, Manager

More information

Lazy Prices: Vector Representations of Financial Disclosures and Market Outperformance

Lazy Prices: Vector Representations of Financial Disclosures and Market Outperformance Lazy Prices: Vector Representations of Financial Disclosures and Market Outperformance Kuspa Kai kuspakai@stanford.edu Victor Cheung hoche@stanford.edu Alex Lin alin719@stanford.edu Abstract The Efficient

More information

Session 5. Predictive Modeling in Life Insurance

Session 5. Predictive Modeling in Life Insurance SOA Predictive Analytics Seminar Hong Kong 29 Aug. 2018 Hong Kong Session 5 Predictive Modeling in Life Insurance Jingyi Zhang, Ph.D Predictive Modeling in Life Insurance JINGYI ZHANG PhD Scientist Global

More information

Stock Market Real Time Recommender Model Using Apache Spark Framework

Stock Market Real Time Recommender Model Using Apache Spark Framework Stock Market Real Time Recommender Model Using Apache Spark Framework Mostafa Mohamed Seif ( ), Essam M. Ramzy Hamed ( ), and Abd El Fatah Abdel Ghfar Hegazy ( ) Arab Academy for Science, Technology and

More information

Application of selected methods of statistical analysis and machine learning. learning in predictions of EURUSD, DAX and Ether prices

Application of selected methods of statistical analysis and machine learning. learning in predictions of EURUSD, DAX and Ether prices Application of selected methods of statistical analysis and machine learning in predictions of EURUSD, DAX and Ether prices Mateusz M.@mini.pw.edu.pl Faculty of Mathematics and Information Science Warsaw

More information

Support Vector Machines: Training with Stochastic Gradient Descent

Support Vector Machines: Training with Stochastic Gradient Descent Support Vector Machines: Training with Stochastic Gradient Descent Machine Learning Spring 2018 The slides are mainly from Vivek Srikumar 1 Support vector machines Training by maximizing margin The SVM

More information

INDIAN STOCK MARKET PREDICTOR SYSTEM

INDIAN STOCK MARKET PREDICTOR SYSTEM INDIAN STOCK MARKET PREDICTOR SYSTEM 1 VIVEK JOHN GEORGE, 2 DARSHAN M. S, 3 SNEHA PRICILLA, 4 ARUN S, 5 CH. VANIPRIYA Department of Computer Science and Engineering, Sir M Visvesvarya Institute of Technology,

More information

Boom or Ruin Does it Make a Difference? Using Text Mining and Sentiment Analysis to Support Intraday Investment Decisions

Boom or Ruin Does it Make a Difference? Using Text Mining and Sentiment Analysis to Support Intraday Investment Decisions 2012 45th Hawaii International Conference on System Sciences Boom or Ruin Does it Make a Difference? Using Text Mining and Sentiment Analysis to Support Intraday Investment Decisions Michael Siering Goethe-University

More information

A Big Data Analytical Framework For Portfolio Optimization

A Big Data Analytical Framework For Portfolio Optimization A Big Data Analytical Framework For Portfolio Optimization (Presented at Workshop on Internet and BigData Finance (WIBF 14) in conjunction with International Conference on Frontiers of Finance, City University

More information

Analyzing Representational Schemes of Financial News Articles

Analyzing Representational Schemes of Financial News Articles Analyzing Representational Schemes of Financial News Articles Robert P. Schumaker Information Systems Dept. Iona College, New Rochelle, New York 10801, USA rschumaker@iona.edu Word Count: 2460 Abstract

More information

Do Media Sentiments Reflect Economic Indices?

Do Media Sentiments Reflect Economic Indices? Do Media Sentiments Reflect Economic Indices? Munich, September, 1, 2010 Paul Hofmarcher, Kurt Hornik, Stefan Theußl WU Wien Hofmarcher/Hornik/Theußl Sentiment Analysis 1/15 I I II Text Mining Sentiment

More information

Big Data, Small Data, Medium-sized Data

Big Data, Small Data, Medium-sized Data Big Data, Small Data, Medium-sized Data Making the most of what you ve got 19 April 2016 Phil Joubert William Chan phil.joubert@hk.ey.com William-KW.Chan@hk.ey.com A Big Data timeline Google trends Big

More information

An Introduction to Opinion Mining and its Applications. Ana Valdivia Granada, 17/11/2016

An Introduction to Opinion Mining and its Applications. Ana Valdivia Granada, 17/11/2016 Sentiment Analysis An Introduction to Opinion Mining and its Applications Ana Valdivia Granada, 17/11/2016 About me Ana Valdivia Degree in Mathematics (UPC) MSc in Data Science (UGR) Paper about museums:

More information

Portfolio Recommendation System Stanford University CS 229 Project Report 2015

Portfolio Recommendation System Stanford University CS 229 Project Report 2015 Portfolio Recommendation System Stanford University CS 229 Project Report 205 Berk Eserol Introduction Machine learning is one of the most important bricks that converges machine to human and beyond. Considering

More information

ScienceDirect. Detecting the abnormal lenders from P2P lending data

ScienceDirect. Detecting the abnormal lenders from P2P lending data Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 91 (2016 ) 357 361 Information Technology and Quantitative Management (ITQM 2016) Detecting the abnormal lenders from P2P

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 2, Mar Apr 2017

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 2, Mar Apr 2017 RESEARCH ARTICLE Stock Selection using Principal Component Analysis with Differential Evolution Dr. Balamurugan.A [1], Arul Selvi. S [2], Syedhussian.A [3], Nithin.A [4] [3] & [4] Professor [1], Assistant

More information

Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms

Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms Volume 119 No. 12 2018, 15395-15405 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms 1

More information

BUZ. Powered by Artificial Intelligence. BUZZ US SENTIMENT LEADERS ETF INVESTMENT PRIMER: DECEMBER 2017 NYSE ARCA

BUZ. Powered by Artificial Intelligence. BUZZ US SENTIMENT LEADERS ETF INVESTMENT PRIMER: DECEMBER 2017 NYSE ARCA BUZZ US SENTIMENT LEADERS ETF INVESTMENT PRIMER: DECEMBER 2017 BUZ NYSE ARCA Powered by Artificial Intelligence. www.alpsfunds.com 855.215.1425 Investors have not previously had a way to capitalize on

More information

Model 0: We start with a linear regression model: log Y t = β 0 + β 1 (t 1980) + ε, with ε N(0,

Model 0: We start with a linear regression model: log Y t = β 0 + β 1 (t 1980) + ε, with ε N(0, Stat 534: Fall 2017. Introduction to the BUGS language and rjags Installation: download and install JAGS. You will find the executables on Sourceforge. You must have JAGS installed prior to installing

More information

10 Things We Don t Understand About Finance. 3: The CAPM Is Missing Something!

10 Things We Don t Understand About Finance. 3: The CAPM Is Missing Something! 10 Things We Don t Understand About Finance 3: The CAPM Is Missing Something! Models Need two features Simple enough to understand Complex enough to be generally applicable Does the CAPM satisfy these?

More information

International Journal of Computer Engineering and Applications, Volume XII, Issue IV, April 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Issue IV, April 18,  ISSN STOCK MARKET PREDICTION USING ARIMA MODEL Dr A.Haritha 1 Dr PVS Lakshmi 2 G.Lakshmi 3 E.Revathi 4 A.G S S Srinivas Deekshith 5 1,3 Assistant Professor, Department of IT, PVPSIT. 2 Professor, Department

More information

Problem set 1 Answers: 0 ( )= [ 0 ( +1 )] = [ ( +1 )]

Problem set 1 Answers: 0 ( )= [ 0 ( +1 )] = [ ( +1 )] Problem set 1 Answers: 1. (a) The first order conditions are with 1+ 1so 0 ( ) [ 0 ( +1 )] [( +1 )] ( +1 ) Consumption follows a random walk. This is approximately true in many nonlinear models. Now we

More information

Session 5. A brief introduction to Predictive Modeling

Session 5. A brief introduction to Predictive Modeling SOA Predictive Analytics Seminar Malaysia 27 Aug. 2018 Kuala Lumpur, Malaysia Session 5 A brief introduction to Predictive Modeling Lichen Bao, Ph.D A Brief Introduction to Predictive Modeling LICHEN BAO

More information

Exploiting Topic based Twitter Sentiment for Stock Prediction

Exploiting Topic based Twitter Sentiment for Stock Prediction Exploiting Topic based Twitter Sentiment for Stock Prediction Jianfeng Si * Arjun Mukherjee Bing Liu Qing Li * Huayi Li Xiaotie Deng * Department of Computer Science, City University of Hong Kong, Hong

More information

Practical example of an Economic Scenario Generator

Practical example of an Economic Scenario Generator Practical example of an Economic Scenario Generator Martin Schenk Actuarial & Insurance Solutions SAV 7 March 2014 Agenda Introduction Deterministic vs. stochastic approach Mathematical model Application

More information

Mean Reverting Asset Trading. Research Topic Presentation CSCI-5551 Grant Meyers

Mean Reverting Asset Trading. Research Topic Presentation CSCI-5551 Grant Meyers Mean Reverting Asset Trading Research Topic Presentation CSCI-5551 Grant Meyers Table of Contents 1. Introduction + Associated Information 2. Problem Definition 3. Possible Solution 1 4. Problems with

More information

Panic Indicator for Measurements of Pessimistic Sentiments from Business News

Panic Indicator for Measurements of Pessimistic Sentiments from Business News International Business Research; Vol. 7, No. 5; 2014 ISSN 1913-9004 E-ISSN 1913-9012 Published by Canadian Center of Science and Education Panic Indicator for Measurements of Pessimistic Sentiments from

More information

Comparative study of credit rating of SMEs based on AHP and KMV. model

Comparative study of credit rating of SMEs based on AHP and KMV. model Joint International Social Science, Education, Language, Management and Business Conference (JISEM 2015) Comparative study of credit rating of SMEs based on AHP and KMV model Gao Jia-ni1, a*, Gui Yong-ping2,

More information

Supervised Learning, Part 1: Regression

Supervised Learning, Part 1: Regression Supervised Learning, Part 1: Max Planck Summer School 2017 Dierent Methods for Dierent Goals Supervised: Pursuing a known goal prediction or classication. Unsupervised: Unknown goal, let the computer summarize

More information

Bayesian Finance. Christa Cuchiero, Irene Klein, Josef Teichmann. Obergurgl 2017

Bayesian Finance. Christa Cuchiero, Irene Klein, Josef Teichmann. Obergurgl 2017 Bayesian Finance Christa Cuchiero, Irene Klein, Josef Teichmann Obergurgl 2017 C. Cuchiero, I. Klein, and J. Teichmann Bayesian Finance Obergurgl 2017 1 / 23 1 Calibrating a Bayesian model: a first trial

More information

We are not saying it s easy, we are just trying to make it simpler than before. An Online Platform for backtesting quantitative trading strategies.

We are not saying it s easy, we are just trying to make it simpler than before. An Online Platform for backtesting quantitative trading strategies. We are not saying it s easy, we are just trying to make it simpler than before. An Online Platform for backtesting quantitative trading strategies. Visit www.kuants.in to get your free access to Stock

More information

$tock Forecasting using Machine Learning

$tock Forecasting using Machine Learning $tock Forecasting using Machine Learning Greg Colvin, Garrett Hemann, and Simon Kalouche Abstract We present an implementation of 3 different machine learning algorithms gradient descent, support vector

More information

distribution of the best bid and ask prices upon the change in either of them. Architecture Each neural network has 4 layers. The standard neural netw

distribution of the best bid and ask prices upon the change in either of them. Architecture Each neural network has 4 layers. The standard neural netw A Survey of Deep Learning Techniques Applied to Trading Published on July 31, 2016 by Greg Harris http://gregharris.info/a-survey-of-deep-learning-techniques-applied-t o-trading/ Deep learning has been

More information

Financial Statements: Modeling and Analytics. Bio Overview Help Prerequisites Materials Assignments Grading Topics

Financial Statements: Modeling and Analytics. Bio Overview Help Prerequisites Materials Assignments Grading Topics Financial Statements: Modeling and Analytics Financial Statements: Modeling and Analytics Bio Overview Help Prerequisites Materials Assignments Grading Topics Spring 2019: Finance and Risk Engineering

More information

By-Peril Deductible Factors

By-Peril Deductible Factors By-Peril Deductible Factors Luyang Fu, Ph.D., FCAS Jerry Han, Ph.D., ASA March 17 th 2010 State Auto is one of only 13 companies to earn an A+ Rating by AM Best every year since 1954! Agenda Introduction

More information

News and narratives in financial Systems

News and narratives in financial Systems News and narratives in financial Systems Exploiting big data for systemic risk assessment Rickard Nyman**, David Gregory*, Sujit Kapadia*, Robert Smith** & David Tuckett** 04/04/2014 *Bank of England,

More information

Understanding delegation in the European Union through machine learning

Understanding delegation in the European Union through machine learning Understanding delegation in the European Union through machine learning Jason Anastasopoulos ljanastas@princeton.edu Anthony M. Bertelli bertelli@nyu.edu September 6, 2017 Abstract The delegation of powers

More information

CS 475 Machine Learning: Final Project Dual-Form SVM for Predicting Loan Defaults

CS 475 Machine Learning: Final Project Dual-Form SVM for Predicting Loan Defaults CS 475 Machine Learning: Final Project Dual-Form SVM for Predicting Loan Defaults Kevin Rowland Johns Hopkins University 3400 N. Charles St. Baltimore, MD 21218, USA krowlan3@jhu.edu Edward Schembor Johns

More information

Exploiting Market Sentiment to Create Daily Trading Signals

Exploiting Market Sentiment to Create Daily Trading Signals Exploiting Market Sentiment to Create Daily Trading Signals Presented by: Dr Xiang Yu LT-Accelerate 22 November 2016, Brussels OptiRisk Systems Ltd. OptiRisk specializes in optimization and risk analytics

More information

Multiple regression - a brief introduction

Multiple regression - a brief introduction Multiple regression - a brief introduction Multiple regression is an extension to regular (simple) regression. Instead of one X, we now have several. Suppose, for example, that you are trying to predict

More information

Mixture Models and Gibbs Sampling

Mixture Models and Gibbs Sampling Mixture Models and Gibbs Sampling October 12, 2009 Readings: Hoff CHapter 6 Mixture Models and Gibbs Sampling p.1/16 Eyes Exmple Bowmaker et al (1985) analyze data on the peak sensitivity wavelengths for

More information

Risk and Risk Management in the Credit Card Industry

Risk and Risk Management in the Credit Card Industry Risk and Risk Management in the Credit Card Industry F. Butaru, Q. Chen, B. Clark, S. Das, A. W. Lo and A. Siddique Discussion by Richard Stanton Haas School of Business MFM meeting January 28 29, 2016

More information

Predicting the Success of a Retirement Plan Based on Early Performance of Investments

Predicting the Success of a Retirement Plan Based on Early Performance of Investments Predicting the Success of a Retirement Plan Based on Early Performance of Investments CS229 Autumn 2010 Final Project Darrell Cain, AJ Minich Abstract Using historical data on the stock market, it is possible

More information

INTELIGENCIA ARTIFICIAL. Machine Learning-Based Analysis of the Association between Online Texts and Stock Price Movements

INTELIGENCIA ARTIFICIAL. Machine Learning-Based Analysis of the Association between Online Texts and Stock Price Movements Inteligencia Artificial 21(61), 95-110 doi: 10.4114/intartif.vol21iss61pp95-110 INTELIGENCIA ARTIFICIAL http://journal.iberamia.org/ Machine Learning-Based Analysis of the Association between Online Texts

More information

State Switching in US Equity Index Returns based on SETAR Model with Kalman Filter Tracking

State Switching in US Equity Index Returns based on SETAR Model with Kalman Filter Tracking State Switching in US Equity Index Returns based on SETAR Model with Kalman Filter Tracking Timothy Little, Xiao-Ping Zhang Dept. of Electrical and Computer Engineering Ryerson University 350 Victoria

More information

Sentiment Extraction from Stock Message Boards The Das and

Sentiment Extraction from Stock Message Boards The Das and Sentiment Extraction from Stock Message Boards The Das and Chen Paper University of Washington Linguistics 575 Tuesday 6 th May, 2014 Paper General Factoids Das is an ex-wall Streeter and a finance Ph.D.

More information

Dynamic Resource Allocation for Spot Markets in Cloud Computi

Dynamic Resource Allocation for Spot Markets in Cloud Computi Dynamic Resource Allocation for Spot Markets in Cloud Computing Environments Qi Zhang 1, Quanyan Zhu 2, Raouf Boutaba 1,3 1 David. R. Cheriton School of Computer Science University of Waterloo 2 Department

More information

Improving Stock Price Prediction with SVM by Simple Transformation: The Sample of Stock Exchange of Thailand (SET)

Improving Stock Price Prediction with SVM by Simple Transformation: The Sample of Stock Exchange of Thailand (SET) Thai Journal of Mathematics Volume 14 (2016) Number 3 : 553 563 http://thaijmath.in.cmu.ac.th ISSN 1686-0209 Improving Stock Price Prediction with SVM by Simple Transformation: The Sample of Stock Exchange

More information

Using Twitter to Analyze Stock Market and Assist Stock and Options Trading

Using Twitter to Analyze Stock Market and Assist Stock and Options Trading University of Connecticut DigitalCommons@UConn Doctoral Dissertations University of Connecticut Graduate School 12-17-2015 Using Twitter to Analyze Stock Market and Assist Stock and Options Trading Yuexin

More information

Credit Scoring. from Concept to Reality. Credit & Collections Conference Boston: June 11 th, 2007

Credit Scoring. from Concept to Reality. Credit & Collections Conference Boston: June 11 th, 2007 Credit Scoring from Concept to Reality Credit & Collections Conference Boston: June 11 th, 2007 2 Agenda 1) Developing & Launching the Credit Scoring Plan Tom Kritzer Navistar Financial Corporation 2)

More information

Machine Learning in Finance

Machine Learning in Finance Machine Learning in Finance Dragana Radojičić Thorsten Rheinländer Simeon Kredatus TU Wien, Vienna University of Technology October 27, 2018 Dragana Radojičić (TU Wien) October 27, 2018 1 / 16 Outline

More information

How Good is 1/n Portfolio?

How Good is 1/n Portfolio? How Good is 1/n Portfolio? at Hausdorff Research Institute for Mathematics May 28, 2013 Woo Chang Kim wkim@kaist.ac.kr Assistant Professor, ISysE, KAIST Along with Koray D. Simsek, and William T. Ziemba

More information

Breaking News: The Influence of the Twitter Community on Investor Behaviour

Breaking News: The Influence of the Twitter Community on Investor Behaviour II Breaking News: The Influence of the Twitter Community on Investor Behaviour Bachelorarbeit zur Erlangung des akademischen Grades Bachelor of Science (B. Sc.) im Studiengang Wirtschaftsingenieur der

More information

Data Adaptive Stock Recommendation

Data Adaptive Stock Recommendation IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Volume 13, PP 06-10 www.iosrjen.org Data Adaptive Stock Recommendation Mayank H. Mehta 1, Kamakshi P. Banavalikar 2, Jigar

More information

Risk profile clustering strategy in portfolio diversification

Risk profile clustering strategy in portfolio diversification Risk profile clustering strategy in portfolio diversification Cathy Yi-Hsuan Chen Wolfgang Karl Härdle Alla Petukhina Ladislaus von Bortkiewicz Chair of Statistics Humboldt-Universität zu Berlin lvb.wiwi.hu-berlin.de

More information

Application of Data Mining Technology in the Loss of Customers in Automobile Insurance Enterprises

Application of Data Mining Technology in the Loss of Customers in Automobile Insurance Enterprises International Journal of Data Science and Analysis 2018; 4(1): 1-5 http://www.sciencepublishinggroup.com/j/ijdsa doi: 10.11648/j.ijdsa.20180401.11 ISSN: 2575-1883 (Print); ISSN: 2575-1891 (Online) Application

More information

Sentiment Analysis of Twitter and RSS News Feeds and Its Impact on Stock Market Prediction

Sentiment Analysis of Twitter and RSS News Feeds and Its Impact on Stock Market Prediction Received: July 12, 2017 68 Sentiment Analysis of Twitter and RSS News Feeds and Its Impact on Stock Market Prediction Shri Bharathi 1* Angelina Geetha 1 Revathi Sathiynarayanan 1 1 Department of Computer

More information

Improving Long Term Stock Market Prediction with Text Analysis

Improving Long Term Stock Market Prediction with Text Analysis Western University Scholarship@Western Electronic Thesis and Dissertation Repository May 2017 Improving Long Term Stock Market Prediction with Text Analysis Tanner A. Bohn The University of Western Ontario

More information

Data Science using R, Minitab & XLMiner

Data Science using R, Minitab & XLMiner Data Science using R, Minitab & XLMiner R, Minitab XLMiner for Forecasting AGENDA Data Mining Supervised & Unsupervised (Machine Learning) Text Mining & NLP Data Visualization usingtableau AGENDA What

More information

A Multi-topic Approach to Building Quant Models. Bringing Semantic Intelligence to Financial Markets

A Multi-topic Approach to Building Quant Models. Bringing Semantic Intelligence to Financial Markets A Multi-topic Approach to Building Quant Models Bringing Semantic Intelligence to Financial Markets Data is growing at an incredible speed Source: IDC - 2014, Structured Data vs. Unstructured Data: The

More information

ADVANCED OPERATIONAL RISK MODELLING IN BANKS AND INSURANCE COMPANIES

ADVANCED OPERATIONAL RISK MODELLING IN BANKS AND INSURANCE COMPANIES Small business banking and financing: a global perspective Cagliari, 25-26 May 2007 ADVANCED OPERATIONAL RISK MODELLING IN BANKS AND INSURANCE COMPANIES C. Angela, R. Bisignani, G. Masala, M. Micocci 1

More information

Modelling economic scenarios for IFRS 9 impairment calculations. Keith Church 4most (Europe) Ltd AUGUST 2017

Modelling economic scenarios for IFRS 9 impairment calculations. Keith Church 4most (Europe) Ltd AUGUST 2017 Modelling economic scenarios for IFRS 9 impairment calculations Keith Church 4most (Europe) Ltd AUGUST 2017 Contents Introduction The economic model Building a scenario Results Conclusions Introduction

More information

Automated Options Trading Using Machine Learning

Automated Options Trading Using Machine Learning 1 Automated Options Trading Using Machine Learning Peter Anselmo and Karen Hovsepian and Carlos Ulibarri and Michael Kozloski Department of Management, New Mexico Tech, Socorro, NM 87801, U.S.A. We summarize

More information

DATA AND TEXT MINING OF FINANCIAL MARKETS USING NEWS AND SOCIAL MEDIA

DATA AND TEXT MINING OF FINANCIAL MARKETS USING NEWS AND SOCIAL MEDIA DATA AND TEXT MINING OF FINANCIAL MARKETS USING NEWS AND SOCIAL MEDIA A DISSERTATION SUBMITTED TO THE UNIVERSITY OF MANCHESTER FOR THE DEGREE OF MASTER OF SCIENCE IN THE FACULTY OF ENGINEERING AND PHYSICAL

More information

******************************* The multi-period binomial model generalizes the single-period binomial model we considered in Section 2.

******************************* The multi-period binomial model generalizes the single-period binomial model we considered in Section 2. Derivative Securities Multiperiod Binomial Trees. We turn to the valuation of derivative securities in a time-dependent setting. We focus for now on multi-period binomial models, i.e. binomial trees. This

More information

Tests for Two Variances

Tests for Two Variances Chapter 655 Tests for Two Variances Introduction Occasionally, researchers are interested in comparing the variances (or standard deviations) of two groups rather than their means. This module calculates

More information

Adjusting for earnings volatility in earnings forecast models

Adjusting for earnings volatility in earnings forecast models Uppsala University Department of Business Studies Spring 14 Bachelor thesis Supervisor: Joachim Landström Authors: Sandy Samour & Fabian Söderdahl Adjusting for earnings volatility in earnings forecast

More information

INTRODUCTION AND OVERVIEW

INTRODUCTION AND OVERVIEW CHAPTER ONE INTRODUCTION AND OVERVIEW 1.1 THE IMPORTANCE OF MATHEMATICS IN FINANCE Finance is an immensely exciting academic discipline and a most rewarding professional endeavor. However, ever-increasing

More information

Large-Scale SVM Optimization: Taking a Machine Learning Perspective

Large-Scale SVM Optimization: Taking a Machine Learning Perspective Large-Scale SVM Optimization: Taking a Machine Learning Perspective Shai Shalev-Shwartz Toyota Technological Institute at Chicago Joint work with Nati Srebro Talk at NEC Labs, Princeton, August, 2008 Shai

More information

Engaging Pension Plan Participants Using Text Mining to better Understand Participants Thomas Post (Maastricht University and Netspar)

Engaging Pension Plan Participants Using Text Mining to better Understand Participants Thomas Post (Maastricht University and Netspar) Engaging Pension Plan Participants Using Text Mining to better Understand Participants Thomas Post (Maastricht University and Netspar) What is the Problem? Don t know their expected pension benefits 56%

More information

Stock Price Prediction using Deep Learning

Stock Price Prediction using Deep Learning San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Spring 2018 Stock Price Prediction using Deep Learning Abhinav Tipirisetty San Jose State University

More information

International Journal of Advance Engineering and Research Development REVIEW ON PREDICTION SYSTEM FOR BANK LOAN CREDIBILITY

International Journal of Advance Engineering and Research Development REVIEW ON PREDICTION SYSTEM FOR BANK LOAN CREDIBILITY Scientific Journal of Impact Factor (SJIF): 4.72 International Journal of Advance Engineering and Research Development Volume 4, Issue 12, December -2017 e-issn (O): 2348-4470 p-issn (P): 2348-6406 REVIEW

More information

Statistical estimation

Statistical estimation Statistical estimation Statistical modelling: theory and practice Gilles Guillot gigu@dtu.dk September 3, 2013 Gilles Guillot (gigu@dtu.dk) Estimation September 3, 2013 1 / 27 1 Introductory example 2

More information

Relative and absolute equity performance prediction via supervised learning

Relative and absolute equity performance prediction via supervised learning Relative and absolute equity performance prediction via supervised learning Alex Alifimoff aalifimoff@stanford.edu Axel Sly axelsly@stanford.edu Introduction Investment managers and traders utilize two

More information

Statistical Models of Word Frequency and Other Count Data

Statistical Models of Word Frequency and Other Count Data Statistical Models of Word Frequency and Other Count Data Martin Jansche 2004-02-12 Motivation Item counts are commonly used in NLP as independent variables in many applications: information retrieval,

More information

Multi-factor Stock Selection Model Based on Kernel Support Vector Machine

Multi-factor Stock Selection Model Based on Kernel Support Vector Machine Journal of Mathematics Research; Vol. 10, No. 5; October 2018 ISSN 1916-9795 E-ISSN 1916-9809 Published by Canadian Center of Science and Education Multi-factor Stock Selection Model Based on Kernel Support

More information

Machine Learning Performance over Long Time Frame

Machine Learning Performance over Long Time Frame Machine Learning Performance over Long Time Frame Yazhe Li, Tony Bellotti, Niall Adams Imperial College London yli16@imperialacuk Credit Scoring and Credit Control Conference, Aug 2017 Yazhe Li (Imperial

More information

Data Mining: A Closer Look. 2.1 Data Mining Strategies 8/30/2011. Chapter 2. Data Mining Strategies. Market Basket Analysis. Unsupervised Clustering

Data Mining: A Closer Look. 2.1 Data Mining Strategies 8/30/2011. Chapter 2. Data Mining Strategies. Market Basket Analysis. Unsupervised Clustering Data Mining: A Closer Look Chapter 2 2.1 Data Mining Strategies Data Mining Strategies Unsupervised Clustering Supervised Learning Market Basket Analysis Classification Estimation Prediction Figure 2.1

More information