Text Mining Part 2. Opinion Mining / Sentiment Analysis. Combining Text procession with Machine Learning

Similar documents
Stock Prediction Using Twitter Sentiment Analysis

MagicBreakout Forex Trading Strategy

Session 3. Life/Health Insurance technical session

Using data mining to detect insurance fraud

Price Action Breakdown. Exclusive Price Action Trading Approach to Financial Markets. by Laurentiu Damir

AI Strategies in Insurance

Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms

Binary Options Trading Strategies How to Become a Successful Trader?

UK Business and Charity Digital Index 2018 Appendix. The fifth edition Benchmarking the digital capability and skills of UK SMEs and charities

Get Smarter. Data Analytics in the Canadian Life Insurance Industry. Introduction. Highlights. Financial Services & Insurance White Paper

As our brand migration will be gradual, you will see traces of our past through documentation, videos, and digital platforms.

FAQs & Required PFOREX Assist Info for Higher efficiency

BUZ. Powered by Artificial Intelligence. BUZZ US SENTIMENT LEADERS ETF INVESTMENT PRIMER: DECEMBER 2017 NYSE ARCA

Predictive Analytics in Insurance Getting it right when your customers need you most

Applying fundamental & technical analysis in stock investing

Topic-based vector space modeling of Twitter data with application in predictive analytics

Based on the audacious premise that a lot more can be done with a lot less.

ORIGINALLY APPEARED IN ACTIVE TRADER M AGAZINE

Point Zero Metatrader4 Indicators

WORKBOOK. The FX Trader s EDGE BLUEPRINT ENCORE EVENT. How to Capture Low Hanging Profits in the New Year Using 3 Simple Blueprints

Stock Market Forecast: Chaos Theory Revealing How the Market Works March 25, 2018 I Know First Research

undiscovered opportunities insurance analytics Advanced analytics for insurance

Better decision making under uncertain conditions using Monte Carlo Simulation

Mortgage Lender Sentiment Survey

Data Abundance and Asset Price Informativeness

Top Down Analysis Success Demands Singleness of Purpose

Applying fundamental & technical analysis in stock investing


Foxzard Trader MT4 Expert Advisor Manual Contents

WHITEPAPER

Stock Trading Following Stock Price Index Movement Classification Using Machine Learning Techniques

Using AI and Factor Testing to Find Multiple Sources of Alpha

Copyright 2008 Congressional Quarterly, Inc. All Rights Reserved. CQ Congressional Testimony SUBCOMMITTEE: DISABILITY ASSISTANCE AND MEMORIAL AFFAIRS

LOCTrailing Expert Advisor with Partial Close user s manual.

NorthPost Partners, LP

Building Winning Algorithmic Trading Systems, + Website: A Trader's Journey From Data Mining To Monte Carlo Simulation To Live Trading (Wiley

Short Selling Stocks For Large And Fast Profits. By Jack Carter

Predicting and Preventing Credit Card Default

HOW TO IMPROVE YOUR TRADING RESULTS STRAIGHT AWAY

March Annuities and the Investor Perspective

Can Twitter predict the stock market?

protecting yourself Money Management SESSION #6

Data Abundance and Asset Price Informativeness

How to Fix the Top 10 Fatal Errors of Trading One Flaw at a Time. April 14: #4 Unrealistic Expectations. From the Active Trend Trader

Instruction (Manual) Document

1st Seminar on Data Science & Analytics 21st July 2018 Changing Landscape of the Actuarial Profession

The Influence of News Articles on The Stock Market.

Statistical Data Mining for Computational Financial Modeling

WEALTHMAKERS PRECISE, PREDICTIVE, PROFITABLE SUMMER 2012 WEALTHMAKERS ONLINE PREDICTIVE RESEARCH

Those who cannot remember the past are condemned to repeat it.

PSYCHOLOGY OF FOREX TRADING EBOOK 05. GFtrade Inc

scalping / altcointrading.net

Bond Pricing AI. Liquidity Risk Management Analytics.

Identifying Market Bottoms: IBD Follow-Through Days

Predictive Claims Processing

STEALTH ORDERS. Page 1 of 12

DIY Trade Manager Plus

Background for Case Study Used in Workshop

A Combined Mining Approach and Application in Tax Administration.

I lavori dello statistico: Swiss Re Corporate Solutions Silvia Catalano (HR Manager Italy) Nicola Linguerri (Head Underwriting Center Marine) Il

Forex Lines Tutorial of Forex Lines 2014 indicators.

concerning the perceived abuse of commissionaire structures

Sentiment Extraction from Stock Message Boards The Das and

SIMPLE SCAN FOR STOCKS: FINDING BUY AND SELL SIGNALS

Real-Time Text Analytics for Event Detection in the Financial World

The FX-Agency Advisor III. User Manual

presented by Thomas Wood MicroQuant SM Divergence Trading Workshop Day One Black Gold

What Came First... Fundamentals or the Technicals? By Jared Martinez

Senior management and investor relations

Knowing When to Buy or Sell a Stock


Inside This Issue. Summer What Data Do You Have? Utilizing Data Analytics for Employee Benefit Plans

Gold and Gold Stocks Patterns, Cycles and Insider Activity, Part 1 December 27, 2017 Author Pater Tenebrarum

Predictive Analytics: The Key to Profitability

with the support of Everyday Banking An easy read guide March 2018

TABLE OF CONTENTS Abstract... page 01 Introduction.. page 02 Comparison Table... page 04 EVOAI Ecosystem.. page 05 EVABOT: Automatic Arbitrage Bot...

123MoneyMaker Guide. Trading Revolution. The Money Making Strategy Guide Presents: Seize your profits with a simple click!

Engaging Pension Plan Participants Using Text Mining to better Understand Participants Thomas Post (Maastricht University and Netspar)

We are not saying it s easy, we are just trying to make it simpler than before. An Online Platform for backtesting quantitative trading strategies.

Data Driven Decision Making

Are New Modeling Techniques Worth It?

Unit 8 - Math Review. Section 8: Real Estate Math Review. Reading Assignments (please note which version of the text you are using)

A Novel Method of Trend Lines Generation Using Hough Transform Method

Advanced Financial Analysis

WHS FutureStation - Guide LiveStatistics

Release Notes. November 2014

Chapter 4.3. Speculating with CFDs

Implementing the Expected Credit Loss model for receivables A case study for IFRS 9

DFAST Modeling and Solution

VantagePoint software

RESEARCHING A COMPANY. Quickstart lesson 2 Includes: Student lessons. Teacher notes & answers

White Paper. Not Just Knowledge, Know How! Artificial Intelligence for Finance!

FOREX TRADING STRATEGIES.

Level III Learning Objectives by chapter

Operational Excellence / Transformative Strategies for Insurers

ΟΜΙΛΙΑ ΔΙΟΙΚΗΣΗ ΑΑΔΕ, ΓΙΩΡΓΟΤ ΠΙΣΙΛΗ, ΣΟ 19 ο ANNUAL CAPITAL LINK INVEST IN GREECE FORUM

HOW TO PROTECT YOURSELF FROM RISKY FOREX SYSTEMS

ContractCoach, LLC. A Jeff Hastings Agency, Inc. Company A-Coach

Trust Through Transparency

Transcription:

Text Mining Part 2 Opinion Mining / Sentiment Analysis Combining Text procession with Machine Learning

Data Mining Data Mining is the non-trivial extraction of previously unknown and potentially useful information from (large collections of) data Real Predictive Analysis Data Mining is about explaining the past We want to find hidden patterns in the data to It does not tell us some magic answer(s) It only gives us more data (or information) which needs to be assessed to see if it is useful predict the future. Can help us understand what is going on in our data => Patterns Ideally suited to a company that was a mature(ish) BI environment

Sentiment Analysis Sentiment analysis or opinion mining Computational study of opinions, sentiments, evaluations, attitudes, appraisal, affects, views, emotions, subjectivity, etc., expressed in text. Reviews, blogs, discussions, news, comments, feedback, or any other documents. Terminology: Sentiment analysis is more widely used in industry. Opinion mining But they can be used interchangeably

Sentiment Analysis Determine the Sentiment of a Document Blog, forum, review, etc. Positive or Negative Sentiment Use machine learning techniques Using Previously labelled Done by human Easiest to get started with Data Mining is about explaining the past Various options for automatic machine learning needs NLP & ML experts and lots of coding to predict the future.

Typical Application Areas Twitter Social Media Product Reviews Call Centre Customer Interactions Discussion Forums Stock Market investment Allows us to do Sentiment Analysis on a large scale We need a tool(s) that can scale

Star Trek Into Darkness Sentiment Analysis Firstly, let me say that both the visual effects and sound track are both great, but it's all down hill from there. The opening scene, I completely agree with Scotty when he says "You know how completely ridiculous it is to hide a starship on the bottom of the ocean?" Yes this ridiculous, it is a spaceship not a submarine. The ship could have stayed in orbit and either beamed the cold fusion device directly into the volcano or beamed Spock with the device down and then beamed him up. This entire scene feels like it was an excuse to get the cast into 23rd century swimmers. Next, the effect when the ships go into warp has changed since the last film. Why do the ships leave a trail of shiny star dust at warp? When Star Trek was rebooted in the last film, the warp effect was updated, this was the time to add this (I still wouldn't have liked this effect). They should have kept this effect consistent for both films. Though this film comes after the Enterprise series, making it canon, the appearance of the Klingons, the design of the Bat'leth and the Bird of Prey have all been changed. These are all key Star Trek components are shouldn't be tampered with. Having Dr. Carol Marcus change uniforms in a shuttle while Kirk is asked to turn his back, is just a pathetic excuse to see Alice Eve in her underwear, and is completely unnecessary to the story. Many parts of

Several fields of computing merge Natural language processing It deals with the actual text element. It transforms it into a format that the machine can use. Artificial intelligence It uses the information given by the NLP and uses a lot of maths to determine whether something is negative or positive. Commercial tools allows you to easily perform Text Mining Using (typically) classification techniques Allows a Data Analysts to do this and concentrate on the task Isolated from the underlying complexity A lot of these (routine) tasks are automated for you

How is it done with Oracle Text & Oracle Advanced Analytics Product Review Human Labelling Tokenization Stop Word Punctuation Text Ready for DM Machine Learning Algorithms Evaluation Model New Product Reviews Sentiment Score Visualisation / Presentation Actionable Insights

What does the Text mining do? Tokenization Stop Word Punctuation Text Ready for DM

Tokenization Tokenization is the process of breaking a stream of text up into words, phrases, symbols, or other meaningful elements called tokens. The list of tokens becomes input for further processing such as parsing or text mining All contiguous strings of alphabetic characters are part of one token; likewise with numbers. Tokens are separated by whitespace characters, such as a space or line break, or by punctuation characters. Punctuation and whitespace may or may not be included in the resulting list of tokens.

Stop Words stop words are words which are filtered out prior to, or after, processing of natural language data (text)

Punctuations Characters that are defined as punctuations are removed from a token before text indexing., : ; @ ~ # { } [ ] + = - _ ( ) * & ^ % $! ` \ /? Product Review Human Labelling Tokenization Stop Word Punctuation Text Ready for DM

What does the machine learning do? Product Demo Product Review Human Labelling Tokenization Stop Word Punctuation Text Ready for DM Machine Learning Algorithms Evaluation Model

Scoring new data Machine Learning Algorithms Evaluation Model New Product Reviews Sentiment Score Visualisation / Presentation Actionable Insights

Other Applications Stock Market Automated buys and sells Stock Indexes Collapse in Minutes as the Computers Take Over May 6 2010 shares of blue-chipper defensive buy Proctor Gamble (PG), dropping over $22 (or 37%) almost instantly. Nobody really knows what happened, but it has been speculated that someone entered a trade that was an error. Too many zeros. Instead of 1,600 shares, they accidentally tried to sell 16 million or so. Oops!

Automatic Trading In what one trader described as "pure chaos," the three-minute plunge triggered by the tweet briefly wiped out $136.5 billion - approximately 105bn - of the S&P 500 index's value, according to Reuters data.

Trading Based on Sentiment Actively traded Fund based on sentiment trends on Tweeter Claim 86% accuracy within 3 days Online Trading System includes Tweeter Sentiment when viewing stocks

Customer Sentiment Tracking customer Sentiment Call Centre & Customer retention Part of Customer Churn management Combined with other Predictive Analytics methods Ensemble Data Mining/Predictive Analytics Can we predict what timeframe they might churn? Is this Big Data? Most of this processing is done on a Laptop/Desktop

Insurance Fraud Insurers discovered a total 118,500 false claims were made, equivalent to 2,279 a week. Using Predictive Analytics assess each Claim as it is received Identify possibility of it being a Claim Identify possible Claim Amount Measure of Risk Exposure : Used to manage work flow and priority Identify potential fraud Works in conjunction with other Fraud prevention measures Supports Claim Risk Exposure measures Various regulatory, group and share holder requirements on Risk Exposure