Statistical and Machine Learning Approach in Forex Prediction Based on Empirical Data

Similar documents
Understanding neural networks

Iran s Stock Market Prediction By Neural Networks and GA

The Use of Artificial Neural Network for Forecasting of FTSE Bursa Malaysia KLCI Stock Price Index

Development and Performance Evaluation of Three Novel Prediction Models for Mutual Fund NAV Prediction

Predicting Economic Recession using Data Mining Techniques

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

Stock Trading Following Stock Price Index Movement Classification Using Machine Learning Techniques

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

Forecasting stock market prices

Keywords Time series prediction, MSM30 prediction, Artificial Neural Networks, Single Layer Linear Counterpropagation network.

Bond Market Prediction using an Ensemble of Neural Networks

Two kinds of neural networks, a feed forward multi layer Perceptron (MLP)[1,3] and an Elman recurrent network[5], are used to predict a company's

Stock market price index return forecasting using ANN. Gunter Senyurt, Abdulhamit Subasi

Improving Stock Price Prediction with SVM by Simple Transformation: The Sample of Stock Exchange of Thailand (SET)

Journal of Internet Banking and Commerce

Role of soft computing techniques in predicting stock market direction

Price Pattern Detection using Finite State Machines with Fuzzy Transitions

Neuro-Genetic System for DAX Index Prediction

COGNITIVE LEARNING OF INTELLIGENCE SYSTEMS USING NEURAL NETWORKS: EVIDENCE FROM THE AUSTRALIAN CAPITAL MARKETS

Cognitive Pattern Analysis Employing Neural Networks: Evidence from the Australian Capital Markets

COMPARING NEURAL NETWORK AND REGRESSION MODELS IN ASSET PRICING MODEL WITH HETEROGENEOUS BELIEFS

AN ARTIFICIAL NEURAL NETWORK MODELING APPROACH TO PREDICT CRUDE OIL FUTURE. By Dr. PRASANT SARANGI Director (Research) ICSI-CCGRT, Navi Mumbai

Keywords: artificial neural network, backpropagtion algorithm, derived parameter.

ANN Robot Energy Modeling

Valencia. Keywords: Conditional volatility, backpropagation neural network, GARCH in Mean MSC 2000: 91G10, 91G70

$tock Forecasting using Machine Learning

Do Trading Volume and MACD Indicator Contains Information Content of Stock Price? Evidence from China

Introducing GEMS a Novel Technique for Ensemble Creation

Forecasting Foreign Exchange Rate during Crisis - A Neural Network Approach

Applications of Neural Networks in Stock Market Prediction

A Comparative Study of Ensemble-based Forecasting Models for Stock Index Prediction

Using artificial neural networks for forecasting per share earnings

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 2, Mar Apr 2017

An introduction to Machine learning methods and forecasting of time series in financial markets

STOCK MARKET PREDICTION AND ANALYSIS USING MACHINE LEARNING

Prediction Using Back Propagation and k- Nearest Neighbor (k-nn) Algorithm

Performance analysis of Neural Network Algorithms on Stock Market Forecasting

An enhanced artificial neural network for stock price predications

OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL

2015, IJARCSSE All Rights Reserved Page 66

Prediction of Future Stock Close Price using Proposed Hybrid ANN Model of Functional Link Fuzzy Logic Neural Model

A Genetic Algorithm for the Calibration of a Micro- Simulation Model Omar Baqueiro Espinosa

Copy Right to GARPH Page 38

Prediction of Stock Closing Price by Hybrid Deep Neural Network

A Dynamic Hedging Strategy for Option Transaction Using Artificial Neural Networks

STOCK PRICE PREDICTION: KOHONEN VERSUS BACKPROPAGATION

Barapatre Omprakash et.al; International Journal of Advance Research, Ideas and Innovations in Technology

Stock Market Forecasting Using Artificial Neural Networks

The Use of Neural Networks in the Prediction of the Stock Exchange of Thailand (SET) Index

Oesterreichische Nationalbank. Eurosystem. Workshops. Proceedings of OeNB Workshops. Macroeconomic Models and Forecasts for Austria

A Comparative Study of Various Forecasting Techniques in Predicting. BSE S&P Sensex

Multi-factor Stock Selection Model Based on Kernel Support Vector Machine

A Novel Prediction Method for Stock Index Applying Grey Theory and Neural Networks

The use of artificial neural network in predicting bankruptcy and its comparison with genetic algorithm in firms accepted in Tehran Stock Exchange

Forecasting Foreign Exchange Rate by using ARIMA Model: A Case of VND/USD Exchange Rate

State Switching in US Equity Index Returns based on SETAR Model with Kalman Filter Tracking

ARTIFICIAL NEURAL NETWORK SYSTEM FOR PREDICTION OF US MARKET INDICES USING MISO AND MIMO APROACHES

Analyzing predictive performance of linear models on high-frequency currency exchange rates

An Improved Approach for Business & Market Intelligence using Artificial Neural Network

Risk Management in the Australian Stockmarket using Artificial Neural Networks

STOCK MARKET TRENDS PREDICTION USING NEURAL NETWORK BASED HYBRID MODEL

Studies in Computational Intelligence

Artificially Intelligent Forecasting of Stock Market Indexes

LITERATURE REVIEW. can mimic the brain. A neural network consists of an interconnected nnected group of

Evaluate the Ability of Artificial Neural Network to Predict the Stock Price of Non-Metallic Mineral Products Industry in Tehran's Stock Exchange

Genetic Algorithm Based Backpropagation Neural Network Performs better than Backpropagation Neural Network in Stock Rates Prediction

Keywords: artificial neural network, backpropagtion algorithm, capital asset pricing model

Chapter IV. Forecasting Daily and Weekly Stock Returns

APPLICATION OF ARTIFICIAL NEURAL NETWORK SUPPORTING THE PROCESS OF PORTFOLIO MANAGEMENT IN TERMS OF TIME INVESTMENT ON THE WARSAW STOCK EXCHANGE

Dr. P. O. Asagba Computer Science Department, Faculty of Science, University of Port Harcourt, Port Harcourt, PMB 5323, Choba, Nigeria

Alternate Models for Forecasting Hedge Fund Returns

Forecasting Currency Exchange Rates via Feedforward Backpropagation Neural Network

COMPARATIVE STUDY IN ESTIMATING VOLKSWAGEN S PRICE: ARIMA VERSUS ANN

PART II IT Methods in Finance

Stock Price and Index Forecasting by Arbitrage Pricing Theory-Based Gaussian TFA Learning

A Novel Iron Loss Reduction Technique for Distribution Transformers Based on a Combined Genetic Algorithm Neural Network Approach

Neural Network Prediction of Stock Price Trend Based on RS with Entropy Discretization

Abstract Making good predictions for stock prices is an important task for the financial industry. The way these predictions are carried out is often

Decision model, sentiment analysis, classification. DECISION SCIENCES INSTITUTE A Hybird Model for Stock Prediction

ALGORITHMIC TRADING STRATEGIES IN PYTHON

Foreign Exchange Rate Forecasting using Levenberg- Marquardt Learning Algorithm

Foreign Exchange Forecasting via Machine Learning

Stock Market Prediction using Artificial Neural Networks IME611 - Financial Engineering Indian Institute of Technology, Kanpur (208016), India

Application of Innovations Feedback Neural Networks in the Prediction of Ups and Downs Value of Stock Market *

Forecasting stock market return using ANFIS: the case of Tehran Stock Exchange

Application of Deep Learning to Algorithmic Trading

Institute of Actuaries of India Subject CT6 Statistical Methods

Compartmentalising Gold Prices

Method of predicting a change in an economy

Option Pricing Using Bayesian Neural Networks

Statistical Models and Methods for Financial Markets

Data Adaptive Stock Recommendation

Backpropagation and Recurrent Neural Networks in Financial Analysis of Multiple Stock Market Returns

Predicting the stock price companies using artificial neural networks (ANN) method (Case Study: National Iranian Copper Industries Company)

Data based stock portfolio construction using Computational Intelligence

Data Mining Driven Computational Analysis of Stock Markets, Methods and Strategies

PREDICTION OF CLOSING PRICES ON THE STOCK EXCHANGE WITH THE USE OF ARTIFICIAL NEURAL NETWORKS

Application of synthetic observations to develop an artificial neural network for mine dewatering

Forecasting the LFS. Fida Hussain Nick Misoulis Nigel Stuttard

Quant Trader. Market Forecasting and Optimization of Trading Models. Presented by Quant Trade Technologies, Inc.

Transcription:

Statistical and Machine Learning Approach in Forex Prediction Based on Empirical Data Sitti Wetenriajeng Sidehabi Department of Electrical Engineering Politeknik ATI Makassar Makassar, Indonesia tenri616@gmail.com Indrabayu 1, Sofyan Tandungan 2 Department of Informatics Engineering Universitas Hasanuddin Makassar, Indonesia indrabayu@unhas.ac.id, standungan@gmail.com Abstract This study proposed a new insight in comparing common methods used in predicting based on data series i.e statistical method and machine learning. The corresponding techniques are use in predicting Forex (Foreign Exchange) rates. The Statistical method used in this paper is Adaptive Spline Threshold Autoregression (), while for machine learning, Support Vector Machine () and hybrid form of Genetic Algorithm-Neural Network () are chosen. The comparison among the three methods accurate rate is measured in root mean squared error (). It is found that and method has advantages depend on the period time intervals. Keywords forex, prediction,.,, I. INTRODUCTION Forex (Foreign Exchange) is a type of transaction where a party obtains some units in one currency to buy proportion amount in another currency. This exchange is usually conducted in pair currency. The most popular pair and trade worldwide is Euro vs. US Dollar (EUR / USD). In Forex, there are two kinds of analysis, fundamental and technical analysis. Fundamental term refer to the movement of the market in association with news or factors that can affect a country's economy, while technical assessment is mainly observed the supply demand trend through market movements by reading charts and indicators of ongoing market price. In most cases, Forex rates technical prediction are based on statistical charts and machine learning. It is always interesting to measure up both of this procedures in data series prediction, which none of both scheme is likely better than other for each case [1]. A statistical modelling and forecasting using Auto- Regressive Integrated Moving Average (ARIMA) for Gold Bullion Coin has shown promising result with a MAPE (mean absolute percentage error) within 10% [2]. Artificial Intelligence has been researched as well as statistical and machine learning. With a novel approach for efficient weekly market price forecasting, has come to an outstanding result with 99.62% of accurate rate[3]. Recently, A hybrid methods of Artificial Intelligence also fulfill the 30 minutes time frame prediction [4]. This breakthrough allows a practical application for traders in gaining profit within the time frame with all the price indicators i.e. open, close, high and low are predicted as well. These previous research in price forecasting are conducted thoroughly on single method. This study aim to apply Adaptive Spline Threshold Autoregression (), combination of Genetic Algorithm-Neural Network () and Support Vector Machine () to Forex rates prediction and provide a computational comparison of the performance of these techniques. A. Adaptive Spline Threshold Autoregression () is a model obtained from modeling nonlinear time series threshold in Multivariate Adaptive Regression Spline (MARS) method where the predictor is the lagged value of time series data [5]. has the ability to generate continuous models with underlying limit cycles when the time series data indicate periodic behaviour. Similar to MARS, structured by two complementary algorithm. has two stepwise algorithm, which help to get basis functions for model and to get the best appropriate model. First step is forward stepwise algorithm, the model obtained has a very complex structure. Second step is backward stepwise algorithm, basis function in the model from the previous step is turn to reach optimum model. model example is as follows: = + + + + + where: c = constants = coefficient t 1, t 2 = threshold of each variable Z t-d1, and Z t-d2, d1, d2 = lagged predictor variable. B. Support Vector Machine () Support Vector Machine () is known as a machine learning that uses a pair of input and output data in the form of the desired target. The concept of can be explained simply as the search for the best hyper plane which serves as a separator of two classes in the input space [6]. was developed by Boser, Guyon, Vapnik, and was first presented in 1992 at the Annual Workshop on Computational Learning Theory. The basic concept of is actually a harmonious combination of computational theories that have existed decades earlier, such as margin hyperplane, (1) 978-1-5090-5548-7/16/$31.00 2016 IEEE

kernel and concepts supporting others. However, until 1992, there was no attempt to weave these components. In contrast to the neural network strategy that seeks hyperplane separation between classes, trying to find the best hyperplane in the input space. The basic principle of is a linear classifier, and further developed in order to work on a non-linear problem by incorporating the concept of the kernel trick on high-dimensional workspace. The example of linearly separated data is shown in Fig.1. The best hyper plane between two classes can be found by measuring the hyper plane margin and find out the maximum points. Margin is defined as the distance between hyper plane and the closest pattern of each class, which is called support vector. The best hyper plane is defined as the following equation. = + 2) where x refers to a training pattern, w is referred to as the weight vector and b as the bias term., =, = (5), where and are the parameters of sigmoid function. C. Genetic Algorithm-Neural Network (GANN) Genetic Algorithm (GA) are algorithms that seeks to apply a comprehension of the natural evolution in problem solving tasks. The approach taken by this algorithm is to merge a various solutions at random within a population and then evaluate them to obtain the best solution [4]. In Genetic Algorithm, procedure for finding the best solution is operated simultaneously on a few solutions known as population. Individuals in a population are specified as chromosomes. First population is randomly generated, then the next population is the result of the chromosomes evolution through generation. In every generation, chromosomes will be evaluated using fitness function. Fitness value determine chromosomes quality in the population. Artificial Neural Network (ANN) is computer science area that attempt to solve real world problems by proposing a powerful solution. ANN has the capability to learn and generate its own knowledge based no its environment. ANN could be used to model complex relation between inputs and outputs to find patterns in data. Artificial Neural Network (ANN) is a computation system that its architectural and operation based on the knowledge about biological neuron in human brain. ANN is an artificial representation based on human brain that try to copy the learning process of human brain. ANN models has the ability to analyse, predict and associate. ANN ability can be used to learn and generate rules or operation from a few example or given inputs and make a prediction about possible output or save the characteristic of given inputs. Fig. 1. The example of linearly separated data The types of kernel that is often used to establish the rules of decision, namely: 1. A polynomial machine, =, +1 (3) where d is the degree of polynomial kernel II. RESEARCH METHOD A. Input Historical data from 2007 to September is prepared for training data. ly data of Open, High, Low, and Close is obtained from Meta Trader software then divided according to the input data for the designed application. This data will be used to calculate the prediction diagram used as a reference in determining the value of the actual prediction. 2. A radial basis function machine, =exp (4) Fig. 2. The proposed scheme of forex prediction 3. A two-layer neural network machine

B. System design The proposed scheme used for, and GANN system shown in Fig. 2. Prediction for variable Open 1) Training : Training data consist of four records such as Open, High, Low, and Close. In, each records are trained in order to get best model. For case, training input data treated with Kernel calculation of Radial Basis Function (RBF). In GANN, training data input is the records of Open, High, Low, and Close price. 2) Prediction : The next stage is to find the predicted value based on model,, and GANN. Forex historical data in is used in this process. Prediction process is conducted to predict forex value for one day, one week and one month. The output from this process will be validated with actual data. 3) Validation : To validate the result from prediction stage, Predicted value will be compared with actual data in from Meta Trader. Validation is important part to evaluate the performance of the prediction. Root Mean Square Error () is used to determine accuracy of the prediction performance. The smaller the value of correspond to better accuracy. is defined by: = (6) Prediction for variable High Prediction for variable Low where is the number of data, represent actual value, and is the predicted value. C. Output The Output is the best from model,, and GANN. From a series of simulation, it is found that the best value is from validation process of one day, one week, and one month time frame. III. RESULTS AND ANALYSIS A. Adaptive Spline Threshold Autoregression () System is a technique to find the best model from a group of data, thus fully utilizing the data past and present to make accurate short-term prediction. Prediction in system divided into three periods that is prediction in 1 st, 1 st -5 th, and all data in. Figure 3. shown a comparison between actual and prediction result in charts in 1 st. Table 1 shown value with system in 1 st. Prediction for variable Close Fig. 3. Comparison of and Predicted value in 1 st for hourly data in system TABLE I. value for prediction in 1 st in system 0.001433 0.001209 0.001104 0.001318

TABLE II. value using system Oct 1st 0.001433 0.001209 0.001104 0.001318 Oct 1st- 5st 0.001113 0.000961 0.000978 0.001084 Oct 0.001104 0.001531 0.001043 0.001099 1st 1st-5st TABLE III. value using system 0.001433 0.001209 0.001104 0.001318 0.001113 0.000961 0.000978 0.001084 0.001104 0.001531 0.001043 0.001099 Prediction for variable Open Prediction for variable High Table I shown that the lowest value is variable Low. Table 2 shows value for prediction in 1 st, 1 st 5 th, and. Table II shows the difference of values for each variable in different prediction time period. Prediction interval from 1 st to 5 th shown a better performance of system than forecasting value in one month period of. Prediction for variable Low Table III shows the difference of values for each variable in different prediction time period. Prediction interval from 1 st to 5 th shown a better performance of system than forecasting value in one month period of. B. Support Vector Machine () System The prognosis computing in system comprise of three periods that is prediction in 1 st, 1 st - 5 th, and whole data in. Figure 4 shown a comparison between actual and prediction result in charts in 1 st. Table 3 shown value with system in 1 st. Prediction for variable Close Fig. 4. Comparison of and Predicted value in 1 st for hourly data in system

TABLE IV. value for prediction in 1 st in 1.3000 Prediction for variable High 0.001827 0.001562 0.001410 0.001781 1.2950 TABLE V. value for all prediction in system 1.2850 1st 0.001827 0.001562 0.001410 0.001781 1st- 5st 0.001382 0.001205 0.001170 0.001369 0.001322 0.001431 0.001215 0.001322 1.2750 1.2700 TABLE VI. value for prediction in 1 st in GA- NN system Prediction for variable Low 0.000559 0.001747 0.001046 0.001322 From Table IV, it is shown that the lowest value acquired from the Low indicator. Table 4 show value for prediction in 1 st, 1 st 5 th, and. Table V shows the difference values for each variable in different prediction time. Prediction time 1 st until 5 th and shown better performance of system when dealing with a lot of data. Prediction for variable Close C. Genetic Algorithm-Neural Network Prediction in system is divided into three intervals that is prediction in 1 st, 1 st -5 th, and all data in. Figure 5 shown a comparison between actual and prediction result in charts in 1 st. Table 5 shown value with system in 1 st. Table VI shown that the lowest value is variable Open. Table 6 show value for prediction in 1 st, 1 st 5 th, and. Figure 5. Comparison of and Predicted value in 1 st for hourly data in system Prediction for variable Open TABLE VII. value for all prediction in system 1st 1st-5st 0.000559 0.001747 0.001046 0.0013218 0.000307 0.001345 0.000854 0.0013220 0.000293 0.001040 0.001232 0.001295

TABLE VIII. Comparison of value for prediction in 1 0.001433 0.001209 0.001104 0.001318 0.001827 0.001562 0.001410 0.001781 0.000559 0.001747 0.001046 0.001322 TABLE IX. Comparison of value for prediction in 1-5 0.001113 0.000961 0.000978 0.001084 0.001382 0.001205 0.001170 0.001369 0.000307 0.001345 0.000854 0.001144 TABLE X. Comparison of value for prediction in 0.001104 0.001531 0.001043 0.001099 0.001322 0.001431 0.001215 0.001322 0.000293 0.001040 0.001232 0.001295 Table VI shows the difference values for each variable in different prediction time. Prediction time from 1 st to 5 th shown system best performance when dealing with a lot of data. D. Comparing,, and System From Table VIII to Table X, For 1 and 5 days intervals, shows better results in term of High and Close variable because its data show periodic behaviour. On the contrary gives an opposite result for the same variables but shows better results in term of Open and Low. When it comes to longer periods of observations, different results emerge where the Open and High prediction are better with and the rest variables is best from forecasting. From this point of view, traders would have wider option in the future trading especially in dealing with volatile currency pairs. IV. CONCLUSION Performance comparison of Statistical and Machine Learning approach has been shown in this paper. From three time periods of observation i.e. 1 day, 5 days, and 30 days, each methods has benefited outcomes depend on the time periods required. The only exception is that always gives average and normalize results compare to and. REFERENCES [1] Indrabayu, N. Harun, M. S. Pallu, and A. Achmad, Statistic Approach versus Artificial Intelligence for Rainfall Prediction Based on Data Series, ResearchGate, vol. 5, no. 2, pp. 1962 1969, Apr. 2013. [2] L. Abdullah, ARIMA Model for Gold Bullion Coin Selling Prices Forecasting, Int. J. Adv. Appl. Sci., vol. 1, no. 4, pp. 153 158, Dec.. [3] Z. H. U. Quan-yin, Y. I. N. Yong-hu, Y. a. N. Yun-yang, and G. U. Tianfeng, A Novel Efficient Adaptive Sliding Window Model for Weekahead Price Forecasting, Indones. J. Electr. Eng. Comput. Sci., vol. 12, no. 3, pp. 2219 2226, Mar. 2014. [4] A. Sespajayadi, Indrabayu, and I. Nurtanio, Technical data analysis for movement prediction of Euro to USD using Genetic Algorithm-Neural Network, in 2015 International Seminar on Intelligent Technology and Its Applications (ISITIA), 2015, pp. 23 26. [5] P. A. W. Lewis and J. G. Stevens, Nonlinear Modeling of Time Series Using Multivariate Adaptive Regression Splines (MARS), J. Am. Stat. Assoc., vol. 86, no. 416, pp. 864 877, Dec. 1991. [6] C. Cortes and V. Vapnik, Support-vector networks, Mach. Learn., vol. 20, no. 3, pp. 273 297, Sep. 1995.