An Improved Version of Kurtosis Measure and Their Application in ICA
|
|
- Ernest Briggs
- 6 years ago
- Views:
Transcription
1 International Journal of Wireless Communication and Information Systems (IJWCIS) Vol 1 No 1 April, An Improved Version of Kurtosis Measure and Their Application in ICA Md. Shamim Reza 1, Mohammed Nasser and Md. Shahjaman 1 Pabna Science & Technology University, Department of Mathematics, Rajapur, Pabna-6600, Bangladesh. Department of Statistics, Rajshahi University, Department of Statistics, Begum Rokeya University, mshamim_stat@yahoo.com, mnasser.ru@gmail.com, shahjaman_brur@yahoo.com Abstract: Kurtosis plays important role in defining shape characteristics of a probability distribution, and also in extracting as well as sorting independent components. From recent research on various versions of classical kurtosis we see that all the measures substantially underestimate kurtosis parameter and exhibit high variability when underlying population distribution is highly skewed or heavy tailed. This is unwanted for independent component analysis (ICA). In this paper, we propose a bootstrap bias corrected kurtosis estimator and compare performances of proposed bootstrap bias corrected estimator with two empirical bias corrected kurtosis measure that is found best in recent works. We use both simulated and real data and investigate the bias, standard error, MSE of each estimator under a variety of situations and also take help of various plots to judge their performances. We observe that our proposed bootstrap bias corrected kurtosis estimators perform better than the class of classical estimators for non-normal situations of univariate population. We then apply our measure in sorting independent components of both data sets and try to examine the capacity of PCA, ICA and ICA on PCA for finding groups. In both data sets ICA on PCA a new visualization technique shows the maximum discriminating power whereas PCA the least. We recommend using our proposed measure in both extracting and sorting independent components. Keywords: Kurtosis, Monte Carlo Simulation, Bootstrapping, PCA, ICA. 1. Introduction It is typically noted in introductory statistics courses that distributions can be characterized in terms of central tendency, variability, and shape. With respect to shape, virtually every textbook defines and illustrates skewness. On the other hand, another aspect of shape, which is kurtosis, is either not discussed or, worse yet, is often described or illustrated incorrectly ( DeCarlo, 1997 and Joanes et al., 1998). Kurtosis is also useful for IC s ordering (Scholz et al., 00; Scholz and Selbig, 007 etc). In principal component analysis, PC s are ordered by corresponding eigen values. But in independent component analysis, these components have no order. For practical reasons to define a criterion for sorting these components to our interest. One measurement which can match our interest very well, is kurtosis. In a recent work Lihua and Ahmed (008) proposed two unbiased sample measures of kurtosis and compared them with three sample measures of kurtosis adapted by various software packages (Minitab, SAS etc) for data from normal and nonnormal populations. Their proposed second estimator is the best performer in normal situations but in non-normal situations all estimators show unwanted large fluctuations. For this reason they put forward two new empirical bias corrected kurtosis estimator. In order to correct the bias, their empirical formulas are provided only for student-t and chisquared distributions. However, empirical estimates are subject to extra variation which results in inflated MSE. In this article we place a bootstrap bias corrected kurtosis estimator. It is worth mentioning that ICA is meaningful for non-normal situation. For purely Gaussian (Normal) distributed data, no unique independent components can be extracted (Hyvarinen and Oja. 000). In section we define the classical measure of kurtosis estimator that we consider in our study. Section we propose a bootstrap bias-corrected kurtosis estimator. In section we compare two empirical bias corrected kurtosis estimators with our propose bootstrap bias corrected kurtosis estimator and finds the overall best performer. We then apply our estimator in sorting independents components for finding data clustering. The final section gives conclusion.. Kurtosis Pearson (1905) introduced kurtosis as a measure of how flat the top of a symmetric distribution is when compared to a normal distribution of the same variance. Kurtosis can be formally defined as the standardized fourth population moment about the mean E X E X Where E is the expectation operator, is the mean, is the fourth moment about the mean, and is the standard deviation. The normal distribution has a kurtosis of, and is often used so that the reference normal distribution has a kurtosis zero. A sample counterpart to can be obtained by replacing population moments with the sample moments, which gives b X n X X i X i (1)
2 International Journal of Wireless Communication and Information Systems (IJWCIS) Vol 1 No 1 April, where b is the sample kurtosis, X bar is the sample mean, and n is the number of observations..1 Some Classical Measure of Kurtosis Estimator Let X 1, X,,X n be a random sample of size n, then a commonly used consistent estimator of is given by x xi x () x i The above estimator is not unbiased. Cramer (196) gave the amount of bias of the following results for normal distributions: 6 Bias () n 1 Another frequently used estimator of adopted by SAS is defined as U n 1 n 1 n n 6 It has been proved that is unbiased for normal distributions. We refer to Fisher (199), Joanes and Gill (1998) and others. U () The kurtosis measure adapted by MINITAB is defined by M n 1 n Joanes and Gill (1998) showed that for normal distributions M Bias n 1 (5) 1 (6) n n 1 n 1 Recently developed two Kurtosis estimator proposed by Lihua and Ahmed(008). They are correcting the bias given in () and (6) yields two new estimators as follows: N 1 6 n 1 (7) And N M n 1 1 (8) n n 1 Consequentially, for normal data, unbiased estimators of. N1 and N are both All five estimators are biased for non-normal populations, and bias is inflated in a range of the parameter space, For detailed description, we refer to Lihua and Ahmed (008) article. It seems to be an appealing idea to construct a biascorrected estimator. For student-t and Chi-square distribution, non-normal situations Lihua and Ahmed (008) suggest employing a bias-reduction technique based on the N best performing estimator. They proposed a new biascorrected estimator may be defined as N N 1 N Simulation experiment is conducted to inspect the bias and MSE of the estimators. The result shows that these estimators effectively reduced the bias to a negligible level; however, extremely large variance was introduced due to the quadratic form, resulting in inflated MSE. They also proposed, a simple linear regression model without independent variable N N 1 N for only small degrees of freedom. The variance of this fitted estimator is greater than the original biased estimator; however it is not inflated too much... Limitations of the empirical bias-correction estimators a) The main problem of the above empirical bias correction is that, these formulas are provided only for student-t and chi-squared distributions. But doesn t consider other distribution to correct the bias. b) The performance of this bias-corrected estimator depends on a table as well as specified sample size. c) The empirical bias correction estimators effectively reduced the bias but extremely large variance was introduced.. Propose Bootstrap Bias Corrected Estimator All the estimators substantially underestimate kurtosis parameter when underlying population distribution is highly skewed or heavy tailed. In order to correct the bias, empirical formulas are provided for student-t and chi-squared distributions. However, empirical estimates are subject to extra variation introduced which results in inflated MSE. Perhaps, some re-sampling methods such as bootstrap and Jackknife may be considered to reduce the bias as well as keeping a relatively lower variance. Thus we want to use a popular re-sampling method bootstrapping, to overcome the problem of empirical bias-correction. For correcting the bias using bootstrap, we use second estimator of Lihua and Ahmed N. Because of N estimator performs well for normal as well as non-normal populations in many situations. Finally, our propose bootstrap bias corrected estimator is given by Where N N Bia Bia s boot * t x tf t. tf * s boot EFn n n
3 International Journal of Wireless Communication and Information Systems (IJWCIS) Vol 1 No 1 April, t B * b. 1 t B x * b.1 Our Bootstrapping Method and Used Estimator In our research we use 5000 bootstrap samples for calculating bias and MSE of sizes n = 0, n = 0, n = 50, each number of replicated 1000 times. Each respective sample take from student-t and chi-squared distribution with d.f and 5. Then we have got the bootstrapped aggregated results and comparing bootstrap bias corrected MSE and empirical bias corrected MSE. We know that kurtosis of student-t and chi-squared distributions are, n n Kur t where n> 1 Kur n The bootstrap aggregated bias calculate as follows Bias 5000 N i MSE Var boot Kurtosis N Bias, then performs for any distributions, any sample size but empirical bias-corrected estimator performs only student-t and chisquared distributions for specified sample size.. Results To simulate skewed and heavy tailed data, 5000 samples of sizes 0, 0 and 50 are randomly taken from and studentt distribution with degrees of freedom and 5. Now we compare among our proposed bootstrap bias corrected estimator and two empirical bias-corrected estimators. We find bias, mean square error of bias correction estimators at different non-normal populations. The results are represented in different tables and plots. Table 1.MSE comparison of chi-square distribution Sample Size d.f Bootstrap log(mse) Emperical-1 Emperical Figure 1, Bootstrap algorithm for calculating bias corrected estimator. To simulate skewed and heavy tailed data, 5000 samples of sizes 0,0 and 50 are randomly taken from χ and student t distribution with degrees of freedom and 5. Now we compare among our proposed bootstrap bias-corrected estimator and two empirical bias-corrected estimators. Our proposed bootstrap bias-corrected estimator is more advantages than empirical estimators because our estimators Figure. MSE comparison for Chi-square distribution The Table-1 and fig.1 shows that the proposed bootstrap bias corrected measure gives the minimum MSE values for (skewed) distributions of sizes 0, 0 and 50 with df, 5. We found that our proposed estimators give greater discrepancy than first empirical correction but relatively lower difference than second empirical bias-corrected estimator based on MSE criterion. The table shows that the proposed bootstrap bias corrected measure gives the minimum MSE values than first empirical bias-corrected estimator for student-t (heavy tailed) distributions of sizes 0, 0 and 50 with df,5. We also found that second empirical estimator performs well than our
4 International Journal of Wireless Communication and Information Systems (IJWCIS) Vol 1 No 1 April, estimators for d.f, but results in favor of our estimators when df increases to 5. Sample Size Table.MSE comparison of t-distribution d.f Bootstrap log(mse) Emperical-1 Emperical Oja. 000), therefore, ICA should only be applied to data sets where we can find components that have a non-gaussian distribution. Examples of super-gaussian distributions (highly positive kurtosis) are speech signals, because these are predominantly close to zero. However, for molecular data sub-gaussian distributions (negative kurtosis) are more interesting. Negative kurtosis can indicate a cluster structure or at least a uniformly distributed factor. Thus the components with the most negative kurtosis can give us the most relevant information. Experiment-1 (Simulation Study) In our research, first we generate four known distribution Normal, Chi-square, t and Uniform of size 100 with taken their different mean, mixing this four distribution and finding out which visualization techniques gives better identification of distribution pattern from mixture. 5. Application in ICA Independent component analysis (ICA) is a statistical method used to discover hidden factors(sources or features) from a set of measurements or observed data such that the sources are maximally independent. The ICA algorithms are able to separate the sources according to the distribution of the data. Independent component analysis (ICA) (Hyvarinen et al., 001), and projection pursuit (PP)(Jones and Sibson, 1987), are closely related techniques, which try to look for interesting directions (projections) in the data. ICA assumes a model, x = AS where x is a vector of observed random variables, A is a d d mixing matrix, and S is a vector of independent latent variables. The task then is to find A to recover S. A key assumption is usually that the S have different kurtosises K j, in order to separate the different independent components. In practice ICA usually measures interestingness of a linear combination a T x in terms of the size of its absolute kurtosis or some related measures. Since for a Gaussian random variables the kurtosis is zero, this criterion measures to some extent, non-gaussianity. j Figure. Original pattern of simulated data (a) Normal (b) Chi square (c) t (d) Uniform distribution. 5.1 Role of Kurtosis in ICA In principal component analysis, pc s are ordered by eigen value where first eigen value is first pc, second eigen value second pc and so on. But in independent component analysis, These components have no order. For practical reasons to define a criterion for sorting these components to our interest. One measurement which can match our interest very well, is kurtosis. Kurtosis is a classical measure of non- Gaussianity, and is computationally and theoretically relatively simple. It indicates whether the data are peaked or flat, relative to a Gaussian (normal) distribution. A Gaussian distribution has a kurtosis of zero. Positive kurtosis indicates a peaked distribution (super-gaussian) and negative kurtosis indicates a flat distribution (sub-gaussian). Now mixing this four distribution (Original sources), and if we apply PCA, ICA and ICA on PCA on the mixture data experiment, to investigate what techniques gives better identification. From purely Gaussian distributed data, no unique independent components can be extracted (Hyvarinen and
5 International Journal of Wireless Communication and Information Systems (IJWCIS) Vol 1 No 1 April, Experiment- (Experiment of Breast cancer data) In breast cancer data, contains 10 variables and 107 observations. When we apply PCA we see that loadings of the first five PC s that explains 8 percent variability of the data set. Now we apply PCA and ICA for original data and ICA apply on 5 pc s and use our estimator to sorting IC s. Table (IC s ordering using kurtosis) Figure.. Mixed Sources of four distribution From the above table the largest negative value of kurtosis is -1.1 which is consider first IC s, second largest second IC s and so on. Since negative kurtosis can indicate a cluster structure or at least a uniformly distributed factor. Thus the components with the most negative kurtosis can give us the most relevant information. Figure.5. Performance of different visualization techniques for chi-square distribution. Fig. exhibits PCA and ICA could not detect the required distribution properly, but ICA on PCA as a new development of visualization technique for our experiment, we obtain that for the last case we get the maximum identification of the chi-square distribution, which is our required result for our experiment. Figure.7. On the left, by applying PCA to the total data, the result is worse than the result of ICA. However, by using PCA for preprocessing before applying ICA, a more strongly discriminating component can be extracted, as shown on the right. Figure.6. Performance of different visualization techniques for t and uniform distribution. Fig.5 shows the identification performance of t and uniform distribution, and we see the both cases PCA fails the proper identification, ICA performs well than PCA but ICA on PCA gives better discriminates of the two distributions. 6. Conclusion In this paper we describe five sample measures of kurtosis estimators and comparing the performances of three(empirical-1, empirical- and propose bootstrap) biascorrected kurtosis estimators. Their performances are investigated through simulation and bootstrapping. we consider χ and student-t distribution with three different sample sizes (0, 50 and 50). The estimators are compared with regard to bias and MSE, the bootstrap bias-corrected
6 International Journal of Wireless Communication and Information Systems (IJWCIS) Vol 1 No 1 April, estimators, especially non-normal population for small degrees of freedom performs better than the class of two empirical bias-corrected estimators. We recommend using as a measure of kurtosis especially when the degrees of freedom small as well as large and non-normal population. We then apply our measure in sorting independent components in simulated and Breast cancer data, and try to examine the capacity of PCA, ICA and ICA on PCA for finding groups. In both data sets ICA on PCA a new visualization technique shows the maximum discriminating power whereas PCA the least. References [1] Cramer, H., Mathematical Methods of Statistics, Princeton University Press, Princeton, p [] DeCarlo, L.T. On the meaning and use of kurtosis. Psychological Methods (), [] Fisher, R.A., Moments and product moments of sampling distributions. Proc. London Math. Soc. Ser. 0, [] Hyv arinen, A. and Oja, E.: Independent component analysis: Algorithms and applications. Neural Networks. -5(1): [5] Hyvarinen, A., Karhunen, J. and Oja, E. Independent Component Analysis, John Wiley and Sons, NewYork.001. [6] Jones,M. and Sibson, R. What is projection pursuit? J. of the Royal Statistical Society, Ser. A, 150: [7] Joanes, D.N., Gill, C.A., Comparing measures of sample skewness and kurtosis. Statist. 7, [8] Lihua An, S.Ejaz Ahmed. Improving the performance of kurtosis estimator. Computational Statistics and Data Analysis 5, [9] Matthias Scholz, Yves Gibon, Mark Stitt and Joachim Selbig, Independent component analysis of starch deficient pgm mutants. Proceedings of the German conference on Bioinformatics. Gesellschaft fur infomark, Bonn, pp.95-10,00. [10] Scholz, M., Gatzek, S., Sterling, A., Fiehn, O., and Selbig, J. Metabolite fingerprinting: detecting biological features by independent component analysis. Bioinformatics 0, 7-5, 00. [11] Shamim, M. Nasser, An improved version of kurtosis estimator and their application in ICA 1International conference on computer and information Technology, Program book, page-7, 010. [1] Scholz, M., and Selbig, J. Visualization and analysis of molecular data. Methods Mol Biol 58, 87-10, 007.
Multivariate Outlier Detection Using Independent Component Analysis
Science Journal of Applied Mathematics and Statistics 2015; 3(4): 171-176 Published online June 17, 2015 (http://www.sciencepublishinggroup.com/j/sjams) doi: 10.11648/j.sjams.20150304.11 ISSN: 2376-9491
More informationPARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS
PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS Melfi Alrasheedi School of Business, King Faisal University, Saudi
More informationRobust Critical Values for the Jarque-bera Test for Normality
Robust Critical Values for the Jarque-bera Test for Normality PANAGIOTIS MANTALOS Jönköping International Business School Jönköping University JIBS Working Papers No. 00-8 ROBUST CRITICAL VALUES FOR THE
More informationMEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL
MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL Isariya Suttakulpiboon MSc in Risk Management and Insurance Georgia State University, 30303 Atlanta, Georgia Email: suttakul.i@gmail.com,
More informationOn Some Statistics for Testing the Skewness in a Population: An. Empirical Study
Available at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 12, Issue 2 (December 2017), pp. 726-752 Applications and Applied Mathematics: An International Journal (AAM) On Some Statistics
More informationA Skewed Truncated Cauchy Logistic. Distribution and its Moments
International Mathematical Forum, Vol. 11, 2016, no. 20, 975-988 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/imf.2016.6791 A Skewed Truncated Cauchy Logistic Distribution and its Moments Zahra
More informationInternational Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN
Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL NETWORKS K. Jayanthi, Dr. K. Suresh 1 Department of Computer
More informationOn Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study
Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 8-26-2016 On Some Test Statistics for Testing the Population Skewness and Kurtosis:
More informationMarket Risk Analysis Volume IV. Value-at-Risk Models
Market Risk Analysis Volume IV Value-at-Risk Models Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume IV xiii xvi xxi xxv xxix IV.l Value
More informationInternational Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN
International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL
More informationChapter 7. Inferences about Population Variances
Chapter 7. Inferences about Population Variances Introduction () The variability of a population s values is as important as the population mean. Hypothetical distribution of E. coli concentrations from
More informationThe Two-Sample Independent Sample t Test
Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 The General Formula The Equal-n Formula 4 5 6 Independence Normality Homogeneity of Variances 7 Non-Normality Unequal
More informationGENERATION OF STANDARD NORMAL RANDOM NUMBERS. Naveen Kumar Boiroju and M. Krishna Reddy
GENERATION OF STANDARD NORMAL RANDOM NUMBERS Naveen Kumar Boiroju and M. Krishna Reddy Department of Statistics, Osmania University, Hyderabad- 500 007, INDIA Email: nanibyrozu@gmail.com, reddymk54@gmail.com
More informationOn the Distribution and Its Properties of the Sum of a Normal and a Doubly Truncated Normal
The Korean Communications in Statistics Vol. 13 No. 2, 2006, pp. 255-266 On the Distribution and Its Properties of the Sum of a Normal and a Doubly Truncated Normal Hea-Jung Kim 1) Abstract This paper
More informationLecture 6: Non Normal Distributions
Lecture 6: Non Normal Distributions and their Uses in GARCH Modelling Prof. Massimo Guidolin 20192 Financial Econometrics Spring 2015 Overview Non-normalities in (standardized) residuals from asset return
More information14.1 Moments of a Distribution: Mean, Variance, Skewness, and So Forth. 604 Chapter 14. Statistical Description of Data
604 Chapter 14. Statistical Description of Data In the other category, model-dependent statistics, we lump the whole subject of fitting data to a theory, parameter estimation, least-squares fits, and so
More informationBusiness Statistics 41000: Probability 3
Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404
More informationCABARRUS COUNTY 2008 APPRAISAL MANUAL
STATISTICS AND THE APPRAISAL PROCESS PREFACE Like many of the technical aspects of appraising, such as income valuation, you have to work with and use statistics before you can really begin to understand
More informationSegmentation and Scattering of Fatigue Time Series Data by Kurtosis and Root Mean Square
Segmentation and Scattering of Fatigue Time Series Data by Kurtosis and Root Mean Square Z. M. NOPIAH 1, M. I. KHAIRIR AND S. ABDULLAH Department of Mechanical and Materials Engineering Universiti Kebangsaan
More informationEffects of skewness and kurtosis on model selection criteria
Economics Letters 59 (1998) 17 Effects of skewness and kurtosis on model selection criteria * Sıdıka Başçı, Asad Zaman Department of Economics, Bilkent University, 06533, Bilkent, Ankara, Turkey Received
More informationFitting financial time series returns distributions: a mixture normality approach
Fitting financial time series returns distributions: a mixture normality approach Riccardo Bramante and Diego Zappa * Abstract Value at Risk has emerged as a useful tool to risk management. A relevant
More informationAlternative VaR Models
Alternative VaR Models Neil Roeth, Senior Risk Developer, TFG Financial Systems. 15 th July 2015 Abstract We describe a variety of VaR models in terms of their key attributes and differences, e.g., parametric
More informationNumerical Measurements
El-Shorouk Academy Acad. Year : 2013 / 2014 Higher Institute for Computer & Information Technology Term : Second Year : Second Department of Computer Science Statistics & Probabilities Section # 3 umerical
More informationA New Test for Correlation on Bivariate Nonnormal Distributions
Journal of Modern Applied Statistical Methods Volume 5 Issue Article 8 --06 A New Test for Correlation on Bivariate Nonnormal Distributions Ping Wang Great Basin College, ping.wang@gbcnv.edu Ping Sa University
More informationConsistent estimators for multilevel generalised linear models using an iterated bootstrap
Multilevel Models Project Working Paper December, 98 Consistent estimators for multilevel generalised linear models using an iterated bootstrap by Harvey Goldstein hgoldstn@ioe.ac.uk Introduction Several
More informationSome Characteristics of Data
Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key
More informationUNIVERSITY OF VICTORIA Midterm June 2014 Solutions
UNIVERSITY OF VICTORIA Midterm June 04 Solutions NAME: STUDENT NUMBER: V00 Course Name & No. Inferential Statistics Economics 46 Section(s) A0 CRN: 375 Instructor: Betty Johnson Duration: hour 50 minutes
More informationStatistics for Business and Economics
Statistics for Business and Economics Chapter 7 Estimation: Single Population Copyright 010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-1 Confidence Intervals Contents of this chapter: Confidence
More informationSTRESS-STRENGTH RELIABILITY ESTIMATION
CHAPTER 5 STRESS-STRENGTH RELIABILITY ESTIMATION 5. Introduction There are appliances (every physical component possess an inherent strength) which survive due to their strength. These appliances receive
More informationMuch of what appears here comes from ideas presented in the book:
Chapter 11 Robust statistical methods Much of what appears here comes from ideas presented in the book: Huber, Peter J. (1981), Robust statistics, John Wiley & Sons (New York; Chichester). There are many
More informationModule Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION
Subject Paper No and Title Module No and Title Paper No.2: QUANTITATIVE METHODS Module No.7: NORMAL DISTRIBUTION Module Tag PSY_P2_M 7 TABLE OF CONTENTS 1. Learning Outcomes 2. Introduction 3. Properties
More informationA RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT
Fundamental Journal of Applied Sciences Vol. 1, Issue 1, 016, Pages 19-3 This paper is available online at http://www.frdint.com/ Published online February 18, 016 A RIDGE REGRESSION ESTIMATION APPROACH
More informationA Skewed Truncated Cauchy Uniform Distribution and Its Moments
Modern Applied Science; Vol. 0, No. 7; 206 ISSN 93-844 E-ISSN 93-852 Published by Canadian Center of Science and Education A Skewed Truncated Cauchy Uniform Distribution and Its Moments Zahra Nazemi Ashani,
More informationTHE USE OF THE LOGNORMAL DISTRIBUTION IN ANALYZING INCOMES
International Days of tatistics and Economics Prague eptember -3 011 THE UE OF THE LOGNORMAL DITRIBUTION IN ANALYZING INCOME Jakub Nedvěd Abstract Object of this paper is to examine the possibility of
More informationTechnical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions
Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions Pandu Tadikamalla, 1 Mihai Banciu, 1 Dana Popescu 2 1 Joseph M. Katz Graduate School of Business, University
More informationA Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution
A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution Debasis Kundu 1, Rameshwar D. Gupta 2 & Anubhav Manglick 1 Abstract In this paper we propose a very convenient
More informationGeneral structural model Part 2: Nonnormality. Psychology 588: Covariance structure and factor models
General structural model Part 2: Nonnormality Psychology 588: Covariance structure and factor models Conditions for efficient ML & GLS 2 F ML is derived with an assumption that all DVs are multivariate
More informationFinancial Data Mining Using Flexible ICA-GARCH Models
55 Chapter 11 Financial Data Mining Using Flexible ICA-GARCH Models Philip L.H. Yu The University of Hong Kong, Hong Kong Edmond H.C. Wu The Hong Kong Polytechnic University, Hong Kong W.K. Li The University
More informationStock Price and Index Forecasting by Arbitrage Pricing Theory-Based Gaussian TFA Learning
Stock Price and Index Forecasting by Arbitrage Pricing Theory-Based Gaussian TFA Learning Kai Chun Chiu and Lei Xu Department of Computer Science and Engineering The Chinese University of Hong Kong, Shatin,
More informationLecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1
Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Section 7.4-1 Chapter 7 Estimates and Sample Sizes 7-1 Review and Preview 7- Estimating a Population
More informationApplying Independent Component Analysis to Factor Model in Finance
In Intelligent Data Engineering and Automated Learning - IDEAL 2000, Data Mining, Financial Engineering, and Intelligent Agents, ed. K.S. Leung, L.W. Chan and H. Meng, Springer, Pages 538-544, 2000. Applying
More informationAn Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1
An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1 Guillermo Magnou 23 January 2016 Abstract Traditional methods for financial risk measures adopts normal
More informationStatistical Methodology. A note on a two-sample T test with one variance unknown
Statistical Methodology 8 (0) 58 534 Contents lists available at SciVerse ScienceDirect Statistical Methodology journal homepage: www.elsevier.com/locate/stamet A note on a two-sample T test with one variance
More informationLinda Allen, Jacob Boudoukh and Anthony Saunders, Understanding Market, Credit and Operational Risk: The Value at Risk Approach
P1.T4. Valuation & Risk Models Linda Allen, Jacob Boudoukh and Anthony Saunders, Understanding Market, Credit and Operational Risk: The Value at Risk Approach Bionic Turtle FRM Study Notes Reading 26 By
More informationCOMPARISON OF RATIO ESTIMATORS WITH TWO AUXILIARY VARIABLES K. RANGA RAO. College of Dairy Technology, SPVNR TSU VAFS, Kamareddy, Telangana, India
COMPARISON OF RATIO ESTIMATORS WITH TWO AUXILIARY VARIABLES K. RANGA RAO College of Dairy Technology, SPVNR TSU VAFS, Kamareddy, Telangana, India Email: rrkollu@yahoo.com Abstract: Many estimators of the
More informationStudy on Financial Market Risk Measurement Based on GJR-GARCH and FHS
Science Journal of Applied Mathematics and Statistics 05; 3(3): 70-74 Published online April 3, 05 (http://www.sciencepublishinggroup.com/j/sjams) doi: 0.648/j.sjams.050303. ISSN: 376-949 (Print); ISSN:
More informationCHAPTER 2 Describing Data: Numerical
CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of
More informationPoint Estimation. Some General Concepts of Point Estimation. Example. Estimator quality
Point Estimation Some General Concepts of Point Estimation Statistical inference = conclusions about parameters Parameters == population characteristics A point estimate of a parameter is a value (based
More informationMODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION
International Days of Statistics and Economics, Prague, September -3, MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION Diana Bílková Abstract Using L-moments
More informationWindow Width Selection for L 2 Adjusted Quantile Regression
Window Width Selection for L 2 Adjusted Quantile Regression Yoonsuh Jung, The Ohio State University Steven N. MacEachern, The Ohio State University Yoonkyung Lee, The Ohio State University Technical Report
More informationLearning Objectives for Ch. 7
Chapter 7: Point and Interval Estimation Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 7 Obtaining a point estimate of a population parameter
More informationKARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI
88 P a g e B S ( B B A ) S y l l a b u s KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI Course Title : STATISTICS Course Number : BA(BS) 532 Credit Hours : 03 Course 1. Statistical
More informationWhere s the Beef Does the Mack Method produce an undernourished range of possible outcomes?
Where s the Beef Does the Mack Method produce an undernourished range of possible outcomes? Daniel Murphy, FCAS, MAAA Trinostics LLC CLRS 2009 In the GIRO Working Party s simulation analysis, actual unpaid
More informationESTIMATION OF MODIFIED MEASURE OF SKEWNESS. Elsayed Ali Habib *
Electronic Journal of Applied Statistical Analysis EJASA, Electron. J. App. Stat. Anal. (2011), Vol. 4, Issue 1, 56 70 e-issn 2070-5948, DOI 10.1285/i20705948v4n1p56 2008 Università del Salento http://siba-ese.unile.it/index.php/ejasa/index
More informationدرس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی
یادگیري ماشین توزیع هاي نمونه و تخمین نقطه اي پارامترها Sampling Distributions and Point Estimation of Parameter (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی درس هفتم 1 Outline Introduction
More informationThe misleading nature of correlations
The misleading nature of correlations In this note we explain certain subtle features of calculating correlations between time-series. Correlation is a measure of linear co-movement, to be contrasted with
More informationProject Proposals for MS&E 444. Lisa Borland and Jeremy Evnine. Evnine and Associates, Inc. April 2008
Project Proposals for MS&E 444 Lisa Borland and Jeremy Evnine Evnine and Associates, Inc. April 2008 1 Portfolio Construction using Prospect Theory Single asset: -Maximize expected long run profit based
More informationComputational Statistics Handbook with MATLAB
«H Computer Science and Data Analysis Series Computational Statistics Handbook with MATLAB Second Edition Wendy L. Martinez The Office of Naval Research Arlington, Virginia, U.S.A. Angel R. Martinez Naval
More informationFinancial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR
Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR Nelson Mark University of Notre Dame Fall 2017 September 11, 2017 Introduction
More informationChapter 6 Simple Correlation and
Contents Chapter 1 Introduction to Statistics Meaning of Statistics... 1 Definition of Statistics... 2 Importance and Scope of Statistics... 2 Application of Statistics... 3 Characteristics of Statistics...
More informationIntroduction. Tero Haahtela
Lecture Notes in Management Science (2012) Vol. 4: 145 153 4 th International Conference on Applied Operational Research, Proceedings Tadbir Operational Research Group Ltd. All rights reserved. www.tadbir.ca
More informationCan we use kernel smoothing to estimate Value at Risk and Tail Value at Risk?
Can we use kernel smoothing to estimate Value at Risk and Tail Value at Risk? Ramon Alemany, Catalina Bolancé and Montserrat Guillén Riskcenter - IREA Universitat de Barcelona http://www.ub.edu/riskcenter
More informationGeneralized Modified Ratio Type Estimator for Estimation of Population Variance
Sri Lankan Journal of Applied Statistics, Vol (16-1) Generalized Modified Ratio Type Estimator for Estimation of Population Variance J. Subramani* Department of Statistics, Pondicherry University, Puducherry,
More informationStochastic model of flow duration curves for selected rivers in Bangladesh
Climate Variability and Change Hydrological Impacts (Proceedings of the Fifth FRIEND World Conference held at Havana, Cuba, November 2006), IAHS Publ. 308, 2006. 99 Stochastic model of flow duration curves
More informationMaximum Likelihood Estimation
Maximum Likelihood Estimation The likelihood and log-likelihood functions are the basis for deriving estimators for parameters, given data. While the shapes of these two functions are different, they have
More informationFat Tailed Distributions For Cost And Schedule Risks. presented by:
Fat Tailed Distributions For Cost And Schedule Risks presented by: John Neatrour SCEA: January 19, 2011 jneatrour@mcri.com Introduction to a Problem Risk distributions are informally characterized as fat-tailed
More informationMarket Risk Analysis Volume I
Market Risk Analysis Volume I Quantitative Methods in Finance Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume I xiii xvi xvii xix xxiii
More informationBias Reduction Using the Bootstrap
Bias Reduction Using the Bootstrap Find f t (i.e., t) so that or E(f t (P, P n ) P) = 0 E(T(P n ) θ(p) + t P) = 0. Change the problem to the sample: whose solution is so the bias-reduced estimate is E(T(P
More informationInt. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS048) p.5108
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS048) p.5108 Aggregate Properties of Two-Staged Price Indices Mehrhoff, Jens Deutsche Bundesbank, Statistics Department
More informationVolatility Clustering of Fine Wine Prices assuming Different Distributions
Volatility Clustering of Fine Wine Prices assuming Different Distributions Cynthia Royal Tori, PhD Valdosta State University Langdale College of Business 1500 N. Patterson Street, Valdosta, GA USA 31698
More informationSELECTION OF INDEPENDENT FACTOR MODEL IN FINANCE. Lai-Wan Chan and Siu-Ming Cha
In Proceedings of rd International Conference on Independent Component Analysis and Blind Signal Separation, SELECTION OF INDEPENDENT FACTOR MODEL IN FINANCE Lai-Wan Chan and Siu-Ming Cha Department of
More informationAN ARTIFICIAL NEURAL NETWORK MODELING APPROACH TO PREDICT CRUDE OIL FUTURE. By Dr. PRASANT SARANGI Director (Research) ICSI-CCGRT, Navi Mumbai
AN ARTIFICIAL NEURAL NETWORK MODELING APPROACH TO PREDICT CRUDE OIL FUTURE By Dr. PRASANT SARANGI Director (Research) ICSI-CCGRT, Navi Mumbai AN ARTIFICIAL NEURAL NETWORK MODELING APPROACH TO PREDICT CRUDE
More informationIncorporating Model Error into the Actuary s Estimate of Uncertainty
Incorporating Model Error into the Actuary s Estimate of Uncertainty Abstract Current approaches to measuring uncertainty in an unpaid claim estimate often focus on parameter risk and process risk but
More informationBackground. opportunities. the transformation. probability. at the lower. data come
The T Chart in Minitab Statisti cal Software Background The T chart is a control chart used to monitor the amount of time between adverse events, where time is measured on a continuous scale. The T chart
More informationShape Measures based on Mean Absolute Deviation with Graphical Display
International Journal of Business and Statistical Analysis ISSN (2384-4663) Int. J. Bus. Stat. Ana. 1, No. 1 (July-2014) Shape Measures based on Mean Absolute Deviation with Graphical Display E.A. Habib*
More informationA Demonstration of the Central Limit Theorem Using Java Program
A Demonstration of the Central Limit Theorem Using Java Program Lakshmi Varshini Damodaran Lynbrook High School San Jose, CA, 95129, USA luckylvd2003@gmail.com Abstract To students learning statistics,
More informationTwo-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models
Economic Review (Otaru University of Commerce), Vo.59, No.4, 4-48, March, 009 Two-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models Haruhiko
More informationAnalysis of truncated data with application to the operational risk estimation
Analysis of truncated data with application to the operational risk estimation Petr Volf 1 Abstract. Researchers interested in the estimation of operational risk often face problems arising from the structure
More informationMarket Risk Analysis Volume II. Practical Financial Econometrics
Market Risk Analysis Volume II Practical Financial Econometrics Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume II xiii xvii xx xxii xxvi
More informationQuality Digest Daily, March 2, 2015 Manuscript 279. Probability Limits. A long standing controversy. Donald J. Wheeler
Quality Digest Daily, March 2, 2015 Manuscript 279 A long standing controversy Donald J. Wheeler Shewhart explored many ways of detecting process changes. Along the way he considered the analysis of variance,
More informationThe Two Sample T-test with One Variance Unknown
The Two Sample T-test with One Variance Unknown Arnab Maity Department of Statistics, Texas A&M University, College Station TX 77843-343, U.S.A. amaity@stat.tamu.edu Michael Sherman Department of Statistics,
More information2015, IJARCSSE All Rights Reserved Page 66
Volume 5, Issue 1, January 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Financial Forecasting
More informationStrategies for Improving the Efficiency of Monte-Carlo Methods
Strategies for Improving the Efficiency of Monte-Carlo Methods Paul J. Atzberger General comments or corrections should be sent to: paulatz@cims.nyu.edu Introduction The Monte-Carlo method is a useful
More informationSubject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018
` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.
More informationPower of t-test for Simple Linear Regression Model with Non-normal Error Distribution: A Quantile Function Distribution Approach
Available Online Publications J. Sci. Res. 4 (3), 609-622 (2012) JOURNAL OF SCIENTIFIC RESEARCH www.banglajol.info/index.php/jsr of t-test for Simple Linear Regression Model with Non-normal Error Distribution:
More informationMEMBER CONTRIBUTION. 20 years of VIX: Implications for Alternative Investment Strategies
MEMBER CONTRIBUTION 20 years of VIX: Implications for Alternative Investment Strategies Mikhail Munenzon, CFA, CAIA, PRM Director of Asset Allocation and Risk, The Observatory mikhail@247lookout.com Copyright
More informationSTOCHASTIC COST ESTIMATION AND RISK ANALYSIS IN MANAGING SOFTWARE PROJECTS
Full citation: Connor, A.M., & MacDonell, S.G. (25) Stochastic cost estimation and risk analysis in managing software projects, in Proceedings of the ISCA 14th International Conference on Intelligent and
More informationELEMENTS OF MONTE CARLO SIMULATION
APPENDIX B ELEMENTS OF MONTE CARLO SIMULATION B. GENERAL CONCEPT The basic idea of Monte Carlo simulation is to create a series of experimental samples using a random number sequence. According to the
More informationChapter 4 Variability
Chapter 4 Variability PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter and Larry B. Wallnau Chapter 4 Learning Outcomes 1 2 3 4 5
More information[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright
Faculty and Institute of Actuaries Claims Reserving Manual v.2 (09/1997) Section D7 [D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright 1. Introduction
More informationRandom Variables and Probability Distributions
Chapter 3 Random Variables and Probability Distributions Chapter Three Random Variables and Probability Distributions 3. Introduction An event is defined as the possible outcome of an experiment. In engineering
More informationXLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING
XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING INTRODUCTION XLSTAT makes accessible to anyone a powerful, complete and user-friendly data analysis and statistical solution. Accessibility to
More informationA Comparison of Univariate Probit and Logit. Models Using Simulation
Applied Mathematical Sciences, Vol. 12, 2018, no. 4, 185-204 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2018.818 A Comparison of Univariate Probit and Logit Models Using Simulation Abeer
More informationSTAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.
STAT 509: Statistics for Engineers Dr. Dewei Wang Applied Statistics and Probability for Engineers Sixth Edition Douglas C. Montgomery George C. Runger 7 Point CHAPTER OUTLINE 7-1 Point Estimation 7-2
More informationSTAT Chapter 6: Sampling Distributions
STAT 515 -- Chapter 6: Sampling Distributions Definition: Parameter = a number that characterizes a population (example: population mean ) it s typically unknown. Statistic = a number that characterizes
More informationResampling Methods. Exercises.
Aula 5. Monte Carlo Method III. Exercises. 0 Resampling Methods. Exercises. Anatoli Iambartsev IME-USP Aula 5. Monte Carlo Method III. Exercises. 1 Bootstrap. The use of the term bootstrap derives from
More informationWeek 1 Quantitative Analysis of Financial Markets Distributions B
Week 1 Quantitative Analysis of Financial Markets Distributions B Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October
More informationDistribution analysis of the losses due to credit risk
Distribution analysis of the losses due to credit risk Kamil Łyko 1 Abstract The main purpose of this article is credit risk analysis by analyzing the distribution of losses on retail loans portfolio.
More informationA New Hybrid Estimation Method for the Generalized Pareto Distribution
A New Hybrid Estimation Method for the Generalized Pareto Distribution Chunlin Wang Department of Mathematics and Statistics University of Calgary May 18, 2011 A New Hybrid Estimation Method for the GPD
More informationTABLE OF CONTENTS - VOLUME 2
TABLE OF CONTENTS - VOLUME 2 CREDIBILITY SECTION 1 - LIMITED FLUCTUATION CREDIBILITY PROBLEM SET 1 SECTION 2 - BAYESIAN ESTIMATION, DISCRETE PRIOR PROBLEM SET 2 SECTION 3 - BAYESIAN CREDIBILITY, DISCRETE
More information