On the Distribution and Its Properties of the Sum of a Normal and a Doubly Truncated Normal

Similar documents
A Skewed Truncated Cauchy Logistic. Distribution and its Moments

GENERATION OF STANDARD NORMAL RANDOM NUMBERS. Naveen Kumar Boiroju and M. Krishna Reddy

ESTIMATION OF MODIFIED MEASURE OF SKEWNESS. Elsayed Ali Habib *

A Skewed Truncated Cauchy Uniform Distribution and Its Moments

MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL

An Information Based Methodology for the Change Point Problem Under the Non-central Skew t Distribution with Applications.

A New Test for Correlation on Bivariate Nonnormal Distributions

Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali

Some developments about a new nonparametric test based on Gini s mean difference

Fitting financial time series returns distributions: a mixture normality approach

Market Risk Analysis Volume I

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

A Comparison Between Skew-logistic and Skew-normal Distributions

A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims

Estimating the Parameters of Closed Skew-Normal Distribution Under LINEX Loss Function

A Test of the Normality Assumption in the Ordered Probit Model *

Australian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model

Chapter 3 Common Families of Distributions. Definition 3.4.1: A family of pmfs or pdfs is called exponential family if it can be expressed as

Edgeworth Binomial Trees

Contents. An Overview of Statistical Applications CHAPTER 1. Contents (ix) Preface... (vii)

Log-linear Modeling Under Generalized Inverse Sampling Scheme

A First Course in Probability

David R. Clark. Presented at the: 2013 Enterprise Risk Management Symposium April 22-24, 2013

KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI

On the Distribution of Kurtosis Test for Multivariate Normality

A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution

2. Copula Methods Background

Robust Critical Values for the Jarque-bera Test for Normality

GENERATION OF APPROXIMATE GAMMA SAMPLES BY PARTIAL REJECTION

A New Multivariate Kurtosis and Its Asymptotic Distribution

Appendix A (Pornprasertmanit & Little, in press) Mathematical Proof

Financial Risk Forecasting Chapter 9 Extreme Value Theory

An Improved Skewness Measure

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS

KURTOSIS OF THE LOGISTIC-EXPONENTIAL SURVIVAL DISTRIBUTION

Financial Models with Levy Processes and Volatility Clustering

Maximum likelihood estimation of skew-t copulas with its applications to stock returns

Statistics 431 Spring 2007 P. Shaman. Preliminaries

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

Multivariate skewness and kurtosis measures with an application in ICA

On Some Statistics for Testing the Skewness in a Population: An. Empirical Study

Chapter 7. Inferences about Population Variances

Business Statistics 41000: Probability 3

Financial Risk Management

Chapter 6 Simple Correlation and

discussion Papers Some Flexible Parametric Models for Partially Adaptive Estimators of Econometric Models

INTERNATIONAL JOURNAL FOR INNOVATIVE RESEARCH IN MULTIDISCIPLINARY FIELD ISSN Volume - 3, Issue - 2, Feb

PROBABILITY. Wiley. With Applications and R ROBERT P. DOBROW. Department of Mathematics. Carleton College Northfield, MN

A lower bound on seller revenue in single buyer monopoly auctions

Probability and Statistics

PORTFOLIO THEORY. Master in Finance INVESTMENTS. Szabolcs Sebestyén

Inferences on Correlation Coefficients of Bivariate Log-normal Distributions

Distortion operator of uncertainty claim pricing using weibull distortion operator

Introduction Recently the importance of modelling dependent insurance and reinsurance risks has attracted the attention of actuarial practitioners and

Lecture 6: Non Normal Distributions

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Page 2 Vol. 10 Issue 7 (Ver 1.0) August 2010

A Family of Kurtosis Orderings for Multivariate Distributions

2.1 Random variable, density function, enumerative density function and distribution function

Discriminating between the log-normal and generalized exponential distributions

An Improved Saddlepoint Approximation Based on the Negative Binomial Distribution for the General Birth Process

Vine-copula Based Models for Farmland Portfolio Management

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

Lindner, Szimayer: A Limit Theorem for Copulas

Noureddine Kouaissah, Sergio Ortobelli, Tomas Tichy University of Bergamo, Italy and VŠB-Technical University of Ostrava, Czech Republic

PORTFOLIO MODELLING USING THE THEORY OF COPULA IN LATVIAN AND AMERICAN EQUITY MARKET

Confidence Intervals for the Median and Other Percentiles

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Comparative Analysis Of Normal And Logistic Distributions Modeling Of Stock Exchange Monthly Returns In Nigeria ( )

1 Residual life for gamma and Weibull distributions

THE USE OF THE LOGNORMAL DISTRIBUTION IN ANALYZING INCOMES

St. Xavier s College Autonomous Mumbai. Syllabus For 2 nd Semester Course in Statistics (June 2015 onwards)

Superiority by a Margin Tests for the Ratio of Two Proportions

Two Equivalent Conditions

Semimartingales and their Statistical Inference

DATA SUMMARIZATION AND VISUALIZATION

2018 AAPM: Normal and non normal distributions: Why understanding distributions are important when designing experiments and analyzing data

Key Words: emerging markets, copulas, tail dependence, Value-at-Risk JEL Classification: C51, C52, C14, G17

Confidence Intervals for Pearson s Correlation

The Economic and Social BOOTSTRAPPING Review, Vol. 31, No. THE 4, R/S October, STATISTIC 2000, pp

Weighted Half Exponential Power Distribution and Associated Inference

Multivariate longitudinal data analysis for actuarial applications

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Annual risk measures and related statistics

IMPERFECT MAINTENANCE. Mark Brown. City University of New York. and. Frank Proschan. Florida State University

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Volatility Models and Their Applications

An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1

STOR Lecture 15. Jointly distributed Random Variables - III

Calibration Estimation under Non-response and Missing Values in Auxiliary Information

Lecture Note 9 of Bus 41914, Spring Multivariate Volatility Models ChicagoBooth

Approximating a multifactor di usion on a tree.

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

34.S-[F] SU-02 June All Syllabus Science Faculty B.Sc. I Yr. Stat. [Opt.] [Sem.I & II] - 1 -

Correlation vs. Trends in Portfolio Management: A Common Misinterpretation

The normal distribution is a theoretical model derived mathematically and not empirically.

A NOTE ON FULL CREDIBILITY FOR ESTIMATING CLAIM FREQUENCY

St. Xavier s College Autonomous Mumbai STATISTICS. F.Y.B.Sc. Syllabus For 1 st Semester Courses in Statistics (June 2015 onwards)

ASSIGNMENT - 1, MAY M.Sc. (PREVIOUS) FIRST YEAR DEGREE STATISTICS. Maximum : 20 MARKS Answer ALL questions.

Transcription:

The Korean Communications in Statistics Vol. 13 No. 2, 2006, pp. 255-266 On the Distribution and Its Properties of the Sum of a Normal and a Doubly Truncated Normal Hea-Jung Kim 1) Abstract This paper proposes a class of distributions which is useful in making inferences about the sum of values from a normal and a doubly truncated normal distribution. It is seen that the class is associated with the conditional distributions of truncated bivariate normal. The salient features of the class are mathematical tractability and strict inclusion of the normal and the skew-normal laws. Further it includes a shape parameter, to some extent, controls the index of skewness so that the class of distributions will prove useful in other contexts. Necessary theories involved in deriving the class of distributions are provided and some properties of the class are also studied. Keywords : Doubly truncated normal; Class of distributions; Skewness. 1. Introduction Azzalini (1985) and Henze (1986) worked on the skew-normal (written ) distribution, a class of distributions including the standard normal, but with an extra parameter ( ) to regulate skewness. A probabilistic representation of the distribution is given in terms of normal and half normal laws:, (1.1) where and be independent standard normal variables. The distribution is useful to modeling random phenomena which have heavier tails than the normal as well as some skewness that appears in screening problems and regression problems with skewed error structure. We refer to Arnold et al. (1993), Chen, Dey and Shao (1999), and Kim (2002) for the applications of the distribution to those problems. Various extensions of the distribution have been given in the literatures: In one direction, Diciccio and Monti (2004) and Ma and Genton (2004), 1) Professor, Department of Statistics, Dongguk University, Seoul 100-715, Korea. E-mail : kim3hj@dongguk.edu

256 Hea-Jung Kim among others, tried to find more flexible distributions that cope with prevalent deviations from normality. In the other direction, multivariate extensions of the distribution have been developed by Azzalini and Valle (1996) and Branco and Dey (2001), among others. The purpose of the present paper is to extend the distribution in terms of yet other direction. This proposes a class of distributions that accounts for the sum of a normal and a doubly truncated normal distributions, that is the distributions of the form, where and are any real values,, and, a truncated with respective lower and upper truncation points and. Such an extension is potentially relevant for practical applications, since in data analysis there are a few distributions available to dealing with the sum (or ratio) of values from a normal and a doubly truncated normal distribution. Moreover, the distribution of a variable of the form arises frequently in the theory of nonparametric statistics. We discuss examples of the nonparametric statistics in Section 4. Necessary theories involved in deriving the class of distributions is provided and some properties of the class is also studied. 2. The Class of Distributions This section proposes a class of distributions of the sum of a normal and a doubly truncated normal distributions by use of the conditional truncated bivariate normal distribution. Suppose is the density function of a bivariate normal random variable with mean vector covariance matrix,, with correlation. Suppose that ( ) has joint density, (2.1) where,, is the df of a standard normal variable, and and are real constants that are the lower and upper truncation points for, respectively. Clearly, ( ) has a doubly truncated bivariate normal distribution. By direct integration one obtains the density of given by (2.2) where,, and is the pdf of the standard normal variable. The following lemma gives a relation between the conditional distribution of a truncated bivariate normal and the sum of a normal

Sum of a Normal and a Truncated Normal 257 and a doubly truncated normal distributions. Lemma 1. Let and be independent random variables and ( ) is a bivariate normal random variable with mean vector and covariance, with,,,, and. Then for any real values and with, the distribution of is equivalent to the conditional distribution of given ; (2.3) where, a truncated with respective lower and upper truncation points and. Proof. Let and. Then ( ) is a bivariate normal random variable with mean vector and covariance matrix. Since and are independent, the distribution of conditionally on equals that of. From now on, for convenience, we shall denote,,,,,,,, and. Under the notations, Lemma 1 leads to the following definition. Definition 1. Suppose and are independent random variables. Then distribution of with the probability density function (2.2) is denoted by, where =(, ). The definition and Lemma 1 imply that the distribution function(df) of variable with distribution is the same as the conditional df of the form where is a bivariate normal with the mean vector and covariance matrix. Thus the df of is, (2.4) where is the orthant probability of the standard bivariate normal variable. Computing methods that evaluate

258 Hea-Jung Kim have been given by Donnelly (1973) and Joe (1995), among others. Definition 1 also reveals the structure of the class of distributions and indicates the kind of departure form the normality. Furthermore, it provides one-for-one method of generating a random variable with density (2.2). For generating the truncated normal variable, the one-for-one method by Devroye (1986) may be used. Now we will state some interesting properties for the distribution as well as the associate results. Property 1. Let, where. Then, (2.5) where with and and. and, Property 2. For, if and while and reduces to. Property 3. If, then, where. The pdf of is for. (2.6) Corollary 1. Let and be independent standard normal random variables, and let and. Then where., (2.7) Proof. Setting and, we have the result from Lemma 1. Property 4. distribution is equivalent to distribution. Property 5. If and is an independent random variable, then the distribution of is, where and.

Sum of a Normal and a Truncated Normal 259 Corollary 2. If is a random variable then, for any real and,, (2.8) where. Proof. Let independently of and let. Then, where random variable by Corollary 1. Thus the df in (2.4) gives the result. Corollary 2 immediately gives the following results for random variable: Upon setting and, we have, for any real and,. (2.9) The expectation was derived by a different method (see, for instance, Zacks 1981, pp.53-54). When and, (2.8) yields (2.10) for any real and, where,,, is the function which gives the integral of the standard bivariate density over the right side region bounded by lines,, and in the ( ) plane(see, Azzalini 1985, for the properties of the function). When and are independent standard normal variables, has a standard Cauchy distribution. From this fact, we see that the df of can be expressed as by (2.10), where. 3. Moments 3.1 Moment Generating Function To compute the moments of the distribution, it suffices to compute the moments of. From Property 1, we see that has the density, (3.1) where,, and.

260 Hea-Jung Kim Theorem 1. Let, then the moment generating function of is. where for, (3.2) Proof. Considering the pdf (3.1), we have written as. Using the transformation, one finds where. Thus we can derive the mgf (3.2) by applying (2.9). 3.2 Moments of the Distribution Naturally, the moments of can be obtained by using the moment generating function differentiation. For example:. Unfortunately, for higher moments this rapidly becomes tedious. An alternative procedure makes use of the fact that, for (3.3) yields the following result. Under the distribution (3.1), the relation (3.3) and integrating by parts gives the following moments. for, (3.4) where. By setting and applying (2.9), we obtain three expressions, which my be solved to yield the first three moments of. Higher

Sum of a Normal and a Truncated Normal 261 moments could be found similarly. One obtains,,. By using the Binomial expansion, one can see that the general formula for the moments of is. (3.5) When,. We recall the functions and studied by Sugiura and Gombi (1985). For real values of and ( ), and give respective skewness and kurtosis of doubly truncated standard normal distribution. Here and denote the left and right truncation points of the standard normal distribution. Using these functions and the above moments, we have the following result. Theorem 2. For, the skewness(the standardized central moment) and kurtosis of the distribution are and respectively, where (3.6) (3.7),,,,. Proof. From Property 1, we see that and of distribution is the same as those of distribution, where and are independent standard normal variables. From the above moments,. Therefore, some algebra using the fact that and (see, for example, Johnson et al. 1994, pp.156-158), gives the result.

262 Hea-Jung Kim Corollary 3. For, the skewness depends on the following cases: (i) If, the distribution of variable is skewed to the right(the left) when. (ii) If, the distribution is skewed to the right(the left) when. (iii) If for, the distribution is symmetric, where,, and. Proof. It is obvious that for, for, and for. This fact and (3.6) immediately gives the result. The class of distribution includes well-known skewnormal distribution,, by Azzalini (1985). distributions is equivalent to as given in Property 3. From (3.5) and (3.6), one finds the moments of,, and. These values of agree with those given in Arnold et al. (1993). See Azzalini (1985) and Henze (1986) for the other properties of the distribution. 4. Applications 4.1. Nonparametric Statistics The distribution of a variate arises frequently in the theory of nonparametric statistics, where and with. Let and. Then the distribution function of the ratio of normal and truncated normal can be directly obtained from that of variate, i.e.. Further assume that and are standard normals, and and. Then, (4.1) the distribution function of a standard Cauchy distribution(see, for instance, Johnson et al. 1994). In (4.1), if we set, (4.2)

Sum of a Normal and a Truncated Normal 263 by the df of variable in (5), where is the standard bivariate normal variable with correlation. Thus (4.2) gives a simple derivation of the joint probability (see Johnson and Kotz (1972) for the direct derivation that uses a complicated double integration). The probability of leads to the following applications to the theory of nonparametric statistics. For example, let ( ) and ( ) be independent identically distributed bivariate normal variables with variances and and correlation. Then Kendall's measure of association, which may be defined as, can be written as by (4.2), where with. Thus if and only if. That is, we have the important relationship that if and only if and are independent when sampling from a bivariate normal population. Another example occurs in the setting of the two-way layout. Hollander (1967) proposed a test statistic that is a sum of Wilcoxon signed rank statistics to detect, where at least one of these inequality is strict, when the null hypothesis is (unknown),. The variance of the asymptotic distribution of his statistic may be written as, where are independent random variables whose continuous df is. If represent a random sample from a univariate normal distribution with variance, then and are bivariate normally distributed with zero means, unit variances, and covariance 1/4. Consequently,, where with. These two examples show that, if one uses (4.2), the values of and can be directly obtained, avoiding the complicated double integration. 4.2. Sum of Values from a Normal and a Truncated Normal Suppose an item which one makes has, among others, two parts which are assembled additively with regard to length. The length of both parts are normally distributed but, before assembly, one of the parts is subject to an inspection which

264 Hea-Jung Kim removes all individuals below a specified length. As an example, suppose that comes from a normal distribution with a mean of 100 and standard deviation of 6, and comes from a normal distribution with a mean 50 with a standard deviation of 3, but with the restriction that. How can we find the chance that is equal to or less than a given value, say 138? This problem was originally answered by Weinstein(1964) and three correspondents regarding to Weinstein(1964) have suggested other methods to get the same answer. The method presented here uses the df (2.4) of distribution as a simple alternative method to this problem. From definition 1, we see that with, so that,,,, and. Thus using (2.4), we see that, and this probability is the same as the Weinstein's answer. When the parameters of and distributions are unknown, the probability can be approximately calculated by using the maximum likelihood estimates of the parameters. See Johnson et al. (1994) and references therein for the estimation of the truncated normal parameters. When the restriction of in the form of, then with. Thus is immediate from (2.4) if values of and are given. 5. Conclusion This paper has proposed a class of the sum of a normal and a doubly truncated normal, denoted by. The properties of the class is studied and two immediate applications of the class are given. As given by (2.3), the special feature of the class is that it gives rich family of parametric density functions that allow a continuous variation from normality to nonnormality. Therefore the class of distribution is potentially relevant for practical application, especially for the analysis of skewed data as implied by Corollary 3. This, in turn, raises the estimation problem of distribution based on a skewed data. A study pertaining to the application is an interesting research topic and it is left as a future study of interest. References [1] Arnold, B.C., Beaver, R.J., Groeneveld, R.A. and Meeker, W.Q. (1993). The nontruncated marginal of a truncated bivariate normal distribution. Psychometrica, Vol. 58, 471-478.

Sum of a Normal and a Truncated Normal 265 [2] Azzalini, A. (1985). A class of distributions which includes the normal ones. Scandinavian Journal of Statistics, Vol. 12, 171-178. [3] Azzalini, A. and Valle, A.D. (1996). The multivariate skew-normal distribution. Biometrika, Vol. 83, 715-726. [4] Branco, M.D. and Dey, D.K. (2001). A general class of multivariate skew-elliptical distributions. Journal of Multivariate Analysis, Vol. 79, 99-113. [5] Chen, M.H., Dey, D.K., and Shao, Q.M. (1999). A new skewed link model for dichotomous quantal response model. Journal of the American Statistical Association, Vol. 94, 1172-1186. [6] Devroye, L. (1986). Non-Uniform Random Variate Generaton. New York: Springer Verlag. [7] DiCiccio, T.J. and Monti, A.C. (2004). Inferential aspects of the skew exponential power distribution. Journal of the American Statistical Association, Vol. 99, 439-450. [8] Donnelly, T.G. (1973). Algorithm 426: Bivariate normal distribution. Communications of the Association for Computing Machinery, Vol. 16, 638. [9] Henze, N. (1986). A probabilistic representation of the 'Skewed-normal' distribution. Scandinavian Journal of Statistics, Vol. 13, 271-275. [10] Hollander, M. (1967). Rank tests for randomized blocks when the alternative have an prior ordering. The Annals of Mathematical Statistics, Vol. 38, 867-887. [11] Joe, H. (1995). Approximation to multivariate normal rectangle probabilities based on conditional expectations. Journal of the American Statistical Association, Vol. 90, 957-966. [12] Johnson, N.L. and Kotz, S. (1972). Distributions in Statistics: Continuous Multivariate Distributions. New York: John Wiley. [13] Johnson, N.L., Kotz, S., and Balakrishnan, N. (1994). Continuous Univariate Distributions, Vol. 1. New York: John Wiley & Sons. [14] Kim, H.J. (2002). Binary regression with a class of skewed link models. Communications in Statistics-Theory and Methods, Vol. 31, 1863-1886. [15] Ma, Y. and Genton, M.G. (2004). A flexible class of skew-symmetric distributions. Scandinavian Journal of Statistics, Vol. 31, 459-468. [16] Sugiura, N. and Gomi, A. (1985). Pearson diagrams for truncated normal and truncated Weibull distributions. Biometrika, Vol. 72, 219-222. [17] Weinstein, M.A. (1964). The sum of values from a normal and a truncated normal distribution (Answer to Query). Technometrics, Vol. 6, 104-105 and 469-471.

266 Hea-Jung Kim [18] Zacks, S. (1981). Parametric Statistical Inference, Pregamon Press, Oxford. [Received February 2006, Accepted April 2006]