Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions

Similar documents
SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS

This paper studies the X control chart in the situation that the limits are estimated and the process distribution is not normal.

Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods

Power functions of the Shewhart control chart

Background. opportunities. the transformation. probability. at the lower. data come

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

On the Distribution and Its Properties of the Sum of a Normal and a Doubly Truncated Normal

Statistical Tables Compiled by Alan J. Terry

Probability. An intro for calculus students P= Figure 1: A normal integral

Fundamentals of Statistics

ESTIMATION OF MODIFIED MEASURE OF SKEWNESS. Elsayed Ali Habib *

SPC Binomial Q-Charts for Short or long Runs

A Skewed Truncated Cauchy Logistic. Distribution and its Moments

STRESS-STRENGTH RELIABILITY ESTIMATION

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

Random Variables and Probability Distributions

Chapter 7 1. Random Variables

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices

DESCRIPTIVE STATISTICS

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Statistical Intervals (One sample) (Chs )

Control Chart for Autocorrelated Processes with Heavy Tailed Distributions

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Robust X control chart for monitoring the skewed and contaminated process

Applications of Good s Generalized Diversity Index. A. J. Baczkowski Department of Statistics, University of Leeds Leeds LS2 9JT, UK

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Control Charts. A control chart consists of:

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

discussion Papers Some Flexible Parametric Models for Partially Adaptive Estimators of Econometric Models

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

8.1 Estimation of the Mean and Proportion

Economic statistical design for x-bar control charts under non-normal distributed data with Weibull in-control time

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL

Statistics for Business and Economics

MATH 264 Problem Homework I

Simultaneous Use of X and R Charts for Positively Correlated Data for Medium Sample Size

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract

Statistics 431 Spring 2007 P. Shaman. Preliminaries

χ 2 distributions and confidence intervals for population variance

Business Statistics 41000: Probability 3

ECON 214 Elements of Statistics for Economists 2016/2017

The Control Chart for Attributes

Module 3: Sampling Distributions and the CLT Statistics (OA3102)

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

Probability & Statistics

Chapter 4 Continuous Random Variables and Probability Distributions

Some Characteristics of Data

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Properties of Probability Models: Part Two. What they forgot to tell you about the Gammas

Chapter 8 Estimation

Strategies for Improving the Efficiency of Monte-Carlo Methods

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

KURTOSIS OF THE LOGISTIC-EXPONENTIAL SURVIVAL DISTRIBUTION

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

DATA ANALYSIS AND SOFTWARE

A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution

Quality Digest Daily, March 2, 2015 Manuscript 279. Probability Limits. A long standing controversy. Donald J. Wheeler

Much of what appears here comes from ideas presented in the book:

The Two-Sample Independent Sample t Test

Unit 5: Sampling Distributions of Statistics

Two Hours. Mathematical formula books and statistical tables are to be provided THE UNIVERSITY OF MANCHESTER. 22 January :00 16:00

Unit 5: Sampling Distributions of Statistics

Continuous Probability Distributions & Normal Distribution

Commonly Used Distributions

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Measures of Central tendency

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

2 DESCRIPTIVE STATISTICS

Chapter 4 Continuous Random Variables and Probability Distributions

STA2601. Tutorial letter 105/2/2018. Applied Statistics II. Semester 2. Department of Statistics STA2601/105/2/2018 TRIAL EXAMINATION PAPER

Monitoring Processes with Highly Censored Data

Chapter 7. Inferences about Population Variances

EX-POST VERIFICATION OF PREDICTION MODELS OF WAGE DISTRIBUTIONS

Lecture # 35. Prof. John W. Sutherland. Nov. 16, 2005

Value at Risk with Stable Distributions

MATH 3200 Exam 3 Dr. Syring

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS

Computing and Graphing Probability Values of Pearson Distributions: A SAS/IML Macro

Stochastic model of flow duration curves for selected rivers in Bangladesh

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method

The Normal Distribution. (Ch 4.3)

An Information Based Methodology for the Change Point Problem Under the Non-central Skew t Distribution with Applications.

Terms & Characteristics

Simulation of Moment, Cumulant, Kurtosis and the Characteristics Function of Dagum Distribution

CHAPTER-1 BASIC CONCEPTS OF PROCESS CAPABILITY ANALYSIS

Random variables. Contents

Information Processing and Limited Liability

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Descriptive Statistics

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Alexander Marianski August IFRS 9: Probably Weighted and Biased?

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority

A Synthetic Scaled Weighted Variance Control Chart for Monitoring the Process Mean of Skewed Populations

EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS. Rick Katz

The "bell-shaped" curve, or normal curve, is a probability distribution that describes many real-life situations.

Modern Methods of Data Analysis - SS 2009

Transcription:

Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions Pandu Tadikamalla, 1 Mihai Banciu, 1 Dana Popescu 2 1 Joseph M. Katz Graduate School of Business, University of Pittsburgh, Pittsburgh, Pennsylvania 2 Stern School of Business, New York University, New York, New York Received 19 February 2007; revised 7 September 2007; accepted 7 October 2007 DOI 10.1002/nav.20265 Published online 26 November 2007 in Wiley InterScience (www.interscience.wiley.com). Abstract: The distribution of the range of a sample, even in the case of a normal distribution, is not symmetric. Shewhart s control chart for range and other approximations for range from skewed distributions and long-tailed (leptokurtic) symmetrical distributions assume the distribution of range as symmetric and provide ±3 sigma control limits. We provide accurate approximations for the R-chart control limits for the leptokurtic symmetrical distributions, using a range quantile approximation (RQA) method and illustrate the use of the RQA method with a numerical example. As special cases, we provide constants for the R-chart for the normal, logistic, and Laplace distributions. 2007 Wiley Periodicals, Inc. Naval Research Logistics 55: 91 99, 2008 Keywords: range control charts; long-tailed distributions; normal distribution; logistic distribution; Laplace distribution; Shewhart method; statistical quality control 1. INTRODUCTION Since Shewhart [8] introduced his control charts in 1920, they have become extremely popular among practitioners largely because of their ease of use. Although recent advances in statistics and computational mathematics make possible significantly more precise methods, traditional control charts remain the most widely taught and used tool in quality control. With a sufficiently large sample size, the Shewhart control charts are accurate [1]. Most of the time, however, using a large sample size is impractical and prohibitively expensive. When the sample size is small, on the other hand, it is more likely that the sample mean of the quality characteristic is not normally distributed, and thus the Shewhart charts become less reliable. Several authors have tried to address these issues and construct improved methods for skewed distributions [4] and for long-tailed (leptokurtic 1 ) symmetrical distributions of the quality characteristics [3, 12]. The Correspondence to: P.R. Tadikamalla (pandu@pitt.edu) 1 We define the kurtosis for a normal distribution as equal to zero and distributions with positive kurtosis are termed leptokurtic distributions. In some text books and research papers, the kurtosis for normal distribution is defined as three. restrictive normality assumption of the quality characteristic is not the only drawback in the case of small sample sizes. As shown in Montgomery [7], the use of the ±3 sigma limits for the range (R) chart produces a higher type I error than for the X chart. This means that even with a normally distributed quality characteristic the use of the ±3 sigma for the R-chart is not appropriate, as it does not guarantee the desired type I error of 0.27%. Another problem with the ±3 sigma limits is that the distribution of R is skewed and because of this, small sample sizes yield negative numbers for the lower control limit (LCL). Since the range is bounded below by zero, the LCL in those cases is truncated to zero. In the Shewhart control chart, for sample sizes less than 7, we always have the LCL for the R-chart equal to 0, although in reality, the exact LCL is higher. By treating very low sample ranges as in control when they are not, we are more likely to overlook improvements in the process as a result of reduced variability, and consequently fail to investigate the causes of these improvements. Khoo and Lim [6] address this shortcoming of the traditional range control charts and show a way to derive the exact upper control limit (UCL) and LCL for the range of a normally distributed quality characteristic. They do not, however, tabulate the constants needed by practitioners to construct these charts. In this study, we derive a range quantile 2007 Wiley Periodicals, Inc.

92 Naval Research Logistics, Vol. 55 (2008) approximation (RQA) method to compute more precise limits for the case of long-tailed symmetrically distributed quality characteristics. We then tabulate the constants needed to construct the range control charts. Finally, we include the exact control limits for the normal, logistic, and Laplace distributions. The rest of this study is organized as follows: Section 2 explains the rationale behind our approach and shows how to compute the control chart constants for the leptokurtic distributions. Section 2 also illustrates the use of these constants with a numerical example. Section 3 presents the range control chart constants for the normal distribution. Section 4 compares the performance of our charts against the traditional Shewhart charts for the case where the quality characteristic has a logistic distribution or a Laplace distribution. Finally, Section 5 presents our conclusions. 2. THE IMPROVED R-CHART FOR LEPTOKURTIC DISTRIBUTIONS 2.1. Description of the RQA Method Let (X1 i, Xi 2,..., Xi n ) i=1,2,...,r be r subgroups of size n from a process distribution with mean µ, standard deviation σ, skewness 0, and kurtosis k 4. Let f(x) be the probability density function (pdf) of the process distribution and let F(x) be the corresponding cumulative density function (cdf). Also, let X(1) i and Xi (n) be respectively the smallest and the largest observations in the subgroup i. We denote by R i = X(n) i Xi (1) the range of the subgroup i. IfF(x) is known, the exact distribution function of R i is given by [9]: G(R) = n [F(x + R) F(x)] n 1 df (x) (1) Sometimes the cdf cannot be expressed in closed form. In that case, we can rewrite (1) in terms of the pdf, f(x): G(R) = n x+r x f(x)dx n 1 f(x)dx (2) To find the UCL and the LCL of the R-chart, we need to solve the following two equations: G(UCL R ) = 1 α/2 (3) G(LCL R ) = α/2 (4) where UCL R and LCL R are the UCL and LCL for R and α is the desired type I error. We define d 2 as the mean of R/σ. The formula for d 2 is as follows (see, for example, [13]): d 2 = [1 (1 F(x)) n (F (x)) n ]dx (5) The control chart constants will be given by: D 3 = LCL R d 2 D 4 = UCL R d 2 and the control chart limits will be: UCL R = RD 4 CL R = R (7) LCL R = RD 3 where R is the average of sample range values. (6) Table 1. Values of LCL constant D3 for the RQA method. n Kurtosis 0.5 1 1.2 1.5 2 2.5 3 4 5 6 2 0.002 0.002 0.002 0.002 0.002 0.002 0.002 0.002 0.002 0.002 3 0.040 0.039 0.039 0.039 0.038 0.038 0.038 0.037 0.037 0.036 4 0.104 0.101 0.100 0.100 0.098 0.097 0.096 0.094 0.093 0.092 5 0.164 0.160 0.158 0.157 0.154 0.152 0.150 0.147 0.145 0.144 6 0.216 0.209 0.207 0.205 0.201 0.198 0.196 0.192 0.189 0.186 7 0.258 0.250 0.248 0.244 0.240 0.236 0.233 0.228 0.224 0.221 8 0.293 0.284 0.281 0.277 0.269 0.267 0.264 0.258 0.253 0.250 9 0.323 0.313 0.309 0.305 0.298 0.294 0.289 0.283 0.278 0.274 10 0.349 0.337 0.333 0.328 0.321 0.316 0.311 0.304 0.299 0.294 12 0.390 0.376 0.371 0.366 0.358 0.351 0.346 0.338 0.332 0.327 15 0.435 0.419 0.414 0.407 0.398 0.390 0.384 0.375 0.368 0.362 20 0.486 0.467 0.461 0.453 0.442 0.434 0.427 0.416 0.408 0.401

Tadikamalla, Banciu, and Popescu: Range Charts for Normal and Long-Tailed Symmetrical Distributions 93 Table 1a. Values of D 3 for Student t and Johnson S u distributions. n Kurtosis 0.5 1 1.2 1.5 2 2.5 3 4 5 6 2 0.0021 a 0.0021 0.0020 0.0020 0.0020 0.0019 0.0019 0.0019 0.0020 0.0020 0.0021 b 0.0021 0.0020 0.0020 0.0020 0.0019 0.0019 0.0020 0.0019 0.0019 3 0.0403 0.0396 0.0394 0.0390 0.0386 0.0383 0.0381 0.0377 0.0374 0.0372 0.0402 0.0394 0.0391 0.0387 0.0381 0.0376 0.0372 0.0365 0.0358 0.0354 4 0.1037 0.1015 0.1008 0.0999 0.0987 0.0978 0.0970 0.0959 0.0951 0.0945 0.1036 0.1009 0.1001 0.0992 0.0971 0.0957 0.0944 0.0923 0.0907 0.0892 5 0.1644 0.1604 0.1591 0.1576 0.1554 0.1538 0.1525 0.1505 0.1492 0.1482 0.1641 0.1594 0.1578 0.1562 0.1527 0.1501 0.1478 0.1438 0.1415 0.1389 6 0.2158 0.2101 0.2083 0.2060 0.2030 0.2007 0.1989 0.1962 0.1943 0.1928 0.2153 0.2086 0.2064 0.2035 0.1991 0.1956 0.1925 0.1874 0.1833 0.1800 7 0.2584 0.2511 0.2488 0.2460 0.2422 0.2393 0.2370 0.2336 0.2311 0.2294 0.2578 0.2493 0.2465 0.2429 0.2373 0.2328 0.2290 0.2233 0.2175 0.2133 8 0.2938 0.2851 0.2825 0.2790 0.2745 0.2711 0.2684 0.2643 0.2615 0.2594 0.2932 0.2830 0.2797 0.2756 0.2640 0.2636 0.2586 0.2513 0.2454 0.2410 9 0.3236 0.3137 0.3106 0.3068 0.3016 0.2977 0.2946 0.2900 0.2868 0.2844 0.3229 0.3114 0.3075 0.3024 0.2953 0.2893 0.2842 0.2757 0.2692 0.2637 10 0.3490 0.3380 0.3346 0.3303 0.3246 0.3203 0.3168 0.3118 0.3082 0.3055 0.3482 0.3355 0.3313 0.3257 0.3177 0.3111 0.3055 0.2969 0.2891 0.2827 12 0.3901 0.3772 0.3732 0.3682 0.3616 0.3565 0.3525 0.3466 0.3424 0.3393 0.3892 0.3744 0.3695 0.3634 0.3538 0.3462 0.3398 0.3291 0.3213 0.3142 15 0.4354 0.4202 0.4156 0.4097 0.4020 0.3960 0.3914 0.3845 0.3796 0.3760 0.4344 0.4172 0.4115 0.4040 0.3935 0.3847 0.3772 0.3657 0.3559 0.3481 20 0.4862 0.4684 0.4630 0.4561 0.4470 0.4400 0.4345 0.4264 0.4207 0.4165 0.4852 0.4652 0.4587 0.4499 0.4379 0.4279 0.4195 0.4058 0.3953 0.3863 a First entry in cell corresponds to the Student t distribution. b Second entry in cell corresponds to the S u distribution. For most known distributions, Eqs. (3) and (4) can be solved numerically. In general, however, the underlying process distribution is unknown and thus it is not possible to solve Eqs. (3) and (4) to get the exact UCL and LCL. Even for known f(x) and/or F(x), the use of the control charts can be enhanced with the readily available control chart constants. We propose a method where we assume that a symmetric system of distributions such as the Student t or Johnson s S u [5] can effectively approximate any leptokurtic symmetrical distribution by matching the mean, variance, and kurtosis. Although the degrees of freedom parameter of the t distribution is treated as an integer in practice, the t distribution is defined for all real values of the degrees of freedom. This gives the family of t distributions the flexibility to have any kurtosis greater than 0. 2.2. Derivation of the Control Charts Constants Using Eqs. (3) and (4), we calculated D 3 and D 4 values for two different families of symmetric, leptokurtic Table 2. Values of UCL constant D4 for the RQA method. n Kurtosis 0.5 1 1.2 1.5 2 2.5 3 4 5 6 2 4.340 4.619 4.719 4.854 5.063 5.242 5.312 5.663 5.874 6.069 3 3.207 3.446 3.530 3.648 3.820 3.968 4.099 4.314 4.488 4.637 4 2.790 3.012 3.091 3.199 3.358 3.494 3.613 3.808 3.972 4.111 5 2.566 2.778 2.853 2.956 3.107 3.237 3.350 3.536 3.687 3.814 6 2.423 2.629 2.701 2.801 2.947 3.072 3.180 3.361 3.505 3.625 7 2.322 2.523 2.594 2.691 2.833 2.955 3.061 3.231 3.376 3.494 8 2.247 2.444 2.513 2.609 2.748 2.867 2.969 3.141 3.280 3.394 9 2.188 2.382 2.449 2.544 2.681 2.798 2.900 3.067 3.205 3.314 10 2.140 2.332 2.399 2.491 2.626 2.742 2.842 3.008 3.139 3.250 12 2.067 2.255 2.321 2.410 2.543 2.656 2.753 2.914 3.044 3.151 15 1.990 2.174 2.238 2.327 2.455 2.565 2.659 2.817 2.940 3.047 20 1.908 2.087 2.150 2.236 2.361 2.468 2.560 2.711 2.832 2.846

94 Naval Research Logistics, Vol. 55 (2008) Table 2a. Values of D 4 for Student t and Johnson S u distributions. n Kurtosis 0.5 1 1.2 1.5 2 2.5 3 4 5 6 2 4.3407 a 4.6174 4.7154 4.8490 5.0407 5.2010 5.3360 5.5501 5.7105 5.8356 4.3391 b 4.6198 4.7222 4.8589 5.0860 5.2828 5.2884 5.7758 6.0379 6.3022 3 3.2082 3.4475 3.5312 3.6452 3.8085 3.9441 4.0585 4.2385 4.3737 4.4785 3.2050 3.4435 3.5295 3.6505 3.8311 3.9925 4.1396 4.3899 4.6018 4.7963 4 2.7921 3.0161 3.0943 3.2008 3.3529 3.4791 3.5855 3.7530 3.8786 3.9762 2.7877 3.0081 3.0869 3.1977 3.3622 3.5087 3.6411 3.8632 4.0650 4.2462 5 2.5685 2.7838 2.8591 2.9612 3.1074 3.2288 3.3307 3.4916 3.6122 3.7059 2.5631 2.7725 2.8472 2.9502 3.1069 3.2446 3.3693 3.5803 3.7616 3.9222 6 2.4259 2.6357 2.7088 2.8083 2.9507 3.0691 3.1683 3.3250 3.4425 3.5338 2.4196 2.6216 2.6934 2.7931 2.9424 3.0741 3.1915 3.3978 3.5671 3.7162 7 2.3258 2.5315 2.6032 2.7010 2.8405 2.9567 3.0542 3.2080 3.3235 3.4130 2.3186 2.5150 2.5847 2.6808 2.8257 2.9530 3.0669 3.2531 3.4292 3.5738 8 2.2506 2.4534 2.5241 2.6205 2.7580 2.8726 2.9687 3.1205 3.2344 3.3228 2.2429 2.4349 2.5029 2.5979 2.7376 2.8614 2.9695 3.1617 3.3261 3.4660 9 2.1918 2.3922 2.4622 2.5574 2.6935 2.8067 2.9018 3.0521 3.1648 3.2524 2.1836 2.3720 2.4349 2.5313 2.6682 2.7890 2.8980 3.0812 3.2452 3.3765 10 2.1443 2.3427 2.4121 2.5065 2.6412 2.7536 2.8479 2.9968 3.1087 3.1955 2.1355 2.3209 2.3863 2.4764 2.6117 2.7301 2.8355 3.0184 3.1695 3.3046 12 2.0714 2.2670 2.3352 2.4283 2.5616 2.6722 2.7654 2.9124 3.0229 3.1087 2.0617 2.2423 2.3058 2.3920 2.5243 2.6388 2.7413 2.9161 3.0657 3.1930 15 1.9955 2.1879 2.2552 2.3469 2.4783 2.5876 2.6795 2.8248 2.9340 3.0189 1.9846 2.1598 2.2214 2.3071 2.4323 2.5425 2.6391 2.8089 2.9456 3.0748 20 1.9143 2.1035 2.1697 2.2602 2.3896 2.4975 2.5884 2.7319 2.8398 2.9237 1.9019 2.0710 2.1302 2.2126 2.3323 2.4376 2.5318 2.6903 2.8235 2.7686 a First entry in cell corresponds to the Student t distribution. b Second entry in cell corresponds to the S u distribution. distributions: Student t and Johnson S u, for sample sizes of n = 2ton = 25, α = 0.0027, and kurtosis ranging from 0.5 to 6. As we can see from Tables 1a and 2a, the corresponding quantiles of these two distributions are very close to one another. We experimented with other symmetric leptokurtic distributions from the Tadikamalla-Johnson s L U family of distributions [11], the Burr [2] distribution, and the Exponential power distribution [10]. The D 3 and D 4 values from these distributions are very similar to the values in Tables 1a and 2a. This led us to infer that distributions with identical first four moments closely approximate one another. It is then reasonable to assume that an average of the values of D 3 for Student t and Johnson S u, would give a good estimate of the actual LCL for any symmetrical leptokurtic distribution with the same mean, standard deviation, and kurtosis. Similarly, averaging the values of D 4 for the two distributions would give a good approximation for the UCL. We define D 3 and D 4 as follows: D 3 = D 3t dist + D 3S u dist 2 D 4 = D 4t dist + D 4S u dist 2 (8) Tables 1 and 2 give the D3 and D 4 values for different kurtosis and sample size values. 2.3. Numerical Example Here we illustrate the use of the proposed method with a numerical example using the same data from Tadikamalla and Popescu [12]. The data comes from a technology company that engages in the development, manufacture, and marketing of materials and derivative products for precision use in industrial, medical, military, surgical, and aerospace applications. The quality characteristic is the center thickness of an optical lens. Table 3 shows the data for 40 subgroups of size n = 5 from a process that is known to be in control. A look at the summary statistics of the data revealed symmetry (skewness of 0.24) and a heavy tailed distribution (kurtosis = 2.9). Following the procedure described in Tadikamalla and Popescu [12], statistical tests indicate that the skewness is not statistically significant (P-value = 0.156), and the kurtosis is highly significant (P-value 0). Using the constants in Tables 1 and 2, we calculated the control chart limits for the R-chart using the proposed method

Tadikamalla, Banciu, and Popescu: Range Charts for Normal and Long-Tailed Symmetrical Distributions 95 Table 3. Center thickness of the optical lenses. Group 1 2 3 4 5 X R 1 66.98 60.21 58.10 58.30 58.26 60.37 8.88 2 60.18 60.42 55.96 59.04 60.06 59.13 4.46 3 60.50 60.10 60.67 64.08 55.26 60.12 8.82 4 59.93 64.72 60.77 60.60 58.03 60.81 6.69 5 58.59 60.38 63.23 61.27 64.14 61.52 5.55 6 60.25 59.81 60.86 61.50 58.05 60.09 3.45 7 62.57 64.27 60.14 63.34 58.67 61.80 5.60 8 60.88 60.91 58.66 60.07 60.06 60.12 2.25 9 58.21 61.92 58.44 61.70 59.90 60.04 3.72 10 62.70 57.70 49.03 62.08 61.75 58.65 13.67 11 63.04 61.53 63.09 62.64 60.38 62.14 2.71 12 60.91 60.15 56.50 62.58 59.15 59.86 6.07 13 58.59 56.63 56.02 56.91 62.00 58.03 5.98 14 58.89 60.44 61.74 62.36 59.88 60.66 3.47 15 65.18 59.72 59.88 60.04 58.90 60.74 6.28 16 55.18 61.50 60.77 61.79 58.66 59.58 6.60 17 60.22 61.47 58.27 59.30 59.50 59.75 3.20 18 67.13 63.26 58.16 58.72 60.60 61.57 8.97 19 60.58 63.24 60.14 58.49 60.24 60.54 4.76 20 61.30 59.22 59.34 58.47 60.38 59.74 2.84 21 57.35 59.52 60.05 60.00 59.78 59.34 2.71 22 63.89 62.15 61.45 61.07 60.89 61.89 3.00 23 60.05 58.39 59.54 57.66 58.55 58.84 2.39 24 58.26 62.11 62.25 59.47 58.81 60.18 3.99 25 60.07 57.43 56.89 59.25 63.54 59.43 6.65 26 62.32 59.34 62.99 59.64 61.67 61.19 3.65 27 60.53 57.56 58.96 61.16 59.60 59.56 3.60 28 64.75 59.92 58.63 57.38 57.38 59.61 7.37 29 59.95 59.76 58.62 59.86 58.87 59.41 1.33 30 59.76 59.87 61.41 64.19 60.79 61.21 4.43 31 58.27 60.37 60.57 61.62 54.63 59.09 7.00 32 61.62 59.34 59.96 58.81 58.81 59.71 2.81 33 58.16 56.06 59.06 61.50 62.17 59.39 6.11 34 59.60 61.93 62.02 63.42 60.98 61.59 3.82 35 58.83 58.28 61.28 58.40 59.75 59.31 3.00 36 61.58 63.01 63.98 64.78 63.68 63.41 3.20 37 60.33 60.43 59.13 59.09 63.73 60.54 4.64 38 60.86 59.46 59.71 58.69 57.54 59.25 3.32 39 59.28 58.15 60.62 60.60 59.23 59.58 2.47 40 62.69 60.73 55.28 60.32 60.93 59.99 7.40 X = 60.195, R = 4.921 RQA, as follows (we use the constants corresponding to kurtosis = 3.0) 2 : CL R = R = 4.921 UCL R = D4 R = (3.35) 4.921 = 16.485 LCL R = D3 R = (0.15) 4.921 = 0.738 2 In general, for intermediate kurtosis values, a linear interpolation seems to be quite satisfactory. For a given value of n, we ran simple linear regressions for the D3 and D 4 values as the dependent variables and the kurtosis values as the independent variable. The R 2 values for these regressions range from 0.923 to 0.976. Figure 1 shows the R-chart with the control limits calculated from our proposed RQA method, KC method (Tadikamalla and Popescu [12]), and Shewhart s method. Note that both the RQA and the KC method indicate that the process under investigation is in statistical control, as is determined by the engineers, while the traditional Shewhart s method sends an out of control signal. (Tadikamalla and Popescu [12]) give the corresponding X chart, which shows that the process as being out of control using the Shewhart chart and to be in control using the KC method). 3. THE NORMAL DISTRIBUTION 3.1. Exact Control Limits for a Normal Process Distribution Consider the case of a standard normal process distribution: f(x)= 1 e x2 2 (9) 2π Substituting (9) in (2), we can compute D3 and D 4 as follows: LCL R = G 1 (α/2) ( ) 1 n n 2π = α/2 UCL R = G 1 (1 α/2) ( ) 1 n n 2π = 1 α/2 x+lcl R x x+ucl R x e x2 2 dx e x2 2 dx n 1 n 1 Then the control chart constants will be given by: where d 2 = D 3 = LCL R d 2 D 4 = UCL R d 2 e x2 2 dx e x2 2 dx (10) (11) [1 (1 (x)) n ( (x)) n ]dx and (x) is the cumulative density function of the standard normal distribution. Equation (10) can be solved numerically for different values of n and α = 0.0027 to yield the corresponding control limits. Table 4 below presents the R-chart constants for a process with an underlying normal distribution computed using our

96 Naval Research Logistics, Vol. 55 (2008) Figure 1. R-chart for the quality characteristic showing the RQA, KC and Shewhart upper and lower control limits. [Color figure can be viewed in the online issue, which is available at www.interscience.wiley.com.] approach as compared with the traditional Shewhart constants derived using the 3-sigma limits. 3.2. The Importance of the LCL Simulated Example We have always wondered about the use of an accurate approximation to the LCL of the R-chart. The purpose of an R-chart, in general, is to monitor the process variance. A typical out of control signal (a point outside the 3 sigma UCL) warns that the process variation may have increased, which in turn warrants an investigation for an assignable cause. In Shewhart control charts, the LCL is zero for n<7, and an out of control signal on the LCL side is impossible for small sample sizes. As in the case of the p-chart (proportion defective), an out of control signal on the LCL side of the R-chart could be a good thing. Such a signal may lead us to an assignable cause, which could result in an unusually low variance in the process quality characteristic. An accurate (nonzero) approximation to the LCL of the R-chart may provide such an opportunity. We simulated several data sets from a normal distribution (µ = 50, σ = 2, n = 5, 6, and r = 40). During the Figure 2. R-chart for the simulated quality characteristic showing the RQA, KC and Shewhart upper and lower control limits. [Color figure can be viewed in the online issue, which is available at www.interscience.wiley.com.]

Tadikamalla, Banciu, and Popescu: Range Charts for Normal and Long-Tailed Symmetrical Distributions 97 Table 4. R-chart constants for a normal process distribution. Exact method Shewhart s method n D 3 D 4 D 3 D 4 2 0.002 4.017 0 3.267 3 0.041 2.925 0 2.575 4 0.107 2.525 0 2.282 5 0.170 2.312 0 2.114 6 0.224 2.176 0 2.004 7 0.269 2.081 0.076 1.924 8 0.307 2.009 0.136 1.864 9 0.339 1.954 0.184 1.816 10 0.366 1.910 0.223 1.777 12 0.410 1.840 0.283 1.717 15 0.459 1.768 0.347 1.653 20 0.513 1.691 0.415 1.585 simulation (in between the simulation of the subgroups), we randomly (with a probability of 0.03) induced an abnormal variation to the process by reducing the variance by half (σ = 1). Table 5 gives one such simulated data set. Figure 2 shows the corresponding range chart with control limits both from the Shewhart method and the proposed method (RQA). Note that the Shewhart method (and the KC method, not shown here) would have an LCL of zero and thus would not have detected an out of control situation where as the proposed method detected an out of control situation on the LCL side. 4. EVALUATION OF THE PERFORMANCE OF THE PROPOSED R-CHARTS 4.1. The Logistic Distribution Case The pdf and the cdf of the standard logistic distribution (with zero mean and unit variance) are given as: and f(x)= π 3 e πx 3 (1 + e πx 3 ) 2 (12) F(x) = (1 + e πx 3 ) 1 (13) Note that the logistic distribution is symmetric and has a kurtosis of 1.2. The control chart constants for the logistic distribution are calculated very similarly to the normal case described above. The constants D 3 and D 4 for the logistic distribution are given in Table 6. Table 6 also gives the corresponding constants for the Shewhart s method, the KC method [12], and the proposed RQA method. Figure 3 shows the exact LCL and UCL values for the R-chart for the standard logistic case and compares them to different approximations. We find that the LCL constant (D 3 ) is very closely approximated by RQA. The KC and the RQA approximations are comparable to the UCL constant (D 4 ). The performance of the Shewhart control limits is significantly worse. 4.2. The Laplace Distribution Case The Laplace distribution is a symmetric distribution with kurtosis of 3.0. The pdf and cdf of the standard Table 5. The importance of a nonzero LCL in an R-chart (Simulated data). Group 1 2 3 4 5 6 R 1 80.49 85.00 82.01 81.28 81.63 81.26 4.51 2 80.07 82.25 78.30 78.90 79.66 78.87 3.95 3 77.93 74.35 79.28 79.62 78.47 81.35 7.01 4 80.69 81.18 77.60 78.12 76.84 82.61 5.77 5 78.86 79.26 81.69 80.20 79.52 78.53 3.16 6 82.31 80.28 81.78 79.21 79.09 79.70 3.22 7 80.21 80.99 77.54 79.90 83.21 80.87 5.67 8 78.40 78.98 81.90 79.67 82.43 76.99 5.44 9 78.02 76.75 78.62 83.08 79.74 79.64 6.33 10 79.61 81.33 81.47 78.49 79.72 77.64 3.84 11 79.17 79.69 77.50 84.02 80.01 81.22 6.52 12 81.08 77.04 80.62 82.02 77.89 79.16 4.98 13 81.74 80.86 77.14 80.75 81.27 78.66 4.60 14 83.35 81.69 80.56 81.01 81.86 81.65 2.79 15 80.76 80.77 81.63 78.50 81.23 81.18 3.14 16 76.92 80.20 80.47 78.63 81.35 80.31 4.43 17 80.91 79.45 83.56 82.73 80.13 78.03 5.53 18 79.08 79.75 82.34 80.09 81.08 80.70 3.26 19 77.58 80.35 79.52 78.07 78.65 78.94 2.77 20 80.39 80.96 82.02 82.39 77.28 77.27 5.12 21 79.17 78.44 79.34 79.64 75.77 77.87 3.87 22 80.12 77.16 80.16 76.04 79.90 77.27 4.12 23 78.74 80.16 81.88 78.80 79.88 79.08 3.14 24 78.88 82.57 79.67 83.79 81.62 82.50 4.91 25 83.43 81.72 79.74 76.57 81.08 81.39 6.86 26 81.65 80.21 76.21 80.14 82.48 82.29 6.26 27 81.29 80.83 76.76 80.08 79.66 78.54 4.52 28 80.20 75.61 79.00 80.09 78.57 84.00 8.39 29 84.25 81.73 81.49 79.13 77.38 77.37 6.88 30 81.61 82.84 82.23 79.44 83.14 80.63 3.70 31 81.10 81.55 80.54 80.20 77.95 78.08 3.60 32 78.30 78.68 78.52 78.31 78.84 79.21 0.90 33 82.26 80.70 78.01 78.22 77.16 78.38 5.09 34 79.64 82.33 80.45 83.94 78.34 82.19 5.60 35 81.85 83.38 77.61 81.31 79.83 81.57 5.76 36 82.25 78.50 80.50 82.89 80.39 81.60 4.39 37 79.72 79.36 79.65 83.74 82.17 81.14 4.38 38 81.83 78.42 79.32 80.60 77.89 82.93 5.03 39 76.51 81.09 82.56 80.93 80.72 79.52 6.05 40 79.97 81.54 80.14 82.32 82.13 81.40 2.35 R = 4.7 Shewhart Limits: UCL = 9.4188; LCL = 0 Exact Limits: UCL = 10.2272; LCL = 1.0528

98 Naval Research Logistics, Vol. 55 (2008) Table 6. R-chart control limits for a logistic process distribution. Table 7. R-chart control limits for a Laplace process distribution. Exact Shewhart s KC RQA method method method method n D3 D4 D3 D4 D3 D4 D3 D4 2 0.002 4.717 0 3.267 0 4.481 0.002 4.719 3 0.039 3.515 0 2.575 0 3.363 0.039 3.530 4 0.099 3.066 0 2.282 0 2.957 0.100 3.091 5 0.156 2.821 0 2.114 0 2.738 0.158 2.853 6 0.204 2.663 0 2.004 0 2.598 0.207 2.701 7 0.243 2.550 0.076 1.924 0 2.498 0.248 2.594 8 0.276 2.465 0.136 1.864 0 2.422 0.281 2.513 9 0.303 2.398 0.184 1.816 0 2.363 0.309 2.449 10 0.326 2.343 0.223 1.777 0 2.314 0.333 2.399 12 0.364 2.259 0.283 1.717 0 2.239 0.371 2.321 15 0.406 2.169 0.347 1.653 0 2.159 0.414 2.238 20 0.453 2.072 0.415 1.585 0 2.071 0.461 2.150 Exact Shewhart s KC RQA method method method method n D3 D4 D3 D4 D3 D4 D3 D4 2 0.002 5.493 0 3.267 0 5.138 0.002 5.312 3 0.033 4.125 0 2.575 0 3.956 0.038 4.099 4 0.080 3.588 0 2.282 0 3.509 0.096 3.613 5 0.124 3.283 0 2.114 0 3.266 0.150 3.350 6 0.161 3.082 0 2.004 0 3.109 0.196 3.180 7 0.193 2.937 0.076 1.924 0 2.997 0.233 3.061 8 0.219 2.823 0.136 1.864 0 2.913 0.264 2.969 9 0.242 2.738 0.184 1.816 0 2.847 0.289 2.900 10 0.262 2.661 0.223 1.777 0 2.793 0.311 2.842 12 0.295 2.547 0.283 1.717 0 2.709 0.346 2.753 15 0.334 2.427 0.347 1.653 0 2.621 0.384 2.659 20 0.381 2.295 0.415 1.585 0 2.525 0.427 2.560 Laplace distribution (mean zero and unit variance) are given below: f(x)= 1 e 2 x (14) 2 and 1 F(x) = 2 e 2x, x 0 1 1 2 e 2x, x>0 (15) The control chart constants (D 3 and D 4 ) for the Laplace distribution can be calculated very similarly to the normal and logistic cases. The constants D 3 and D 4 for the Laplace distribution are given in Table 7. Table 7 also gives the corresponding constants for Shewhart s method, the KC method [12], and the RQA method. Exact values for the LCL and UCL of the Laplace distribution are shown in Figure 4 along with the other approximations. Once again, RQA outperforms the other approximations to the LCL and is comparable to the KC method for the UCL. 5. CONCLUSIONS This study presents a method to calculate more accurate limits for the R-chart under the assumption of a symmetric, leptokurtic distribution of the quality characteristic. Our Figure 3. Upper and lower control limits for the R-chart for the logistic distribution.

Tadikamalla, Banciu, and Popescu: Range Charts for Normal and Long-Tailed Symmetrical Distributions 99 Figure 4. Upper and lower control limits for the R-chart for the Laplace distribution. method computes the true quantiles directly for the distribution of R and thus overcomes the disadvantage of the Shewhart R-chart and other approximations that use ±3 sigma limits which result in a LCL of zero for smaller sample sizes. We tabulated the D3 and D 4 constants for different values of n and of the kurtosis and illustrate the implementation of our method with a numerical example. By providing these constants, the implementation of our method is identical with Shewhart s method and is just as simple. As a special case, we tabulated the exact constants of the range chart for the normal distribution. Finally, we compared the performance of our method for the case of the logistic and the Laplace distributions and found increased accuracy for the LCL relative to the Shewhart method and the KC method. REFERENCES [1] W. Albers and W.C.M. Kallenberg, Estimation in Shewhart control charts: Effects and corrections, Metrika 59 (2004), 207 234. [2] I.W. Burr, Cumulative frequency functions, Ann Math Stat 13 (1942), 215 232. [3] P. Castagliola, Control charts for data having a symmetrical distribution with a positive kurtosis, Recent Advances in Reliability and Quality Engineering, Hoang Pham (editor), World Scientific Publishers, 2001, pp. 1 16. [4] L.K. Chan and H.J. Cui, Skewness correction x-bar and R-charts for skewed distributions, Nav Res Logistics 50 (2003), 555 573. [5] N.L. Johnson, Systems of frequency curves generated by methods of translation, Biometrika 36 (1949), 149 176. [6] M.B.C. Khoo and E.G. Lim, An improved R (range) control chart for monitoring the process variance, Qual Reliab Eng Int 21 (2005), 43 50. [7] D.C. Montgomery, Introduction to statistical quality control, Wiley, New York, 2001. [8] W.A. Shewhart, Economic control of quality of manufacturing processes, American Society for Quality Control, Milwaukee, 1931. [9] A. Stuart and J.K. Ord, Advanced theory of statistics, Oxford University Press, New York, 1987. [10] P. Tadikamalla, Random sampling from the exponential power distributions, J Am Stat Assoc 75 (1980), 683 686. [11] P. Tadikamalla and N.L. Johnson, Tables to facilitate fitting Lu distributions, Commun Stat Simulation Computation 11 (1982), 249 271. [12] P. Tadikamalla and D. Popescu, Kurtosis correction method for x-bar and R control charts for long-tailed symmetrical distributions, Nav Res Logistics 54 (2007), 371 383. [13] L.H.C. Tippett, On the extreme individuals and the range of sample taken from a normal population, Biometrika 17 (1925), 364 387.