Simulation of probability distributions commonly used in hydrological frequency analysis

Size: px
Start display at page:

Download "Simulation of probability distributions commonly used in hydrological frequency analysis"

Transcription

1 HYDROLOGICAL PROCESSES Hydrol. Process. 2, 5 6 (27) Published online May 26 in Wiley InterScience ( DOI: 2/hyp.676 Simulation of probability distributions commonly used in hydrological frequency analysis Ke-Sheng Cheng,* Jie-Lun Chiang and Chieh-Wei Hsu Department of Bioenvironmental Systems Engineering, National Taiwan University, Taipei, Taiwan, ROC Abstract: Random variable simulation has been applied to many applications in hydrological modelling, flood risk analysis, environmental impact assessment, etc. However, computer codes for simulation of distributions commonly used in hydrological frequency analysis are not available in most software libraries. This paper presents a frequency-factor-based method for random number generation of five distributions (normal, log normal, extreme-value type I, Pearson type III and log-pearson type III) commonly used in hydrological frequency analysis. The proposed method is shown to produce random numbers of desired distributions through three means of validation: () graphical comparison of cumulative distribution functions (CDFs) and empirical CDFs derived from generated data; (2) properties of estimated parameters; (3) type I error of goodness-of-fit test. An advantage of the method is that it does not require CDF inversion, and frequency factors of the five commonly used distributions involves only the standard normal deviate. Copyright 26 John Wiley & Sons, Ltd. KEY WORDS random number generation; random variable simulation; hydrological frequency analysis; goodness-of-fit test Received 23 August 24; Accepted August 25 INTRODUCTION In various statistical applications, particularly in simulation studies, it is often desired to generate random samples of specified random variables. Recent developments in hydrological modelling, flood risk analysis, environmental impact assessment, etc. have demonstrated the usefulness of random variable simulation (National Research Council, 2). Although computer codes for simulation of random variables with uniform and normal (or Gaussian) distributions are widely available, simulation of other non-gaussian continuous random variables frequently used in hydrological study are less common. Computer simulation of random variables is the task of using computers to generate many random numbers that are independent and identically distributed. It is also known as random number generation (RNG). In fact, these computer-generated random numbers form a deterministic sequence, and the same list of numbers will be cycled repeatedly. This cycle can be made to be so long that the lack of true independence is unimportant (Larget, 22). Therefore, such computer codes are often termed pseudo-random number generators (PRNGs). There exist mathematical transformation methods to obtain other distributions from uniform variates (Devroye, 986; Hellekalek, 997). For this reason, most PRNGs found in software libraries produce uniform random numbers in the unit interval (, ). However, transformations from a uniform distribution to the several distributions commonly used in hydrological frequency * Correspondence to: Ke-Sheng Cheng, Department of Bioenvironmental Systems Engineering, National Taiwan University, Taipei, Taiwan, ROC. rslab@ntu.edu.tw analysis is not easy, and computer codes for simulation of these distributions are not available in most software libraries. Therefore, the purpose of this paper is to present a method for RNG of five distributions (normal, log normal, extreme-value type I (EV), Pearson type III (PT3) and log-pearson type III (LPT3)) commonly used in hydrological frequency analysis. The method utilizes the frequency factor, which is familiar to hydrologists for transformation from uniform variates to desired distributions. GENERATING RANDOM NUMBERS: PROBABILITY INTEGRAL TRANSFORMATION AND REJECTION METHODS Suppose that we are interested in generating n values of a random variable X that has a continuous cumulative distribution function (CDF) F X Ð. A commonly used method for such a purpose is the probability integral transform (PIT) method or the inversion method (Mood et al., 974). The PIT method is based on the property that a random variable X with CDF F X Ð can be transformed into a random variable U with uniform distribution over the interval (, ) by defining U D F X X Conversely, if U is uniformly distributed over the interval (, ), then X D F X U has CDF F X Ð. Thus, to generate a value, say x, of a random variable X having a continuous CDF F X Ð, it suffices to generate a value, say u, of a random variable U that is uniformly distributed Copyright 26 John Wiley & Sons, Ltd.

2 52 K.-S. CHENG, J.-L. CHIANG AND C.-W. HSU over the interval (, ). The value x is then obtained by x D F X u 2 Another method for generating random deviates is the rejection method (Press et al., 993), which does not require that the CDF be readily computable, much less the inverse of that function. The rejection method utilizes a comparison function g x that lies everywhere above the desired probability distribution f x. The method first generates a uniform deviate between zero and A, where A is the total area under the comparison function, and uses it to get a corresponding x. Then a second uniform deviate y between zero and g x is generated and used to decide whether to accept (if f x ½ y) or reject (if f x < y) thatx. The non-rejected x values would have the desired distribution f x. GENERAL EQUATION FOR HYDROLOGICAL FREQUENCY ANALYSIS Hydrological frequency analysis is the work of determining the magnitudes of hydrological variables corresponding to a given frequency or recurrence interval. The recurrence interval, also called the return period, is defined as the average interval over a long period of time during which a corresponding magnitude of some hydrological variable is met or exceeded. Rainfall and streamflow are two major types of hydrological variable that are used in frequency analysis. For example, annual maximum flow records are used to estimate the magnitude of flood of 5- year return period Q 5, i.e. on average, year of every 5-year sequence is expected to experience a flood of at least Q 5. Two methods of hydrological frequency analysis are commonly applied: the plotting position method and the frequency factor method. The former is a straightforward plotting technique to obtain the CDF by use of certain plotting position formulas (Chow et al., 988). The frequency factor method is described below. A random variable X has CDF F X Ð with mean and standard deviation. The magnitude of X corresponding to return period T, denoted by x T,isdefinedas P X ½ x T D /T 3 Chow (95) proposed the following general equation for hydrological frequency analysis: x T D C K T 4 where K T, the frequency factor, is a function of T and is distribution specific. Apparently, if X is normally distributed, then the frequency factor K T corresponds to the standard normal deviate with exceedance probability /T. Frequency factors of commonly used distributions in hydrological frequency analysis have been developed (Kite, 988). Table I shows the probability density functions and frequency factors of five distributions commonly used for hydrological frequency analysis. Table I. Probability density functions and frequency factors of distributions commonly used for hydrological frequency analysis Distribution, X Probability density function f X x [ Normal f X x D p exp 2 2 <x<c Log normal f X x D ( ) ] x 2, Frequency factor K T Standard normal deviate z with exceedance probability /T [ p exp ( ) y y 2 ] 2xy 2, K T D expf[ln C C2 V ]/2 Z [ln C C 2 V ]/2g y C V <x<c C V D /, coefficient of variation of X D e y C2 y /2, 2 D e 2 y 2 z: standard normal deviate with exceedance y and y are respectively the mean and probability /T standard deviation of Y D ln X p [ ( )]} EV f X x D exp[ x ˇ e x ˇ ], K T D 6 {Ð5772 C ln ln T T <x<c D / p 6, ˇ D Ð5772/ and are respectively the mean and standard deviation of X PT3 f X x D ˇ LPT3 ε x<c ( x ε )ˇ e [ x ε / ], K T ³ z C z 2 6 C 3 ( ) 2 z3 6z ( ) 6 z 2 3 ( ) 4 ( ) 6 C z z: standard normal deviate with exceedance D / pˇ, ˇ D 2/ 2,εD pˇ, and are respectively the mean, standard probability /T deviation and skewness coefficient of X ) ( f X x D ln x ε )ˇ ( ln x ε x ˇ e, Same as K T of the PT3 distribution (K T is to be ε ln x<c substituted into y T D ln x T D y C K T y ) D y / pˇ, ˇ D 2/ y 2,εD y y pˇ y, y and y are respectively the mean, standard deviation and skewness coefficient of Y D ln X Copyright 26 John Wiley & Sons, Ltd. Hydrol. Process. 2, 5 6 (27) DOI: 2/hyp

3 HYDROLOGICAL FREQUENCY ANALYSIS 53 Table II. Distribution parameters designated for generation of random numbers Parameter Distribution Normal Log normal EV PT3 LPT3 Mean a a Standard deviation a a Coefficient of skewness a Ð396 Ð5 Ð5 a a Parameters are assigned for Y D ln X when X is a random variable of log normal or LPT3 distribution. Normal (µ=. σ=); n= Normal (µ=. σ=); n= LN (µ y =. σ y =); n= LN (µ y =. σ y =); n= EV (µ=. σ=); n= EV (µ=. σ=); n= PT3 (µ=. σ=, γ=.5); n=5.6 PT3 (µ=. σ=, γ=.5); n= LPT3 (µ y =. σ y =, γ y =.5); n= X LPT3 (µ y =. σ y =, γ y =.5); n= X Figure. Graphical comparison of ECDF (dots) and CDF (solid curves). (For log normal and log-pearson distributions, values of Y D ln X are used in X-axis) Copyright 26 John Wiley & Sons, Ltd. Hydrol. Process. 2, 5 6 (27) DOI: 2/hyp

4 54 K.-S. CHENG, J.-L. CHIANG AND C.-W. HSU Table III. Sample mean and standard deviation of the estimated parameters (mean, standard deviation and coefficient of skewness) with respect to sample size n ranging from 5 to 5 for normal distribution ( D, D ) Estimated Summary n (sample size) parameter statistic Number of random samples N D x Mean Ð25 Ð53 Ð6 Ð Ð3 Ð Ð8 Ð2 Ð2 Ð SD Ð346 Ð9 Ð82 Ð696 Ð655 Ð63 Ð525 Ð493 Ð479 Ð457 s Mean Ð9993 Ð9955 Ð9994 Ð9986 Ð Ð999 Ð9993 Ð9989 Ð9985 Ð9996 SD Ð9 Ð683 Ð587 Ð56 Ð449 Ð423 Ð395 Ð354 Ð324 Ð33 O Mean Ð77 Ð2 Ð66 Ð8 Ð98 Ð9 Ð3 Ð9 Ð Ð3 SD Ð494 Ð2637 Ð294 Ð785 Ð6 Ð495 Ð363 Ð252 Ð87 Ð4 Number of random samples N D x Mean Ð2 Ð3 Ð Ð6 Ð4 Ð5 Ð7 Ð9 Ð Ð4 SD Ð422 Ð2 Ð82 Ð74 Ð64 Ð574 Ð536 Ð5 Ð474 Ð443 s Mean Ð9944 Ð998 Ð9987 Ð9986 Ð9986 Ð9982 Ð9994 Ð Ð9989 Ð9994 SD Ð5 Ð7 Ð577 Ð55 Ð448 Ð46 Ð375 Ð353 Ð33 Ð34 O Mean Ð57 Ð2 Ð22 Ð Ð24 Ð5 Ð4 Ð25 Ð8 Ð5 SD Ð436 Ð2677 Ð233 Ð84 Ð62 Ð453 Ð364 Ð239 Ð78 Ð3 Table IV. Sample mean and standard deviation of the estimated parameters (mean, standard deviation and coefficient of skewness) with respect to sample size n ranging from 5 to 5 for log normal distribution ( y D, y D ) Estimated Summary n (sample size) parameter statistic Number of random samples N D y Mean Ð2 Ð22 Ð28 Ð47 Ð Ð6 Ð4 Ð7 Ð3 Ð4 SD Ð43 Ð999 Ð8 Ð7 Ð67 Ð574 Ð544 Ð484 Ð482 Ð457 s y Mean Ð9947 Ð999 Ð9976 Ð998 Ð9976 Ð999 Ð9993 Ð998 Ð9979 Ð9975 SD Ð48 Ð7 Ð587 Ð494 Ð44 Ð49 Ð379 Ð36 Ð337 Ð33 O y Mean Ð9 Ð Ð56 Ð54 Ð46 Ð5 Ð62 Ð54 Ð37 Ð42 SD Ð3987 Ð2672 Ð252 Ð849 Ð637 Ð464 Ð322 Ð229 Ð98 Ð2 Number of random samples N D y Mean Ð Ð Ð4 Ð Ð2 Ð Ð2 Ð9 Ð5 Ð SD Ð43 Ð999 Ð83 Ð79 Ð635 Ð574 Ð532 Ð496 Ð474 Ð448 s y Mean Ð996 Ð9959 Ð9983 Ð9985 Ð9986 Ð9993 Ð9989 Ð9994 Ð9994 Ð9988 SD Ð Ð78 Ð58 Ð496 Ð447 Ð47 Ð378 Ð356 Ð334 Ð37 O y Mean Ð4 Ð2 Ð6 Ð6 Ð9 Ð5 Ð Ð3 Ð Ð8 SD Ð4 Ð2664 Ð233 Ð82 Ð599 Ð448 Ð35 Ð247 Ð8 Ð2 y, s y and O y are respectively the sample estimates of mean, standard deviation and skewness coefficient of Y D ln X respectively. Table V. Sample mean and standard deviation of the estimated parameters (mean, standard deviation and coefficient of skewness) with respect to sample size n ranging from 5 to 5 for EV distribution ( D, D, D Ð396) Estimated Summary n (sample size) parameter statistic Number of random samples N D x Mean Ð2 Ð2 Ð7 Ð7 Ð44 Ð3 Ð4 Ð34 Ð9 Ð28 SD Ð47 Ð976 Ð85 Ð73 Ð63 Ð565 Ð55 Ð5 Ð463 Ð46 s Mean Ð9897 Ð2 Ð9956 Ð9966 Ð9952 Ð9985 Ð997 Ð9958 Ð9994 Ð7 SD Ð45 Ð Ð863 Ð734 Ð667 Ð595 Ð57 Ð54 Ð487 Ð47 O Mean Ð48 Ð729 Ð53 Ð394 Ð425 Ð393 Ð27 Ð344 Ð49 Ð42 SD Ð6397 Ð4846 Ð4279 Ð3654 Ð336 Ð34 Ð2584 Ð2569 Ð242 Ð2364 Number of random samples N D x Mean Ð25 Ð8 Ð5 Ð2 Ð5 Ð5 Ð6 Ð Ð Ð SD Ð49 Ð99 Ð82 Ð74 Ð63 Ð576 Ð532 Ð5 Ð472 Ð445 s Mean Ð9937 Ð9936 Ð9976 Ð9972 Ð9977 Ð9984 Ð9985 Ð9983 Ð9993 Ð999 SD Ð455 Ð35 Ð85 Ð733 Ð665 Ð64 Ð554 Ð58 Ð49 Ð47 O Mean Ð766 Ð42 Ð48 Ð39 Ð434 Ð398 Ð345 Ð39 Ð429 Ð397 SD Ð6342 Ð4778 Ð44 Ð3479 Ð326 Ð2998 Ð2687 Ð2575 Ð2557 Ð2353 Copyright 26 John Wiley & Sons, Ltd. Hydrol. Process. 2, 5 6 (27) DOI: 2/hyp

5 HYDROLOGICAL FREQUENCY ANALYSIS 55 Table VI. Sample mean and standard deviation of the estimated parameters (mean, standard deviation and coefficient of skewness) with respect to sample size n ranging from 5 to 5 for PT3 distribution ( D, D, D Ð5) Estimated Summary n (sample size) parameter statistic Number of random samples N D x Mean Ð25 Ð3 Ð22 Ð23 Ð Ð9 Ð2 Ð24 Ð8 Ð8 SD Ð392 Ð976 Ð77 Ð694 Ð62 Ð582 Ð54 Ð498 Ð475 Ð448 s Mean Ð983 Ð9942 Ð9927 Ð9945 Ð9952 Ð9954 Ð9989 Ð9994 Ð9978 Ð2 SD Ð666 Ð4 Ð947 Ð825 Ð758 Ð72 Ð637 Ð568 Ð568 Ð529 O Mean Ð5675 Ð5277 Ð537 Ð5526 Ð524 Ð56 Ð5284 Ð5293 Ð598 Ð539 SD Ð6938 Ð557 Ð439 Ð425 Ð3647 Ð347 Ð3283 Ð332 Ð2732 Ð2738 Number of random samples N D x Mean Ð6 Ð Ð4 Ð5 Ð5 Ð9 Ð8 Ð Ð8 Ð3 SD Ð42 Ð7 Ð86 Ð74 Ð626 Ð576 Ð54 Ð497 Ð47 Ð444 s Mean Ð9876 Ð992 Ð9947 Ð9977 Ð996 Ð9969 Ð997 Ð9982 Ð9986 Ð9976 SD Ð62 Ð62 Ð957 Ð83 Ð743 Ð682 Ð636 Ð59 Ð56 Ð526 O Mean Ð569 Ð545 Ð535 Ð5364 Ð539 Ð5249 Ð5265 Ð5279 Ð5248 Ð525 SD Ð656 Ð5257 Ð45 Ð45 Ð3699 Ð3382 Ð322 Ð333 Ð299 Ð2745 Table VII. Sample mean and standard deviation of the estimated parameters (mean, standard deviation and coefficient of skewness) with respect to sample size n ranging from 5 to 5 for LPT3 distribution ( y D, y D, y D Ð5) Estimated Summary n (sample size) parameter statistic Number of random samples N D y Mean Ð5 Ð32 Ð8 Ð Ð5 Ð2 Ð8 Ð4 Ð7 Ð SD Ð358 Ð37 Ð8 Ð688 Ð624 Ð572 Ð53 Ð5 Ð46 Ð452 s y Mean Ð982 Ð993 Ð9956 Ð9929 Ð9956 Ð9973 Ð9935 Ð9947 Ð998 Ð9993 SD Ð68 Ð93 Ð965 Ð844 Ð74 Ð67 Ð632 Ð588 Ð575 Ð522 O y mean Ð5947 Ð5533 Ð5283 Ð525 Ð5289 Ð5222 Ð56 Ð5269 Ð5279 Ð5324 SD Ð676 Ð4998 Ð4629 Ð395 Ð3899 Ð3265 Ð34 Ð2964 Ð294 Ð266 Number of random samples N D y Mean Ð8 Ð Ð Ð7 Ð9 Ð3 Ð Ð8 Ð Ð7 SD Ð4 Ð993 Ð85 Ð7 Ð64 Ð584 Ð534 Ð52 Ð475 Ð45 s y Mean Ð9883 Ð9928 Ð9944 Ð996 Ð9968 Ð997 Ð9967 Ð9979 Ð9975 Ð9979 SD Ð648 Ð68 Ð955 Ð83 Ð75 Ð685 Ð627 Ð593 Ð56 Ð535 O y Mean Ð5849 Ð549 Ð5326 Ð5262 Ð5283 Ð5296 Ð532 Ð5285 Ð5299 Ð528 SD Ð686 Ð56 Ð4423 Ð392 Ð3644 Ð3393 Ð322 Ð346 Ð2857 Ð2827 y, s y and O y are respectively the sample estimates of mean, standard deviation and skewness coefficient of Y D ln X respectively. Suppose that a random sample fx,x 2,...,x n g of a hydrological variable X of known distribution type is available. The magnitude of X corresponding to return period T, x T, can be estimated by. Calculating the sample mean x, sample variance s 2 and skewness coefficient O from the random sample fx,x 2,...,x n g. 2. Determining the value of frequency factor K T using the appropriate distribution or equation in Table I. Readers are reminded that frequency factors of random variables of normal, log normal and PT3 distributions do not explicitly relate to return period T. The relation between K T and T is embedded in the value of the standard normal deviate z, which satisfies P Z ½ z D /T. 3. x T is estimated by Ox T D x C K T s. GENERATING RANDOM NUMBERS USING FREQUENCY FACTORS u is a one-to-one single-value relation. After a random number, say u, ofa uniform distribution over the interval (, ) is generated, let us set As shown in Equation (2), generation of random numbers by the PIT method requires inversion of the CDF F X Ð. It is not always easy to determine the inverse function of F X Ð. As for the rejection method, one needs to choose a comparison function whose indefinite integral is known analytically, and which is analytically invertible. In this section we propose an alternative method that can avoid CDF inversion by using the general equation of frequency analysis. The CDF of a continuous random variable is a nondecreasing function and x D F X T D u 5 Copyright 26 John Wiley & Sons, Ltd. Hydrol. Process. 2, 5 6 (27) DOI: 2/hyp

6 56 K.-S. CHENG, J.-L. CHIANG AND C.-W. HSU Normal (N=) Log-Normal (N=) EV (N=) Normal (N=) Log-Normal (N=) EV (N=) PT3 (N=).5 PT3 (N=) LPT3 (N=) LPT3 (N=) Figure 2. Uncertainty in estimation of mean reduces as sample size increases. (The centre line represents mean of x and upper and lower lines are one standard deviation away from the centre line) This yields P X ½ x T D T D u D P U ½ u 6 approach is that, even though there are five types of random variable in Table I, determination of K T involves only the standard normal deviate z. with T determined by Equation (5). Frequency factor K T can be calculated using the appropriate distribution or equation in Table I. Finally, the magnitude of x T is calculated by x T D C K T. Similarly, a set of random numbers fu,u 2,...,u n g of a uniform distribution over interval (, ) can be transformed to random numbers fx,x 2,...,x n g of the desired distribution. Unlike the PIT method, no CDF inversion is involved in the above calculation, and the proposed method is hereafter referred to as the frequency factor transformation (FQFT) method. Another advantage of the FQFT TEST AND VALIDATION In order to demonstrate the applicability of the FQFT approach, random numbers of normal, log normal, EV, PT3 and LPT3 distributions are generated and tested. Specific distribution parameters designated for generating random numbers are shown in Table II. For each type of distribution N, random samples (each of size n) were generated and used in subsequent analysis. In this study, the sample size n was set to vary from 5 to 5 in Copyright 26 John Wiley & Sons, Ltd. Hydrol. Process. 2, 5 6 (27) DOI: 2/hyp

7 HYDROLOGICAL FREQUENCY ANALYSIS Normal (N=) Normal (N=) Log-Normal (N=) Log-Normal (N=) EV (N=) EV (N=) PT3 (N=) PT3 (N=).2.5 LPT3 (N=) LPT3 (N=) Figure 3. Uncertainty in estimation of standard deviation reduces as sample size n increases. (The centre line represents mean of s and upper and lower lines are one standard deviation away from the centre line) increments of 5 and the number of random samples N was set to and. We adopt three means to test the validity of the random numbers generated: () graphical comparison of the CDF and the empirical CDF (ECDF) derived from generated data; (2) properties of estimated parameters; (3) type I error of goodness-of-fit (GOF) test. Graphical comparison of CDF and ECDF Figure graphically illustrates the closeness of the CDF and ECDF with regard to sample sizes of 5 and 5. Each ECDF in Figure is based on one single random sample of size 5 or 5 and it may change when another random sample is used. It can be seen that even at sample size of 5 the ECDF is fairly close to the CDF of the designated distribution. At a sample size of 5, all ECDFs become almost indistinguishable from their corresponding CDFs. Properties of parameter estimators From each of the N random samples generated, the distribution parameters mean, standard deviation and coefficient of skewness can be estimated by x D n n id x i 7 Copyright 26 John Wiley & Sons, Ltd. Hydrol. Process. 2, 5 6 (27) DOI: 2/hyp

8 58 K.-S. CHENG, J.-L. CHIANG AND C.-W. HSU.5.3 Normal (N=).5.3 Normal (N=) Log-Normal (N=).5.3 Log-Normal (N=) EV (N=).7 EV (N=) PT3 (N=) PT3 (N=) LPT3 (N=) LPT3 (N=) Figure 4. Uncertainty in estimation of skewness coefficient reduces as sample size n increases. (The centre line represents mean of O and upper and lower lines are one standard deviation away from the centre line) n s D p x i x 2 n id n x i x 3 id O D n n n 2 s The coefficient of skewness is very sensitive to sample size n. Bobee and Robitaille (977) suggested using the following sample-size-adjusted coefficient of skewness for PT3 and LPT3 distributions ( O [n n ]/2 DO C 8Ð5 ) n 2 n Furthermore, from a total of N random samples, the sample mean and standard deviation of the above estimated parameters were calculated, with respect to sample size n ranging from 5 to 5, and listed in Tables III VII. Figures 2 4 demonstrate that, with and random samples (N D or ), sample means (the centre line) of the estimated parameters (including mean, standard deviation and coefficient of skewness) are very close to the theoretical values designated for RNG. It is also seen clearly that standard deviations of all parameter estimators decrease with increase of the sample size n, indicating the unbiased nature of the estimator and reduction of uncertainty in parameter estimation. Such characteristics of parameter estimators suggest the random Copyright 26 John Wiley & Sons, Ltd. Hydrol. Process. 2, 5 6 (27) DOI: 2/hyp

9 HYDROLOGICAL FREQUENCY ANALYSIS α= α=.3.2. Normal (µ=, σ=); N= α= α=.3.2 Normal (µ=, σ=); N= α= α=.3.2. LN (µy=,σy=); N= α= α=.3.2 LN (µy=, σy=); N= α= α=.3.2 EV (µ=, σ=); N= α= α=.3.2 EV (µ=, σ=); N= α= α= PT3 (µ=, σ=, γ=.5); N= α= α=.3.2 PT3 (µ=, σ=, γ=.5); N= α= α= LPT3 (µy=, σy=, γy=.5); N= sample size (n).2..9 α= α=.3.2. LPT3 (µy=, σy=, γy=.5); N= sample size (n) Figure 5. Type I error Ǫ of chi-square GOF test with respect to sample size n samples generated are indeed from the desired distributions. Type I error of GOF test Each random sample of size n is generated from a theoretical distribution with designated parameters and the GOF test can be applied to test whether the random sample is drawn from the theoretical distribution. The widely applied chi-square GOF test is adopted in this study. A random sample x,x 2,...,x n consists of n observed values of a hypothesized distribution. These observed values fall into k mutually exclusive categories, and the following statistic T D k O i E i 2 id E i has a chi-square distribution, for large n, with k degrees of freedom. In Equation (), O i and E i respectively represent the observed and theoretical expected Copyright 26 John Wiley & Sons, Ltd. Hydrol. Process. 2, 5 6 (27) DOI: 2/hyp

10 6 K.-S. CHENG, J.-L. CHIANG AND C.-W. HSU frequencies falling in the ith category. There are various criteria for determination of sample size n and number of categories k. It is usually felt that n should be large enough that no expected frequency is less than unity and no more than 2% of the expected frequencies are less than five (Milton and Arnold, 23). Mann and Wald (942) initiated a study of choice of categories and recommended that the categories be chosen to have equal probabilities under the hypothesized distribution. They found that, for a sample of size n (large) and significance level, the number of equiprobable categories should be approximately [ ] 2n k Ł 2 /5 D 4 c 2 2 where c is the standard normal deviate with exceedance probability. D Agostino and Stephens (986) further recommended that the number of equiprobable categories should fall between the k Ł value determined by Equation (2) for D Ð5 and half that value. Since the value of k Ł in Equation (2) increases slowly with and it overstates the number of categories required, the value of D Ð5 for the c calculation can be used for various levels of significance (D Agostino and Stephens, 986). To be more specific, c Ð5 D Ð645 and half the value of k Ł in Equation (2) is [ ] 2n k D Ð5k Ł 2 /5 D 2 c 2 D Ð88n 2/5 3 Therefore, k ³ 2n 2/5 is a convenient choice for the number of mutually exclusive, equiprobable categories for the chi-square GOF test and is adopted in this study. The null hypothesis of the chi-square GOF test assumes that the observed sample is drawn from the hypothesized distribution. The null hypothesis is rejected, at level of significance, if the value of the test statistic T calculated from the random sample x,x 2,...,x n exceeds,k 2, the ( )th quantile of the chi-square distribution with k degrees of freedom. Although all random samples are generated based on a theoretical distribution, there is no guarantee that all random samples will not be rejected by the GOF test, since the level of significance is imposed. If a random sample is rejected by the GOF test using the theoretical distribution as the hypothesized distribution, then a type I error is conducted. Theoretically, at level of significance, there will be % of the total number of random samples rejected by the GOF test. In practice, N r samples out of the totality are rejected and the probability of conducting a type I error is estimated as Ǫ D N r /N 4 As N increases, Ǫ should become increasingly close to the level of significance. Figure 5 demonstrates type I error Ǫ of the chi-square GOF test with respect to sample size n. With random samples, most values of Ǫ fall between Ð4 and Ð6 for D Ð5, and between Ð9 and Ð for D Ð. As the number of random samples N increases to, almost all values of Ǫ are very close to the level of significance. Such results indicate that random samples generated by the FQFT method do comply with the desired distributions. It can also be seen in Figure 5 that Ǫ fluctuates steadily with respect to sample size n, indicating that an increase in sample size does not help to stabilize the type I error Ǫ. CONCLUSIONS The proposed FQFT method is capable of generating random numbers of five distributions commonly used in hydrological frequency analysis. The ECDFs and the distribution parameters estimated from random samples are very close to, or indistinguishable from, theoretical values. Sample estimates of distribution parameters are unbiased, and estimation uncertainties reduce with increasing sample size. The type I error of the chi-square GOF test on the random samples generated fluctuates slightly around the level of significance. An advantage of the FQFT method is that it does not require CDF inversion and the frequency factor of the five commonly used distributions involves only the standard normal deviate. ACKNOWLEDGEMENTS We are grateful to the Council of Agriculture (Taiwan, ROC) for funding a project that led to the initiation of this study. REFERENCES Bobee B, Robitaille R The use of the Pearson type III and log Pearson type III distribution revisited. Water Resources Research 3(2): Chow VT. 95. A general formula for hydrologic frequency analysis. Transactions, American Geophysical Union 32: Chow VT, Maidment DR, Mays LW Applied Hydrology. McGraw-Hill: New York. D Agostino RB, Stephens MA Goodness-of-Fit Techniques. Marcel Dekker: New York. Devroye L Non-Uniform Random Variate Generation. Springer- Verlag: New York. Hellekalek P A note on pseudorandom number generators. EUROSIM Simulation News Europe 2: 6 8. Kite GW Frequency and Risk Analysis in Hydrology. Water Resources Publications. Larget B. 22. Random number generation. larget/math496/random.html (accessed 3 March 26). Mann HB, Wald A On the choice of the number of class intervals in the application of the chi-squared test. Annals of Mathematical Statistics 3: Milton JS, Arnold JC. 23. Introduction to Probability and Statistics. McGraw-Hill: New York. Mood AM, Graybill FA, Boes DC Introduction to the Theory of Statistics. McGraw-Hill: New York. National Research Council. 2. Risk Analysis and Uncertainty in Flood Damage Reduction Studies. National Academy Press. Press WH, Flannery BP, Teukolsky SA, Vetterling WT Numerical Recipes in C. The Art of Scientific Computing. Cambridge University Press: Copyright 26 John Wiley & Sons, Ltd. Hydrol. Process. 2, 5 6 (27) DOI: 2/hyp

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -26 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -26 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY Lecture -26 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. Summary of the previous lecture Hydrologic data series for frequency

More information

ELEMENTS OF MONTE CARLO SIMULATION

ELEMENTS OF MONTE CARLO SIMULATION APPENDIX B ELEMENTS OF MONTE CARLO SIMULATION B. GENERAL CONCEPT The basic idea of Monte Carlo simulation is to create a series of experimental samples using a random number sequence. According to the

More information

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method Meng-Jie Lu 1 / Wei-Hua Zhong 1 / Yu-Xiu Liu 1 / Hua-Zhang Miao 1 / Yong-Chang Li 1 / Mu-Huo Ji 2 Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method Abstract:

More information

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg :

More information

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 8-26-2016 On Some Test Statistics for Testing the Population Skewness and Kurtosis:

More information

GENERATION OF APPROXIMATE GAMMA SAMPLES BY PARTIAL REJECTION

GENERATION OF APPROXIMATE GAMMA SAMPLES BY PARTIAL REJECTION IASC8: December 5-8, 8, Yokohama, Japan GEERATIO OF APPROXIMATE GAMMA SAMPLES BY PARTIAL REJECTIO S.H. Ong 1 Wen Jau Lee 1 Institute of Mathematical Sciences, University of Malaya, 563 Kuala Lumpur, MALAYSIA

More information

Random Variables and Probability Distributions

Random Variables and Probability Distributions Chapter 3 Random Variables and Probability Distributions Chapter Three Random Variables and Probability Distributions 3. Introduction An event is defined as the possible outcome of an experiment. In engineering

More information

GENERATION OF STANDARD NORMAL RANDOM NUMBERS. Naveen Kumar Boiroju and M. Krishna Reddy

GENERATION OF STANDARD NORMAL RANDOM NUMBERS. Naveen Kumar Boiroju and M. Krishna Reddy GENERATION OF STANDARD NORMAL RANDOM NUMBERS Naveen Kumar Boiroju and M. Krishna Reddy Department of Statistics, Osmania University, Hyderabad- 500 007, INDIA Email: nanibyrozu@gmail.com, reddymk54@gmail.com

More information

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions ELE 525: Random Processes in Information Systems Hisashi Kobayashi Department of Electrical Engineering

More information

Strategies for Improving the Efficiency of Monte-Carlo Methods

Strategies for Improving the Efficiency of Monte-Carlo Methods Strategies for Improving the Efficiency of Monte-Carlo Methods Paul J. Atzberger General comments or corrections should be sent to: paulatz@cims.nyu.edu Introduction The Monte-Carlo method is a useful

More information

Equivalence Tests for the Ratio of Two Means in a Higher- Order Cross-Over Design

Equivalence Tests for the Ratio of Two Means in a Higher- Order Cross-Over Design Chapter 545 Equivalence Tests for the Ratio of Two Means in a Higher- Order Cross-Over Design Introduction This procedure calculates power and sample size of statistical tests of equivalence of two means

More information

Chapter 2 Uncertainty Analysis and Sampling Techniques

Chapter 2 Uncertainty Analysis and Sampling Techniques Chapter 2 Uncertainty Analysis and Sampling Techniques The probabilistic or stochastic modeling (Fig. 2.) iterative loop in the stochastic optimization procedure (Fig..4 in Chap. ) involves:. Specifying

More information

Robust Critical Values for the Jarque-bera Test for Normality

Robust Critical Values for the Jarque-bera Test for Normality Robust Critical Values for the Jarque-bera Test for Normality PANAGIOTIS MANTALOS Jönköping International Business School Jönköping University JIBS Working Papers No. 00-8 ROBUST CRITICAL VALUES FOR THE

More information

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS Science SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS Kalpesh S Tailor * * Assistant Professor, Department of Statistics, M K Bhavnagar University,

More information

Chapter 7. Inferences about Population Variances

Chapter 7. Inferences about Population Variances Chapter 7. Inferences about Population Variances Introduction () The variability of a population s values is as important as the population mean. Hypothetical distribution of E. coli concentrations from

More information

Approximate Variance-Stabilizing Transformations for Gene-Expression Microarray Data

Approximate Variance-Stabilizing Transformations for Gene-Expression Microarray Data Approximate Variance-Stabilizing Transformations for Gene-Expression Microarray Data David M. Rocke Department of Applied Science University of California, Davis Davis, CA 95616 dmrocke@ucdavis.edu Blythe

More information

A NEW POINT ESTIMATOR FOR THE MEDIAN OF GAMMA DISTRIBUTION

A NEW POINT ESTIMATOR FOR THE MEDIAN OF GAMMA DISTRIBUTION Banneheka, B.M.S.G., Ekanayake, G.E.M.U.P.D. Viyodaya Journal of Science, 009. Vol 4. pp. 95-03 A NEW POINT ESTIMATOR FOR THE MEDIAN OF GAMMA DISTRIBUTION B.M.S.G. Banneheka Department of Statistics and

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 7 Estimation: Single Population Copyright 010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-1 Confidence Intervals Contents of this chapter: Confidence

More information

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Statistics 431 Spring 2007 P. Shaman. Preliminaries Statistics 4 Spring 007 P. Shaman The Binomial Distribution Preliminaries A binomial experiment is defined by the following conditions: A sequence of n trials is conducted, with each trial having two possible

More information

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018 ` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.

More information

Continuous random variables

Continuous random variables Continuous random variables probability density function (f(x)) the probability distribution function of a continuous random variable (analogous to the probability mass function for a discrete random variable),

More information

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu

More information

A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution

A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution Debasis Kundu 1, Rameshwar D. Gupta 2 & Anubhav Manglick 1 Abstract In this paper we propose a very convenient

More information

Stochastic model of flow duration curves for selected rivers in Bangladesh

Stochastic model of flow duration curves for selected rivers in Bangladesh Climate Variability and Change Hydrological Impacts (Proceedings of the Fifth FRIEND World Conference held at Havana, Cuba, November 2006), IAHS Publ. 308, 2006. 99 Stochastic model of flow duration curves

More information

On Some Statistics for Testing the Skewness in a Population: An. Empirical Study

On Some Statistics for Testing the Skewness in a Population: An. Empirical Study Available at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 12, Issue 2 (December 2017), pp. 726-752 Applications and Applied Mathematics: An International Journal (AAM) On Some Statistics

More information

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions SGSB Workshop: Using Statistical Data to Make Decisions Module 2: The Logic of Statistical Inference Dr. Tom Ilvento January 2006 Dr. Mugdim Pašić Key Objectives Understand the logic of statistical inference

More information

Specific Objectives. Be able to: Apply graphical frequency analysis for data that fit the Log- Pearson Type 3 Distribution

Specific Objectives. Be able to: Apply graphical frequency analysis for data that fit the Log- Pearson Type 3 Distribution CVEEN 4410: Engineering Hydrology (continued) : Topic and Goal: Use frequency analysis of historical data to forecast hydrologic events Specific Be able to: Apply graphical frequency analysis for data

More information

Tests for Two Independent Sensitivities

Tests for Two Independent Sensitivities Chapter 75 Tests for Two Independent Sensitivities Introduction This procedure gives power or required sample size for comparing two diagnostic tests when the outcome is sensitivity (or specificity). In

More information

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali Part I Descriptive Statistics 1 Introduction and Framework... 3 1.1 Population, Sample, and Observations... 3 1.2 Variables.... 4 1.2.1 Qualitative and Quantitative Variables.... 5 1.2.2 Discrete and Continuous

More information

Probability & Statistics

Probability & Statistics Probability & Statistics BITS Pilani K K Birla Goa Campus Dr. Jajati Keshari Sahoo Department of Mathematics Statistics Descriptive statistics Inferential statistics /38 Inferential Statistics 1. Involves:

More information

Much of what appears here comes from ideas presented in the book:

Much of what appears here comes from ideas presented in the book: Chapter 11 Robust statistical methods Much of what appears here comes from ideas presented in the book: Huber, Peter J. (1981), Robust statistics, John Wiley & Sons (New York; Chichester). There are many

More information

Business Statistics 41000: Probability 3

Business Statistics 41000: Probability 3 Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404

More information

2.1 Mathematical Basis: Risk-Neutral Pricing

2.1 Mathematical Basis: Risk-Neutral Pricing Chapter Monte-Carlo Simulation.1 Mathematical Basis: Risk-Neutral Pricing Suppose that F T is the payoff at T for a European-type derivative f. Then the price at times t before T is given by f t = e r(t

More information

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. Summary of the previous lecture Moments of a distribubon Measures of

More information

EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS. Rick Katz

EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS. Rick Katz 1 EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS Rick Katz Institute for Mathematics Applied to Geosciences National Center for Atmospheric Research Boulder, CO USA email: rwk@ucar.edu

More information

A New Test for Correlation on Bivariate Nonnormal Distributions

A New Test for Correlation on Bivariate Nonnormal Distributions Journal of Modern Applied Statistical Methods Volume 5 Issue Article 8 --06 A New Test for Correlation on Bivariate Nonnormal Distributions Ping Wang Great Basin College, ping.wang@gbcnv.edu Ping Sa University

More information

Non-Inferiority Tests for the Ratio of Two Means

Non-Inferiority Tests for the Ratio of Two Means Chapter 455 Non-Inferiority Tests for the Ratio of Two Means Introduction This procedure calculates power and sample size for non-inferiority t-tests from a parallel-groups design in which the logarithm

More information

Copyright 2005 Pearson Education, Inc. Slide 6-1

Copyright 2005 Pearson Education, Inc. Slide 6-1 Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is

More information

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION International Days of Statistics and Economics, Prague, September -3, MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION Diana Bílková Abstract Using L-moments

More information

Diploma in Business Administration Part 2. Quantitative Methods. Examiner s Suggested Answers

Diploma in Business Administration Part 2. Quantitative Methods. Examiner s Suggested Answers Cumulative frequency Diploma in Business Administration Part Quantitative Methods Examiner s Suggested Answers Question 1 Cumulative Frequency Curve 1 9 8 7 6 5 4 3 1 5 1 15 5 3 35 4 45 Weeks 1 (b) x f

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information

M249 Diagnostic Quiz

M249 Diagnostic Quiz THE OPEN UNIVERSITY Faculty of Mathematics and Computing M249 Diagnostic Quiz Prepared by the Course Team [Press to begin] c 2005, 2006 The Open University Last Revision Date: May 19, 2006 Version 4.2

More information

Modelling catastrophic risk in international equity markets: An extreme value approach. JOHN COTTER University College Dublin

Modelling catastrophic risk in international equity markets: An extreme value approach. JOHN COTTER University College Dublin Modelling catastrophic risk in international equity markets: An extreme value approach JOHN COTTER University College Dublin Abstract: This letter uses the Block Maxima Extreme Value approach to quantify

More information

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates A point estimate is a single number, a confidence interval provides additional information about the variability of the estimate Lower

More information

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Chapter 7 Sampling Distributions and Point Estimation of Parameters Chapter 7 Sampling Distributions and Point Estimation of Parameters Part 1: Sampling Distributions, the Central Limit Theorem, Point Estimation & Estimators Sections 7-1 to 7-2 1 / 25 Statistical Inferences

More information

Using Monte Carlo Integration and Control Variates to Estimate π

Using Monte Carlo Integration and Control Variates to Estimate π Using Monte Carlo Integration and Control Variates to Estimate π N. Cannady, P. Faciane, D. Miksa LSU July 9, 2009 Abstract We will demonstrate the utility of Monte Carlo integration by using this algorithm

More information

may be of interest. That is, the average difference between the estimator and the truth. Estimators with Bias(ˆθ) = 0 are called unbiased.

may be of interest. That is, the average difference between the estimator and the truth. Estimators with Bias(ˆθ) = 0 are called unbiased. 1 Evaluating estimators Suppose you observe data X 1,..., X n that are iid observations with distribution F θ indexed by some parameter θ. When trying to estimate θ, one may be interested in determining

More information

Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods

Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods ANZIAM J. 49 (EMAC2007) pp.c642 C665, 2008 C642 Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods S. Ahmad 1 M. Abdollahian 2 P. Zeephongsekul

More information

Probability and distributions

Probability and distributions 2 Probability and distributions The concepts of randomness and probability are central to statistics. It is an empirical fact that most experiments and investigations are not perfectly reproducible. The

More information

CABARRUS COUNTY 2008 APPRAISAL MANUAL

CABARRUS COUNTY 2008 APPRAISAL MANUAL STATISTICS AND THE APPRAISAL PROCESS PREFACE Like many of the technical aspects of appraising, such as income valuation, you have to work with and use statistics before you can really begin to understand

More information

Section 3.1: Discrete Event Simulation

Section 3.1: Discrete Event Simulation Section 3.1: Discrete Event Simulation Discrete-Event Simulation: A First Course c 2006 Pearson Ed., Inc. 0-13-142917-5 Discrete-Event Simulation: A First Course Section 3.1: Discrete Event Simulation

More information

ANALYSIS OF THE DISTRIBUTION OF INCOME IN RECENT YEARS IN THE CZECH REPUBLIC BY REGION

ANALYSIS OF THE DISTRIBUTION OF INCOME IN RECENT YEARS IN THE CZECH REPUBLIC BY REGION International Days of Statistics and Economics, Prague, September -3, 11 ANALYSIS OF THE DISTRIBUTION OF INCOME IN RECENT YEARS IN THE CZECH REPUBLIC BY REGION Jana Langhamrová Diana Bílková Abstract This

More information

ESTIMATION OF MODIFIED MEASURE OF SKEWNESS. Elsayed Ali Habib *

ESTIMATION OF MODIFIED MEASURE OF SKEWNESS. Elsayed Ali Habib * Electronic Journal of Applied Statistical Analysis EJASA, Electron. J. App. Stat. Anal. (2011), Vol. 4, Issue 1, 56 70 e-issn 2070-5948, DOI 10.1285/i20705948v4n1p56 2008 Università del Salento http://siba-ese.unile.it/index.php/ejasa/index

More information

Normal Probability Distributions

Normal Probability Distributions Normal Probability Distributions Properties of Normal Distributions The most important probability distribution in statistics is the normal distribution. Normal curve A normal distribution is a continuous

More information

Characteristics of measures of directional dependence - Monte Carlo studies

Characteristics of measures of directional dependence - Monte Carlo studies Characteristics of measures of directional dependence - Monte Carlo studies Alexander von Eye Richard P. DeShon Michigan State University Characteristics of measures of directional dependence - Monte Carlo

More information

Lecture 2. Probability Distributions Theophanis Tsandilas

Lecture 2. Probability Distributions Theophanis Tsandilas Lecture 2 Probability Distributions Theophanis Tsandilas Comment on measures of dispersion Why do common measures of dispersion (variance and standard deviation) use sums of squares: nx (x i ˆµ) 2 i=1

More information

Describing Uncertain Variables

Describing Uncertain Variables Describing Uncertain Variables L7 Uncertainty in Variables Uncertainty in concepts and models Uncertainty in variables Lack of precision Lack of knowledge Variability in space/time Describing Uncertainty

More information

Tests for One Variance

Tests for One Variance Chapter 65 Introduction Occasionally, researchers are interested in the estimation of the variance (or standard deviation) rather than the mean. This module calculates the sample size and performs power

More information

An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1

An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1 An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1 Guillermo Magnou 23 January 2016 Abstract Traditional methods for financial risk measures adopts normal

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrich Alfons Vasicek he amount of capital necessary to support a portfolio of debt securities depends on the probability distribution of the portfolio loss. Consider

More information

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD MAJOR POINTS Sampling distribution of the mean revisited Testing hypotheses: sigma known An example Testing hypotheses:

More information

Estimating the Natural Rate of Unemployment in Hong Kong

Estimating the Natural Rate of Unemployment in Hong Kong Estimating the Natural Rate of Unemployment in Hong Kong Petra Gerlach-Kristen Hong Kong Institute of Economics and Business Strategy May, Abstract This paper uses unobserved components analysis to estimate

More information

Two-Sample Z-Tests Assuming Equal Variance

Two-Sample Z-Tests Assuming Equal Variance Chapter 426 Two-Sample Z-Tests Assuming Equal Variance Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample z-tests when the variances of the two groups

More information

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

Confidence Intervals for the Difference Between Two Means with Tolerance Probability Chapter 47 Confidence Intervals for the Difference Between Two Means with Tolerance Probability Introduction This procedure calculates the sample size necessary to achieve a specified distance from the

More information

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay. Solutions to Final Exam

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay. Solutions to Final Exam The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay Solutions to Final Exam Problem A: (42 pts) Answer briefly the following questions. 1. Questions

More information

STATISTICS and PROBABILITY

STATISTICS and PROBABILITY Introduction to Statistics Atatürk University STATISTICS and PROBABILITY LECTURE: PROBABILITY DISTRIBUTIONS Prof. Dr. İrfan KAYMAZ Atatürk University Engineering Faculty Department of Mechanical Engineering

More information

The histogram should resemble the uniform density, the mean should be close to 0.5, and the standard deviation should be close to 1/ 12 =

The histogram should resemble the uniform density, the mean should be close to 0.5, and the standard deviation should be close to 1/ 12 = Chapter 19 Monte Carlo Valuation Question 19.1 The histogram should resemble the uniform density, the mean should be close to.5, and the standard deviation should be close to 1/ 1 =.887. Question 19. The

More information

Simulation Wrap-up, Statistics COS 323

Simulation Wrap-up, Statistics COS 323 Simulation Wrap-up, Statistics COS 323 Today Simulation Re-cap Statistics Variance and confidence intervals for simulations Simulation wrap-up FYI: No class or office hours Thursday Simulation wrap-up

More information

Bayesian Inference for Volatility of Stock Prices

Bayesian Inference for Volatility of Stock Prices Journal of Modern Applied Statistical Methods Volume 3 Issue Article 9-04 Bayesian Inference for Volatility of Stock Prices Juliet G. D'Cunha Mangalore University, Mangalagangorthri, Karnataka, India,

More information

Characterization of the Optimum

Characterization of the Optimum ECO 317 Economics of Uncertainty Fall Term 2009 Notes for lectures 5. Portfolio Allocation with One Riskless, One Risky Asset Characterization of the Optimum Consider a risk-averse, expected-utility-maximizing

More information

Monte Carlo Methods for Uncertainty Quantification

Monte Carlo Methods for Uncertainty Quantification Monte Carlo Methods for Uncertainty Quantification Abdul-Lateef Haji-Ali Based on slides by: Mike Giles Mathematical Institute, University of Oxford Contemporary Numerical Techniques Haji-Ali (Oxford)

More information

Frequency Distribution Models 1- Probability Density Function (PDF)

Frequency Distribution Models 1- Probability Density Function (PDF) Models 1- Probability Density Function (PDF) What is a PDF model? A mathematical equation that describes the frequency curve or probability distribution of a data set. Why modeling? It represents and summarizes

More information

Choice Probabilities. Logit Choice Probabilities Derivation. Choice Probabilities. Basic Econometrics in Transportation.

Choice Probabilities. Logit Choice Probabilities Derivation. Choice Probabilities. Basic Econometrics in Transportation. 1/31 Choice Probabilities Basic Econometrics in Transportation Logit Models Amir Samimi Civil Engineering Department Sharif University of Technology Primary Source: Discrete Choice Methods with Simulation

More information

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized

More information

Probability distributions relevant to radiowave propagation modelling

Probability distributions relevant to radiowave propagation modelling Rec. ITU-R P.57 RECOMMENDATION ITU-R P.57 PROBABILITY DISTRIBUTIONS RELEVANT TO RADIOWAVE PROPAGATION MODELLING (994) Rec. ITU-R P.57 The ITU Radiocommunication Assembly, considering a) that the propagation

More information

Statistical Methodology. A note on a two-sample T test with one variance unknown

Statistical Methodology. A note on a two-sample T test with one variance unknown Statistical Methodology 8 (0) 58 534 Contents lists available at SciVerse ScienceDirect Statistical Methodology journal homepage: www.elsevier.com/locate/stamet A note on a two-sample T test with one variance

More information

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is: **BEGINNING OF EXAMINATION** 1. You are given: (i) A random sample of five observations from a population is: 0.2 0.7 0.9 1.1 1.3 (ii) You use the Kolmogorov-Smirnov test for testing the null hypothesis,

More information

Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions

Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions Pandu Tadikamalla, 1 Mihai Banciu, 1 Dana Popescu 2 1 Joseph M. Katz Graduate School of Business, University

More information

Confidence Intervals for One-Sample Specificity

Confidence Intervals for One-Sample Specificity Chapter 7 Confidence Intervals for One-Sample Specificity Introduction This procedures calculates the (whole table) sample size necessary for a single-sample specificity confidence interval, based on a

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education

More information

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Chapter 14 : Statistical Inference 1 Chapter 14 : Introduction to Statistical Inference Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Data x

More information

A Comparison of Univariate Probit and Logit. Models Using Simulation

A Comparison of Univariate Probit and Logit. Models Using Simulation Applied Mathematical Sciences, Vol. 12, 2018, no. 4, 185-204 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2018.818 A Comparison of Univariate Probit and Logit Models Using Simulation Abeer

More information

Non-Inferiority Tests for Two Means in a 2x2 Cross-Over Design using Differences

Non-Inferiority Tests for Two Means in a 2x2 Cross-Over Design using Differences Chapter 510 Non-Inferiority Tests for Two Means in a 2x2 Cross-Over Design using Differences Introduction This procedure computes power and sample size for non-inferiority tests in 2x2 cross-over designs

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Lecture 6: Non Normal Distributions

Lecture 6: Non Normal Distributions Lecture 6: Non Normal Distributions and their Uses in GARCH Modelling Prof. Massimo Guidolin 20192 Financial Econometrics Spring 2015 Overview Non-normalities in (standardized) residuals from asset return

More information

The Fallacy of Large Numbers

The Fallacy of Large Numbers The Fallacy of Large umbers Philip H. Dybvig Washington University in Saint Louis First Draft: March 0, 2003 This Draft: ovember 6, 2003 ABSTRACT Traditional mean-variance calculations tell us that the

More information

DRAFT. Half-Mack Stochastic Reserving. Frank Cuypers, Simone Dalessi. July 2013

DRAFT. Half-Mack Stochastic Reserving. Frank Cuypers, Simone Dalessi. July 2013 Abstract Half-Mack Stochastic Reserving Frank Cuypers, Simone Dalessi July 2013 We suggest a stochastic reserving method, which uses the information gained from statistical reserving methods (such as the

More information

2018 AAPM: Normal and non normal distributions: Why understanding distributions are important when designing experiments and analyzing data

2018 AAPM: Normal and non normal distributions: Why understanding distributions are important when designing experiments and analyzing data Statistical Failings that Keep Us All in the Dark Normal and non normal distributions: Why understanding distributions are important when designing experiments and Conflict of Interest Disclosure I have

More information

Value at Risk and Self Similarity

Value at Risk and Self Similarity Value at Risk and Self Similarity by Olaf Menkens School of Mathematical Sciences Dublin City University (DCU) St. Andrews, March 17 th, 2009 Value at Risk and Self Similarity 1 1 Introduction The concept

More information

Confidence Intervals for Paired Means with Tolerance Probability

Confidence Intervals for Paired Means with Tolerance Probability Chapter 497 Confidence Intervals for Paired Means with Tolerance Probability Introduction This routine calculates the sample size necessary to achieve a specified distance from the paired sample mean difference

More information

A Scenario Based Method for Cost Risk Analysis

A Scenario Based Method for Cost Risk Analysis A Scenario Based Method for Cost Risk Analysis Paul R. Garvey The MITRE Corporation MP 05B000003, September 005 Abstract This paper presents an approach for performing an analysis of a program s cost risk.

More information

Operational Risk Aggregation

Operational Risk Aggregation Operational Risk Aggregation Professor Carol Alexander Chair of Risk Management and Director of Research, ISMA Centre, University of Reading, UK. Loss model approaches are currently a focus of operational

More information

Power of t-test for Simple Linear Regression Model with Non-normal Error Distribution: A Quantile Function Distribution Approach

Power of t-test for Simple Linear Regression Model with Non-normal Error Distribution: A Quantile Function Distribution Approach Available Online Publications J. Sci. Res. 4 (3), 609-622 (2012) JOURNAL OF SCIENTIFIC RESEARCH www.banglajol.info/index.php/jsr of t-test for Simple Linear Regression Model with Non-normal Error Distribution:

More information

Web Extension: Continuous Distributions and Estimating Beta with a Calculator

Web Extension: Continuous Distributions and Estimating Beta with a Calculator 19878_02W_p001-008.qxd 3/10/06 9:51 AM Page 1 C H A P T E R 2 Web Extension: Continuous Distributions and Estimating Beta with a Calculator This extension explains continuous probability distributions

More information

Statistics for Managers Using Microsoft Excel 7 th Edition

Statistics for Managers Using Microsoft Excel 7 th Edition Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 7 Sampling Distributions Statistics for Managers Using Microsoft Excel 7e Copyright 2014 Pearson Education, Inc. Chap 7-1 Learning Objectives

More information

EMH vs. Phenomenological models. Enrico Scalas (DISTA East-Piedmont University)

EMH vs. Phenomenological models. Enrico Scalas (DISTA East-Piedmont University) EMH vs. Phenomenological models Enrico Scalas (DISTA East-Piedmont University) www.econophysics.org Summary Efficient market hypothesis (EMH) - Rational bubbles - Limits and alternatives Phenomenological

More information

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

Fat Tailed Distributions For Cost And Schedule Risks. presented by:

Fat Tailed Distributions For Cost And Schedule Risks. presented by: Fat Tailed Distributions For Cost And Schedule Risks presented by: John Neatrour SCEA: January 19, 2011 jneatrour@mcri.com Introduction to a Problem Risk distributions are informally characterized as fat-tailed

More information

EMPIRICAL EVIDENCE ON ARBITRAGE BY CHANGING THE STOCK EXCHANGE

EMPIRICAL EVIDENCE ON ARBITRAGE BY CHANGING THE STOCK EXCHANGE Advances and Applications in Statistics Volume, Number, This paper is available online at http://www.pphmj.com 9 Pushpa Publishing House EMPIRICAL EVIDENCE ON ARBITRAGE BY CHANGING THE STOCK EXCHANGE JOSÉ

More information