INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -26 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Similar documents
INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Continuous random variables

Hydrology 4410 Class 29. In Class Notes & Exercises Mar 27, 2013

EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS. Rick Katz

Describing Uncertain Variables

Frequency Distribution Models 1- Probability Density Function (PDF)

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

Statistics & Flood Frequency Chapter 3. Dr. Philip B. Bedient

Simulation of probability distributions commonly used in hydrological frequency analysis

ELEMENTS OF MONTE CARLO SIMULATION

Specific Objectives. Be able to: Apply graphical frequency analysis for data that fit the Log- Pearson Type 3 Distribution

Probability and Statistics

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Commonly Used Distributions

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

ECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 10: Continuous RV Families. Prof. Vince Calhoun

Stochastic model of flow duration curves for selected rivers in Bangladesh

CHAPTERS 5 & 6: CONTINUOUS RANDOM VARIABLES

Chapter 7 1. Random Variables

Bivariate Birnbaum-Saunders Distribution

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

On Stochastic Evaluation of S N Models. Based on Lifetime Distribution

MAS187/AEF258. University of Newcastle upon Tyne

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

Probability analysis of return period of daily maximum rainfall in annual data set of Ludhiana, Punjab

Introduction to Population Modeling

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

Case Study: Heavy-Tailed Distribution and Reinsurance Rate-making

Some Characteristics of Data

Chapter 4 Continuous Random Variables and Probability Distributions

Chapter 2 Uncertainty Analysis and Sampling Techniques

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Practice Exam 1. Loss Amount Number of Losses

Risk Reduction Potential

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

Business Statistics 41000: Probability 3

1. You are given the following information about a stationary AR(2) model:

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Slides for Risk Management

2. The sum of all the probabilities in the sample space must add up to 1

Impact of Weekdays on the Return Rate of Stock Price Index: Evidence from the Stock Exchange of Thailand

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =

Statistics for Managers Using Microsoft Excel 7 th Edition

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x)

IEOR 3106: Introduction to OR: Stochastic Models. Fall 2013, Professor Whitt. Class Lecture Notes: Tuesday, September 10.

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

Point Estimation. Copyright Cengage Learning. All rights reserved.

Definition 9.1 A point estimate is any function T (X 1,..., X n ) of a random sample. We often write an estimator of the parameter θ as ˆθ.

Modelling Environmental Extremes

Discrete Random Variables

Appendix A: Introduction to Probabilistic Simulation

Fundamentals of Statistics

Random Variables Handout. Xavier Vilà

STATISTICS and PROBABILITY

Modelling Environmental Extremes

Statistics 6 th Edition

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION

4-1. Chapter 4. Commonly Used Distributions by The McGraw-Hill Companies, Inc. All rights reserved.

Continuous Distributions

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

STOR Lecture 15. Jointly distributed Random Variables - III

LAST SECTION!!! 1 / 36

Applications of Good s Generalized Diversity Index. A. J. Baczkowski Department of Statistics, University of Leeds Leeds LS2 9JT, UK

6. Continous Distributions

VI. Continuous Probability Distributions

Chapter 3 Common Families of Distributions. Definition 3.4.1: A family of pmfs or pdfs is called exponential family if it can be expressed as

Chapter 2. Random variables. 2.3 Expectation

Contents. An Overview of Statistical Applications CHAPTER 1. Contents (ix) Preface... (vii)

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Lecture 3: Probability Distributions (cont d)

Statistical Modeling Techniques for Reserve Ranges: A Simulation Approach

Prepared By. Handaru Jati, Ph.D. Universitas Negeri Yogyakarta.

SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

Monte Carlo Methods for Uncertainty Quantification

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS

ANALYSIS OF THE DISTRIBUTION OF INCOME IN RECENT YEARS IN THE CZECH REPUBLIC BY REGION

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

MLC at Boise State Logarithms Activity 6 Week #8

Chapter 4 Continuous Random Variables and Probability Distributions

Statistics and Their Distributions

Calculating VaR. There are several approaches for calculating the Value at Risk figure. The most popular are the

What was in the last lecture?

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Chapter 6 Simple Correlation and

Statistical Tables Compiled by Alan J. Terry

Chapter 4 Random Variables & Probability. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

CS 237: Probability in Computing

Logarithmic and Exponential Functions

MAS3904/MAS8904 Stochastic Financial Modelling

Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

M249 Diagnostic Quiz

Transcription:

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY Lecture -26 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Summary of the previous lecture Hydrologic data series for frequency analysis Complete duration series Partial duration series Annual exceedence series Extreme value series Extreme value distributions Frequency factors 2

Frequency Analysis Frequency factor for Normal Distribution: x T K T = µ + K σ = x T T µ σ K T 2 2.515517 + 0.802853w+ 0.010328w = w 1 + 1.432788w + 0.189269w + 0.001308w 2 3 w 1 = ln 2 p 1 2 0 < p < 0.5 3

Example 1 Consider the annual maximum discharge Q in cumec, of a river for 45 years : Year Q Year Q Year Q Year Q 1950 804 1961 507 1972 1651 1983 1254 1951 1090 1962 1303 1973 716 1984 430 1952 1580 1963 197 1974 286 1985 260 1953 487 1964 583 1975 671 1986 276 1954 719 1965 377 1976 3069 1987 1657 1955 140 1966 348 1977 306 1988 937 1956 1583 1967 804 1978 116 1989 714 1957 1642 1968 328 1979 162 1990 855 1958 1586 1969 245 1980 425 1991 399 1959 218 1970 140 1981 1982 1992 1543 1960 623 1971 49 1982 277 1993 360 1994 348 4

Example 1 (Contd.) Mean, x = 756.6 cumec Standard deviation, s = 639.5 cumec Determine the frequency factor and obtain the maximum annual discharge value corresponding to 20 year return period using Normal distribution. 5

Example 1 (Contd.) T = 20 p = 1/20 = 0.05 w 1 = ln 2 p 1 2 1 ln 0.05 = 2.45 = 2 1 2 6

Example 1 (Contd.) K 2 2.515517 + 0.802853w+ 0.01032w = w 1 + 1.432788w + 0.189269w + 0.001308w 2 2.515517 + 0.802853 2.45 + 0.01032 2.45 = 2.45 1+ 1.432788 2.45 + 0.189269 2.45 + 0.001308 2.45 20 2 3 = 1.648 2 3 x = x+ K s 20 20 = 756.6 + 1.648 639.5 = 1810.5 cumec 7

Frequency factor for Extreme Value Type I (EV I) Distribution: To express T in terms of K T, 8 Frequency Analysis 6 0.5772 ln ln 1 T T K T π = + 1 1 exp exp 0.5772 6 T T π K = + Ref: Applied Hydrology by V.T.Chow, D.R.Maidment, L.W.Mays, McGraw-Hill 1998

When x T = µ in the equation ; K T = 0 Substituting K T = 0, i.e., the return period of mean of a EV I is 2.33 years 9 Frequency Analysis T T x K µ σ = 1 0 1 exp exp 0.5772 6 2.33 T π = + = years T T x K µ σ = +

Example 2 Consider the annual maximum discharge of a river for 45 years, given in the previous example. Mean, x = 756.6 cumec Standard deviation, s = 639.5 cumec Determine the frequency factor and obtain the maximum annual discharge value corresponding to 20 year return period using Extreme Value Type I (EV I) distribution. 10

Example 2 (Contd.) T T = 20 years K T 6 T = 0.5772 + ln ln π T 1 6 20 = 0.5772 + ln ln π 20 1 = 1.866 x = x+ K s T = 756.6 + 1.866 639.5 = 1949.9 cumec 11

Frequency Analysis Frequency factor for Log Pearson Type III Distribution: The PDF is f ( x) = β λ where y = log x ( y ε) xγ( β ) ( ) β 1 λ y ε e log x ε The data is converted to the logarithmic series by {y} = log {x}. 12

Frequency Analysis The mean y, standard deviation s y, and the coefficient of skewness C s are calculated for the converted logarithmic series {y} The frequency factor for the log Pearson Type III distribution depends on the return period and coefficient of skewness 13

Frequency Analysis When C s = 0, the frequency factor is equal to the standard normal deviate z and is calculated as in case of Normal distribution. When C s 0, K T is calculated by (Kite, 1977) ( ) 1 K ( ) ( ) T = z+ z 1 k+ z 6z k z 1 k 3 4 1 5 + zk + k 3 where k = C s / 6 2 3 2 2 3 Ref: Kite, G. W., Frequency and Risk Analysis in Hydrology, Water Resources Publications, Fort Collins, Colorado, 1977 14

Example 3 Consider the annual maximum discharge of a river for 45 years given in the previous example. Calculate the frequency factor and obtain the maximum annual discharge value corresponding to 20 year return period using Log Person Type III distribution. The logarithmic data series is first obtained. 15

Example 3 (Contd.) Logarithmic values of the data given in the previous example: Year Log Q Year Log Q Year Log Q Year Log Q 1950 2.905 1961 2.705 1972 3.218 1983 3.098 1951 3.037 1962 3.115 1973 2.855 1984 2.633 1952 3.199 1963 2.294 1974 2.456 1985 2.415 1953 2.688 1964 2.766 1975 2.827 1986 2.441 1954 2.857 1965 2.576 1976 3.487 1987 3.219 1955 2.146 1966 2.542 1977 2.486 1988 2.972 1956 3.199 1967 2.905 1978 2.064 1989 2.854 1957 3.215 1968 2.516 1979 2.210 1990 2.932 1958 3.200 1969 2.389 1980 2.628 1991 2.601 1959 2.338 1970 2.146 1981 3.297 1992 3.188 1960 2.794 1971 1.690 1982 2.442 1993 2.556 1994 2.542 16

Example 3 (Contd.) The mean, y = 2.725 cumec Standard deviation, s = 0.388 cumec Coefficient of skewness C s = -0.2664 T = 20 years w 1 = ln 2 p 1 2 1 ln 0.05 = 2.45 = 2 1 2 17

Example 3 (Contd.) z 2 2.515517 + 0.802853w+ 0.01032w = w 2 3 1 + 1.432788w + 0.189269w + 0.001308w 2 2.515517 + 0.802853 2.45 + 0.01032 2.45 = 2.45 1+ 1.432788 2.45 + 0.189269 2.45 + 0.001308 2.45 2 3 = 1.648 k = C s / 6 = -0.2664/6 = -0.0377 18

Example 3 (Contd.) 1 K T = 1.648 + 1.648 1 0.0377 + 1.648 6 1.648 0.0377 3 2 3 4 1 5 ( 1.648 1)( 0.0377) + 1.648 ( 0.0377) + ( 0.0377) 3 = 1.581 x = x + K s T 2 3 ( )( ) ( )( ) T = 756.6 + 1.581 639.5 = 1767.6 cumec 2 19

PROBABILITY PLOTTING

Probability Plotting Probability plotting is a method to check whether a probability distribution fits a set of data or not. The data is plotted on specially designed probability paper. When the cumulative distribution function (CDF) F (x), is plotted on arithmetic paper versus the value of RV X, usually a straight line does not result. To obtain a straight line on arithmetic paper, F(x) would have to be given by expression F(x) = ax+b or f(x) = a; which is an uniform distribution 21

Probability Plotting i.e., if a CDF of a set of data plots a straight line on arithmetic paper, the data follows uniform distribution. The probability paper for a given distribution can be developed so that the cumulative distribution plots as a straight line on the paper. 22

Probability Plotting Constructing probability paper is a process of transforming the arithmetic scale to the probability scale so that the resulting cumulative distribution plot is a straight line The plot is prepared with exceedence probability or the return period T on abscissa and the magnitude of the event on ordinate. 23

Probability Plotting Construction of Probability paper: Mathematical construction. Graphical construction 24

Probability Plotting Mathematical construction: For some probability distributions, probability paper can be constructed analytically so that the cumulative distribution function plots a straight line, on the paper. This can be achieved by transforming the cumulative distribution function to the form Y = az + b where Y is a function of parameters and F(x), Z is a function of parameters and x, a and b are functions of parameters. 25

Probability Plotting Exponential distribution: F x = e λx ( ) 1 which can be written as, { ( )} ln 1 F x =λx Y = az + b Comparing with Y = ln 1 F x, Z = x, a= λ and b= 0 { ( )} Y is plotted against Z and the corresponding values of F(x) and x are used to label the axes 26

Example 4 Construct probability paper for exponential distribution with λ = 1/3 Soln: 1. The values of F(x) are assumed and corresponding x, Y and Z values are calculated. 2. Y is plotted against Z and the Y axis is labeled with the corresponding value of F(x) and the Z axis with corresponding value of x. 27

Example 4 (Contd.) F(x) Y x = Z 0.01 0.010 0.030 0.05 0.051 0.154 0.1 0.105 0.316 0.2 0.223 0.669 0.3 0.357 1.070 0.4 0.511 1.532 0.5 0.693 2.079 0.6 0.916 2.749 0.7 1.204 3.612 0.8 1.609 4.828 0.9 2.303 6.908 0.95 2.996 8.987 0.99 4.605 13.816 28

Example 4 (Contd.) Y is plotted against Z : 5 4 Y 3 2 1 0 0 3 6 9 12 15 Z 29

Example 4 (Contd.) Y axis labeled with F(x) and Z with x : 0.99 0.95 0.9 F(x) 0.8 0.7 0.5 0.1 X = 30

Example 4 (Contd.) Probability paper for exponential distribution: 0.99 F(x) 0.95 0.9 0.8 0.7 0.5 0.1 0 3 6 9 12 15 X 31

Example 4 (Contd.) Probability paper for exponential distribution: 0.99 F(x) 0.95 λ 2 λ 1 0.9 0.8 0.7 0.5 0.1 0 3 6 9 12 15 X 32

Probability Plotting Any exponential distribution data will plot as a straight line. The slope of the line will change as λ changes. Slope of the line gives the λ value. For many probability distributions, the same graph paper may be used for all values of the parameters of the distribution. For some distributions like gamma, a separate graph paper is required for different values of the parameters. Many types of probability papers are commercially available. 33

Probability Plotting Graphical construction: Graphical construction is done by transforming the arithmetic scale to probability scale so that a straight line is obtained when cumulative distribution function is plotted The transformation technique is explained with the normal distribution. Consider the coordinates from the standardized normal distribution table. 34

Normal Distribution Tables z 0 2 4 6 8 0 0 0.008 0.016 0.0239 0.0319 0.1 0.0398 0.0478 0.0557 0.0636 0.0714 0.2 0.0793 0.0871 0.0948 0.1026 0.1103 0.3 0.1179 0.1255 0.1331 0.1406 0.148 0.4 0.1554 0.1628 0.17 0.1772 0.1844 0.5 0.1915 0.1985 0.2054 0.2123 0.219 0.6 0.2257 0.2324 0.2389 0.2454 0.2517 0.7 0.258 0.2642 0.2704 0.2764 0.2823 0.8 0.2881 0.2939 0.2995 0.3051 0.3106 0.9 0.3159 0.3212 0.3264 0.3315 0.3365 1 0.3413 0.3461 0.3508 0.3554 0.3599 z 35

Probability Plotting Normal distribution table: Z F(z) -3 0.0013-2.5 0.0062-2 0.0227-1.5 0.0668-1 0.1587-0.5 0.3085 Z F(z) 0 0.5 0.5 0.6915 1 0.8413 1.5 0.9332 2 0.9772 2.5 0.9938 3 0.9987 36

Probability Plotting Arithmetic scale plot: 3 2 1 Z 0-1 - 2-3 0 0.1 0.2 0.3 0.4 F(Z) 0.5 0.6 0.7 0.8 0.9 1 37

Probability Plotting Transformation plot: F(Z) 0.01 0.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 38

Probability Plotting Normal probability paper (probabilities in percentage) : 99.9 99.0 95.0 90.0 80.0 70.0 50.0 30.0 10.0 5.0 1.0 1.0 Redrawn from source: http://www.weibull.com/gpaper/ 50.0 99.9 39

Probability Plotting The purpose of using the probability paper is to linearize the probability relationship The plot can be used for interpolation, extrapolation and comparison purposes. The plot can also be used for estimating magnitudes with other return periods. If the plot is used for extrapolation, the effect of various errors is often magnified. 40

Plotting Position Plotting position is a simple empirical technique Relation between the magnitude of an event verses its probability of exceedence. Plotting position refers to the probability value assigned to each of the data to be plotted Several empirical methods to determine the plotting positions. Arrange the given series of data in descending order Assign a order number to each of the data (termed as rank of the data) 41

Plotting Position First entry as 1, second as 2 etc. Let n is the total no. of values to be plotted and m is the rank of a value, the exceedence probability (p) of the m th largest value is obtained by various formulae. The return period (T) of the event is calculated by T = 1/p Compute T for all the events Plot T verses the magnitude of event on semi log or log log paper 42

Plotting Position Formulae for exceedence probability: California Method: P X x = ( ) m m n Limitations Produces a probability of 100% for m = n 43

Plotting Position Modification to California Method: P X x = ( ) m m 1 n Limitations Formula does not produce 100% probability If m = 1, probability is zero 44

Plotting Position Hazen s formula: P X x = ( ) m m 0.5 n Chegodayev s formula: P X m 0.3 x = m n + 0.4 ( ) Widely used in U.S.S.R and Eastern European countries 45

Plotting Position Weibull s formula: Most commonly used method If n values are distributed uniformly between 0 and 100 percent probability, then there must be n+1 intervals, n 1 between the data points and 2 at the ends. P X m xm = n + 1 ( ) Indicates a return period T one year longer than the period of record for the largest value 46

Plotting Position Most plotting position formulae are represented by: P X m b x = m n + 1 2b ( ) Where b is a parameter E.g., b = 0.5 for Hazen s formula, b = 0.5 for Chegodayev s formula, b = 0 for Weibull s formula b = 3/8 0.5 for Blom s formula b = 1/3 0.5 for Tukey s formula b = 0.44 0.5 for Gringorten s formula 47

Plotting Position Cunnane (1978) studied the various available plotting position methods based on unbiasedness and minimum variance criteria. If large number of equally sized samples are plotted, the average of the plotted points foe each value of m lie on the theoretical distribution line. Minimum variance plotting minimizes the variance of the plotted points about the theoretical line. Cunnane concluded that the Weibull s formula is biased and plots the largest values of a sample at too small a return period. 48

Plotting Position For normally distributed data, the best formula is Blom s plotting position formula (b = 3/8). For Extreme Value Type I distribution, the Gringorten formula (b = 0.44) is the best. 49

Example 3 Consider the annual maximum discharge of a river for 45years, plot the data using Weibull s formula Year Data Year Data Year Data Year Data 1950 804 1961 507 1972 1651 1983 1254 1951 1090 1962 1303 1973 716 1984 430 1952 1580 1963 197 1974 286 1985 260 1953 487 1964 583 1975 671 1986 276 1954 719 1965 377 1976 3069 1987 1657 1955 140 1966 348 1977 306 1988 937 1956 1583 1967 804 1978 116 1989 714 1957 1642 1968 328 1979 162 1990 855 1958 1586 1969 245 1980 425 1991 399 1959 218 1970 140 1981 1982 1992 1543 1960 623 1971 49 1982 277 1993 360 1994 348 50

Example 3 (Contd.) The data is arranged in descending order Rank is assigned to the arranged data The probability is obtained using P X m xm = n + 1 ( ) Return period is calculated The maximum annual discharge verses the return period is plotted 51

Example 3 (Contd.) Year Annual Max. Q Arranged data Rank (m) P(X > x m ) T 1950 804 3069 1 0.021739 46 1951 1090 1982 2 0.043478 23 1952 1580 1657 3 0.065217 15.33333 1953 487 1651 4 0.086957 11.5 1954 719 1642 5 0.108696 9.2 1955 140 1586 6 0.130435 7.666667 1956 1583 1583 7 0.152174 6.571429 1957 1642 1580 8 0.173913 5.75 1958 1586 1543 9 0.195652 5.111111 1959 218 1303 10 0.217391 4.6 1960 623 1254 11 0.23913 4.181818 52

Example 3 (Contd.) Annual Maximum discharge (Q) 1500 150 15 1 10 100 Return period (T) 53