Statistics & Flood Frequency Chapter 3. Dr. Philip B. Bedient

Similar documents
Describing Uncertain Variables

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

4.3 Normal distribution

Chapter 7 1. Random Variables

Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

Statistics 6 th Edition

Some Characteristics of Data

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Basic Procedure for Histograms

Chapter 6 Continuous Probability Distributions. Learning objectives

Commonly Used Distributions

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

Chapter 2. Random variables. 2.3 Expectation

Chapter 8: The Binomial and Geometric Distributions

Hydrology 4410 Class 29. In Class Notes & Exercises Mar 27, 2013

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

Topic 6 - Continuous Distributions I. Discrete RVs. Probability Density. Continuous RVs. Background Reading. Recall the discrete distributions

Frequency Distribution Models 1- Probability Density Function (PDF)

2011 Pearson Education, Inc

4-2 Probability Distributions and Probability Density Functions. Figure 4-2 Probability determined from the area under f(x).

1/2 2. Mean & variance. Mean & standard deviation

Continuous random variables

AP Statistics Ch 8 The Binomial and Geometric Distributions

Statistics for Managers Using Microsoft Excel 7 th Edition

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -26 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

4 Random Variables and Distributions

8.1 Binomial Distributions

Chapter 3 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc.

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Chapter 4 Continuous Random Variables and Probability Distributions

Random Variables and Probability Distributions

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

MA : Introductory Probability

Probability Models.S2 Discrete Random Variables

Random Variables Handout. Xavier Vilà

Why Pooling Works. CAJPA Spring Mujtaba Datoo Actuarial Practice Leader, Public Entities Aon Global Risk Consulting

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Statistical Tables Compiled by Alan J. Terry

Favorite Distributions

Examples of continuous probability distributions: The normal and standard normal

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

Discrete Random Variables and Probability Distributions

Continuous Distributions

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

Theoretical Foundations

Chapter 8 Homework Solutions Compiled by Joe Kahlig. speed(x) freq 25 x < x < x < x < x < x < 55 5

The normal distribution is a theoretical model derived mathematically and not empirically.

INF FALL NATURAL LANGUAGE PROCESSING. Jan Tore Lønning, Lecture 3, 1.9

5.2 Random Variables, Probability Histograms and Probability Distributions

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

MAS187/AEF258. University of Newcastle upon Tyne

Lecture 3: Probability Distributions (cont d)

Business Statistics 41000: Probability 3

Definition 9.1 A point estimate is any function T (X 1,..., X n ) of a random sample. We often write an estimator of the parameter θ as ˆθ.

ECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 10: Continuous RV Families. Prof. Vince Calhoun

Chapter 6: Random Variables. Ch. 6-3: Binomial and Geometric Random Variables

Converting to the Standard Normal rv: Exponential PDF and CDF for x 0 Chapter 7: expected value of x

Engineering Statistics ECIV 2305

CVE SOME DISCRETE PROBABILITY DISTRIBUTIONS

Normal populations. Lab 9: Normal approximations for means STT 421: Summer, 2004 Vince Melfi

INF FALL NATURAL LANGUAGE PROCESSING. Jan Tore Lønning, Lecture 3, 1.9

Review. Binomial random variable

EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS. Rick Katz

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Exam M Fall 2005 PRELIMINARY ANSWER KEY

Session Window. Variable Name Row. Worksheet Window. Double click on MINITAB icon. You will see a split screen: Getting Started with MINITAB

PROBABILITY DISTRIBUTIONS

2.1 Properties of PDFs

NORMAL APPROXIMATION. In the last chapter we discovered that, when sampling from almost any distribution, e r2 2 rdrdϕ = 2π e u du =2π.

x is a random variable which is a numerical description of the outcome of an experiment.

Model Paper Statistics Objective. Paper Code Time Allowed: 20 minutes

Random variables. Contents

Engineering Mathematics III. Moments

. (i) What is the probability that X is at most 8.75? =.875

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Welcome to Stat 410!

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Lecture 2. Probability Distributions Theophanis Tsandilas

Central Limit Theorem, Joint Distributions Spring 2018

Appendix A. Selecting and Using Probability Distributions. In this appendix

Review of the Topics for Midterm I

The Bernoulli distribution

Chapter 2: Random Variables (Cont d)

VI. Continuous Probability Distributions

Specific Objectives. Be able to: Apply graphical frequency analysis for data that fit the Log- Pearson Type 3 Distribution

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

Two Hours. Mathematical formula books and statistical tables are to be provided THE UNIVERSITY OF MANCHESTER. 22 January :00 16:00

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Chapter 4 Continuous Random Variables and Probability Distributions

A useful modeling tricks.

Probability. An intro for calculus students P= Figure 1: A normal integral

Lecture Data Science

Random Variable: Definition

Transcription:

Statistics & Flood Frequency Chapter 3 Dr. Philip B. Bedient

Predicting FLOODS

Flood Frequency Analysis n Statistical Methods to evaluate probability exceeding a particular outcome - P (X >20,000 cfs) = 10% n Used to determine return periods of rainfall or flows n Used to determine specific frequency flows for floodplain mapping purposes (10, 25, 50, 100 yr) n Used for datasets that have no obvious trends n Used to statistically extend data sets

Random Variables n Parameter that cannot be predicted with certainty n Outcome of a random or uncertain process - flipping a coin or picking out a card from deck n Can be discrete or continuous n Data are usually discrete or quantized n Usually easier to apply continuous distribution to discrete data that has been organized into bins

Typical CDF F(x 1 ) - F(x 2 ) Continuous Discrete F(x 1 ) = P(x < x 1 )

Freq Histogram of Flows 36 27 1.3 17.3 9 8 1.3 Probability that Q is 10,000 to 15, 000 = 17.3% Prob that Q < 20,000 = 1.3 + 17.3 + 36 = 54.6%

Probability Distributions CDF is the most useful form for analysis i F(x) = P(X x) = P(x i ) F(x 1 ) = P( x x 1 ) = x 1 f (x) dx P(x 1 x x 2 ) = F(x 2 ) F(x 1 )

Moments of a Distribution Used to characterize a distribution or set of data! Moments taken about the origin (1 st ) or the mean (2 nd, 3 rd, etc)" Discrete P (x i ) µ N ' = x i N P( x ) i Continuous f (x) µ N ' = x N f x ( )dx

Moments of a Distribution First Moment about the Origin - Mean" E(x) = µ = x i P(x i ) Discrete E(x) = µ = xf (x) dx Continuous

Var(x) = Variance " Second moment about mean Var(x) = σ 2 = (x i µ) 2 P(x i ) Var(x) = (x µ) 2 f (x) dx Var(x) = E(x 2 ) (E(x)) 2 cv = σ µ = Coeff. of Variation

Estimates of Moments from a Dataset x = 1 n n x Mean of Data i i s x 2 = 1 n 1 (x x ) 2 Variance i Std Dev. S x = (S x2 ) 1/2

Skewness Coefficient" Used to evaluate high or low data points - flood or drought data Skewness µ 3 third central moment 3 σ C s = n (n 1)(n 2) (x i x ) 3 s x 3 skewness coeff. Coeff of Var = σ µ

Mean, Median, Mode Positive Skew moves mean to right Negative Skew moves mean to left Normal Dist n has mean = median = mode Median has highest prob. of occurrence

Skewed PDF - Long Right Tail

76 83 '98 01

Skewed Data

Climate Change Data

Siletz River Data Stationary Data Showing No Obvious Trends

Data with Trends

Frequency Histogram 36 27 1.3 17.3 9 8 1.3 Probability that Q is 10,000 to 15, 000 = 17.3% Prob that Q < 20,000 = 1.3 + 17.3 + 36 = 54.6%

Cumulative Histogram Probability that Q < 20,000 is 54.6 % Probability that Q > 25,000 is 19 %

PDF - Gamma Dist

Major Distributions n Binomial - P (x successes in n trials) n Exponential - decays rapidly to low probability - event arrival times n Normal - Symmetric based on µ and σ n Lognormal - Log data are normally dist d n Gamma - skewed distribution - hydro data n Log Pearson III -skewed logs -recommended by the IAC on water data - most often used

Binomial Distribution The probability of getting x successes followed by n-x failures is the product of prob of n independent events: p x (1-p) n-x This could be used to represent the case of flooding a success is exceeding a certain level while a failure is falling below that level in any given year. Thus, over a 25 year period, one would just add up the number of successes and the number of failures by year.

Binomial Distribution The probability of getting x successes followed by n-x failures is the product of prob of n independent events: p x (1-p) n-x This represents only one possible outcome. The number of ways of choosing x successes out of n events is the binomial coeff. The resulting distribution is the Binomial or B(n,p). n! P(x) = x!(n x)! px (1 p) n x x = 0, 1, 2, 3,..., n Bin. Coeff for single success in 3 years = 3(2)(1) / 2(1) = 3 For 3 success in 3 years = 6 / (3) (2)(1) = 1

Binomial Dist n B(n,p)

Risk and Reliability The probability of at least one success in n years, where the probability of success in any year is 1/T, is called the RISK. Prob success = p = 1/T and Prob failure = 1-p RISK = 1 - P(0) = 1 - Prob(no success in n years) = 1 - (1-p) n = 1 - (1-1/T) n Reliability = (1-1/T) n

Design Periods vs RISK and Design Life Expected Design Life (Years) Risk % 5 10 25 50 100 75 4.1 7.7 18.5 36.6 72.6 50 7.7 14.9 36.6 72.6 144.8 20 22.9 45.3 112.5 224.6 448.6 x 2 x 3 10 48 95.4 237.8 475.1 949.6

Risk Example What is the probability of at least one 50 yr flood in a 30 year mortgage period, where the probability of success in any year is 1/T = 1.50 = 0.02 RISK = 1 - (1-1/T) n = 1 - (1-0.02) 30 = 1 - (0.98) 30 = 0.455 or 46% If this is too large a risk, then increase design level to the 100 year where p = 0.01 RISK = 1 - (0.99) 30 = 0.26 or 26%

Important Probability Distributions Normal mean and std dev. zero skew Log Normal (Log data ) same as normal Gamma skewed data Exponential- constant skew

Normal, LogN, LPIII Data in bins Normal

Normal Prob Paper Normal Prob Paper converts the Normal CDF S curve into a straight line on a prob scale

Normal Prob Paper Std Dev = +1000 cfs Mean = 5200 cfs Std Dev = 1000 cfs Place mean at F = 50% Place one S x at 15.9 and 84.1% Connect points with st. line Plot data with plotting position formula P = m/n+1

Normal Dist n Fit Mean

Exponential Dist n Poisson Process where k is average no. of events per time and 1/k is the average time between arrivals f(t) = k e - kt for t > 0 Traffic flow Flood arrivals Telephone calls

Exponential Dist n f(t) = k e - kt for t > 0 F(t) = 1 - e - kt E(t) = 0 (tk)e kt dt Letting u = kt Mean or E(t) = 1 k Avg Time Between Events 0 ue u du = 1 k Var = 1 k 2

Q n = Gamma Dist n 1 # t % & n 1 ( e t / K KΓ(n) $ k ' Mean or E(t) = nk Var = nk 2 whereγ(n) = (n 1)! n =1 n =2 Unit Hydrographs n =3

Parameters of Dist n Distribution Normal x LogN Y =logx Gamma x Exp t Mean µ x µ y nk 1/k Variance σ x 2 σ y 2 nk 2 1/k 2 Skewness zero zero 2/n 0.5 2

Frequency Analysis of Peak Flow Data Year Rank Ordered cfs 1940 1 42,700 1925 2 31,100 1932 3 20,700 1966 4 19,300 1969 5 14,200 1982 6 14,200 1988 7 12,100 1995 8 10,300 2000.

Frequency Analysis of Peak Flow Data n Take Mean and Variance (S.D.) of ranked data n Take Skewness C s of data (3rd moment about mean) n If C s near zero, assume normal dist n n If C s large, convert Y = Log x - (Mean and Var of Y) n Take Skewness of Log data - C s (Y) n If C s near zero, then fits Lognormal n If C s not zero, fit data to Log Pearson III

Siletz River Example " 75 data points - Excel Tools Original Q Y = Log Q Mean 20,452 4.2921 Std Dev 6089 0.129 Skew 0.7889-0.1565 Coef of Variation 0.298 0.03

Siletz River Example - Fit Normal and LogN Normal Distribution Q = Q m + z S Q Q 100 = 20452 + 2.326(6089) = 34,620 cfs Mean + z (S.D.) Where z = std normal variate - tables Log N Distribution Y = Y m + k S Y Y 100 = 4.29209 + 2.326(0.129) = 4.5923 k = freq factor and Q = 10 Y = 39,100 cfs

Log Pearson Type III Log Pearson Type III Y = Y m + k S Y K is a function of Cs and Recurrence Interval Table 3.4 lists values for pos and neg skews For Cs = -0.15, thus K = 2.15 from Table 3.4 Y 100 = 4.29209 + 2.15(0.129) = 4.567 Q = 10 Y = 36,927 cfs for LP III Plot several points on Log Prob paper

LogN Prob Paper for CDF What is the prob that flow exceeds some given value - 100 yr value Plot data with plotting position formula P = m/n+1, m = rank, n = # Log N dist n plots as straight line

LogN Plot of Siletz R. Mean Straight Line Fits Data Well

Siletz River Flow Data Various Fits of CDFs LP3 has curvature LN is straight line

Flow Duration Curves

Trends in data have to be removed before any Frequency Analysis White Oak at Houston (1936-2002) 35000 01 Peak Flow (cfs) 30000 25000 20000 15000 10000 92 '98 5000 0 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63 65 67 Years