Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

Similar documents
4.3 Normal distribution

Tutorial 11: Limit Theorems. Baoxiang Wang & Yihan Zhang bxwang, April 10, 2017

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Central Limit Theorem, Joint Distributions Spring 2018

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Statistics for Business and Economics

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance

The normal distribution is a theoretical model derived mathematically and not empirically.

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

Elementary Statistics Lecture 5

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

Probability. An intro for calculus students P= Figure 1: A normal integral

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

4 Random Variables and Distributions

Chapter 7 1. Random Variables

4.2 Bernoulli Trials and Binomial Distributions

The Bernoulli distribution

Part V - Chance Variability

Homework Assignments

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Topic 6 - Continuous Distributions I. Discrete RVs. Probability Density. Continuous RVs. Background Reading. Recall the discrete distributions

Statistics & Flood Frequency Chapter 3. Dr. Philip B. Bedient

MA : Introductory Probability

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

Business Statistics 41000: Probability 4

Chapter 7: Point Estimation and Sampling Distributions

Continuous Distributions

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

MATH MW Elementary Probability Course Notes Part IV: Binomial/Normal distributions Mean and Variance

Engineering Statistics ECIV 2305

2011 Pearson Education, Inc

5.4 Normal Approximation of the Binomial Distribution

Statistics, Measures of Central Tendency I

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics

INF FALL NATURAL LANGUAGE PROCESSING. Jan Tore Lønning, Lecture 3, 1.9

Introduction to Statistics I

Statistics 6 th Edition

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

STATISTICAL LABORATORY, May 18th, 2010 CENTRAL LIMIT THEOREM ILLUSTRATION

MATH 3200 Exam 3 Dr. Syring

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

Midterm Exam III Review

The Binomial Distribution

AP Statistics Ch 8 The Binomial and Geometric Distributions

INF FALL NATURAL LANGUAGE PROCESSING. Jan Tore Lønning, Lecture 3, 1.9

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41

What was in the last lecture?

Business Statistics 41000: Probability 3

IEOR 3106: Introduction to OR: Stochastic Models. Fall 2013, Professor Whitt. Class Lecture Notes: Tuesday, September 10.

Consider the following examples: ex: let X = tossing a coin three times and counting the number of heads

The Normal Probability Distribution

Chapter 8: The Binomial and Geometric Distributions

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Review. Binomial random variable

2017 Fall QMS102 Tip Sheet 2

Commonly Used Distributions

Binomial Distribution and Discrete Random Variables

The Normal Distribution

Binomial and Normal Distributions

Favorite Distributions

The Binomial Probability Distribution

Theoretical Foundations

Chapter 5: Statistical Inference (in General)

STAT Chapter 7: Central Limit Theorem

3.2 Hypergeometric Distribution 3.5, 3.9 Mean and Variance

SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data

PROBABILITY DISTRIBUTIONS. Chapter 6

MAKING SENSE OF DATA Essentials series

Standard Normal, Inverse Normal and Sampling Distributions

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

Lecture 2. Probability Distributions Theophanis Tsandilas

Central Limit Theorem (cont d) 7/28/2006

Contents. The Binomial Distribution. The Binomial Distribution The Normal Approximation to the Binomial Left hander example

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Lecture 3: Probability Distributions (cont d)

Chapter 4 Continuous Random Variables and Probability Distributions

The Normal Distribution

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

NORMAL APPROXIMATION. In the last chapter we discovered that, when sampling from almost any distribution, e r2 2 rdrdϕ = 2π e u du =2π.

Statistical Tables Compiled by Alan J. Terry

Probability Theory. Mohamed I. Riffi. Islamic University of Gaza

Random Variables Handout. Xavier Vilà

Chapter 5. Statistical inference for Parametric Models

Introduction to Business Statistics QM 120 Chapter 6

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr.

AMS7: WEEK 4. CLASS 3

4-1. Chapter 4. Commonly Used Distributions by The McGraw-Hill Companies, Inc. All rights reserved.

18.05 Problem Set 3, Spring 2014 Solutions

5.3 Statistics and Their Distributions

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MA 1125 Lecture 12 - Mean and Standard Deviation for the Binomial Distribution. Objectives: Mean and standard deviation for the binomial distribution.

Some Discrete Distribution Families

Probability Distributions II

4: Probability. What is probability? Random variables (RVs)

Transcription:

1.1.2 Normal distribution 1.1.3 Approimating binomial distribution by normal 2.1 Central Limit Theorem Prof. Tesler Math 283 Fall 216 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 1 / 38

Normal distribution a.k.a. Bell curve and Gaussian distribution The normal distribution is a continuous distribution. Parameters: µ = mean (center) σ = standard deviation (width) ( ) PDF: f X () = 1 σ ep ( µ)2 for < <. 2π 2σ 2 Normal distribution N(2, 5): µ = 2, σ = 5..4.8 Normal µ µ ± σ 1 2 3 4 The normal distribution is symmetric about = µ, so median = mean = µ. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 2 / 38

Applications of normal distribution Applications Many natural quantities are modelled by it: e.g., a histogram of the heights or weights of everyone in a large population often follows a normal distribution. Many distributions such as binomial, Poisson,... are closely approimated by it when the parameters are large enough. Sums and averages of huge quantities of data are often modelled by it. Coverage in DNA sequencing Illumina GA II sequencing of E. coli at 6 coverage. Chitsaz et al. (211), Nature Biotechnology % of positions with coverage..2.4.6.8 1. Empirical distribution of coverage 2 4 6 8 1 Coverage Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 3 / 38

Cumulative distribution function It turns out that the integral for total probability does equal 1: ) 1 ( σ 2π ep ( µ)2 2σ 2 d = 1 However, it s a tricky integral; see a Calculus tetbook in the section on double integrals in polar coordinates. The cumulative distribution function is the integral F X () = P(X ) = ) 1 ( σ 2π ep (t µ)2 2σ 2 dt This integral cannot be done symbollically in terms of the usual functions (polynomials, eponentials, logs, trig functions, etc.). It can be done via numerical integration or Taylor series. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 4 / 38

Standard normal distribution Standard normal distribution N(, 1): µ =, σ = 1 CDF of standard normal distribution..2.4 Normal µ µ ± σ cdf..4.8 Normal µ µ ± σ 4 2 2 4 4 2 2 4 z The standard normal distribution is the normal distribution for µ =, σ = 1. Use the variable name Z: PDF: φ(z) = f Z (z) = e z2 /2 2π for < z < CDF: Φ(z) = F Z (z) = P(Z z) = 1 z e t2 /2 dt 2π The integral can t be done in terms of ordinary functions, but it can be done using numerical methods. In the past, people used lookup tables. We ll use functions for it in Matlab and R. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 5 / 38 z

Matlab and R commands For the standard normal: Φ(1.96).975 Φ 1 (.975) 1.96 Matlab: normcdf(1.96) norminv(.975) R: pnorm(1.96) qnorm(.975) We will see shortly how to convert between an arbitrary normal distribution (any µ, σ) and the standard normal distribution. The commands above allow additional arguments to specify µ and σ, e.g., normcdf(1.96,,1). R also can work with the right tail directly: pnorm(1.96, lower.tail = FALSE).975 qnorm(.975, lower.tail = FALSE) 1.96 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 6 / 38

Standard normal distribution areas.4 Standard Normal Curve.3.2.1!5 a b 5 z The area between z = a and z = b is P(a Z b) = 1 2π b a e t2 /2 dt = Φ(b) Φ(a) P(1.51 Z 1.62) = Φ(1.62) Φ(1.51) =.9474.9345 =.129 Matlab: normcdf(1.62) normcdf(1.51) R: pnorm(1.62) pnorm(1.51) Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 7 / 38

Standard normal distribution symmetries of areas.4 Area! on the right "#$ &'()*!*+,*-.(*/(-.2 12 "#! z!!2 2 z "!%!!! " %! Area right of z is P(Z > z) = 1 Φ(z). By symmetry, the area left of z and the area right of z are equal: Φ( z) = 1 Φ(z) Φ( 1.51) = 1 Φ(1.51) = 1.9345 =.655 Area between z = ±1.51 is Φ(1.51) Φ( 1.51) = 2Φ(1.51) 1.869 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 8 / 38

Central area Area between z = ±1 is 68.27%. Area between z = ±2 is 95.45%. Area between z = ±3 is 99.73%..4.2 Area! split half on each tail!z!/2 z!/2!2 z 2 Find the center part containing 95% of the area Put 2.5% of the area at the upper tail, 2.5% at the lower tail, and 95% in the middle. The value of z putting 2.5% at the top gives Φ(z) = 1.25 =.975. Notation: z.25 = 1.96. The area between z = ±1.96 is about 95%. For 99% in the middle,.5% on each side, use z.5 2.58. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 9 / 38

Areas on normal curve for arbitrary µ, σ P(a X b) = b a ) 1 ( σ 2π ep ( µ)2 2σ 2 d Substitute z = µ σ (or = σz + µ) into the integral to turn it into the standard normal integral: ( a µ P X µ b µ ) ( a µ = P Z b µ ) σ σ σ σ σ ( ) ( ) b µ a µ = Φ Φ σ σ The z-score of is z = µ σ. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 1 / 38

Binomial distribution Compute P(43 X 51) when n = 6, p = 3/4 Binomial: n = 6, p = 3/4 k P(X = k) = ( ) 6 k (.75) k (.25) 6 k 43.9562 44.1183 45.11822 46.11565 47.1335 48.8397 49.6169 5.471 51.2395 Total.7544 Mean µ = np = 6(3/4) = 45 Standard deviation σ= np(1 p) = 6(3/4)(1/4) = 11.25 3.35411966 Mode (k with ma ) np + p = 6(3/4) + (3/4) = 45 3 4 = 45 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 11 / 38

Mode of a distribution The mode of random variable X is the value k at which the is maimum. Mode of binomial distribution when < p < 1 The mode is (n + 1)p. Eception: If (n + 1)p is an integer then (n + 1)p and (n + 1)p 1 are tied as the mode. The mode is within 1 of the mean np. When np is an integer, the mode equals the mean. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 12 / 38

Binomial and normal distributions Binomial Normal approimation to binomial k P(X = k).15 Binomial: n=6, p=3/4 Binomial P(43! X! 51) 43.9562 44.1183 Normal: µ=45,"=3.35 45.11822.1 46.11565 47.1335 48.8397 49.6169.5 5.471 51.2395 Total.7544 41 43 45 47 49 51 53 55 P(X = k) shown as a rectangle: height P(X = k), etent k ± 1/2. The binomial distribution is only defined at the integers, and is very close to the normal distribution shown. We will approimate the probability P(43 X 51) we had above by the corresponding one for the normal distribution. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 13 / 38

Normal approimation to binomial, step 1 Compute corresponding parameters We want to approimate P(a X b) in a binomial distribution. We ll use n = 6, p = 3/4 and approimate P(43 X 51). Determine µ, σ: µ = np = 6(3/4) = 45 σ = np(1 p) = 11.25 3.354 The normal distribution with those same values of µ, σ is a good approimation to the binomial distribution provided µ ± 3σ are both between and n. Check: µ 3σ 45 3(3.354) = 34.938 µ + 3σ 45 + 3(3.354) = 55.62 are both between and 6, so we may proceed. Note: Some applications are more strict and may require µ ± 5σ or more to be between and n. Since µ + 5σ 61.771, this would fail at that level of strictness. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 14 / 38

Normal approimation to binomial, step 2 Continuity correction Normal approimation to binomial.15.1.5 Binomial: n=6, p=3/4 Binomial P(43! X! 51) Normal: µ=45,"=3.35 The binomial distribution is discrete (X = integers) but the normal distribution is continuous. 41 43 45 47 49 51 53 55 The sum P(X = 43) + + P(X = 51) has 9 terms, corresponding to the area of the 9 rectangles in the picture. The area under the normal distribution curve from 42.5 X 51.5 approimates the area of those rectangles. Change P(43 X 51) to P(42.5 X 51.5). Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 15 / 38

Normal approimation to binomial, steps 3 4 3. Convert to z-scores 4. Use the normal distribution to approimately evaluate it For random variable X with mean µ and standard deviation σ, The z-score of a value is z = E(X) SD(X) The random variable Z is Z = X E(X) SD(X) Convert to z-scores: P(42.5 X 51.5) = P = µ σ. = X µ σ. ( 42.5 45 11.25 X 45 11.25 ) 51.5 45 11.25 = P(.7453559926 Z 1.937925581) Approimate this by the standard normal distribution cdf: Φ(1.937925581) Φ(.7453559926).7456555785 This is close to the true answer (apart from rounding errors) P(43 X 51) =.7544 we got from the binomial distribution. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 16 / 38

Estimating fraction of successes instead of number of successes What is the value of p in the binomial distribution? Estimate it: flip a coin n times and divide the # heads by n. Let X = binomial distribution for n flips, probability p of heads. Let X = X/n be the fraction of flips that are heads. X is discrete, with possible values n, 1 n, 2 n,..., n n {(. n ) P(X = k n ) = P(X = k) = k p k (1 p) n k for k =, 1,..., n; otherwise. Mean E(X) = E(X/n) = E(X)/n = np/n = p. Variance Var(X) = Var ( ) X n = Var(X) n 2 Standard deviation SD(X) = p(1 p)/n. = np(1 p) n 2 = p(1 p) n. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 17 / 38

Normal approimation for fraction of successes n flips, probability p of heads, X=observed fraction of heads Mean E(X) = p Variance Var(X) = p(1 p)/n Standard deviation SD(X) = p(1 p)/n The Z transformation of X is Z = X E(X) SD(X) = X p p(1 p)/n and value X = has z-score z = p p(1 p)/n. For k heads in n flips, The z-score of X = k is z 1 = k np. np(1 p) The z-score of X = k/n is z 2 = (k/n) p p(1 p)/n. These are equal! Divide the numerator and denominator of z 1 by n to get z 2. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 18 / 38

Normal approimation for fraction of successes For n = 6 flips of a coin with p = 3 4, we ll estimate P ( 43 6 The eact answer equals P(43 X 51).7544. X 51 6). Step 1: Determine mean and SD E(X) = p =.75 SD(X) = p(1 p)/n = (.75)(.25)/6 =.3125.559 Verify approimation is valid: Mean ± 3 SD between and 1 Mean 3 SD =.58229 Mean + 3 SD =.9177 Both are between and 1. Step 2: Continuity correction P ( ) ( 43 51 6 X 6 = P 42.5 6 Step 3: z-scores X 51.5 6 Step 4: Evaluate approimate answer using normal distribution Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 19 / 38 )

Normal approimation for fraction of successes P ( 43 6 ) ( 51 X 6 = P 42.5 6 X 51.5 6 = P(.7833 X.85833) (.7833 E(X) = P SD(X) ) X E(X) SD(X) = P (.7833.75.559 Z.85833.75.559 = P(.74535 Z 1.93792).85833 E(X) SD(X) ) ) = Φ(1.93792) Φ(.74535).74565 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 2 / 38

Mean and SD of sums and averages of i.i.d. random variables Let X 1,..., X n be n i.i.d. (independent identically distributed) random variables, each with mean µ and standard deviation σ. Let S n = X 1 + + X n be their sum and X n = (X 1 + + X n )/n = S n /n be their average. Means: Sum: E(S n ) = E(X 1 ) + + E(X n ) = n E(X 1 ) = nµ Avg: E(X n ) = E(S n /n) = nµ/n = µ Variance: Sum: Var(S n ) = Var(X 1 ) + + Var(X n ) = n Var(X 1 ) = nσ 2 Avg: Var(X n ) = Var(S n )/n 2 = nσ 2 /n 2 = σ 2 /n Standard deviation: Sum: SD(S n ) = σ n Avg: SD(X n ) = σ/ n Terminology for different types of standard deviation The standard deviation (SD) of a trial (each X i ) is σ The standard error (SE) of the sum is σ n The standard error (SE) of the average is σ/ n Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 21 / 38

Z-scores of sums and averages For sum S n For average X n Mean: E(S n ) = nµ E(X n ) = µ Variance: Var(S n ) = nσ 2 Var(X n ) = σ 2 /n Standard Deviation: SD(S n ) = σ n SD(X n ) = σ/ n Z-scores: Z = S n E(S n ) SD(S n ) = S n nµ σ n Z = X n E(X n ) SD(X n ) = X n µ σ/ n Z-scores of sum and average are equal! Divide the numerator and denominator of Z of the sum by n to get Z of the average. Z sum = (S n nµ)/n (σ n)/n = X n µ σ/ n = Z avg Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 22 / 38

Theorem (Central Limit Theorem abbreviated CLT) For n i.i.d. random variables X 1,..., X n with sum S n = X 1 + + X n and average X n = S n /n, and any real numbers a < b, P ( a S n nµ σ n ) b = P ( a X ) n µ σ/ n b Φ(b) Φ(a) if n is large enough. As n, the approimation becomes eact equality. 1.5 Binomial n=1,p=.75; µ=.75,!=.43.5 1 Binomial n=6,p=.75; µ=45.,!=3.35.15.1.5 2 4 6 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 23 / 38

Interpretation of Central Limit Theorem As n increases, the more and more closely resembles a normal distribution. However, the is defined as in-between the red points shown, if it s a discrete distribution. The cdfs are approimately equal everywhere on the continuum. Probabilities of intervals for sums or averages of enough i.i.d. variables can be approimately evaluated using the normal distribution. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 24 / 38

Repeated rolls of a die One roll: µ = 3.5, σ = 35/12 1.71.3 Average of 1 roll of die; µ=3.5,!=1.71.2 Average of 2 rolls of die; µ=3.5,!=1.21.2.15.1.1.5 2 4 6 2 4 6.15.1.5 Average of 3 rolls of die; µ=3.5,!=.99.3.2.1 Average of 1 rolls of die; µ=3.5,!=.17 Die average Normal dist. µ µ±! 2 4 6 2 4 6 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 25 / 38

Repeated rolls of a die Find n so that at least 95% of the time, the average of n rolls of a die is between 3 and 4. ( ) P(3 X 4) = P 3 µ σ/ X µ n σ/ 4 µ n σ/ n Plug in µ = 3.5 and σ = 35/12. ( P(3 X 4) = P 1/2 Z 35/(12n) ) 1/2 35/(12n) Recall the center 95% of the area on the standard normal curve is between z = ±1.96. 1/2 1.96 n (1.96) 2 35/12 44.81 35/(12n) (1/2) 2 n is an integer so n 45 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 26 / 38

Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 27 / 38

Sawtooth distribution (made up as demo) One trial: µ = 4, σ 2.24 Average of 1 trial; µ=4.,!=2.24 Average of 2 trials; µ=4.,!=1.58.25.2.15.1.5 2 4 6 8 Average of 3 trials; µ=4.,!=1.29.15.1.5 2 4 6 8 Average of 1 trials; µ=4.,!=.22.1.5 2 4 6 8.2.15.1.5 2 4 6 8 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 28 / 38

Binomial distribution (n, p) A Bernoulli trial is to flip a coin once and count the number of heads, { 1 probability p; X 1 = probability 1 p. Mean E(X 1 ) = p, standard deviation SD(X 1 ) = p(1 p). The binomial distribution is the sum of n i.i.d. Bernoulli trials. Mean µ = np, standard deviation σ = np(1 p). The binomial distribution is approimated pretty well by the normal distribution when µ ± 3σ are between and n. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 29 / 38

Binomial distribution (n, p) One flip: µ = p =.75, σ = p(1 p) =.1875.433 Binomial n=1,p=.75; µ=.75,!=.43 1.5 Binomial n=6,p=.75; µ=4.5,!=1.6.4.2.5 1 Binomial n=3,p=.75; µ=22.5,!=2.37.2.1 1 2 3 2 4 6 Binomial n=6,p=.75; µ=45.,!=3.35.15.1.5 2 4 6 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 3 / 38

Poisson distribution (µ or µ = λ d) Mean: µ (same as the Poisson parameter) Standard deviation: σ = µ. It is approimated pretty well by the normal distribution when µ 5. The reason the Central Limit Theorem applies is that a Poisson distribution with parameter µ equals the sum of n i.i.d. Poissons with parameter µ/n. The Poisson distribution has infinite range =, 1, 2,... and the normal distribution has infinite range < < (reals). Both are truncated in the plots. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 31 / 38

Poisson distribution (µ) Poisson µ=1;!=1..4.2.2.1 Poisson µ=6;!=2.45 5 1.8.6.4.2 Poisson µ=3;!=5.48 2 4 6 8 5 1 15.6.4.2 Poisson µ=6;!=7.75 5 1 15 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 32 / 38

Geometric and negative binomial distributions Geometric distribution (p) X is the number of flips { until the first heads, (1 p) 1 p if = 1, 2, 3,... ; p X () = otherwise. The plot doesn t resemble the normal distribution at all. Mean: µ = 1/p Negative binomial distribution (r, p) Standard deviation: σ = 1 p/p r = 1 is same as geometric distribution. r > 2: The has a bell -like shape, but is not close to the normal distribution unless r is very large. Mean: µ = r/p Standard deviation: σ = r(1 p)/p Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 33 / 38

Geometric and negative binomial distributions Heads with probability p =.1 Geometric p=.1; µ=1.,!=9.49.1.5 Neg. bin. r=6,p=.1; µ=6.,!=23.24.2.1.5 1 2 3 Neg. bin. r=3,p=.1; µ=3.,!=51.96.1 1 2 3 4 4 2 5 1 Neg. bin. r=6,p=.1; µ=6.,!=73.48 6 1!3 2 4 6 8 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 34 / 38

Eponential and gamma distributions Eponential distribution (λ) The eponential distribution doesn t resemble the normal distribution at all. Mean: µ = 1/λ Standard deviation: σ = 1/λ Gamma distribution (r, λ) The gamma distribution for r = 1 is the eponential distribution. The gamma distribution for r > 1 does have a bell -like shape, but it is not close to the normal distribution until r is very large. There is a generalization to allow r to be real numbers, not just integers. Mean: µ = r/λ Standard deviation: σ = r/λ Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 35 / 38

Eponential and gamma distributions Rate λ =.1 Eponential!=.1; µ=1.,"=1..1.5 Gamma r=6,p=.1; µ=6.,!=24.49.2.1 6 4 2 1 2 3 4 Gamma r=3,p=.1; µ=3.,!=54.77 8 1!3 1 2 3 4 4 2 5 1 Gamma r=6,p=.1; µ=6.,!=77.46 6 1!3 2 4 6 8 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 36 / 38

Geometric/Negative binomial vs. Eponential/Gamma p = λ gives same means for geometric and eponential. p = 1 e λ gives same eponential decay rate for both geometric and eponential distributions. 1 e λ λ when λ is small. This corespondence carries over to the gamma and negative binomial distributions. Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 37 / 38

Geometric/negative binomial vs. Eponential/gamma This is for p =.1 vs. λ =.1; a better fit for λ =.1 would be p = 1 e λ.95 Geometric p=.1; µ=1.,!=9.49 Eponential!=.1; µ=1.,"=1..1.5.1.5.5 1 2 3 Neg. bin. r=3,p=.1; µ=3.,!=51.96.1 1 2 3 4 6 4 2 1 2 3 4 Gamma r=3,p=.1; µ=3.,!=54.77 8 1!3 1 2 3 4 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 38 / 38