continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

Similar documents
The Normal Distribution

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

What was in the last lecture?

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Random Variables Handout. Xavier Vilà

Statistical Tables Compiled by Alan J. Terry

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

ECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 10: Continuous RV Families. Prof. Vince Calhoun

The Normal Distribution. (Ch 4.3)

Statistics for Business and Economics

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

Density curves. (James Madison University) February 4, / 20

STATISTICS and PROBABILITY

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Commonly Used Distributions

Topic 6 - Continuous Distributions I. Discrete RVs. Probability Density. Continuous RVs. Background Reading. Recall the discrete distributions

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

Central Limit Theorem, Joint Distributions Spring 2018

NORMAL RANDOM VARIABLES (Normal or gaussian distribution)

Chapter 8. Variables. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc.

Business Statistics 41000: Probability 3

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Chapter 4 Continuous Random Variables and Probability Distributions

Random variables. Contents

2. The sum of all the probabilities in the sample space must add up to 1

Continuous Distributions

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

IEOR 165 Lecture 1 Probability Review

Chapter 4 Continuous Random Variables and Probability Distributions

ECON 214 Elements of Statistics for Economists 2016/2017

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Lecture Stat 302 Introduction to Probability - Slides 15

Sampling Distribution

4-1. Chapter 4. Commonly Used Distributions by The McGraw-Hill Companies, Inc. All rights reserved.

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

6. Continous Distributions

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

Chapter 3 Common Families of Distributions. Definition 3.4.1: A family of pmfs or pdfs is called exponential family if it can be expressed as

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr.

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

χ 2 distributions and confidence intervals for population variance

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

MATH 3200 Exam 3 Dr. Syring

AMS7: WEEK 4. CLASS 3

Chapter 6 Continuous Probability Distributions. Learning objectives

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

2011 Pearson Education, Inc

Tutorial 11: Limit Theorems. Baoxiang Wang & Yihan Zhang bxwang, April 10, 2017

4.3 Normal distribution

Chapter 7 1. Random Variables

14.30 Introduction to Statistical Methods in Economics Spring 2009

Chapter 6: Random Variables and Probability Distributions

Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

Discrete Random Variables

Statistics and Their Distributions

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

MAS187/AEF258. University of Newcastle upon Tyne

Review. Binomial random variable

Chapter 6. The Normal Probability Distributions

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Homework Assignments

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

Expectations. Definition Let X be a discrete rv with set of possible values D and pmf p(x). The expected value or mean value of X, denoted by E(X ) or

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41

Continuous random variables

Probability Distributions for Discrete RV

X = x p(x) 1 / 6 1 / 6 1 / 6 1 / 6 1 / 6 1 / 6. x = 1 x = 2 x = 3 x = 4 x = 5 x = 6 values for the random variable X

Normal Probability Distributions

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics

Chapter 2. Random variables. 2.3 Expectation

Lecture 6: Chapter 6

SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data

Bus 701: Advanced Statistics. Harald Schmidbauer

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Section Introduction to Normal Distributions

Central limit theorems

Introduction to Business Statistics QM 120 Chapter 6

Lecture 10: Point Estimation

King Saud University Academic Year (G) College of Sciences Academic Year (H) Solutions of Homework 1 : Selected problems P exam

Random Variable: Definition

Statistics and Probability

Engineering Statistics ECIV 2305

STAT 111 Recitation 4

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

Probability Distributions II

Statistics for Business and Economics: Random Variables:Continuous

Business Statistics 41000: Probability 4

VI. Continuous Probability Distributions

CHAPTERS 5 & 6: CONTINUOUS RANDOM VARIABLES

Reliability and Risk Analysis. Survival and Reliability Function

The Bernoulli distribution

Transcription:

continuous rv Let X be a continuous rv. Then a probability distribution or probability density function (pdf) of X is a function f(x) such that for any two numbers a and b with a b, P(a X b) = b a f (x)dx. Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c c f (x)dx = 0, hence P(a X b) = P(a X < b) = P(a < X b) = P(a < X < b).

example: The waiting time X (in minutes) for bus route 4 has the pdf given by { 1 f (x) = 10, 0 x 10 0, otherwise Then the probability of waiting between 2 and 5 minutes is P(2 X 5) = 5 1 2 10 dx = x 10 x=5 x=2 = 0.3 A continuous rv is said to have a uniform distribution on the interval [A, B] if the pdf of X is f (x; A, B) = { 1 B A, A x B 0, otherwise.

The cdf F(x) of a continuous rv X is defined for any number x by F (x) = P(X x) = x f (y)dy. use cdf to compute probabilities: P(X < a) = F (a), P(X > a) = 1 F (a), P(a X b) = F (b) F (a) Obtain f(x) from F(x): proposition: If X is a continuous rv with pdf f(x) and cdf F(x), then at every x at which the derivative F (x) exists, F (x) = f (x).

example For X with a uniform density f (x) = 1 F (x) = x f (y)dy = x 1 A B A dy = Hence F (x) = Obtain f (x) from F (x): F (x) = d dx ( x A B A ) = 1 f (x) = 0, otherwise. B A B A, A x B, we have y B A y=x y=a = x A B A, A x B. 0, x < A x A B A, A x B 1, x B = f (x), fora < x < B, and

exercise Suppose a continuous rv X has pdf f (x) = 2x, 0 x 1. Find F (x), P(X 0.8), P(0.5 X 1).

percentiles Let p be a number between 0 and 1. The (100p)th percentile of the distribution of a continuous rv, denoted by η(p), is defined by p = F [η(p)] = η(p) f (y)dy. The median of a continuous distribution, denoted by µ, is the 50th percentile.

expected value The expected value or the mean of a continuous rv X with pdf f (x) is µ x = E(X ) = xf (x)dx. Expected value of E(h(X )) : E(h(X )) = h(x)f (x)dx. The variance of a continuous rv X with pdf f(x) and mean µ is σx 2 = V (X ) = E(X µ)2 = (x µ)2 f (x)dx = E(X 2 ) µ 2 = x 2 f (x)dx µ 2. The standard deviation of X is σ x = V (X ).

example The pdf of weekly gravel sales X was { 3 f (x) = 2 (1 x 2 ), 0 x 1 0, otherwise. E(X ) = xf (x)dx = 1 0 x 3 2 (1 x 2 )dx = 3 2 3 2 ( x2 2 x4 4 ) x=1 x=0 = 3 8. 1 0 (x x 3 )dx = E(X 2 ) = x 2 f (x)dx = 1 0 x 2 3 2 (1 x 2 )dx = 1 0 3 2 (x 2 x 4 )dx = 1 5. so V (X ) = 1 5 ( 3 8 )2 = 0.059 and σ X = 0.059 = 0.244.

The normal distribution A continuous rv is said to have a normal distribution with parameters µ and σ where < µ < and σ > 0, if the pdf of X is f (x; µ, σ) = 1 2πσ e (x µ)2 /2σ 2, < x <. Often abbreviated as X N(µ, σ 2 ). X has a standard normal distribution if µ = 0 and σ = 1, often denoted Z N(0, 1). f (z) = 1 2π e z2 /2. its cdf is Φ(z) = P(Z z) = z f (t)dt

Find probabilities How to find P(a X b) if X N(µ, σ 2 ). First, find P(a Z b). examples: P(Z 1.25) = Φ(1.25) = 0.8944. P(Z > 1.25) = 1 P(z 1.25) = 1 0.8944 = 0.1056. P( 0.38 Z 1.25) = Φ(1.25) Φ( 0.38) = 0.8944 0.3520 = 0.5424. percentiles of the standard normal distribution: 99th percentile, z=2.33 95th percentile, z=1.64 or 1.65 or 1.645. 90th percentile, z=1.28 z α notation: z 0.01 = 2.33, z 0.05 = 1.645, z 0.025 = 1.96. α : right tail probability

Nonstandard normal distribution X N(µ, σ 2 ). Z = X µ σ. P(a X b) = P( a µ σ Z b µ σ ). e.g. adult female heights in North America have approximately a normal distribution with a µ = 65 inches and σ = 3.5 inches. The probability that X falls below 70 inches: P(X < 70) = P(Z 70 65 3.5 ) = P(Z < 1.43) = 0.9236. The probability that X falls between 60 and 70 inches is 0.9236-0.0764=0.8472. i.e., about 85% of the heights are between 60 and 70 inches. P(60 < X < 70) = P( 60 65 3.5 Z 70 65 3.6 ) = P( 1.43 < Z < 1.43) = P(Z < 1.43) P(Z < 1.43) = 0.9236 0.0764 = 0.8472.

exercise Suppose the test scores follow a normal distribution with µ = 82 and σ = 4. Find the probabilities that a randomly selected test score a). falls below 88, b). falls below 75, c). falls between 75 and 88.

answer: P(X < 88) = P(Z < 88 82 4 ) = P(Z < 1.50) = 0.9332, P(X < 75) = P(Z < 1.75) = 0.0401. P(75 < X < 88) = 0.9332 0.0401 = 0.8931.

empirical rule If X N(µ, σ 2 ), then P(X is within one standard deviation of its mean) = P(µ σ X µ + σ) = P( µ σ µ σ Z µ+σ µ σ ) = P( 1.00 Z 1.00) = Φ(1.00) Φ( 1.00) = 0.6826, similarly, we can find P(X is within 2 standard deviations of its mean) = 0.9544 P(X is within 3 standard deviations of its mean) = 0.9974 Empirical rule: If the population distribution of a random variable is (approximately) normal, then 1. Roughly 68% of the values are within 1 SD of its mean. 2. Roughly 95% of the values are within 2 SDs of its mean. 3. Roughly 99.7% of the values are within 3 SDs of its mean.

example Adult female heights in North America have approximately a normal distribution with a µ = 65 inches and σ = 3.5 inches, then About 68% of the heights fall between [65 1 3.5, 65 + 1 3.5] = [61.5, 68.5] inches. About 95% of the heights fall between [65 2 3.5, 65 + 2 3.5] = [58, 72] inches. And almost all the heights fall between [65 3 3.5, 65 + 3 3.5] = [54.5, 75.5] inches.

percentiles of normal distribution the 2.5th percentile of Z is z=-1.96, i.e., 2.5% of the values are below z = 1.96. Example continued: find a height such that 2.5% of the heights are below this values, i.e., find the 2.5th percentile. x = µ + zσ = 65 1.96 3.5 = 58.1 inches. Final exam scores have approximately normal distribution with mean 76 and standard deviation 8. Find the 75th percentile of test scores, i.e., 75% of the test scores are below this value.

note P(Z < 0.67) = 0.75, so x = µ + zσ = 76 + 0.67 8 = 81.36.

Approximate the binomial distribution X binomial(n, p), if np 10, n(1 p) 10, then approximately X N(np, np(1 p)). So P(X a) P(Z a np ) np(1 p) better approximated by P(Z a+0.5 np ). np(1 p) Example: X binomial(50, 0.25). P(X 10) P(Z 10 + 0.5 50 0.25 50 0.25 0.75 ) = P(Z 0.65) = 0.2578. The exact probability is 0.2622.

The exponential distribution X is said to have an exponential distribution with parameter (λ) > 0 if the pdf of X is { λe f (x; λ) = λx, x 0 0, otherwise E(X ) = 1 λ, V (X ) = 1, F (x) = 1 e λx, x 0. λ 2 example: The response time X (in seconds) at a computer terminal has an exponential distribution with λ = 0.2. The probability that the response time is at most 10 seconds is P(X 10) = F (10) = 1 e 0.2 10 = 0.865. The probability that the response time is between 5 and 10 sec is P(5 X 10) = 10 5 0.2e 0.2x dx = e 0.2x x=10 x=5 = 0.233.

gamma distribution A continuous rv X is said to have a gamma distribution if its pdf is given by f (x; α, β) = 1 β α Γ(α) x α 1 e x/β, x 0, α > 0, β > 0. Gamma function: For α > 0, Γ(α) = 0 x α 1 e x dx. Note for α > 1, Γ(α) = (α 1)Γ(α 1). For any positive integer, Γ(n) = (n 1)! E(X ) = αβ, V (X ) = αβ 2.

the chi-squared distribution The chi-squared distribution is a gamma distribution with α = ν 2 and β = 2. 1 f (x; ν) = ν 2 ν/2 Γ( ν )x 2 1 e x/2, x 0. 2 ν : degrees of freedom. X χ 2 (ν).

1. Suppose the proportion of gas sold at a gas station in a week, X, has pdf given by f (x) = 6x(1 x), 0 x 1. a. Find P(X 0.5), the probability that it sells at least half of its gas in a week. b. Find E(X 2 ). 2. If adult female heights are normally distributed, find the probability that the height of a randomly selected women is within 0.67 SDs of the mean. 3. It is known that 30% of vehicles on I 81 are trucks. If you take a random sample of 50 vehicles, use the normal approximation to find the probability that at most 15 of them are trucks.

Solutions 1. a. P(X > 0.5) = 1 0.5 6x(1 x)dx = 1 0.5 (6x 6x 2 )dx = (3x 2 2x 3 ) 1 0.5 = 0.5. b. E(X 2 ) = 1 0 x 2 6x(1 x)dx = ( 6x4 4 6x5 5 ) 1 0 = 0.3. 2. P(µ 0.67σ X µ + 0.67σ) = P( 0.67 Z 0.67) = 0.7486 0.2514 = 0.4972 0.50. 3. P(X 15) = P(Z 15+0.5 50 0.3 50 0.3 0.7 ) = P(Z < 0.1543) = 0.56. The exact binomial probability is 0.569. R code to get the cumulative binomial probability P(X 15): > pbinom(15,50,0.3)