UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

Similar documents
Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

IEOR 165 Lecture 1 Probability Review

Random Variables Handout. Xavier Vilà

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

The Normal Distribution

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

ECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 10: Continuous RV Families. Prof. Vince Calhoun

Chapter 3 Discrete Random Variables and Probability Distributions

Chapter 4 Continuous Random Variables and Probability Distributions

Chapter 4 Continuous Random Variables and Probability Distributions

Central Limit Theorem, Joint Distributions Spring 2018

6. Continous Distributions

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

Chapter 3 Common Families of Distributions. Definition 3.4.1: A family of pmfs or pdfs is called exponential family if it can be expressed as

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

Chapter 4 Probability Distributions

Statistics for Business and Economics

Commonly Used Distributions

Discrete Random Variables

Statistics 6 th Edition

Probability Theory. Mohamed I. Riffi. Islamic University of Gaza

Random variables. Contents

Some Discrete Distribution Families

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

4 Random Variables and Distributions

What was in the last lecture?

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

2. The sum of all the probabilities in the sample space must add up to 1

Tutorial 11: Limit Theorems. Baoxiang Wang & Yihan Zhang bxwang, April 10, 2017

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

STATISTICS and PROBABILITY

Business Statistics 41000: Probability 3

4-1. Chapter 4. Commonly Used Distributions by The McGraw-Hill Companies, Inc. All rights reserved.

Continuous random variables

Frequency and Severity with Coverage Modifications

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

5. In fact, any function of a random variable is also a random variable

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Binomial Random Variables. Binomial Random Variables

4.3 Normal distribution

The Normal Distribution

Statistical Methods in Practice STAT/MATH 3379

Statistical Tables Compiled by Alan J. Terry

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Sampling Distribution

STAT/MATH 395 PROBABILITY II

Statistics for Managers Using Microsoft Excel 7 th Edition

CS 237: Probability in Computing

Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

Central limit theorems

Chapter 3 Discrete Random Variables and Probability Distributions

Lecture Notes 6. Assume F belongs to a family of distributions, (e.g. F is Normal), indexed by some parameter θ.

Lecture 10: Point Estimation

Homework Assignments

Random Samples. Mathematics 47: Lecture 6. Dan Sloughter. Furman University. March 13, 2006

Central Limit Theorem (CLT) RLS

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Much of what appears here comes from ideas presented in the book:

IEOR E4703: Monte-Carlo Simulation

Probability Models.S2 Discrete Random Variables

5.3 Statistics and Their Distributions

2011 Pearson Education, Inc

Probability Theory. Probability and Statistics for Data Science CSE594 - Spring 2016

Random Variable: Definition

Probability Distributions II

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

Basic notions of probability theory: continuous probability distributions. Piero Baraldi

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Reliability and Risk Analysis. Survival and Reliability Function

MAS187/AEF258. University of Newcastle upon Tyne

Chapter 7 1. Random Variables

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

Probability Distributions for Discrete RV

MATH 3200 Exam 3 Dr. Syring

. (i) What is the probability that X is at most 8.75? =.875

Engineering Statistics ECIV 2305

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

The Binomial Distribution

4-2 Probability Distributions and Probability Density Functions. Figure 4-2 Probability determined from the area under f(x).

Central Limit Theorem (cont d) 7/28/2006

Chapter 8. Variables. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc.

STOR Lecture 7. Random Variables - I

CS145: Probability & Computing

Chapter 7: Point Estimation and Sampling Distributions

Overview. Definitions. Definitions. Graphs. Chapter 5 Probability Distributions. probability distributions

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

LECTURE CHAPTER 3 DESCRETE RANDOM VARIABLE

Point Estimators. STATISTICS Lecture no. 10. Department of Econometrics FEM UO Brno office 69a, tel

Useful Probability Distributions

Write legibly. Unreadable answers are worthless.

MA : Introductory Probability

Chapter 5. Statistical inference for Parametric Models

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

Transcription:

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

Random Variables 2

A random variable X is a numerical (integer, real, complex, vector etc.) summary of the outcome of the random experiment. The range or support of the random variable is the set of possible values that it may take. Random variables are usually denoted by capital letters.

A discrete random variable is an integer/real-valued random variable with a finite (or countably infinite) range. A continuous random variable is a real valued random variable with an interval (either finite or infinite) of real numbers for its range.

5 Experiment Outcome, ω, from the sample space. X (ω) Random Variable (function of the outcome). P ( X U ) = P ( {ω X (ω) U} ).

Example: Dig a hole searching for gold. Ω all possible outcomes (many ways to define this). X Weight of gold found in grams. P ( X > 20) = P ( {ω X (ω) U} ) with U = {x : x > 20}.

Probability Distributions 7

The probability distribution of a random variable X is a description of the probabilities associated with the possible values of X. There are several common alternative ways to describe the probability distribution, with some differences between discrete and continuous random variables.

While not the most popular in practice, a unified way to describe the distribution of any scalar valued random variable X (real or integer) is the cumulative distribution function, It holds that F (x) = P(X x). (1) 0 F (x) 1. (2) lim x F (x) = 0. (3) lim x F (x) = 1. (3) If x y, then F (x) F (y). That is, F ( ) is non-decreasing.

Examples to understand: 0, x < 1, F (x) = 0.3, 1 x < 1, 1, 1 x. 0, x < 0, F (x) = x, 0 x 1, 1, 1 x.

Distributions are often summarised by numbers such as the mean, µ, variance, σ 2, or moments. These numbers, in general do not identify the distribution, but hint at the general location, spread and shape. The standard deviation of X is σ = σ 2 and is particularly useful when working with the Normal distribution. More on these soon.

Discrete Random Variables 12

Given a discrete random variable X with possible values x 1, x 2,..., x n, the probability mass function of X is, p(x) = P(X = x). Note: In [MonRun2014] and many other sources, the notation used is f (x) (as a pdf of a continuous random variable).

14 A probability mass function, p(x) satisfies: (1) p(x i ) 0. n (2) p(x i ) = 1. i=1 The cumulative distribution function of a discrete random variable X, denoted as F (x), is F (x) = x i x p(x i ).

P(X = x i ) can be determined from the jump at the value of x. More specifically p(x i ) = P(X = x i ) = F (x i ) lim x xi F (x i ).

Back to the example: 0, x < 1, F (x) = 0.3, 1 x < 1, 1, 1 x. What is the pmf?

17 The mean or expected value of a discrete random variable X, is µ = E(X ) = x x p(x).

18 The expected value of h(x ) for some function h( ) is: [ ] E h(x ) = h(x) p(x). x

19 The k th moment of X is, E(X k ) = x x k p(x).

20 The variance of X, is σ 2 = V (X ) = E(X µ) 2 = x (x µ) 2 p(x) = x x 2 p(x) µ 2.

The Discrete Uniform Distribution 21

22 A random variable X has a discrete uniform distribution if each of the n values in its range, x 1, x 2,..., x n, has equal probability. I.e. p(x i ) = 1/n.

23 Suppose that X is a discrete uniform random variable on the consecutive integers a, a + 1, a + 2,..., b, for a b. The mean and variance of X are E(X ) = b + a 2 and V (X ) = (b a + 1)2 1. 12

24 To compute the mean and variance of the discrete uniform, use: n k = k=1 n(n + 1), 2 n k 2 = k=1 n(n + 1)(2n + 1) 6

E(X ) = b k=a k 1 b a+1 =

E(X 2 ) = b k=a k2 1 b a+1 =

The Binomial Distribution 27

The setting of n independent and identical Bernoulli trials is as follows: (1) There are n trials. (1) The trials are independent. (2) Each trial results in only two possible outcomes, labelled as success and failure. (3) The probability of a success in each trial denoted as p is the same for all trials.

29 Binomial Example: Number of digs finding gold. n = 5 digs in different spots. p = 0.1 chance of finding gold in each spot.

The random variable X that equals the number of trials that result in a success is a binomial random variable with parameters 0 p 1 and n = 1, 2,.... The probability mass function of X is ( ) n p(x) = p x (1 p) n x, x = 0, 1,..., n. x

Useful to remember from algebra: the binomial expansion for constants a and b is n ( ) n (a + b) n = a k b n k. k k=0

32 If X is a binomial random variable with parameters p and n, then, E(X ) = n p and V (X ) = n p (1 p).

Example (cont.): Number of digs finding gold (n = 5, p = 0.1): 33

Continuous Random Variables 34

Given a continuous random variable X, the probability density function (pdf) is a function, f (x) such that, (1) f (x) 0. (2) f (x) = 0 for x not in the range. (3) f (x) dx = 1. (4) For small x, f (x) x P(X [x, x + x)). b (5) P(a X b) = f (x)dx = area under f (x) from a to b. a

36 Given the pdf, f (x) we can get the cdf as follows: F (x) = P(X x) = x f (u)du for < x <.

37 Given the cdf, F (x) we can get the pdf: f (x) = d dx F (x).

The mean or expected value of a continous random variable X, is µ = E(X ) = x f (x)dx. The expected value of h(x ) for some function h( ) is: [ ] E h(x ) = The k th moment of X is, The variance of X, is σ 2 = V (X ) = E(X k ) = h(x)f (x) dx. x k f (x) dx. (x µ) 2 f (x)dx = x 2 f (x) dx µ 2.

Continuous Uniform Distribution 39

40 A continuous random variable X with probability density function f (x) = 1 b a, a x b. is a continuous uniform random variable or uniform random variable for short.

41 If X is a continuous uniform random variable over a x b, the mean and variance are: µ = E(X ) = a + b 2 and σ 2 = V (X ) = (b a)2. 12

The Normal Distribution 42

43 A random variable X with probability density function f (x) = 1 σ 2π e (x µ) 2 2σ 2, < x <, is a normal random variable with parameters µ where < µ <, and σ > 0. For this distribution, the parameters map directly to the mean and variance, E(X ) = µ and V (X ) = σ 2. The notation N(µ, σ 2 ) is used to denote the distribution. Note that some authors and software packages use σ for the second parameter and not σ 2.

A normal random variable with a mean and variance of: µ = 0 and σ 2 = 1 is called a standard normal random variable and is denoted as Z. The cumulative distribution function of a standard normal random variable is denoted as and is tabulated in a table. Φ(z) = F Z (z) = P(Z z),

45 It is very common to compute P(a < X < b) for X N(µ, σ 2 ). This is the typical way: We get: P(a < X < b) = P(a µ < X µ < b µ) ( a µ = P < X µ < b µ ) σ σ σ ( a µ = P < Z < b µ ) σ σ ( b µ ) ( a µ ) = Φ Φ. σ σ ( b µ ) F X (b) F X (a) = F Z σ ( a µ ) F Z. σ

The Exponential Distribution 46

47 The exponential distribution with parameter λ > 0 is given by the survival function, F (x) = 1 F (x) = P(X > x) = e λx. The random variable X represents the distance between successive events from a Poisson process with mean number of events per unit interval λ > 0.

The probability density function of X is f (x) = λe λx for 0 x <. Note that sometimes a different parameterisation, θ = 1/λ is used (e.g. in the Julia Distributions package).

The mean and variance are: µ = E(X ) = 1 λ and σ 2 = V (X ) = 1 λ 2

The exponential distribution is the only continuous distribution with range [0, ) exhibiting the lack of memory property. For an exponential random variable X, P(X > t + s X > t) = P(X > s).

Monte Carlo Random Variable Generation 51

52 Monte Carlo simulation makes use of methods to transform a uniform random variable in a manner where it follows an arbitrary given given distribution. One example of this is if U Uniform(0, 1) then X = 1 λ log(u) is exponentially distributed with parameter λ.