Bus 701: Advanced Statistics. Harald Schmidbauer

Similar documents
continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

MATH 3200 Exam 3 Dr. Syring

Continuous random variables

Random variables. Contents

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Random Variables Handout. Xavier Vilà

Chapter 6 Continuous Probability Distributions. Learning objectives

Sampling and sampling distribution

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

Introduction to Statistics I

Chapter 7: Point Estimation and Sampling Distributions

Homework Assignments

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Discrete Random Variables

Chapter 2: Random Variables (Cont d)

Central Limit Theorem, Joint Distributions Spring 2018

Simulation Wrap-up, Statistics COS 323

Statistics and Their Distributions

STAT Chapter 7: Central Limit Theorem

Random Variable: Definition

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

STAT 241/251 - Chapter 7: Central Limit Theorem

Exercise Questions. Q7. The random variable X is known to be uniformly distributed between 10 and

Sampling Distribution

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

Chapter 7 - Lecture 1 General concepts and criteria

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

BIOL The Normal Distribution and the Central Limit Theorem

Chapter 4 Continuous Random Variables and Probability Distributions

Basic notions of probability theory: continuous probability distributions. Piero Baraldi

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

MAS187/AEF258. University of Newcastle upon Tyne

Section The Sampling Distribution of a Sample Mean

VI. Continuous Probability Distributions

ECON 214 Elements of Statistics for Economists 2016/2017

Chapter 8. Introduction to Statistical Inference

Chapter 5: Statistical Inference (in General)

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Topic 6 - Continuous Distributions I. Discrete RVs. Probability Density. Continuous RVs. Background Reading. Recall the discrete distributions

Chapter 7 Sampling Distributions and Point Estimation of Parameters

The Normal Distribution

5.1 Mean, Median, & Mode

Central Limit Theorem (CLT) RLS

Confidence Intervals Introduction

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

2.) What is the set of outcomes that describes the event that at least one of the items selected is defective? {AD, DA, DD}

University of California, Los Angeles Department of Statistics. The central limit theorem The distribution of the sample mean

LECTURE CHAPTER 3 DESCRETE RANDOM VARIABLE

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x)

Martingales, Part II, with Exercise Due 9/21

ECON 214 Elements of Statistics for Economists 2016/2017

4-2 Probability Distributions and Probability Density Functions. Figure 4-2 Probability determined from the area under f(x).

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

6 Central Limit Theorem. (Chs 6.4, 6.5)

Chapter 4 Continuous Random Variables and Probability Distributions

Density curves. (James Madison University) February 4, / 20

X = x p(x) 1 / 6 1 / 6 1 / 6 1 / 6 1 / 6 1 / 6. x = 1 x = 2 x = 3 x = 4 x = 5 x = 6 values for the random variable X

CH 5 Normal Probability Distributions Properties of the Normal Distribution

Statistics for Business and Economics

STUDY SET 2. Continuous Probability Distributions. ANSWER: Without continuity correction P(X>10) = P(Z>-0.66) =

Continuous Distributions

NORMAL APPROXIMATION. In the last chapter we discovered that, when sampling from almost any distribution, e r2 2 rdrdϕ = 2π e u du =2π.

Homework Problems Stat 479

Lecture III. 1. common parametric models 2. model fitting 2a. moment matching 2b. maximum likelihood 3. hypothesis testing 3a. p-values 3b.

Commonly Used Distributions

CHAPTERS 5 & 6: CONTINUOUS RANDOM VARIABLES

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

6. Continous Distributions

Statistics 6 th Edition

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

Part V - Chance Variability

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

(Practice Version) Midterm Exam 1

Chapter 5. Sampling Distributions

Statistics 251: Statistical Methods Sampling Distributions Module

What was in the last lecture?

NORMAL RANDOM VARIABLES (Normal or gaussian distribution)

Central limit theorems

Chapter 5. Statistical inference for Parametric Models

Continuous Probability Distributions & Normal Distribution

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

ECON 214 Elements of Statistics for Economists

4 Random Variables and Distributions

University of California, Los Angeles Department of Statistics

Engineering Statistics ECIV 2305

A.REPRESENTATION OF DATA

Chapter 7 Study Guide: The Central Limit Theorem

Central Limit Theorem (cont d) 7/28/2006

PROBABILITY DISTRIBUTIONS

Favorite Distributions

Lecture 10: Point Estimation

Reliability and Risk Analysis. Survival and Reliability Function

Statistics for Business and Economics: Random Variables:Continuous

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

Transcription:

Bus 701: Advanced Statistics Harald Schmidbauer c Harald Schmidbauer & Angi Rösch, 2008

About These Slides The present slides are not self-contained; they need to be explained and discussed. They contain only a small part of the course Bus 701. Even though being a work in progress and subject to revision, the slides constitute copyrighted material. If you want to reproduce or copy anything from the slides, please ask: Harald Schmidbauer Angi Rösch harald at hs-stat dot com angi.r at t-online dot de The slides were produced using L A TEX and R (the R project; www.rproject.org) on a Linux system. R files used for this course are available upon request. c Harald Schmidbauer & Angi Rösch, 2008 About these slides 2/38

Chapter 8: Continuous Probability Distributions c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 3/38

8.1 Basics Continuous random variables and continuous distributions. A random variable is called continuous if it can take on any value in a certain interval. The distribution of a continuous random variable is called a continuous distribution. Examples: X = daily return on DAX X = waiting time of a customer at a call center until an incoming call is answered X = impurity of a chemical produced by a company... c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 4/38

8.1 Basics Computing probabilities. For a continuous random variable X, P (X = x) = 0 for all x R. For example, let X = body-height of a randomly selected person. Then, P (X = 172) = P (X = 172.0000000) = 0! Therefore, the concept of probability function doesn t work for a continuous random variable. A density is needed! c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 5/38

8.1 Basics The density. f(x)........ a b x. Probabilities are areas below the density. For example, P (a X b) = b a f(x)dx A continuous probability distribution is given by these terms. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 6/38

8.1 Basics The distribution function. The distribution function of X is defined as x F (x) = P (X x) = x f(ξ)dξ. With it, P (a < X b) = F (b) F (a). As before, we can display the distribution (here: the density), compute location and variation measures. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 7/38

8.1 Basics The expectation: a location measure. For a continuous random variable X, E(X) = x f(x)dx. Same principle as in Chapter 7, with f(x)dx instead of p i = P (X = i), instead of. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 8/38

8.1 Basics The variance: a variation measure. For a discrete random variable X, var(x) = E [ (X E(X)) 2] = E(X 2 ) E 2 (X) = (x E(X)) 2 f(x)dx. Same principle as in Chapter 7, with f(x)dx instead of p i = P (X = i), instead of. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 9/38

8.2 The Normal Distribution Definition. A random variable X with density f(x) = 1 σ (x µ) 2 2π e 2σ 2, x R, µ R, σ 2 > 0, is said to be normally distributed with parameters µ and σ 2. In symbols: X N(µ, σ 2 ) c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 10/38

8.2 The Normal Distribution This picture shows the density. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 11/38

8.2 The Normal Distribution Meaning of the parameters µ and σ 2. Analytical meaning of µ and σ 2 : The density has its maximum at x = µ. The inflection points are at x = µ ± σ. Statistical meaning of µ and σ 2 : If X N(µ, σ 2 ), then E(X) = µ, var(x) = σ 2. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 12/38

8.2 The Normal Distribution This picture shows the influence of σ 2. 0.0 0.2 0.4 0.6 0.8 σ = 1 σ = 2 σ = 0.5 6 4 2 0 2 4 6 x c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 13/38

8.2 The Normal Distribution The standard normal distribution. If X N(0, 1), we say: X has a standard normal distribution. Let X N(0, 1). Using the table (and using geometry): P (X < 0) = 0.5 P (X < 1) = 0.8413 P (X > 1) = 1 0.8413 = 0.1587 P ( 1 < X < 1) = 1 2 (1 0.8413) = 0.6826 and: P ( 1.96 < X < 1.96) = 0.95 c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 14/38

8.2 The Normal Distribution The normal distribution standardization. There is a table ONLY for the standard normal distribution. If X N(µ, σ 2 ) with any µ R and σ 2 > 0, we can standardize X: Z := X µ σ N(0, 1) For the new random variable Z, we can again use the standard normal distribution table. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 15/38

8.2 The Normal Distribution Example 1: IQ tests. Let X = IQ of a randomly selected adult. IQ tests are designed such that X N(100, 15 2 ). What is the probability that a randomly selected person s IQ is... higher than 130? less than 90? between 90 and 110? c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 16/38

8.2 The Normal Distribution Example 1: IQ tests. Now let X = IQ of a randomly selected student of Bilgi University. Assuming X N(105, 15 2 ): What is the probability that a randomly selected student s IQ is higher than 150? How many of the students of Bilgi University would we expect to have an IQ above 150? c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 17/38

8.2 The Normal Distribution Example 2: Viscosity. Viscosity measurements from a batch chemical process are assumed to be a normal random variable with mean 15 and standard deviation 1. What is the probability that a batch has viscosity... above 15? above 15.5? above 16? c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 18/38

8.2 The Normal Distribution Example 3: Body-height. Assume that the body-height (in cm) of a randomly selected male person is a random variable X N(178, σ 2 ). Which of the following could be a possible value of σ 2? 1 / 25 / 50 / 100 / 250 / 1000??? c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 19/38

8.2 The Normal Distribution The sigma rules again. If X N(µ, σ 2 ): P (µ 1 σ X µ + 1 σ) 0.68 P (µ 2 σ X µ + 2 σ) 0.95 P (µ 3 σ X µ + 3 σ) 0.997 P (µ 6 σ X µ + 6 σ) 0.9999999980 The last rule the six-σ rule gave rise to the six-sigma concept in quality management (Motorola, mid-1980s). c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 20/38

8.2 The Normal Distribution The sigma rules again. 6σ 5σ 4σ 3σ 2σ σ 0 σ 2σ 3σ 4σ 5σ 6σ 68.3% 95.4% 99.7% 99.99999980% c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 21/38

8.2 The Normal Distribution Further important properties of the normal distribution. Let X N(µ, σ 2 ). Then, for a, b R: ax + b N(aµ + b, a 2 σ 2 ). Let X 1 N(µ 1, σ 2 1), X 2 N(µ 2, σ 2 2); X 1, X 2 independent. Then: X 1 + X 2 N(µ 1 + µ 2, σ 2 1 + σ 2 2). c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 22/38

8.2 The Normal Distribution Further important properties of the normal distribution. Consequences are: Let X 1,..., X n N(µ, σ 2 ) and independent. Then: X = 1 n n i=1 n i=1 X i N(nµ, nσ 2 ), X i N(µ, σ2 n ) c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 23/38

8.2 The Normal Distribution Densities of X i and X (with n = 9). density of X i density of X µ σ µ σ µ µ + σ µ + σ 9 9 c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 24/38

8.2 The Normal Distribution Example 4: Airline passenger weight. An airline assumes that the weight (in kg) of a passenger, including carry-on baggage weight, is a random variable X N(84, 400). An airplane with 10 seats has a capacity of 1000 kg. Compute the probability that this limit is exceeded, i.e. that 10 passengers weigh more than 1000 kg. The normality assumption is not really needed. (CLT!) c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 25/38

8.2 The Normal Distribution Example 5: Average IQ. Süleyman Hoca is offering an elective course at Bilgi University. Seven students have registered. What is the probability that the average IQ of these students is above 115? Assume again that student IQ is normally distributed with mean 105 and standard deviation 15. Also assume that the IQs of the students in this course are independent. (Is this assumption plausible?) c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 26/38

8.3 The Lognormal Distribution Derivation of the lognormal distribution. If Y N(µ, σ 2 ), then X := e Y is said to have a lognormal distribution with parameters µ and σ 2. Its density is: f(x) = 1 σx (ln x µ) 2 2π e 2σ 2, x > 0, µ R, σ > 0. In symbols: X LN(µ, σ 2 ) c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 27/38

8.3 The Lognormal Distribution This picture shows the density. (Each expectation equals 1.) 0 1 2 3 4 σ = 1 σ = 0.5 σ = 0.2 σ = 0.1 0.0 0.5 1.0 1.5 2.0 2.5 3.0 c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 28/38

8.3 The Lognormal Distribution Moments of the lognormal distribution. Let X LN(µ, σ 2 ). Then, E(X) = e µ+σ2 /2, var(x) = e 2µ+σ2 (e σ2 1). What is the median of X? c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 29/38

8.3 The Lognormal Distribution A financial application. Define stock prices: V 0 : price today (known), V t : price at time t > 0 (unknown) Some celebrated models (e.g., the Black-Scholes model for option pricing) assume that V t = V 0 X, where X LN(µ, σ 2 ) We shall see a justification of this assumption in Chapter 9! Under the lognormality assumption: E(V t V 0 ) = V 0 e µ+σ2 2. (If µ = σ 2 /2, the process (V t ) will be a martingale.) c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 30/38

8.3 The Lognormal Distribution Example: Stock prices. Consider a certain stock, and define stock prices V 0 (today), V 1 (one year ahead of now). Assume: V 1 = X V 0, where X LN(0, 0.16), V 0 = e 100. With this model: What is... the probability that the stock price is at least e 130 the expected stock price after one year? c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 31/38

8.4 The Exponential Distribution Definition, some properties. A continuous random variable X is said to have an exponential distribution with parameter λ > 0 if it has the density f(x) = λe λx, x > 0. In symbols: X EXPO(λ). Expectation and variance are: E(X) = 1 λ, var(x) = 1 λ 2. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 32/38

8.4 The Exponential Distribution This picture shows the density. 0.0 0.5 1.0 1.5 2.0 λ = 0.5 λ = 1 λ = 2 λ = 3 0.0 0.5 1.0 1.5 2.0 2.5 3.0 c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 33/38

8.4 The Exponential Distribution Example: Interarrival times of customers in a copy-shop. Interarrival times...... before 2 p.m.:... after 2 p.m.: (#) (11) 0 00001122334 (6) 0 678889 (6) 1 033344 (2) 1 57 (2) 2 44 (1) 2 7 (1) 3 3 3 4 (1) 4 5 (1) 5 2 5 (31) (#) (24) 0 000011111112222223334444 (12) 0 566667778899 (4) 1 0113 (1) 1 8 (3) 2 002 2 3 (1) 3 5 4 4 5 5 1 0=10 minutes (45) c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 34/38

8.4 The Exponential Distribution Forgetfulness. Forgetfulness or memorylessness of the distribution of a random variable X: P (X t + s X s) = P (X t) for all s, t > 0 The following two statements are equivalent: A: The distribution of the continuous random variable X is forgetful. B: The random variable X is exponentially distributed. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 35/38

8.4 The Exponential Distribution The Poisson process. Suppose events happen according to the following assumptions: The number of events occurring in [s, s + t] has a Poisson distribution with parameter λt. The numbers of events occurring in non-overlapping intervals are independent. Let N t = # events in [0, t]. The process (N t ) t 0 is called Poisson process with intensity λ. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 36/38

8.4 The Exponential Distribution Interarrival times. Consider a Poisson process with intensity λ. Let X = length of the time interval between two successive events. Then, X EXPO(λ). c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 37/38

8.4 The Exponential Distribution Example: Occurrence of power failures. Power failures occur in a certain area of Istanbul according to a Poisson process with intensity λ = 0.7 per week. What is the probability that there is... a power failure during the next 12 hours? no power failure during the next 3 days? Hint: If X EXPO(ν), then the distribution function of X is x P (X x) = x 0 νe νξ dξ = 1 e νx. c Harald Schmidbauer & Angi Rösch, 2008 8. Continuous Probability Distributions 38/38