Introduction to Statistics I

Similar documents
The Normal Probability Distribution

Lecture 6: Chapter 6

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

ECON 214 Elements of Statistics for Economists 2016/2017

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x)

NORMAL RANDOM VARIABLES (Normal or gaussian distribution)

ECON 214 Elements of Statistics for Economists

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Lecture 9. Probability Distributions. Outline. Outline

Theoretical Foundations

Lecture 9. Probability Distributions

Introduction to Business Statistics QM 120 Chapter 6

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Continuous Probability Distributions & Normal Distribution

Section Introduction to Normal Distributions

CH 5 Normal Probability Distributions Properties of the Normal Distribution

STAT 201 Chapter 6. Distribution

2011 Pearson Education, Inc

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Business Statistics 41000: Probability 4

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Business Statistics 41000: Probability 3

The graph of a normal curve is symmetric with respect to the line x = µ, and has points of

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

Statistical Methods in Practice STAT/MATH 3379

The Normal Distribution

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Lecture 5 - Continuous Distributions

STA Module 3B Discrete Random Variables

Chapter ! Bell Shaped

Prob and Stats, Nov 7

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Sampling Distribution

Chapter 6. The Normal Probability Distributions

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

Expected Value of a Random Variable

Chapter Seven. The Normal Distribution

Chapter 6 Continuous Probability Distributions. Learning objectives

Math 227 Elementary Statistics. Bluman 5 th edition

The normal distribution is a theoretical model derived mathematically and not empirically.

MTH 245: Mathematics for Management, Life, and Social Sciences

Probability. An intro for calculus students P= Figure 1: A normal integral

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr.

Section Random Variables and Histograms

Section Distributions of Random Variables

Chapter 7: Point Estimation and Sampling Distributions

Continuous Distributions

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Statistics for Business and Economics

AMS7: WEEK 4. CLASS 3

4.3 Normal distribution

Normal Probability Distributions

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

Review of commonly missed questions on the online quiz. Lecture 7: Random variables] Expected value and standard deviation. Let s bet...

What type of distribution is this? tml

VI. Continuous Probability Distributions

Density curves. (James Madison University) February 4, / 20

Topic 6 - Continuous Distributions I. Discrete RVs. Probability Density. Continuous RVs. Background Reading. Recall the discrete distributions

MAKING SENSE OF DATA Essentials series

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Unit2: Probabilityanddistributions. 3. Normal distribution

Standard Normal, Inverse Normal and Sampling Distributions

Statistics 511 Supplemental Materials

15.063: Communicating with Data Summer Recitation 4 Probability III

11.5: Normal Distributions

Chapter 5: Statistical Inference (in General)

The Binomial Distribution

Data Analysis and Statistical Methods Statistics 651

Midterm Exam III Review

MTH 245: Mathematics for Management, Life, and Social Sciences

Sampling and sampling distribution

Continuous random variables

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =

Chapter 6: Random Variables

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Shifting our focus. We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why?

CHAPTER TOPICS STATISTIK & PROBABILITAS. Copyright 2017 By. Ir. Arthur Daniel Limantara, MM, MT.

Statistics and Probability

Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

Probability Distributions II

CHAPTER 8 PROBABILITY DISTRIBUTIONS AND STATISTICS

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 4. The Normal Distribution

Discrete Random Variables

Chapter 5: Probability models

Data Analysis and Statistical Methods Statistics 651

Continuous Random Variables and Probability Distributions

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Chapter 8 Homework Solutions Compiled by Joe Kahlig. speed(x) freq 25 x < x < x < x < x < x < 55 5

Chapter 7. Random Variables

IOP 201-Q (Industrial Psychological Research) Tutorial 5

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

Class 11. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Chapter 3. Density Curves. Density Curves. Basic Practice of Statistics - 3rd Edition. Chapter 3 1. The Normal Distributions

What was in the last lecture?

Transcription:

Introduction to Statistics I Keio University, Faculty of Economics Continuous random variables Simon Clinet (Keio University) Intro to Stats November 1, 2018 1 / 18

Definition (Continuous random variable) A random variable is continuous if it can take any value in an interval [a, b]. (We can have a = and/or b = +.) Example The following random variables are continuous: Let X be the random time (in hours) someone spends watching TV everyday. The possible values for X are all the numbers in [0, 24]. Let X be the price of a stock in $ on a random day. The possible values for X are all the numbers in [0, + [. Simon Clinet (Keio University) Intro to Stats November 1, 2018 2 / 18

Probability distribution for a continuous random variable Unlike discrete random variables, continuous ones can take any value in an interval [a, b]. Therefore, it does not make sense to talk about P(X = c) for each possible c [a, b]. Definition (probability distribution for a continuous random variable) A probability distribution for a continuous random variable X lists all the probabilities P(x 1 X x 2 ) for all possible values x 1, x 2 such that a x 1 < x 2 b. We require that: Conditions on the probability distribution 0 P(x 1 X x 2 ) 1. P(a X b) = 1. Simon Clinet (Keio University) Intro to Stats November 1, 2018 3 / 18

Probability distribution for a continuous random variable Remark For a continuous random variable X, the probabilities P(x 1 X x 2 ), P(x 1 < X x 2 ), P(x 1 < X < x 2 ), P(x 1 X < x 2 ) are all equal. In other words, the symbols < and can be used indifferently. Simon Clinet (Keio University) Intro to Stats November 1, 2018 4 / 18

Density function, density curve Definition (Density curve, density function) We associate to a random variable X a density function f, such that the probabilities P(x 1 X x 2) are equal to the area under the curve x f (x) between x = x 1 and x = x 2. Simon Clinet (Keio University) Intro to Stats November 1, 2018 5 / 18

Density function, density curve We thus require that Conditions on the density function f (x) 0. The area under x f (x) between x = a and x = b is 1. Interpretation of the density We can interpret the density function as follows. X is likely to take a value where the curve is high. If we repeat the experiment a large number of times, and collect the measured values X 1, X 2,..., then the histogram of those values will look like the density curve. Simon Clinet (Keio University) Intro to Stats November 1, 2018 6 / 18

Interpretation of the density Example Let X be the random variable corresponding to the queuing time in minutes in a fast food restaurant. Statistical research has proven that X is distributed according to the red curve below. We repeat the experiment 1, 000 times and get the following histogram: Simon Clinet (Keio University) Intro to Stats November 1, 2018 7 / 18

Expected value and Variance Definition (Expected value and variance) If X is a continuous random variable, we associate to it an expected (or mean) value E[X ] and a variance Var[X ] which play the same role as for discrete random variables. They are also often denoted by µ and σ 2. Moreover, we also call σ the standard deviation of X. Remark We do not give definitions of the mean and the variance because they require more advanced mathematics. However they can be interpreted exactly in the same way as we did for discrete random variables. The mean E[X ] is the center of symmetry of the density curve. The more dispersed the density curve, the higher the variance Var[X ]. Simon Clinet (Keio University) Intro to Stats November 1, 2018 8 / 18

Example We consider three random variables with three distinct density curves (blue, green, red). They all are such that µ = 0. Blue : Var[X ] = 1, Green : Var[X ] = 3, Red : Var[X ] = 10. Simon Clinet (Keio University) Intro to Stats November 1, 2018 9 / 18

Fundamental example: The standard normal distribution Definition (Standard normal distribution) A continuous random variable X which is described by the following symmetric bell-shaped (see below) density curve is a standard normal random variable. We also say that it follows a standard normal distribution, and we write X N (0, 1). In particular, E[X ] = 0, and Var[X ] = 1, and X can take any value between and +. Simon Clinet (Keio University) Intro to Stats November 1, 2018 10 / 18

Standard normal variable Remarks The standard normal distribution is the most important one in statistics, because it approximates very well the distributions of many variables in practice. For example, the heights or weights of people, the total annual sales of a firm, exam scores of a given population are often approximately normally distributed. There are other symmetric bell-shaped curves which are not the normal distribution. In fact, the standard normal distribution can be defined more precisely by specifying the density with an equation like f (x) =..., but, again, this would require more advanced mathematics. Simon Clinet (Keio University) Intro to Stats November 1, 2018 11 / 18

general Normal distribution Definition (Normal distribution) For a number µ and a positive number σ 2, we say that a random variable X is normally distributed with parameters µ and σ 2, if (X µ)/σ N (0, 1). We write X N (µ, σ 2 ), and we have E[X ] = µ, Var[X ] = σ 2. Simon Clinet (Keio University) Intro to Stats November 1, 2018 12 / 18

Distribution table In practice, for a continuous random variable X, we use a distribution table to calculate the probabilities P(x 1 X x 2 ). Example: distribution table for the standard normal distribution A distribution table for N (0, 1) reports all the probabilities of the form P(X x 1 ) for x 1 0. By symmetry of the normal curve, this is sufficient to deduce any probability related to X (see the exercise on the next slide). Simon Clinet (Keio University) Intro to Stats November 1, 2018 13 / 18

Exercise Exercise Calculate for X N (0, 1) : P(X 1.02) P(X 2.36) = 1 P(X 2.36) P(0 X 0.9) = P(X 0.9) P(X < 0). P( 1.36 X ) = P(X 1.36) (by symmetry of the curve). P( 1.36 X 0) P( 1.36 X 0.03) Answers : 0.8461, 0.0091, 0.3159, 0.9131, 0.4131, 0.4251. Simon Clinet (Keio University) Intro to Stats November 1, 2018 14 / 18

Normal distribution table - Example When X N (µ, σ 2 ), we use the fact that the transformed variable Z = X µ σ N (0, 1). Example Assume that X N (1, 4). Let s calculate P(X 2). With Z defined as above, this is the same probability as P(Z (2 1)/ 4) = P(Z 0.5). From the distribution table, we get that it is approximately equal to 0.69. Exercise We know that a certain stock s dividend yield has a mean of µ = 6% and a standard deviation of σ = 2%. Assume the dividends follow a normal distribution. Compute the probability that the dividend yield will be: Less than 2 % Greater than 10 % Between 4 % and 8 % Answers : 0.0228, 0.0228, 0.6826. Simon Clinet (Keio University) Intro to Stats November 1, 2018 15 / 18

Quantile of the standard normal distribution Definition (Quantile of the standard normal distribution) We call quantile of the standard normal distribution of level a the number z a such that P(Z z a ) = a, where Z N (0, 1). Simon Clinet (Keio University) Intro to Stats November 1, 2018 16 / 18

Calculating quantiles in practice In practice, we also use the distribution table to calculate quantiles. Example Let us calculate z 0.75. In the distribution table, we look for the value such that P(Z value) 0.75. We find that P(Z 0.68) = 0.7517 which is the closest probability to 0.75 and so z 0.75 0.68. Exercise If we pick a student at random in a class, we assume that her grade X (a score between 0 and 100) approximately follows a N (µ, σ 2 ) with µ = 73 and σ 2 = 225. 1 What is the distribution of the variable Z = X 73 15? 2 Find the score x 10% such that P(X x 10 ) = 10%. What is the proportion of students who got less than this score? Simon Clinet (Keio University) Intro to Stats November 1, 2018 17 / 18

Conclusion summary A continuous random variable can take any value in a given interval. We associate to the variable a density function such that the probability to get a number between to values x 1 and x 2 is the area under the curve of the density function between those two points. We also associate an expected value, a variance and a standard deviation which play the same role as for discrete random variables. A fundamental continuous random variable is the standard normal variable, whose density is bell-shaped and symmetric. We can use a distribution table to calculate probabilities and quantiles. Simon Clinet (Keio University) Intro to Stats November 1, 2018 18 / 18