II. Random Variables

Size: px
Start display at page:

Download "II. Random Variables"

Transcription

1 II. Random Variables Random variables operate in much the same way as the outcomes or events in some arbitrary sample space the distinction is that random variables are simply outcomes that are represented numerically. A ith t d t d ibl itlltt As with events, we denote a random variable as a capital letter, such as, Y, or Z. Realizations of a random variable are denoted using lower case. For instance, x may be used to represent a possible value of the variable.

2 Variable Types Random Variables Discrete/Categorical (e.g., ethnic group, gender) Continuous (e.g., age, height, weight)

3 Variables Map Outcomes to Numbers Experimental Outcomes Possible Numeric Values of a Random Variable O 1 x 1 O x O x 3 3 O 4 x 4

4 Example II.A Consider an experiment where we flip a coin: the sample space is S = {H, T}. We can define a random variable such that = 1 if the flip turns up heads and = 0 if the flip is tails. Example II.B Consider an experiment where we sample an adult and ask for his/her age in years: we might view the sample space as something like S = {18, 19, 0,,10}. Suppose we let Y = the person s age in years. Then in this case the mapping of outcomes in S to numeric values is straightforward.

5 Categorical Random Variables Possible values of a categorical random variable have probabilities biliti determined by a probability mass function, or pmf. Suppose that x 1,x,,x N are the N possible outcomes of a random variable. The pmf for, denoted by f(x) or P( = x), must have the following properties: i. 0 P( xi ) 1, for i 1,, N, and N ii. P( ) 1. i1 x i

6 Example II.C Let be the sum of two fair six-sided dice. Then the pmf for is given by x i P( = x i ) P( <= x i ) 1/36 1/36 3 /36 3/36 4 3/36 6/36 5 4/36 10/36 6 5/36 15/36 7 6/36 1/36 8 5/36 6/36 9 4/36 30/ /36 33/36 11 /36 35/36 1 1/36 1

7 Example II.C (cont d) The third column in the previous table contains the cumulative distribution function (cdf), or P( x i ). Note that the pmf determines the cdf, and vice versa. The pmf and cdf are plotted below: CDF PDF P( = x) P 0.0 ( <= x) x x

8 Continuous Random Variables As opposed to a probability mass function, a continuous random variable has a probability density function, or pdf. Suppose a continuous random variable can take any value between l and u. We denote the pdf of by f(x), for l < x < u. However, note that f(x) P( = x). Why must this be so? How then do we interpret the pdf for a continuous random variable?

9 Interpreting the Probability Density Function It doesn t make sense with a continuous variable to talk about P( = x). However, we can use the pdf f(x), for l < x < u, to compute the probability that we observe a value for that is within an interval, for example P(a b). Note: the probability P(a b) is given by the area under the curve f(x) between the points a and b, where a b and a and b are possible values of. Question: how does one find the area under a continuous curve between two points?

10 Properties of the Probability Density Function Given a pdf f(x), ) we can compute the probability bilit over an interval by P ( a b ) b a f ( x ) dx. This probability has two interpretations: 1. It represents the probability that a randomly selected individual from the underlying gpopulation p will have a value of that falls within the interval (a, b).. It represents the proportion of individuals in the underlying population who have values for that fall within the interval (a, b).

11 Properties of the Probability Density Function Hence, a pdf dff( f(x) ) must thave the following properties: i. f(x) 0, for l x u. u l ii. f ( x) dx 1. In other words, a pdf f(x) must be positive over its domain (l, u), and the area under the curve over that domain must be equal to 1.

12 Example II.D Stat 3000 Statistics for Scientists and Engineers Victims of a particular type of cancer have post-diagnosis survival times that follow the pdf plotted on the following slide. If represents the survival time (in months) of a given cancer patient, then the pdf for is f ( x) 1 15 Verify that this is a legitimate pdf. e x /15 On the plot, diagram the area that represents the probability that a given cancer patient survives no longer than 10 months. Diagram the area that represents the proportion who survive between 1 and 18 months. Diagram the area representing the proportion who survive longer than 18 months., x 0.

13 Example II.D, cont d Probability density function for survival time of cancer patients. 0.0 PDF P.Df(x).F. of of Chip Surviv Lifeti val me Time -- f(x) Survival Time After Diagnosis (months) Lifetime of Chip -- x

14 Example II.D, cont d Compute the probabilities that you diagrammed on the plot of the pdf. Find and plot the cumulative distribution function. Use the cdf to compute the probabilities that you diagrammed on the plot of the pdf. What is the median survival time? What is the 75 th percentile of survival?

15 The Mean or Expectation of a Random Variable Based on Example II.C, suppose that t I propose the following game: you pay me $7.50, then roll the dice. I pay you the sum of the dice in dollars. From a purely economic standpoint, should you play? In order to answer that question you need to know something In order to answer that question, you need to know something about the mean of, or the average sum of the dice.

16 Definition of Expectation By definition, iti the average or expected tdvalue of a categorical random variable with pmf at a value x i given by P(= x i ) for i = 1,,N is computed as E( ) N i1 x P( i x i The expected value of a continuous random variable with pdf given by f(x), for l x u, is computed as ). E( ) u l xf ( x) dx. I ith i th t ti i ht d In either case, we can view the expectation as a weighted average.

17 Example II.E The expected value of the distribution in Example II.C is given by E ( ) (1/ 36) 3( / 36) 1(1/ 36) 7. Playing the game described on the previous slide probably wouldn t be your smartest move that is, what would be your average net gain (or loss)? In general, how do we interpret expectation? How would you describe expectation to the average person? (Hint: recall our discussion of long-term relative frequency.)

18 Example II.F What is the average survival time for the cancer patients in Example II.D? The expectation in this case is computed by x /15 ( ) ( ) /15 /15 x x x E xf x dx e dx xe 15e x /15 x /15 lim xe 15e (0 15) x. Hence, the average life expectancy after diagnosis is 15 months. How do the mean and median compare? Why are they different?

19 The Variance of a Random Variable The mean is used to describe the center of a distribution. The variance is used to describe the spread of a distribution about its mean. As shown in the plot below, two distributions with the same mean can have very different degrees of variability about the mean:

20 Interpreting the Variance The variance is actually all the average squared deviation of the possible values of a random variable about its mean. In some sense, you can think of variance probabilistically: (i) If you sample one observation at random from a distribution with high variance, then there is a relatively high probability that the observation will lie far away from the mean. (ii) If you sample an observation from a distribution with low variance, then there is a relatively small probability that the observation will lie far from the mean.

21 Definition of the Variance By definition, the variance of a categorical random variable y, g with pmf at a value x i given by P(= x i ) for i = 1,,N is computed as N N Th i f ti d i bl ith df i b. )] ( [ ) ( ) ( )] ( [ ) ( 1 1 E x P x x P E x Var N i i i N i i i The variance of a continuous random variable with pdf given by f(x), for l x u, is computed as )} ( { ) ( ) ( )} ( { ) ( E d f d f E V u u. )} ( { ) ( ) ( )} ( { ) ( E dx x f x dx x f E x Var l l Note that the last part of the equations given above result from the fact that Var() = E( ) {E()}. This is typically much easier to compute by hand.

22 Example II.G For the distribution in Example II.C, Var ( ) (1/ 36) 3 ( / 36) 1 (1/ 36) Hence, the standard deviation is.4.

23 Example II.H For the distribution of survival times in Example II.D, x /15 ( ) ( ) x E x f x dx e dx x lim x e x /15 15xe x /15 ()15 e x /15 x /15 x /15 x /15 x e 15xe ()15 e {0 0 ()15 } ()15 ()15 The variance then is Var() = E( ) {E()} = (15) 15 = 15 (months). Hence, the standard deviation is 15 months.. 0

24 Some Additional Properties of the Mean and Variance In practical research settings, we are almost always interested t in the distribution of sums of random variables. In particular, as we shall see within a few weeks, we need to know something about the mean and variance of the sample mean 1 n n i 1 where the values 1,, n represent a sample of observations from some population lti (e.g., we sample 0USU students td t and measure their individual heights). Question: if each subject in the sample has a mean E( i ) = μ, and a variance Var( i ) = σ, then what are the mean and variance of? i,

25 Mean and Variance of a Linear Function Let be a random variable such that E() ( ) = μ and Var() ( ) = σ. Suppose we re interested in a random variable Y defined as Y = a + b, where a and b are constants. In other words, Y is a linear function of. Then E(Y) = E(a + b) = E(a) + be() = a + bμ, and Var(Y) = Var(a + b) = Var(a) + b Var() = 0 + b σ = b σ.

26 Example II.I The average high temperature during the month of September in Logan, UT, is 67.6 F, with a standard deviation of 4. F. That is, if is a variable that represents the high temperature in Logan on a randomly selected September day, then E() = 67.6 and Var() = 4.. What is the average daily temperature in degrees Celsius? What is the standard deviation of daily temperature in degrees Celsius?

27 Mean and Variance of a Linear Combination of Independent Random Variables Let 1 be a random variable such that E( 1 ) = μ 1 and Var( 1 ) = σ 1, and be a random variable such that E( ) = μ and Var( ) = σ, where 1 and are independent. Suppose we re interested in a random variable Y defined as Y = c c, where c 1 and c are constants. In other words, Y is a linear combination of 1 and. Then E(Y) = E(c c ) = c 1 E( 1 ) + c E( ) = c 1 μ 1 + c μ, and Var(Y) = Var(c c ) = c 1 Var( 1 ) + c Var( ) = c 1 σ 1 + c σ. Note that the expectation of Y given above holds regardless of the dependence of 1 and. However, 1 and must be independent for the variance formula to hold.

28 Example II.J Consider the cancer patients whose survival follows the distribution in Example II.D. Suppose that we randomly sample individuals with this cancer, and measure their survival after diagnosis. What is the expected combined survival of these individuals, id in months? In years? What is the standard deviation of their combined survival, in months? In years?

29 The Mean and Variance of a Sample Mean It might sound odd to talk about the mean and variance of a mean, but recall how a sample mean (which is just a simple average) is computed: we randomly sample n subjects 1,, n from a population with an underlying expected value of μ and variance of σ. That is, E( i ) = μ and Var( i ) = σ, for i = 1,,n. The sample mean is defined as 1 n n i 1 In other words, the sample mean is just a linear combination of the independent random variables 1,, n. Hence, the sample mean is also random variable, and so ith has a distribution ib ti of fits own, with a mean and and a variance. i,

30 The Mean and Variance of a Sample Mean Note the distinction between (i) a sample mean and (ii) the expectation and variance of the underlying distribution (we can t emphasize this enough): Sample Mean Random Variable and Expectation and Variance Fixed Parameters (Constants)

31 The Mean and Variance of a Sample Mean Having sampled the n subjects 1 1,,,, n at random from a population with an underlying expected value of μ and variance of σ, we can assume that these subjects are independent. Then using the rules given four slides previous the expectation of the distribution of the sample mean is μ, and variance of is σ /n. Can you demonstrate why?

32 Example II.K Consider again the cancer patients whose survival follows the distribution in Example II.D. Suppose that we randomly select 0 individuals with this cancer, and compute the average survival time for the sample. What is the expected value of? What is the variance of? By what factor do we need to increase our sample size to divide the standard deviation of in half?

II. Random Variables. Variable Types. Variables Map Outcomes to Numbers

II. Random Variables. Variable Types. Variables Map Outcomes to Numbers II. Random Varables Random varables operate n much the same way as the outcomes or events n some arbtrary sample space the dstncton s that random varables are smply outcomes that are represented numercally.

More information

The Binomial Distribution

The Binomial Distribution Patrick Breheny September 13 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 16 Outcomes and summary statistics Random variables Distributions So far, we have discussed the

More information

Central Limit Theorem, Joint Distributions Spring 2018

Central Limit Theorem, Joint Distributions Spring 2018 Central Limit Theorem, Joint Distributions 18.5 Spring 218.5.4.3.2.1-4 -3-2 -1 1 2 3 4 Exam next Wednesday Exam 1 on Wednesday March 7, regular room and time. Designed for 1 hour. You will have the full

More information

Chapter 4 Continuous Random Variables and Probability Distributions

Chapter 4 Continuous Random Variables and Probability Distributions Chapter 4 Continuous Random Variables and Probability Distributions Part 2: More on Continuous Random Variables Section 4.5 Continuous Uniform Distribution Section 4.6 Normal Distribution 1 / 27 Continuous

More information

Chapter 7: Random Variables

Chapter 7: Random Variables Chapter 7: Random Variables 7.1 Discrete and Continuous Random Variables 7.2 Means and Variances of Random Variables 1 Introduction A random variable is a function that associates a unique numerical value

More information

Chapter 4 Continuous Random Variables and Probability Distributions

Chapter 4 Continuous Random Variables and Probability Distributions Chapter 4 Continuous Random Variables and Probability Distributions Part 2: More on Continuous Random Variables Section 4.5 Continuous Uniform Distribution Section 4.6 Normal Distribution 1 / 28 One more

More information

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial Lecture 23 STAT 225 Introduction to Probability Models April 4, 2014 approximation Whitney Huang Purdue University 23.1 Agenda 1 approximation 2 approximation 23.2 Characteristics of the random variable:

More information

Random Variables and Probability Distributions

Random Variables and Probability Distributions Chapter 3 Random Variables and Probability Distributions Chapter Three Random Variables and Probability Distributions 3. Introduction An event is defined as the possible outcome of an experiment. In engineering

More information

4.2 Probability Distributions

4.2 Probability Distributions 4.2 Probability Distributions Definition. A random variable is a variable whose value is a numerical outcome of a random phenomenon. The probability distribution of a random variable tells us what the

More information

Week 1 Quantitative Analysis of Financial Markets Basic Statistics A

Week 1 Quantitative Analysis of Financial Markets Basic Statistics A Week 1 Quantitative Analysis of Financial Markets Basic Statistics A Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions. UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions. Random Variables 2 A random variable X is a numerical (integer, real, complex, vector etc.) summary of the outcome of the random experiment.

More information

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise. Math 224 Q Exam 3A Fall 217 Tues Dec 12 Version A Problem 1. Let X be the continuous random variable defined by the following pdf: { 1 x/2 when x 2, f(x) otherwise. (a) Compute the mean µ E[X]. E[X] x

More information

Business Statistics 41000: Probability 3

Business Statistics 41000: Probability 3 Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404

More information

5.2 Random Variables, Probability Histograms and Probability Distributions

5.2 Random Variables, Probability Histograms and Probability Distributions Chapter 5 5.2 Random Variables, Probability Histograms and Probability Distributions A random variable (r.v.) can be either continuous or discrete. It takes on the possible values of an experiment. It

More information

Discrete Random Variables

Discrete Random Variables Discrete Random Variables ST 370 A random variable is a numerical value associated with the outcome of an experiment. Discrete random variable When we can enumerate the possible values of the variable

More information

Probability. An intro for calculus students P= Figure 1: A normal integral

Probability. An intro for calculus students P= Figure 1: A normal integral Probability An intro for calculus students.8.6.4.2 P=.87 2 3 4 Figure : A normal integral Suppose we flip a coin 2 times; what is the probability that we get more than 2 heads? Suppose we roll a six-sided

More information

4 Random Variables and Distributions

4 Random Variables and Distributions 4 Random Variables and Distributions Random variables A random variable assigns each outcome in a sample space. e.g. called a realization of that variable to Note: We ll usually denote a random variable

More information

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial. Lecture 21,22, 23 Text: A Course in Probability by Weiss 8.5 STAT 225 Introduction to Probability Models March 31, 2014 Standard Sums of Whitney Huang Purdue University 21,22, 23.1 Agenda 1 2 Standard

More information

Random Variables Handout. Xavier Vilà

Random Variables Handout. Xavier Vilà Random Variables Handout Xavier Vilà Course 2004-2005 1 Discrete Random Variables. 1.1 Introduction 1.1.1 Definition of Random Variable A random variable X is a function that maps each possible outcome

More information

. (i) What is the probability that X is at most 8.75? =.875

. (i) What is the probability that X is at most 8.75? =.875 Worksheet 1 Prep-Work (Distributions) 1)Let X be the random variable whose c.d.f. is given below. F X 0 0.3 ( x) 0.5 0.8 1.0 if if if if if x 5 5 x 10 10 x 15 15 x 0 0 x Compute the mean, X. (Hint: First

More information

Chapter 7 1. Random Variables

Chapter 7 1. Random Variables Chapter 7 1 Random Variables random variable numerical variable whose value depends on the outcome of a chance experiment - discrete if its possible values are isolated points on a number line - continuous

More information

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Week 2: Probability and Distributions) GY Zou gzou@robarts.ca Reporting of continuous data If approximately symmetric, use mean (SD), e.g., Antibody titers ranged from

More information

AP Statistics Chapter 6 - Random Variables

AP Statistics Chapter 6 - Random Variables AP Statistics Chapter 6 - Random 6.1 Discrete and Continuous Random Objective: Recognize and define discrete random variables, and construct a probability distribution table and a probability histogram

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 5 Continuous Random Variables and Probability Distributions Ch. 5-1 Probability Distributions Probability Distributions Ch. 4 Discrete Continuous Ch. 5 Probability

More information

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence continuous rv Let X be a continuous rv. Then a probability distribution or probability density function (pdf) of X is a function f(x) such that for any two numbers a and b with a b, P(a X b) = b a f (x)dx.

More information

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg :

More information

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Announcements: There are some office hour changes for Nov 5, 8, 9 on website Week 5 quiz begins after class today and ends at

More information

Introduction to Business Statistics QM 120 Chapter 6

Introduction to Business Statistics QM 120 Chapter 6 DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics QM 120 Chapter 6 Spring 2008 Chapter 6: Continuous Probability Distribution 2 When a RV x is discrete, we can

More information

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Math 224 Fall 207 Homework 5 Drew Armstrong Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Section 3., Exercises 3, 0. Section 3.3, Exercises 2, 3, 0,.

More information

Statistics and Probability

Statistics and Probability Statistics and Probability Continuous RVs (Normal); Confidence Intervals Outline Continuous random variables Normal distribution CLT Point estimation Confidence intervals http://www.isrec.isb-sib.ch/~darlene/geneve/

More information

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82 Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections

More information

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x

More information

Basic notions of probability theory: continuous probability distributions. Piero Baraldi

Basic notions of probability theory: continuous probability distributions. Piero Baraldi Basic notions of probability theory: continuous probability distributions Piero Baraldi Probability distributions for reliability, safety and risk analysis: discrete probability distributions continuous

More information

STAT 201 Chapter 6. Distribution

STAT 201 Chapter 6. Distribution STAT 201 Chapter 6 Distribution 1 Random Variable We know variable Random Variable: a numerical measurement of the outcome of a random phenomena Capital letter refer to the random variable Lower case letters

More information

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 6 Point Estimation Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Point Estimation Statistical inference: directed toward conclusions about one or more parameters. We will use the generic

More information

CS 237: Probability in Computing

CS 237: Probability in Computing CS 237: Probability in Computing Wayne Snyder Computer Science Department Boston University Lecture 12: Continuous Distributions Uniform Distribution Normal Distribution (motivation) Discrete vs Continuous

More information

MATH 264 Problem Homework I

MATH 264 Problem Homework I MATH Problem Homework I Due to December 9, 00@:0 PROBLEMS & SOLUTIONS. A student answers a multiple-choice examination question that offers four possible answers. Suppose that the probability that the

More information

STA Module 3B Discrete Random Variables

STA Module 3B Discrete Random Variables STA 2023 Module 3B Discrete Random Variables Learning Objectives Upon completing this module, you should be able to 1. Determine the probability distribution of a discrete random variable. 2. Construct

More information

Business Statistics 41000: Probability 4

Business Statistics 41000: Probability 4 Business Statistics 41000: Probability 4 Drew D. Creal University of Chicago, Booth School of Business February 14 and 15, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office:

More information

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS A random variable is the description of the outcome of an experiment in words. The verbal description of a random variable tells you how to find or calculate

More information

variance risk Alice & Bob are gambling (again). X = Alice s gain per flip: E[X] = Time passes... Alice (yawning) says let s raise the stakes

variance risk Alice & Bob are gambling (again). X = Alice s gain per flip: E[X] = Time passes... Alice (yawning) says let s raise the stakes Alice & Bob are gambling (again). X = Alice s gain per flip: risk E[X] = 0... Time passes... Alice (yawning) says let s raise the stakes E[Y] = 0, as before. Are you (Bob) equally happy to play the new

More information

Probability Theory. Probability and Statistics for Data Science CSE594 - Spring 2016

Probability Theory. Probability and Statistics for Data Science CSE594 - Spring 2016 Probability Theory Probability and Statistics for Data Science CSE594 - Spring 2016 What is Probability? 2 What is Probability? Examples outcome of flipping a coin (seminal example) amount of snowfall

More information

Module 3: Sampling Distributions and the CLT Statistics (OA3102)

Module 3: Sampling Distributions and the CLT Statistics (OA3102) Module 3: Sampling Distributions and the CLT Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chpt 7.1-7.3, 7.5 Revision: 1-12 1 Goals for

More information

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, 2013 Abstract Introduct the normal distribution. Introduce basic notions of uncertainty, probability, events,

More information

LECTURE CHAPTER 3 DESCRETE RANDOM VARIABLE

LECTURE CHAPTER 3 DESCRETE RANDOM VARIABLE LECTURE CHAPTER 3 DESCRETE RANDOM VARIABLE MSc Đào Việt Hùng Email: hungdv@tlu.edu.vn Random Variable A random variable is a function that assigns a real number to each outcome in the sample space of a

More information

Chapter 8. Variables. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc.

Chapter 8. Variables. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Random Variables Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 8.1 What is a Random Variable? Random Variable: assigns a number to each outcome of a random circumstance, or,

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Slides for Risk Management

Slides for Risk Management Slides for Risk Management Introduction to the modeling of assets Groll Seminar für Finanzökonometrie Prof. Mittnik, PhD Groll (Seminar für Finanzökonometrie) Slides for Risk Management Prof. Mittnik,

More information

AMS7: WEEK 4. CLASS 3

AMS7: WEEK 4. CLASS 3 AMS7: WEEK 4. CLASS 3 Sampling distributions and estimators. Central Limit Theorem Normal Approximation to the Binomial Distribution Friday April 24th, 2015 Sampling distributions and estimators REMEMBER:

More information

Chapter ! Bell Shaped

Chapter ! Bell Shaped Chapter 6 6-1 Business Statistics: A First Course 5 th Edition Chapter 7 Continuous Probability Distributions Learning Objectives In this chapter, you learn:! To compute probabilities from the normal distribution!

More information

Lecture Data Science

Lecture Data Science Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Statistics Foundations JProf. Dr. Claudia Wagner Learning Goals How to describe sample data? What is mode/median/mean?

More information

The Normal Distribution

The Normal Distribution 5.1 Introduction to Normal Distributions and the Standard Normal Distribution Section Learning objectives: 1. How to interpret graphs of normal probability distributions 2. How to find areas under the

More information

Random Variables and Applications OPRE 6301

Random Variables and Applications OPRE 6301 Random Variables and Applications OPRE 6301 Random Variables... As noted earlier, variability is omnipresent in the business world. To model variability probabilistically, we need the concept of a random

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

Theoretical Foundations

Theoretical Foundations Theoretical Foundations Probabilities Monia Ranalli monia.ranalli@uniroma2.it Ranalli M. Theoretical Foundations - Probabilities 1 / 27 Objectives understand the probability basics quantify random phenomena

More information

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations MLLunsford 1 Activity: Central Limit Theorem Theory and Computations Concepts: The Central Limit Theorem; computations using the Central Limit Theorem. Prerequisites: The student should be familiar with

More information

Chapter 3: Probability Distributions and Statistics

Chapter 3: Probability Distributions and Statistics Chapter 3: Probability Distributions and Statistics Section 3.-3.3 3. Random Variables and Histograms A is a rule that assigns precisely one real number to each outcome of an experiment. We usually denote

More information

Chapter 4 and 5 Note Guide: Probability Distributions

Chapter 4 and 5 Note Guide: Probability Distributions Chapter 4 and 5 Note Guide: Probability Distributions Probability Distributions for a Discrete Random Variable A discrete probability distribution function has two characteristics: Each probability is

More information

A.REPRESENTATION OF DATA

A.REPRESENTATION OF DATA A.REPRESENTATION OF DATA (a) GRAPHS : PART I Q: Why do we need a graph paper? Ans: You need graph paper to draw: (i) Histogram (ii) Cumulative Frequency Curve (iii) Frequency Polygon (iv) Box-and-Whisker

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution

More information

Chapter 16. Random Variables. Copyright 2010 Pearson Education, Inc.

Chapter 16. Random Variables. Copyright 2010 Pearson Education, Inc. Chapter 16 Random Variables Copyright 2010 Pearson Education, Inc. Expected Value: Center A random variable assumes a value based on the outcome of a random event. We use a capital letter, like X, to denote

More information

Chapter 7. Random Variables

Chapter 7. Random Variables Chapter 7 Random Variables Making quantifiable meaning out of categorical data Toss three coins. What does the sample space consist of? HHH, HHT, HTH, HTT, TTT, TTH, THT, THH In statistics, we are most

More information

X = x p(x) 1 / 6 1 / 6 1 / 6 1 / 6 1 / 6 1 / 6. x = 1 x = 2 x = 3 x = 4 x = 5 x = 6 values for the random variable X

X = x p(x) 1 / 6 1 / 6 1 / 6 1 / 6 1 / 6 1 / 6. x = 1 x = 2 x = 3 x = 4 x = 5 x = 6 values for the random variable X Calculus II MAT 146 Integration Applications: Probability Calculating probabilities for discrete cases typically involves comparing the number of ways a chosen event can occur to the number of ways all

More information

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x)

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x) Section 6-2 I. Continuous Probability Distributions A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x) to represent a probability density

More information

MAS187/AEF258. University of Newcastle upon Tyne

MAS187/AEF258. University of Newcastle upon Tyne MAS187/AEF258 University of Newcastle upon Tyne 2005-6 Contents 1 Collecting and Presenting Data 5 1.1 Introduction...................................... 5 1.1.1 Examples...................................

More information

6. Continous Distributions

6. Continous Distributions 6. Continous Distributions Chris Piech and Mehran Sahami May 17 So far, all random variables we have seen have been discrete. In all the cases we have seen in CS19 this meant that our RVs could only take

More information

Chapter Learning Objectives. Discrete Random Variables. Chapter 3: Discrete Random Variables and Probability Distributions.

Chapter Learning Objectives. Discrete Random Variables. Chapter 3: Discrete Random Variables and Probability Distributions. Chapter 3: Discrete Random Variables and Probability Distributions 3-1Discrete Random Variables ibl 3-2 Probability Distributions and Probability Mass Functions 3-33 Cumulative Distribution ib ti Functions

More information

Math 227 Elementary Statistics. Bluman 5 th edition

Math 227 Elementary Statistics. Bluman 5 th edition Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find

More information

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables STA 2023 Module 5 Discrete Random Variables Learning Objectives Upon completing this module, you should be able to: 1. Determine the probability distribution of a discrete random variable. 2. Construct

More information

If the distribution of a random variable x is approximately normal, then

If the distribution of a random variable x is approximately normal, then Confidence Intervals for the Mean (σ unknown) In many real life situations, the standard deviation is unknown. In order to construct a confidence interval for a random variable that is normally distributed

More information

4: Probability. What is probability? Random variables (RVs)

4: Probability. What is probability? Random variables (RVs) 4: Probability b binomial µ expected value [parameter] n number of trials [parameter] N normal p probability of success [parameter] pdf probability density function pmf probability mass function RV random

More information

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment

More information

TOPIC: PROBABILITY DISTRIBUTIONS

TOPIC: PROBABILITY DISTRIBUTIONS TOPIC: PROBABILITY DISTRIBUTIONS There are two types of random variables: A Discrete random variable can take on only specified, distinct values. A Continuous random variable can take on any value within

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic Probability Distributions: Binomial and Poisson Distributions Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College

More information

Probability: Week 4. Kwonsang Lee. University of Pennsylvania February 13, 2015

Probability: Week 4. Kwonsang Lee. University of Pennsylvania February 13, 2015 Probability: Week 4 Kwonsang Lee University of Pennsylvania kwonlee@wharton.upenn.edu February 13, 2015 Kwonsang Lee STAT111 February 13, 2015 1 / 21 Probability Sample space S: the set of all possible

More information

Introduction to Statistics I

Introduction to Statistics I Introduction to Statistics I Keio University, Faculty of Economics Continuous random variables Simon Clinet (Keio University) Intro to Stats November 1, 2018 1 / 18 Definition (Continuous random variable)

More information

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8) 3 Discrete Random Variables and Probability Distributions Stat 4570/5570 Based on Devore s book (Ed 8) Random Variables We can associate each single outcome of an experiment with a real number: We refer

More information

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2 Determining Sample Size Slide 1 E = z α / 2 ˆ ˆ p q n (solve for n by algebra) n = ( zα α / 2) 2 p ˆ qˆ E 2 Sample Size for Estimating Proportion p When an estimate of ˆp is known: Slide 2 n = ˆ ˆ ( )

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions

More information

χ 2 distributions and confidence intervals for population variance

χ 2 distributions and confidence intervals for population variance χ 2 distributions and confidence intervals for population variance Let Z be a standard Normal random variable, i.e., Z N(0, 1). Define Y = Z 2. Y is a non-negative random variable. Its distribution is

More information

Module 4: Probability

Module 4: Probability Module 4: Probability 1 / 22 Probability concepts in statistical inference Probability is a way of quantifying uncertainty associated with random events and is the basis for statistical inference. Inference

More information

INF FALL NATURAL LANGUAGE PROCESSING. Jan Tore Lønning, Lecture 3, 1.9

INF FALL NATURAL LANGUAGE PROCESSING. Jan Tore Lønning, Lecture 3, 1.9 INF5830 015 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lønning, Lecture 3, 1.9 Today: More statistics Binomial distribution Continuous random variables/distributions Normal distribution Sampling and sampling

More information

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom Review for Final Exam 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom THANK YOU!!!! JON!! PETER!! RUTHI!! ERIKA!! ALL OF YOU!!!! Probability Counting Sets Inclusion-exclusion principle Rule of product

More information

Chapter 5. Discrete Probability Distributions. Random Variables

Chapter 5. Discrete Probability Distributions. Random Variables Chapter 5 Discrete Probability Distributions Random Variables x is a random variable which is a numerical description of the outcome of an experiment. Discrete: If the possible values change by steps or

More information

Statistics (This summary is for chapters 18, 29 and section H of chapter 19)

Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x n =

More information

Lecture 6: Chapter 6

Lecture 6: Chapter 6 Lecture 6: Chapter 6 C C Moxley UAB Mathematics 3 October 16 6.1 Continuous Probability Distributions Last week, we discussed the binomial probability distribution, which was discrete. 6.1 Continuous Probability

More information

INF FALL NATURAL LANGUAGE PROCESSING. Jan Tore Lønning, Lecture 3, 1.9

INF FALL NATURAL LANGUAGE PROCESSING. Jan Tore Lønning, Lecture 3, 1.9 1 INF5830 2015 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lønning, Lecture 3, 1.9 Today: More statistics 2 Recap Probability distributions Categorical distributions Bernoulli trial Binomial distribution

More information

Review of the Topics for Midterm I

Review of the Topics for Midterm I Review of the Topics for Midterm I STA 100 Lecture 9 I. Introduction The objective of statistics is to make inferences about a population based on information contained in a sample. A population is the

More information

Law of Large Numbers, Central Limit Theorem

Law of Large Numbers, Central Limit Theorem November 14, 2017 November 15 18 Ribet in Providence on AMS business. No SLC office hour tomorrow. Thursday s class conducted by Teddy Zhu. November 21 Class on hypothesis testing and p-values December

More information

Data Science Essentials

Data Science Essentials Data Science Essentials Probability and Random Variables As data scientists, we re often concerned with understanding the qualities and relationships of a set of data points. For example, you may need

More information

INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS. 20 th May Subject CT3 Probability & Mathematical Statistics

INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS. 20 th May Subject CT3 Probability & Mathematical Statistics INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS 20 th May 2013 Subject CT3 Probability & Mathematical Statistics Time allowed: Three Hours (10.00 13.00) Total Marks: 100 INSTRUCTIONS TO THE CANDIDATES 1.

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

Chapter 3 Discrete Random Variables and Probability Distributions

Chapter 3 Discrete Random Variables and Probability Distributions Chapter 3 Discrete Random Variables and Probability Distributions Part 2: Mean and Variance of a Discrete Random Variable Section 3.4 1 / 16 Discrete Random Variable - Expected Value In a random experiment,

More information

Commonly Used Distributions

Commonly Used Distributions Chapter 4: Commonly Used Distributions 1 Introduction Statistical inference involves drawing a sample from a population and analyzing the sample data to learn about the population. We often have some knowledge

More information

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is Normal Distribution Normal Distribution Definition A continuous rv X is said to have a normal distribution with parameter µ and σ (µ and σ 2 ), where < µ < and σ > 0, if the pdf of X is f (x; µ, σ) = 1

More information

MANAGEMENT PRINCIPLES AND STATISTICS (252 BE)

MANAGEMENT PRINCIPLES AND STATISTICS (252 BE) MANAGEMENT PRINCIPLES AND STATISTICS (252 BE) Normal and Binomial Distribution Applied to Construction Management Sampling and Confidence Intervals Sr Tan Liat Choon Email: tanliatchoon@gmail.com Mobile:

More information

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION INSTITUTE AND FACULTY OF ACTUARIES Curriculum 2019 SPECIMEN EXAMINATION Subject CS1A Actuarial Statistics Time allowed: Three hours and fifteen minutes INSTRUCTIONS TO THE CANDIDATE 1. Enter all the candidate

More information

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

Probability is the tool used for anticipating what the distribution of data should look like under a given model. AP Statistics NAME: Exam Review: Strand 3: Anticipating Patterns Date: Block: III. Anticipating Patterns: Exploring random phenomena using probability and simulation (20%-30%) Probability is the tool used

More information

Statistics & Flood Frequency Chapter 3. Dr. Philip B. Bedient

Statistics & Flood Frequency Chapter 3. Dr. Philip B. Bedient Statistics & Flood Frequency Chapter 3 Dr. Philip B. Bedient Predicting FLOODS Flood Frequency Analysis n Statistical Methods to evaluate probability exceeding a particular outcome - P (X >20,000 cfs)

More information

Chapter 8: The Binomial and Geometric Distributions

Chapter 8: The Binomial and Geometric Distributions Chapter 8: The Binomial and Geometric Distributions 8.1 Binomial Distributions 8.2 Geometric Distributions 1 Let me begin with an example My best friends from Kent School had three daughters. What is the

More information