M3S1 - Binomial Distribution

Similar documents
Random variables. Discrete random variables. Continuous random variables.

5. In fact, any function of a random variable is also a random variable

Chapter 6: Random Variables and Probability Distributions

Lecture 9: Plinko Probabilities, Part III Random Variables, Expected Values and Variances

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Random Variables Handout. Xavier Vilà

MA : Introductory Probability

15.063: Communicating with Data Summer Recitation 3 Probability II

Binomial Random Variables. Binomial Random Variables

Simple Random Sample

Chapter 16. Random Variables. Copyright 2010, 2007, 2004 Pearson Education, Inc.

variance risk Alice & Bob are gambling (again). X = Alice s gain per flip: E[X] = Time passes... Alice (yawning) says let s raise the stakes

STOR Lecture 7. Random Variables - I

Chapter 3 Discrete Random Variables and Probability Distributions

Binomial Random Variables

Statistics 6 th Edition

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

Probability Distributions for Discrete RV

Chapter 3 - Lecture 3 Expected Values of Discrete Random Va

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

30 Wyner Statistics Fall 2013

4 Random Variables and Distributions

Mathematics of Randomness

Discrete Random Variables

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

VIDEO 1. A random variable is a quantity whose value depends on chance, for example, the outcome when a die is rolled.

Chapter 16. Random Variables. Copyright 2010 Pearson Education, Inc.

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

4.2 Bernoulli Trials and Binomial Distributions

Bernoulli and Binomial Distributions

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

Central Limit Theorem 11/08/2005

Section Distributions of Random Variables

5.4 Normal Approximation of the Binomial Distribution

Chapter 3: Probability Distributions and Statistics

Chapter 3 Discrete Random Variables and Probability Distributions

5.2 Random Variables, Probability Histograms and Probability Distributions

AMS7: WEEK 4. CLASS 3

The normal distribution is a theoretical model derived mathematically and not empirically.

Part 1 In which we meet the law of averages. The Law of Averages. The Expected Value & The Standard Error. Where Are We Going?

MATH 264 Problem Homework I

Binomial and Normal Distributions

Statistical Methods in Practice STAT/MATH 3379

Chapter 6: Random Variables. Ch. 6-3: Binomial and Geometric Random Variables

Lecture Data Science

Central Limit Theorem (cont d) 7/28/2006

E509A: Principle of Biostatistics. GY Zou

The Binomial Probability Distribution

Chapter 7. Sampling Distributions and the Central Limit Theorem

6.3: The Binomial Model

Discrete Random Variables

Lecture 7 Random Variables

Chapter 7. Sampling Distributions and the Central Limit Theorem

MAS187/AEF258. University of Newcastle upon Tyne

Chapter 5. Sampling Distributions

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

STA 6166 Fall 2007 Web-based Course. Notes 10: Probability Models

Expectations. Definition Let X be a discrete rv with set of possible values D and pmf p(x). The expected value or mean value of X, denoted by E(X ) or

SECTION 4.4: Expected Value

STA Module 3B Discrete Random Variables

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41

ECON 214 Elements of Statistics for Economists 2016/2017

Chapter 3 Discrete Random Variables and Probability Distributions

Binomal and Geometric Distributions

MAKING SENSE OF DATA Essentials series

Chapter 5 Discrete Probability Distributions. Random Variables Discrete Probability Distributions Expected Value and Variance

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

Chapter 7: Point Estimation and Sampling Distributions

Review of the Topics for Midterm I

4.2 Probability Distributions

Statistics Class 15 3/21/2012

CS145: Probability & Computing

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

Chapter 6: Discrete Probability Distributions

Probability. An intro for calculus students P= Figure 1: A normal integral

Probability Distributions

The Binomial Distribution

Binomial and Geometric Distributions

Continuous random variables

Section Random Variables and Histograms

Probability and Statistics

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

Section Distributions of Random Variables

Econ 250 Fall Due at November 16. Assignment 2: Binomial Distribution, Continuous Random Variables and Sampling

Statistics for Managers Using Microsoft Excel 7 th Edition

The Binomial distribution

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Normal Approximation to Binomial Distributions

ECON 214 Elements of Statistics for Economists 2016/2017

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Metropolis-Hastings algorithm

STAT Mathematical Statistics

STAT Lab#5 Binomial Distribution & Midterm Review

Statistic Midterm. Spring This is a closed-book, closed-notes exam. You may use any calculator.

Chapter 5: Statistical Inference (in General)

Statistics for Business and Economics

Business Statistics 41000: Probability 4

1 PMF and CDF Random Variable PMF and CDF... 4

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Transcription:

M3S1 - Binomial Distribution Professor Jarad Niemi STAT 226 - Iowa State University September 28, 2018 Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 1 / 28

Outline Random variables Probability distribution function Expectation (mean) Variance Discrete random variables Bernoulli Binomial Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 2 / 28

Probability Probability Definition A probability is a mathematical function, P (E), that describes how likely an event E is to occur. This function adheres to two basic rules: 1. 0 P (E) 1 2. For mutually exclusive events E 1,..., E K, P (E 1 or E 2 or or E K ) = P (E 1 ) + P (E 2 ) + + P (E K ). Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 3 / 28

Probability Flipping a coin Suppose we are flipping an unbiased coin that has two sides: heads (H) and tails (T ). Then which adheres to rule 1) and P (H) = 0.5 P (T ) = 0.5. P (H or T ) = P (H) + P (T ) = 0.5 + 0.5 = 1 which adheres to rule 2). So this is a valid probability. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 4 / 28

Probability Rolling a 6-sided die Suppose we are rolling an unbiased 6-sided die. If we count the number of pips on the upturned face, then the possible events are 1, 2, 3, 4, 5, and 6. Then P (1) = P (2) = P (3) = P (4) = P (5) = P (6) = 1/6 which adheres to 1). What is P (1 or 2 or 3 or 4 or 5 or 6) = 1. To verify 2), we would need to calculate the probability of the 2 6 possible colections of mutually exclusive events and find that their probability is the sum of the individual probabilities. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 5 / 28

Probability Random variable Random variable Definition A random variable is the uncertain, numeric outcome of a random process. A discrete random variable takes on one of a list of possible values. A continuous random variable takes on any value in an interval. A random variable is denoted by a capital letter, e.g. X or Y. Discrete random variables: result of a coin flip the number of pips on the upturned face of a 6-sided die roll whether or not a company beats its earnings forecast the number of HR incidents next month Continuous random variables: my height how far away a 6-sided die lands a company s next quarterly earnings a company s closing stock price tomorrow Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 6 / 28

Probability Probability distribution function Probability distribution function Definition A probability distribution function describes all possible outcomes for a random variable and the probability of those outcomes. For example, Coin flipping: Unbiased 6-sided die rolling P (H) = P (T ) = 1. P (1) = P (2) = P (3) = P (4) = P (5) = P (6) = 1/6. Company earnings compared to forecasts P (Earnings within 5% of forecast) = 0.6 P (Earnings less than 5% of forecast) = 0.1 P (Earnings greater than 5% of forecast) = 0.3 Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 7 / 28

Probability Events Events Definition An event is a set of possible outcomes of a random variable. Discrete random variables: a coin flipping heads is heads the number of pips on the upturned face of a 6-sided die roll is less than 3 a company beats its earnings forecast the number of HR incidents next month is less between 5 and 10 Continuous random variables: my height is greater than 6 feet how far away a 6-sided die lands is less than 3 feet a company s next quarterly earnings is within 5% of forecast a company s closing stock price tomorrow is less than today s Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 8 / 28

Probability Events Die rolling Suppose we roll an unbiased 6-sided die. Determine the probabilities of the following events. The number of pips is exactly 3 less than 3 is greater than or equal to 3 is odd is even and less than 5 Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 9 / 28

Bernoulli Bernoulli random variable Definition A Bernoulli random variable has two possible outcomes: 1 (success) 0 (failure) A Bernoulli random variable is completey characterized by a single probability p, the probability of success (1). We write X Ber(p) to indicate that X is a random variable that has a Bernoulli distribution with probability of success p. If X Ber(p), then we know P (X = 1) = p and P (X = 0) = 1 p. Examples: a coin flip landing heads a 6-sided die landing on 1 a 6-sided die landing on 1 or 2 a company beating its earnings forecast a company s stock price closing higher tomorrow Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 10 / 28

Bernoulli Coin flipping Suppose we are flipping an unbiased coin and we let { 0 if coin flip lands on tails X = 1 if coin flip lands on heads Then X Ber(0.5) which means p = 0.5 is the probability of success (heads) and P (X = 1) = 0.5 and P (X = 0) = 0.5. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 11 / 28

Bernoulli Die rolling Suppose we are rolling an unbiased 6-sided die and we let { 0 if die lands on 3, 4, 5, or 6 X = 1 if die lands on 1 or 2 Then X Ber(1/3) which means p = 1/3 is the probability of success (a 1 or 2) and P (X = 1) = 1/3 and P (X = 0) = 2/3. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 12 / 28

Bernoulli Mean of a random variable Mean of a random variable Definition The mean of a random variable is a probability weighted average of the outcomes of that random variable. This mean is also called the expectation of the random variable and for a random variable X is denoted E[X] (or E(X)). For a Bernoulli random variable X Ber(p), we have E[X] = (1 p) 0 + p 1 = p. The mean of a random variable is analogous to the physics concept of center of mass. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 13 / 28

Bernoulli Mean of a random variable Expectation is the center of mass Ber(0.9) P(X=x) mean Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 14 / 28 x

Bernoulli Mean of a random variable Variance of a random variable Definition The variance of a random variable is the probability-weighted average of the squared difference from the mean. The variance of a random variable X is denoted V ar[x] (or V ar(x)) and V ar[x] = E[(X µ) 2 ] where µ = E[X] is the mean. The standard deviation of a random variable is the square root of the variance of the random variable, i.e. SE[X] = V ar[x]. For a Bernoulli random variable X Ber(p), we have V ar[x] = (1 p) (0 p) 2 + p (1 p) 2 = (1 p) p 2 + p (1 2p + p 2 ) = p 2 p 3 + p 2p 2 + p 3 ) = p p 2 = p(1 p). Variance is analogous to the physics concept of moment of inertia. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 15 / 28

Bernoulli Mean of a random variable Coin flipping If X Ber(0.5), then E[X] = 1/2 V ar[x] = 1/2 (1 1/2) = 1/2 1/2 = 1/4. If X Ber(1/3), then E[X] = 1/3 V ar[x] = 1/3 (1 1/3) = 1/3 2/3 = 2/9. If X Ber(2/9), then E[X] = 2/9 V ar[x] = 2/9 (1 2/9) = 2/9 7/9 = 14/81. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 16 / 28

Bernoulli Mean of a random variable Die rolling Let X be the number of pips on the upturned face of an unbiased 6-sided die. Find the probability distribution function, the expected value (mean), and the variance. Then the probability distribution function is P (X = 1) = P (X = 2) = P (X = 3) = P (X = 4) = P (X = 5) = P (X = 6) = 1/6. The expected value, E[X], is E[X] = 1/6 1 + 1/6 2 + 1/6 3 + 1/6 4 + 1/6 5 + 1/6 6 = 3.5. The variance, V ar[x], is V ar[x] = 1/6 (1 3.5) 2 + 1/6 (2 3.5) 2 + 1/6 (3 3.5) 2 +1/6 (4 3.5) 2 + 1/6 (5 3.5) 2 + 1/6 (6 3.5) 2 = 2.916. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 17 / 28

Bernoulli Mean of a random variable Expectation is the center of mass Probabilities for 6 sided die roll P(X=x) mean Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 18 / 28 x

Bernoulli Independence Independence Definition Two random variables are independent if the outcome of one random variable does not affect the probabilities of the outcomes of the other random variable. For independent random variables X and Y and constants a, b, and c, we have the following properties E[aX + by + c] = ae[x] + be[y ] + c and V ar[ax + by + c] = a 2 V ar[x] + b 2 V ar[y ]. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 19 / 28

Binomial Sum of independent Bernoulli random variables Let X i, for i = 1,..., n be independent Bernoulli random variable with a common probability of success p. We write X i ind Ber(p). Then the sum is a binomial random variable. Y = n i=1 X i Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 20 / 28

Binomial Binomial Definition A binomial random variable with n attempts and probability of success p has a probability distribution function ( ) n P (Y = y) = p y (1 p) n y y for 0 p 1 and y = 0, 1,..., n where ( ) n n! = y (n y)!y!. We write Y Bin(n, p). Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 21 / 28

Binomial Bin(10,0.3) P(Y=y) 0.00 0.05 0.10 0.15 0.20 0.25 0 2 4 6 8 10 y Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 22 / 28

Binomial Expected values Binomial expected value and variance The expected value (mean) is E[Y ] = E[X 1 + X 2 + + X n ] = E[X 1 ] + E[X 2 ] + + E[X n ] = p + p + + p = np. The variance is V ar[y ] = V ar[x 1 + X 2 + + X n ] = V ar[x 1 ] + V ar[x 2 ] + + V ar[x n ] = p(1 p) + p(1 p) + + p(1 p) = np(1 p). Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 23 / 28

Binomial Expected values Examples If Y Bin(10,.3), then E[Y ] = 10 0.3 = 3 and V ar[y ] = 10 0.3 (1 0.3) = 10 0.3 0.7 = 2.1. If Y Bin(65, 1/4), then E[Y ] = 65 1/4 = 16.25 and V ar[y ] = 65 1/4 (1 1/4) = 65 1/4 3/4 = 12.1875. Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 24 / 28

Binomial AVP Example AVP Example In the 2018 AVP Gold Series Championships in Chicago, IL, Alex Klineman and April Ross beat Sara Hughes and Summer Ross in 2 sets with scores 25-23, 21-16. Suppose that these scores actually determine the probability that Klineman/Ross will score a point against Hughes/Ross, i.e. p = (25 + 21)/(25 + 23 + 21 + 16) = 0.54 and that each point is independent. Let Y be the number of points Klineman/Ross will win (against Hughes/Ross) over the next 20 points. Based on our assumptions Y Bin(20, 0.54). Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 25 / 28

Binomial AVP Example AVP Example (cont.) Bin(20,0.54) P(Y=y) 0.00 0.05 0.10 0.15 0 5 10 15 20 Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 26 / 28 y

Binomial AVP Example AVP Example (cont.) Here are some questions we can answer: How many points do we expect Klineman/Ross to score? E[Y ] = 20.54 = 10.8 points What is the variance around this number? V ar[y ] = 20.54 (1.54) = 4.966 points 2 What is the standard deviation around this number? SD[Y ] = V ar[y ] = 4.966 = 2.23 points What is the probability that Klineman/Ross will win at least 10 points? P (Y >= 10) = P (Y = 10) + P (Y = 11) + + P (Y = 20) = 0.72 Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 27 / 28

Binomial AVP Example AVP Example (cont.) Bin(20,0.54) P(Y=y) 0.00 0.05 0.10 0.15 0 5 10 15 20 Professor Jarad Niemi (STAT226@ISU) M3S1 - Binomial Distribution September 28, 2018 28 / 28 y