Stat511 Additional Materials

Similar documents
The Binomial Probability Distribution

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Random Variables CHAPTER 6.3 BINOMIAL AND GEOMETRIC RANDOM VARIABLES

Binomial Random Variables. Binomial Random Variables

Statistics 511 Additional Materials

CHAPTER 6 Random Variables

Math 14 Lecture Notes Ch The Normal Approximation to the Binomial Distribution. P (X ) = nc X p X q n X =

MATH 264 Problem Homework I

MA 1125 Lecture 12 - Mean and Standard Deviation for the Binomial Distribution. Objectives: Mean and standard deviation for the binomial distribution.

Statistics 6 th Edition

chapter 13: Binomial Distribution Exercises (binomial)13.6, 13.12, 13.22, 13.43

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Discrete Probability Distribution

Elementary Statistics Lecture 5

Discrete Random Variables and Probability Distributions

Chapter 9: Sampling Distributions

Probability Distributions for Discrete RV

Discrete Random Variables

Probability & Statistics Chapter 5: Binomial Distribution

Chapter 6: Random Variables

AP Statistics Ch 8 The Binomial and Geometric Distributions

Lecture 9: Plinko Probabilities, Part III Random Variables, Expected Values and Variances

MA : Introductory Probability

Chapter 6: Random Variables. Ch. 6-3: Binomial and Geometric Random Variables

15.063: Communicating with Data Summer Recitation 3 Probability II

5.2 Random Variables, Probability Histograms and Probability Distributions

Probability Theory. Mohamed I. Riffi. Islamic University of Gaza

Part 10: The Binomial Distribution

Section 6.3 Binomial and Geometric Random Variables

MA 1125 Lecture 18 - Normal Approximations to Binomial Distributions. Objectives: Compute probabilities for a binomial as a normal distribution.

Chapter 8: Binomial and Geometric Distributions

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

Chapter 3 Discrete Random Variables and Probability Distributions

Stat 20: Intro to Probability and Statistics

Section 5 3 The Mean and Standard Deviation of a Binomial Distribution!

30 Wyner Statistics Fall 2013

A random variable is a quantitative variable that represents a certain

x is a random variable which is a numerical description of the outcome of an experiment.

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Binomial and multinomial distribution

Binomial Probability

Law of Large Numbers, Central Limit Theorem

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

The Binomial Distribution

The Binomial Distribution

Module 4: Probability

VIDEO 1. A random variable is a quantity whose value depends on chance, for example, the outcome when a die is rolled.

Chapter 8 Additional Probability Topics

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr.

Chapter 11. Data Descriptions and Probability Distributions. Section 4 Bernoulli Trials and Binomial Distribution

Statistical Methods in Practice STAT/MATH 3379

We use probability distributions to represent the distribution of a discrete random variable.

guessing Bluman, Chapter 5 2

Section Random Variables

CHAPTER 6 Random Variables

Stat 211 Week Five. The Binomial Distribution

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

STOR 155 Introductory Statistics (Chap 5) Lecture 14: Sampling Distributions for Counts and Proportions

4.1 Probability Distributions

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

II - Probability. Counting Techniques. three rules of counting. 1multiplication rules. 2permutations. 3combinations

S = 1,2,3, 4,5,6 occurs

4 Random Variables and Distributions

23.1 Probability Distributions

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

MAKING SENSE OF DATA Essentials series

What is the probability of success? Failure? How could we do this simulation using a random number table?

Random Variables. Chapter 6: Random Variables 2/2/2014. Discrete and Continuous Random Variables. Transforming and Combining Random Variables

Chapter 4 and 5 Note Guide: Probability Distributions

5.4 Normal Approximation of the Binomial Distribution

The Binomial distribution

Consider the following examples: ex: let X = tossing a coin three times and counting the number of heads

Chapter 5. Sampling Distributions

Chapter 5. Discrete Probability Distributions. Random Variables

6.3: The Binomial Model

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics

The Binomial and Geometric Distributions. Chapter 8

SECTION 4.4: Expected Value

MATH 112 Section 7.3: Understanding Chance

X P(X) (c) Express the event performing at least two tests in terms of X and find its probability.

The Normal Probability Distribution

The Binomial Distribution

Objective: To understand similarities and differences between geometric and binomial scenarios and to solve problems related to these scenarios.

Midterm Exam III Review

Random Variables and Probability Functions

Class 13. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Theoretical Foundations

Class 12. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Math 14 Lecture Notes Ch. 4.3

The normal distribution is a theoretical model derived mathematically and not empirically.

4.2 Bernoulli Trials and Binomial Distributions

Statistics for Managers Using Microsoft Excel 7 th Edition

Section Distributions of Random Variables

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Experimental Probability - probability measured by performing an experiment for a number of n trials and recording the number of outcomes

Transcription:

Binomial Random Variable Stat511 Additional Materials The first discrete RV that we will discuss is the binomial random variable. The binomial random variable is a result of observing the outcomes from more than one trial. It arises from a binomial experiment. A binomial experiment is defined as having the following properties. Each trial has only two possible outcomes that we refer to as successes and failures. These trials are independent. The random variable is then the number of successes. It is similar to the idea of flipping a coin and recording the number of heads that you see. The binomial RV is like this but there are numerous processes that we can model using a binomial RV. Definition: A binomial experiment has the following characteristics: 1. There must be a fixed number of trials, n. 2. Each trial can result in one and only one of two possible outcomes, labeled success and failure. 3. The probability of success in a single trial, p, is the same for each trial. 4. The trials are independent, so that the probability of success is unaffected by the result of a previous trial. The binomial random variable X is the number of successes observed from the n trials. It should be noted that the terms success and failure are arbitrary labels and do not connote good or bad outcomes. We describe the probability distribution for a binomial RV with the following formula If we know p and n, P(X=x) = n x px (1 p) n x for x = 0, 1, 2,, n. The theoretical mean of a binomial RV X is µ x = np. The theoretical mean is the number of trials multiplied by the probability of success on each trial. The theoretical standard deviation of a binomial RV X is σ x = np(1 p) The classic example of a binomial distribution is flipping fair coins. Suppose that we toss a penny 10 times and we assume that the penny is fair, meaning that the probability of heads is the same as the probability of tails, one-half or 50%. Let random variable X be the number of heads from 10 tosses of a fair penny. Find the probability of getting 4 heads.

The first question we must ask is the Are 4 conditions for this to be a binomial random variable met? 1. Are there a fixed number of trials? Yes, n =10. 2. Does each trial result in one of two possible outcomes? Yes since each trial results in either a head or a tail. Define a success as a fair penny coming up heads. 3. Is the probability of success the same for each trial? Since the penny is the same one tossed 10 times, it is very likely that this holds. Note that that probability is 0.5, so p=0.5. 4. Are the trials independent? This is often the hardest assumption to verify. However it is hard to imagine in this case that getting a head on one trial would affect the outcome of the next. So yes, they would seem to be independent. We first note that n = 10 and p = 0.5, since the total number of tosses is 10 and the probability of success (getting a head) on a single trial is 0.5. Thus n P(X=4) = x px (1 p) n x = 10 4 0.54 (0.5) 6 =210*0.0625*(0.0156)=0.205 Thus the probability of getting 4 heads from flipping a penny 10 times is 20.5%. Another way to think of this is that if we repeated this experiment a large number of times, flipping 10 pennies each repetition of the experiment, on 20.5% of those experiments we would get exactly 4 heads out of 10 pennies. A psychologist is doing a study of spatial relationships. Suppose that 15 people will be given a sequence of mazes to complete in a limited amount of time. Each person will be given the mazes separately. Past experience suggests that 20% of all people finish all the mazes correctly in the limited amount of time they are allotted. Find the probability that 5 of the 15 people will complete the mazes correctly in the limited amount of time they are given. Again, we must ask whether or not the four conditions for a binomial RV are met. 1. Are there a fixed number of trials? Yes, since there are 15 people who will attempt to complete the task. 2. Does each trial result in one of two possible outcomes? Yes since each trial results in either correctly finishing the mazes in the time allotted or not. Define success as an individual correctly finishing the mazes in the time allotted. 3. Is the probability of success the same for each trial? This may not be the case since each individual has different abilities. 4. Are the trials independent? Yes, since each individual does his/her mazes separately. There is some reason for concern that the third condition is violated. However, prior to conducting the experiment we can assume that the probability, p, of success is the same for each individual. With that assumption made, then let X be the number of people who successfully complete all the mazes on time. Then the probability that 5 of the 15 people will complete the mazes correctly is

P(X=5) = n x px (1 p) n x = 15 5 0.25 (0.8) 10 = 3003*(0.00032)*(0.10737)=0.1032. Hence the probability of having 5 people out of twenty complete the mazes in the allotted time is about 10%. Calculating Cumulative probabilities We are often interested in calculating probabilities other than those of the simple events of the two preceding sections. There we focused on calculating the probability that our binomial RV has a single specific value. We might be interested in the probability of getting less than 7 successes in a binomial experiment section. There are two possibilities. First for something like P(X<3) we could add up the simple events that make up this probability. P(X<3) = P(X=0)+P(X=1)+P(X=2). Each one of these would have to be calculated using the formula that was appropriate. It gets worse for something like P(X 15) = P(X=0) + P(X=1) + P(X=2) + + P(X=15). Our second option is to have a set of tables that will allow us to simplify these calculations. Unfortunately our textbook does not have cumulative binomial probability tables. Your instructor will provide these tables to the students from another source. The drawback to this is that we need to learn how to use the tables. The first piece of vital information is that the tables for binomial random variables are cumulative tables. That is, instead of giving P(X=4) or P(Y<3), they give P(X 3) and P(Y 7). As a consequence of this we need to use formulas to determine the other probabilities that we are interested in. Thus the table below. In this table the letter r represents the value that we need for the probability we want. For the examples in the last column, what is on the right hand side of the equal sign Probability we want Calculation we need to perform Example P(Y r) P(Y r) P(Y 4) P(Y < r) P(Y (r-1)) P(Y<4) = P(Y 3) P(Y r) 1-P(Y (r-1)) P(Y 4) = 1-P(Y 3) P(Y > r) 1-P(Y r) P(Y>4) = 1-P(Y 4) *P(Y=r) P(Y r)-p(y (r-1)) P(Y=4) = P(Y 4)-P(Y 3)

* The fifth probability above could also be calculated using the formula for the Binomial random variable. The above rules are the result of two basic ideas. The first is one that we saw in the previous chapter. This it the probability of a complement. Hence P(X 4) = 1-P(X>4). The second is an observation that P(X < 4 ) = P(X 3 ), since both events contain the same outcomes. The cumulative Binomial probability tables give us probabilities that a binomial RV X is less than or equal to a value r. Consequently, we must use a formula for the probability that we want that includes probabilities with less than or equal to values in them. Having mastered these rules, let s consider the tables. We want to find P(X r) for some binomial random variable X. In order to use these tables we need to know the values of n and p. Recall that we needed to know the values of those two quantities before we could do any calculations of binomial probabilities. The first step in the process is to find the value of p across the top row of tables. Having found p, we then go down the first column on the left until we find the correct n. That will place us in a sub-table of the entire table. Sub-tables are separated by a blank row. Having found the right sub-table, we then find the appropriate value of r that we need and use that row. We then go across that row until we hit the column for our value of p. Suppose that n = 10 and p = 0.15. And we want P(X 2) The third value at the top of the page p = 0.15 is the column we need. Go down the first column on the left until you reach n = 10. The r that we want is 2, so find the row that corresponds to r =2 in the sub-table. Then go across that row to the second column (it s the one underneath p = 0.15). The value you find there is P(X 2) = 0.8202. Suppose that n = 5 and p = 0.30. We want P(X 4) The sixth column at the top of the page corresponds to p = 0.30. Go down the first column on the left until you come to n = 5. We will use this sub-table to get the probability we need. Find the row corresponding to r = 4. Go across that row until you reach the column for p = 0.30. The value at that row and column is 0.9976. Consequently P(X 4) = 0.9976. Find P(X 13) with n = 20 and p = 0.80.

P(X 13) = 0.0867 Find P(X 6) with n = 20 and p = 0.25. P(X 6) = 0.7858 Find P(X>3) with n = 10 and p = 0.4. Since we want P(X>3) = 1-P(X 3), we need to look up P(X 3) in the table. P(X 2) = 0.3823 So P(X>3) = 1-P(X 3) = 1-0.3823 = 0.6177 Find P(X<2) with n = 5 and p = 0.20. Since we want P(X<2) =P(X 1), so we need to look up P(X 1) in the table. P(X 1) = 0.7373. So P(X<2) = P(X 1) = 0.7373 Find P(X 5) with n = 10 and p = 0.35. Since we want P(X 5) = 1- P(X 4), we need to look up P(X 4) in the table. P(X 4) = 0.7515 So P(X 5) = 1- P(X 4) = 1 0.7515 = 0.2485. Find P(X=4) with n = 10 and p = 0.15. Since we want P(X=4) = P(X 4) P(X 3), we need to look up both P(X 4) and P(X 3) in the table. P(X 4) = 0.9901 and P(X 3) = 0.9500 So P(X=4) = P(X 4)-P(X 3) = 0.9901-0.9500 = 0.0401.