The Binomial Distribution

Similar documents
The Binomial Distribution

The Binomial Distribution

Random variables The binomial distribution The normal distribution Other distributions. Distributions. Patrick Breheny.

Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny.

The Central Limit Theorem

Lab #7. In previous lectures, we discussed factorials and binomial coefficients. Factorials can be calculated with:

Probability & Statistics Chapter 5: Binomial Distribution

chapter 13: Binomial Distribution Exercises (binomial)13.6, 13.12, 13.22, 13.43

Stat 20: Intro to Probability and Statistics

Chapter 8: Binomial and Geometric Distributions

Binomial and Normal Distributions

Binomial distribution

***SECTION 8.1*** The Binomial Distributions

Binomial Distributions

MATH 446/546 Homework 1:

AP Statistics Ch 8 The Binomial and Geometric Distributions

Random Variables CHAPTER 6.3 BINOMIAL AND GEOMETRIC RANDOM VARIABLES

Section 6.3 Binomial and Geometric Random Variables

Chapter 6: Random Variables

The Binomial distribution

Binomial Random Variable - The count X of successes in a binomial setting

INTRODUCTION TO MATHEMATICAL MODELLING LECTURES 3-4: BASIC PROBABILITY THEORY

Unit 6 Bernoulli and Binomial Distributions Homework SOLUTIONS

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions

The Binomial Probability Distribution

Binomial Random Variables

Lesson 97 - Binomial Distributions IBHL2 - SANTOWSKI

Opening Exercise: Lesson 91 - Binomial Distributions IBHL2 - SANTOWSKI

STAT 111 Recitation 2

4.2 Bernoulli Trials and Binomial Distributions

Module 4: Probability

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions

Probability Distributions: Discrete

CHAPTER 6 Random Variables

Sampling Distributions For Counts and Proportions

Binomial Distributions

Binomial and Geometric Distributions

Part 1 In which we meet the law of averages. The Law of Averages. The Expected Value & The Standard Error. Where Are We Going?

Chapter 3 - Lecture 5 The Binomial Probability Distribution

14.30 Introduction to Statistical Methods in Economics Spring 2009

Binomal and Geometric Distributions

Binomial Distributions

Lecture 6 Probability

6.3: The Binomial Model

STOR 155 Introductory Statistics (Chap 5) Lecture 14: Sampling Distributions for Counts and Proportions

(# of die rolls that satisfy the criteria) (# of possible die rolls)

Chapter 8 Additional Probability Topics

5.2 Random Variables, Probability Histograms and Probability Distributions

Unit 4 Bernoulli and Binomial Distributions Week #6 - Practice Problems. SOLUTIONS Revised (enhanced for q4)

Event p351 An event is an outcome or a set of outcomes of a random phenomenon. That is, an event is a subset of the sample space.

MATH 112 Section 7.3: Understanding Chance

Probability mass function; cumulative distribution function

Marquette University MATH 1700 Class 8 Copyright 2018 by D.B. Rowe

The normal distribution is a theoretical model derived mathematically and not empirically.

ASSIGNMENT 14 section 10 in the probability and statistics module

The Binomial Distribution

MATH 264 Problem Homework I

3.2 Hypergeometric Distribution 3.5, 3.9 Mean and Variance

Statistical Methods in Practice STAT/MATH 3379

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

The Binomial Distribution

Chapter 5 Probability Distributions. Section 5-2 Random Variables. Random Variable Probability Distribution. Discrete and Continuous Random Variables

6.1 Discrete and Continuous Random Variables. 6.1A Discrete random Variables, Mean (Expected Value) of a Discrete Random Variable

Chapter 4. Section 4.1 Objectives. Random Variables. Random Variables. Chapter 4: Probability Distributions

10 5 The Binomial Theorem

CS 237: Probability in Computing

Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7

Chapter 4 Discrete Random variables

Math 14 Lecture Notes Ch. 4.3

Stat511 Additional Materials

EXERCISES ACTIVITY 6.7

x is a random variable which is a numerical description of the outcome of an experiment.

Chapter 4 Probability Distributions

2. Modeling Uncertainty

Probability and Sample space

CSSS/SOC/STAT 321 Case-Based Statistics I. Random Variables & Probability Distributions I: Discrete Distributions

CHAPTER 6 Random Variables

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

CHAPTER 6 Random Variables

The Binomial Distribution

ECON 214 Elements of Statistics for Economists 2016/2017

Lecture 9: Plinko Probabilities, Part III Random Variables, Expected Values and Variances

Math 160 Professor Busken Chapter 5 Worksheets

Chapter 2: Probability

The following content is provided under a Creative Commons license. Your support

Chapter 4 Discrete Random variables

Ex 1) Suppose a license plate can have any three letters followed by any four digits.

Class 13. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

GOALS. Discrete Probability Distributions. A Distribution. What is a Probability Distribution? Probability for Dice Toss. A Probability Distribution

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

Discrete Probability Distributions Chapter 6 Dr. Richard Jerz

4.1 Probability Distributions

3.2 Binomial and Hypergeometric Probabilities

BIOS 4120: Introduction to Biostatistics Breheny. Lab #7. I. Binomial Distribution. RCode: dbinom(x, size, prob) binom.test(x, n, p = 0.

Part V - Chance Variability

Binomial Distribution

Chapter 3 Discrete Random Variables and Probability Distributions

MAKING SENSE OF DATA Essentials series

Probability and distributions

Transcription:

Patrick Breheny February 21 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 1 / 16

So far, we have discussed the probability of single events In research, however, the data we collect consists of many events (for each subject, does he/she contract polio?) We then summarize those events with a number (out of the 200,000 people who got the vaccine, how many contracted polio?) Such a number is an example of a random variable Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 2 / 16

Distributions In our sample, we observe a certain value of a random variable In order to assess the variability of that value, we need to know the chances that our random variable could have taken on different values depending on the true values of the population parameters This is called a distribution A distribution describes the probability that a random variable will take on a specific value or fall within a specific range of values Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 3 / 16

Examples Random variable Possible outcomes # of copies of a genetic mutation 0,1,2 # of children a woman will have in her lifetime 0,1,2,... # of people in a sample who contract polio 0,1,2,...,n Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 4 / 16

Listing the ways When trying to figure out the probability of something, it is sometimes very helpful to list all the different ways that the random process can turn out If all the ways are equally likely, then each one has probability, where n is the total number of ways 1 n Thus, the probability of the event is the number of ways it can happen divided by n Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 5 / 16

Genetics example For example, the possible outcomes of an individual inheriting cystic fibrosis genes are CC Cc cc cc If all these possibilities are equally likely (as they would be if the individual s parents had one copy of each version of the gene), then the probability of having one copy of each version is 2/4 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 6 / 16

Coin example Another example where the outcomes are equally likely is flips of a coin Suppose we flip a coin three times; what is the probability that exactly one of the flips was heads? Possible outcomes: HHH HHT HT H HT T T HH T HT T T H T T T The probability is therefore 3/8 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 7 / 16

Counting the number of ways something can happen quickly becomes a hassle (imagine listing the outcomes involved in flipping a coin 100 times) Luckily, mathematicians long ago discovered that when there are two possible outcomes that occur/don t occur n times, the number of ways of one event occurring k times is n! k!(n k)! The notation n! means to multiply n by all the positive numbers that come before it (e.g. 3! = 3 2 1) Note: 0! = 1 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 8 / 16

Calculating the binomial coefficients For the coin example, we could have used the binomial coefficients instead of listing all the ways the flips could happen: 3! 1!(3 1)! = 3 2 1 2 1(1) = 3 Many calculators and computer programs (including R) have specific functions for calculating binomial coefficients: > choose(3,1) [1] 3 > choose(10,2) [1] 45 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 9 / 16

When sequences are not equally likely Suppose we draw 3 balls, with replacement, from an urn that contains 10 balls: 2 red balls and 8 green balls What is the probability that we will draw two red balls? As before, there are three possible sequences: RRG, RGR, and GRR, but the sequences no longer have probability 1 8 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 10 / 16

When sequences are not equally likely (cont d) The probability of each sequence is 2 10 2 10 8 10 = 2 10 8 10 2 10 = 8 10 2 10 Thus, the probability of drawing two red balls is 3 2 10 2 10 8 10 = 9.6% 2 10.03 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 11 / 16

The binomial formula This line of reasoning can be summarized in the following formula: the probability that an event will occur k times out of n is n! k!(n k)! pk (1 p) n k In this formula, n is the number of trials, p is the probability that the event will occur on any particular trial We can then use the above formula to figure out the probability that the event will occur k times Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 12 / 16

Example According to the CDC, 22% of the adults in the United States smoke Suppose we sample 10 people; what is the probability that 5 of them will smoke? We can use the binomial formula, with 10! 5!(10 5)!.225 (1.22) 10 5 = 3.7% There is also a shortcut formula in R for this: > dbinom(5, size=10, prob=.22) [1] 0.03749617 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 13 / 16

Example (cont d) What is the probability that our sample will contain two or fewer smokers? We can add up probabilities from the binomial distribution: Or, in R: P (X 2) = P (X = 0) + P (X = 1) + P (X = 2) =.083 +.235 +.298 = 61.7% > dbinom(0:2, size=10, prob=.22) [1] 0.08335776 0.23511163 0.29841091 > pbinom(2, size=10, prob=.22) [1] 0.6168803 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 14 / 16

The binomial formula when to use This formula works for any random variable that counts the number of times an event occurs out of n trials, provided that the following assumptions are met: The number of trials n must be fixed in advance The probability that the event occurs, p, must be the same from trial to trial The trials must be independent If these assumptions are met, the random variable is said to follow a binomial distribution, or to be binomially distributed Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 15 / 16

A random variable is a number that can equal different values depending on the outcome of a random process The distribution of a random variable describes the probability that the random variable will take on those different values The number of ways to choose k things out of n possibilities is: n! k!(n k)! (Binomial distribution) The probability that an event will occur k times out of n is n! k!(n k)! pk (1 p) n k, where p is the probability that the event will occur on any particular trial Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 16 / 16