Chapter 3 Class Notes Intro to Probability

Similar documents
PROBABILITY AND THE BINOMIAL DISTRIBUTION

Section Distributions of Random Variables

ECO220Y Sampling Distributions of Sample Statistics: Sample Proportion Readings: Chapter 10, section

Statistical Methods in Practice STAT/MATH 3379

Section Distributions of Random Variables

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Part V - Chance Variability

MANAGEMENT PRINCIPLES AND STATISTICS (252 BE)

4.2 Bernoulli Trials and Binomial Distributions

4: Probability. Notes: Range of possible probabilities: Probabilities can be no less than 0% and no more than 100% (of course).

Lecture 6 Probability

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Bernoulli and Binomial Distributions

STA 6166 Fall 2007 Web-based Course. Notes 10: Probability Models

PROBABILITY DISTRIBUTIONS

Probability Distributions

Experimental Probability - probability measured by performing an experiment for a number of n trials and recording the number of outcomes

***SECTION 8.1*** The Binomial Distributions

Probability Models. Grab a copy of the notes on the table by the door

Section Random Variables and Histograms

Stat 20: Intro to Probability and Statistics

4 Random Variables and Distributions

6 If and then. (a) 0.6 (b) 0.9 (c) 2 (d) Which of these numbers can be a value of probability distribution of a discrete random variable

Theoretical Foundations

Section 8.1 Distributions of Random Variables

Lesson 97 - Binomial Distributions IBHL2 - SANTOWSKI

Opening Exercise: Lesson 91 - Binomial Distributions IBHL2 - SANTOWSKI

Consider the following examples: ex: let X = tossing a coin three times and counting the number of heads

Statistics and Probability

VIDEO 1. A random variable is a quantity whose value depends on chance, for example, the outcome when a die is rolled.

12 Math Chapter Review April 16 th, Multiple Choice Identify the choice that best completes the statement or answers the question.

List of Online Quizzes: Quiz7: Basic Probability Quiz 8: Expectation and sigma. Quiz 9: Binomial Introduction Quiz 10: Binomial Probability

MATH 264 Problem Homework I

Section 3.1 Distributions of Random Variables

Binomial Random Variables. Binomial Random Variables

Chapter 4 Discrete Random variables

What do you think "Binomial" involves?

Introduction to Business Statistics QM 120 Chapter 6

What is the probability of success? Failure? How could we do this simulation using a random number table?

Chapter 5. Sampling Distributions

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

MAS187/AEF258. University of Newcastle upon Tyne

Chapter 7. Sampling Distributions and the Central Limit Theorem

MA : Introductory Probability

Chapter 4 Discrete Random variables

LESSON 9: BINOMIAL DISTRIBUTION

Part 1 In which we meet the law of averages. The Law of Averages. The Expected Value & The Standard Error. Where Are We Going?

ECON 214 Elements of Statistics for Economists 2016/2017

Binomial and multinomial distribution

Binomial Distributions

Lean Six Sigma: Training/Certification Books and Resources

The normal distribution is a theoretical model derived mathematically and not empirically.

A useful modeling tricks.

Binomial Distributions

Chapter 3: Probability Distributions and Statistics

MATH 118 Class Notes For Chapter 5 By: Maan Omran

STOR 155 Introductory Statistics (Chap 5) Lecture 14: Sampling Distributions for Counts and Proportions

MATH1215: Mathematical Thinking Sec. 08 Spring Worksheet 9: Solution. x P(x)

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

The Binomial Probability Distribution

Chapter 7. Sampling Distributions and the Central Limit Theorem

30 Wyner Statistics Fall 2013

Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7

Probability Binomial Distributions. SOCY601 Alan Neustadtl

EXERCISES FOR PRACTICE SESSION 2 OF STAT CAMP

Random Variables. 6.1 Discrete and Continuous Random Variables. Probability Distribution. Discrete Random Variables. Chapter 6, Section 1

2. Modeling Uncertainty

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

4.1 Probability Distributions

MA 1125 Lecture 12 - Mean and Standard Deviation for the Binomial Distribution. Objectives: Mean and standard deviation for the binomial distribution.

Random Variables CHAPTER 6.3 BINOMIAL AND GEOMETRIC RANDOM VARIABLES

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

Module 3: Sampling Distributions and the CLT Statistics (OA3102)

2011 Pearson Education, Inc

Chapter 3 Discrete Random Variables and Probability Distributions

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

Business Statistics. Chapter 5 Discrete Probability Distributions QMIS 120. Dr. Mohammad Zainal

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 6: Random Variables. Ch. 6-3: Binomial and Geometric Random Variables

STAT 201 Chapter 6. Distribution

II - Probability. Counting Techniques. three rules of counting. 1multiplication rules. 2permutations. 3combinations

Probability and Statistics for Engineers

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Probability & Statistics Chapter 5: Binomial Distribution

5.2 Random Variables, Probability Histograms and Probability Distributions

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Chapter 8. Binomial and Geometric Distributions

Binomial distribution

Math 160 Professor Busken Chapter 5 Worksheets

Chapter 6: Random Variables

(Practice Version) Midterm Exam 1

5.1 Personal Probability

Binomial Random Variable - The count X of successes in a binomial setting

Probability. An intro for calculus students P= Figure 1: A normal integral

The Binomial distribution

ECON 214 Elements of Statistics for Economists 2016/2017

Transcription:

Chapter 3 Class Notes Intro to Probability Concept: role a fair die, then: what is the probability of getting a 3? Getting a 3 in one roll of a fair die is called an Event and denoted E. In general, Number of successful outcomes Pr{E} = Total number of outcomes This is the frequency interpretation of probability (and gives the answer 1/6 for the above); another interpretation is the mathematical definition. Another example throw a pair of dice and let E be the event that neither die shows a 6 (i.e., both are non 6 s). By counting below, we see that Pr{E} = 25 / 36 = 69.44%. First Die Second Die 1 2 3 4 5 6 1 2 3 4 5 6 Here that the tosses are independent and that we can also find the answer 25 / 36 by multiplication: Pr{E} = 5 / 6 5 / 6 = 0.6944. Page 1 of 11

In contrast, suppose that we have a box containing six pieces of paper (called tickets ) your name is on one of the tickets. Your friend randomly selects two tickets without replacement, and let E be the event that your name is not selected (in the two draws). Draws here are dependent. Then, Pr{E} = 5 / 6 4 / 5 = 2 / 3 = 0.6667, and this differs from the above answer. Illustration of Probability Trees (p.94 #3.2.5a) An EPT (early pregnancy test) is such that if a woman is Pregnant, then the probability she tests Positive (+) is 98%. Further, if she is Not Pregnant, there is a 99% chance that she will test Negative ( ). By p.92, we then say that the sensitivity of this EPT is 98% and the specificity of the test is 99%. Additionally, of every group of 100 women in a certain village, suppose 10 are indeed Pregnant. If one woman from this village is randomly selected, what is the probability that she tests Positive (+)? Product 0.98 Tests + 0.098 0.10 Pregnant 0.02 Tests 0.002 0.01 Tests + 0.009 0.90 Not Pregnant 0.99 Tests 0.891 Answer = 0.098 + 0.009 = 0.107 = 10.7%. [Aside: note that here the false negative rate is 0.2%, the false positive rate is 0.9%, & so this test is in error 1.1% of the time.] Page 2 of 11

Rules of Probability (Section 3.3) 1. The Addition Rule for any two events E 1 and E 2, Pr{E 1 or E 2 } = Pr{E 1 } + Pr{E 2 } Pr{E 1 and E 2 }. If the events are disjoint, then just add the probabilities. 2. Conditional probability given that an event E 1 has occurred (i.e., it is certain), then the probability that E 2 will occur is Pr{E 1 and E 2 } Pr{E 2 / E 1 } = Pr{E 1 } provided Pr{E 1 } 0. For example, if a woman is Pregnant, then the probability that she tests + is Pr{Tests + / Pregnant} = 0.098 / 0.10 = 0.98 (this is the given sensitivity of the EPT). 3. The Multiplication Rule for any two events E 1 and E 2, Pr{E 1 and E 2 } = Pr{E 1 } Pr{E 2 / E 1 } We used the multiplication rule above for the example regarding drawing two tickets from the box. We also used the multiplication rule with the above pair of dice example since for independent events E 1 and E 2, Pr{E 2 / E 1 } = Pr{E 2 } and Pr{E 1 / E 2 } = Pr{E 1 } Page 3 of 11

4. Bayes Rule relates prior and posterior probabilities by: Pr{E 2 / E 1 } Pr{E 1 } Pr{E 1 / E 2 } = Pr{E 2 } For example, given that a woman tests positive on the EPT (E 2 ), what is the probability that she really is pregnant (E 1 )? 0.98 0.10 Pr{E 1 / E 2 } = = 0.9159 = 91.59% 0.107 Note how different this conditional probability is from the unconditional probability that a woman is pregnant (10%). Density Curves (Section 3.4) These curves correspond to the population (of a continuous RV) and are thus theoretical ; the progression of histograms on p. 99 (as the sample size grows) is helpful to understand the connection between histograms and density curves. These curves help us connect Area on the one hand with {Percentages, Probabilities, or Proportions} on the other hand. Thus, the area under the density curve is one (i.e., 100%). To illustrate, for a randomly selected individual from the population generating Figure 3.4.4 on p.100, the probability that his/her blood glucose is greater than 150 mg/dli is Pr{Y > 150} = 8%. Also, referring to Figure 3.4.5 on p.101, the probability that a randomly selected Douglas fir tree has diameter of between 2" and 8" is Pr{2 < Y < 8} = 0.20 + 0.33 + 0.25 = 0.78 (78%); the Page 4 of 11

probability that one such tree s diameter is exactly 2" is Pr{Y = 2} = 0 (zero) since Y here is continuous. Thus, the area under the density curve between a and b equals the proportion of Y values between a and b and equals the probability that one such individual has a Y between a and b. Random Variables (Section 3.5) A random variable (RV) Y is a variable whose value depends upon the outcome of a chance operation. For example, randomly choose a card from a deck of cards, then Pr{face card} = 12/52 = 3/13 (face card is J or Q or K). Throw a pair of dice, Pr{sum exceeds 4} = 1 6/36 = 30/36 = 5/6 = 83.33% SUM First Die Second Die 1 2 3 4 5 6 1 2 3 4 5 6 7 2 3 4 5 6 7 8 3 4 5 6 7 8 9 4 5 6 7 8 9 10 5 6 7 8 9 10 11 6 7 8 9 10 11 12 Page 5 of 11

In Ex. 3.5.5 on pp.103 4, we are given the distribution for Y = the number of tail vertebrae of the freshwater sculpin Cottus rotheus. Y Percent of Fish 20 3 21 51 22 40 23 6 Note that this RV is discrete, and note also that in the following frequency histogram that the bars extend from 19.5 to 20.5 for Y = 20, 20.5 to 21.5 for Y= 21, etc. Histogram of No. of vertebae 50 40 Percent 30 20 10 0 20 21 22 No. of vertebae 23 If we randomly select one such sculpin and let Y be the number of its tail vertebrae, then (a) Pr{Y = 21} = 0.51 and (b) Pr{Y 22} = 0.94. As pointed out above, the probability that a randomly selected individual has a certain characteristic is equal to the proportion of the population with that characteristic. The tree diameter illustration (p.101) is an example of a continuous RV; the vertebrae example (p.104) is a discrete RV. Page 6 of 11

Definitions: The mean (or expected value) of a discrete random variable is E(Y) = Y = y k Pr{Y = y k } The variance of such a RV is VAR(Y) = 2 Y = (y k Y ) 2 Pr{Y = y k } Notes: For continuous RVs, substitute (integral) signs above instead of signs E(aY + bz) = ae(y) + be(z) VAR(cX + d) = c 2 VAR(X) The variance short cut formula is Y 2 = [y k 2 Pr{Y = y k }] Y 2 For the sculpins, = 20 0.03 + 21 0.51 +22 0.40 +23 0.06 = 21.49 (vertebrae) 2 = (20 21.49) 2 0.03 + (21 21.49) 2 0.51 + (22 21.49) 2 0.40 + (23 21.49) 2 0.06 = 0.4299 so = 0.6557. For the sum of 2 dice, verify that = 7 and 2 = 1974/36 7 2 = 5.8333 so = 2.4152. Page 7 of 11

Now draw n = 3 cards from a fair deck of cards with replacement and note whether the card is a face card (F) or not (N); remember, the probability of F is 3/13 and the probability of N is 10/13. We can list the possibilities using a tree (Y = # of face cards): First Second Third Seq. Y Probability N NNN 0 (10/13) 3 N F NNF 1 (3/13) (10/13) 2 N N NFN 1 (3/13) (10/13) 2 F F NFF 2 (3/13) 2 (10/13) N FNN 1 (3/13) (10/13) 2 N F F FNF 2 (3/13) 2 (10/13) N FFN 2 (3/13) 2 (10/13) F F FFF 3 (3/13) 3 Page 8 of 11

The Binomial Distribution (Section 3.6) In the above example, Y = the number of picture cards in n draws with replacement is an example of a Binomial random variable. For such a variable, we need all of these conditions: Each trial results in a Success or a Failure and we are interested in Y = # of successes [B] The outcomes of the trials are independent of each other [I] The number of trials (n) is fixed in advanced [n] The probability of a success (p) stays the same from trial to trial [S] These are called the BInS requirements (see p.110) all four must be met in order for a RV to be a Binomial RV. Note that not all S/F random variables are Binomial two counterexamples are (1) count the number of cards until we see the 5 th Face card, and (2) if the n = 3 cards above are drawn without replacement. For Y a Binomial RV with parameters n and p we write Y ~ Bin(n,p) then the probability of exactly k successes can be computed using the binomial probability formula:,, Here,! and (n factorial) n! = n(n 1) (2)(1) and 0! = 1.!! Most calculators can find n C k or we can use Table 2 on p.615. For the Binomial distribution and only for this distribution the following short cut formulas hold for the mean and variance: Y = np, and Y 2 = np(1 p) = npq (where q = 1 p) Page 9 of 11

For the card example above, a success = a face card, n = 3, p = 3/13 (recall Y is the number of face cards!), so Pr{Y = 0} =!!! (3/13)0 (10/13) 3 = 0.455166 Pr{Y = 1} =!!! (3/13)1 (10/13) 2 = 0.409650 Pr{Y = 2} =!!! (3/13)2 (10/13) 1 = 0.122895 Pr{Y = 3} =!!! (3/13)3 (10/13) 0 = 0.012289 Of course, the probabilities sum to one. In n = 3 draws (with replacement), the expected number of face cards is Y = np = 3(0.2308) = 0.6923, give or take Y =.. = 0.7298. Another example variation on Ex. 3.6.5 on p.112 Drosophila, where 30% are black (B) and 70% are gray (G). Randomly select n = 10 such flies. (Even though sampling here is without replacement, the population of flies is infinite, so we can act as though sampling here is with replacement.) We get: a) The probability exactly 2 flies are B. Pr{Y = 2} =!!! (0.3)2 (0.7) 8 = 0.233474 b) The probability that exactly 20% of the flies is B. c) The probability that all of the flies are G. Pr{Y = 0} =!!! (0.3)0 (0.7) 10 = 0.028248 d) The probability that at least one of the flies is B. Pr{Y 1} = Pr{Y > 0} = 1 Pr{Y = 0} = 0.971752 e) The expected number of B flies and the give or take number. = 10(0.3) = 3 B flies give or take =.. = 1.45 flies. Page 10 of 11

The probability histogram for this example is the following: 30 Histogram of Y 25 20 Percent 15 10 5 0 0 2 4 Y 6 8 10 [Skip Section 3.7 for now.] Page 11 of 11