E509A: Principle of Biostatistics. GY Zou

Size: px
Start display at page:

Download "E509A: Principle of Biostatistics. GY Zou"

Transcription

1 E509A: Principle of Biostatistics (Week 2: Probability and Distributions) GY Zou

2 Reporting of continuous data If approximately symmetric, use mean (SD), e.g., Antibody titers ranged from 25 to 347 ng/ml and had a mean (SD) of 110 ng/ml (43 ng/ml). If markedly asymmetric, use median (interquartile range), e.g., Antibody titers ranged from 25 to 347 ng/ml, with a median (interquartile range) of 110 g/ml (61 to 159 ng/ml). It s better not to use: mean ± SD or mean ± s.e. Why?

3 The ultimate goal of statistical inference is to use sample to draw conclusions about the population of interest. This boils down to use sample statistics to estimate population parameters (unknown). It seems then the logic to understand what is going on in the population, if we know the population parameters. Probability theory is the key to accomplish this.

4 Probability: definitions a vehicle to convey an opinion: may be, might be, perhaps,...; Relative frequency in the action-outcome scenario, i.e. the ratio of events over all the possible outcome (sample space). Flip a coin outcome: the coin will fall each time (deterministic); outcome: head or tail (not sure before the experiment is done) random; probability of a random event (getting a head): if we flip the coin n times, we d know the relative frequency of getting a head, or the probability of getting a head; in a long run (flip many times), the probability will be 50%, or 0.5.

5 Event: before talking about probability, we must define the event of interest. Example 3.1 (p. 89): a population of N = 6 people with gender and age information, we could be interested in female: Pr = 3/6 = 0.5 male: Pr = 3/6 = 0.5 subject over 65: Pr = 2/6 = 0.3 odds is not probability! odds = Pr(A) 1 Pr(A) Probability of complex event is what makes people dizzy, yet we cannot always avoid it. e.g., female over 65, male younger than 65...

6 Relationships between two simple events The Complement of an event consists of all outcomes in the sample space that are not in the event. e.g., male complement female, because normal people can either be a male or female, but not both. Pr(A) = 1 Pr(A ) mutually exclusive: when one happens the other cannot happen. e.g., a thrown of a die may show a one or two, but not both. They have nothing in common. Union: either, or. denote as A B Intersection: and. denote as A B

7 Space A B Addition rule Pr(A B) = Pr(A) + Pr(B) P(A B) P(A B) = Pr(A) + Pr(B) Pr(A B)

8 Example 3.2A (p. 93). Educational level of pts went to allergy clinic (contingency table) gender < > 17 Total Male Female Total Randomly select a patient, What is the probability that The pt is male: 78/ male or had at most 12 yrs:, since there were 78 males, among 208 females there are had at most 12 yrs ed male and had at most 12 yrs: =

9 Conditional probability (very hard to apply) The probability of one event (A), given that the other event (B) has happened already. Pr(A B) =? Can we define Pr(B A) =? Is Pr(A B) = Pr(B A)? Let A denote AIDS, and B test positive. The probability of a person has aids, given s/he tested positive Pr(A B) The probability of a person tested positive, given that s/he has aids. Pr(B A)

10 Development of conditional probability (Change space or denominator) Space A B A B Conditional: new space Pr(A B) = Pr(A B)/ Pr(B) Pr(B A) = Pr(A B)/ Pr(A)

11 Example 3.2A (p. 93). Educational level of pts went to allergy clinic (contingency table) gender < > 17 Total Male Female Total Randomly select a patient, Pr(< 9 Male)? This is to say suppose the patient is a male, what is the probability that he has less than 9 yrs of ed? Pr(< 9 Male) =? = / = Pr(Male < 9)? This is to say suppose the patient had less than 9 yrs ed, what is the probability that he is a male? Pr(Male < 9) = = / = Pr(< 9 Male) = 15 78

12 Again Pr(A B) Pr(B A) Fail to recognize this has become the tragedy of modern statistics.

13 In the development of a diagnostic test, one must evaluate the accuracy of the test from two aspect, 1) if a subject has the targeted disease, what is the probability of test being positive, i.e., Pr(T + D + ); and 2) if a subject does not have the disease, what is the probability of test being negative, ie., Pr(T D ). Now, a subject has a positive test result (T + ), what is the probability that s/he has the disease (D + ), i.e., Pr(D + T + ). Another piece of info is Pr(D + ), some rough idea before ordering the test. Pr(D + T + ) = Pr(D+ T + ) Pr(T + ) = Pr(D+ T + )Pr(T + ) Pr(T + ) = Pr(D + T + ) Did not achieve anything? What if we write Pr(D + T + ) = Pr(T + D + )Pr(D + )?, ie. Pr(D + T + ) = Pr(T+ D + ) Pr(D + ) Pr(T + ),

14 Space D + T + We have Pr(D + T + ) = Pr(T+ D + )Pr(D + ) Pr(T + ) T + may be regarded as a sum of two parts: T + D + and T + D Pr(T + ) = Pr(T + D + ) + Pr(T + D ) = Pr(T + D + )Pr(D + ) + Pr(T + D )Pr(D ) = Pr(T + D + )Pr(D + ) + [1 Pr(T D )][(1 Pr(D + )]

15 Now, a subject has a negative test result (T ), what is the probability that s/he does not have the disease (D ), i.e., Pr(D T ). Similar argument can be used to calculate this. From diagnostic accuracy to accurate diagnosis (Zou 2004 Medical Decision Making 24: ). From sensitivity Pr(T + D + ) and specificity Pr(T D ) to positive predicted value Pr(T + D + ), Pr(T D ) Pr(D+ ) Pr(D + T + ) and From sensitivity Pr(T + D + ) and specificity Pr(T D ) to negative predicted value Pr(T + D + ), Pr(T D ) Pr(D+ ) Pr(D T )

16 A and B are independent means (no association) Pr(A B) = Pr(A) and Pr(B A) = Pr(B) Pr(A B) = Pr(A B) Pr(B) Pr(A B) Pr(B) = Pr(A B) Indep Pr(A) Pr(B)

17 Example 3.2A (p. 93). Educational level of pts went to allergy clinic (contingency table) gender 12 > 12 Total Male Female Total Is education and gender independent? Pr(male) = Pr(male ( 12)) = and Pr( 12) = Pr(male) Pr( 12) Note these calculations do not take random error into account.

18 Random variable and its distribution Random variable: a characteristic can take multiple values with given probabilities. e.g., outcome (X) of flipping a coin can either be head (= 1) or tail (= 0), each with probability of 0.5. The set of probabilities make up a (probability) distribution. The distribution of flipping a coin is f(x) = 0.5, X = 1 = 0.5, X = 0 or Pr(X = i) = 0.5, i = 0, 1 This distribution is known as Bernoulli distribution.

19 Let flip two coins at once. Four possible outcomes with equal probability of 1/4: 1) Head, head 2) Head, tail 3) Tail, head 4) Tail, tail. Let s define the event of interest be total number of heads, X X could be 0, 1, 2 Pr(X = 0) = 1/4 Pr(X = 1) = 1/4 + 1/4 Pr(X = 2) = 1/4

20 Binomial distribution We can extend the flipping of two coins to n coins, and define the event of interest as the total number of heads, X. A key assumption here is independence. We could enumerate the probabilities for Y = 0, 1,, n as we did for n = 2 coins, but that will take a while, instead we will use the formula presented in the book (p. 104) P(X = x) = ( n x ) p x (1 p) n x = n! x!(n x)! px (1 p) n x where p is the probability of head showing. For unbiased coin, p = 0.5. Note, binomial distribution is determined by n, p, which we call parameters for binomial, B(n, p). B(n, p) = n B(p): Binomial is a sum of Bernoulli s

21 Let n = 2, we get Pr(X = 0) = 2! 0!(2 0)! p0 (1 p) 2 0 = p, similar calculation can bet Pr(X = 1) and Pr(X = 2).

22 Ex 3.5 (p. 105) Application of Binomial distribution: Antibiotic with p = 0.7 If the antibiotic is given to 5 unrelated pts, what is the probability that it will be effective in exactly 3? Binomial (n = 5, p = 0.7) Pr(X = 3) = 5! 3!(5 3)! (0.7)3 (1 0.7) 5 3 = The probability that the antibiotic will be effect in none of the 5 pts. Pr(X = 0) = 5! 0!(5 0)! (0.7)0 (1 0.7) 5 0 = (0.3) 5 =

23 Population mean and variance Mean, also called expectation (E(X)): it is a sum of (each possible value multiplied by its probability). For Bernoulli(p): E(X) = 0 (1 p) + 1 p = p Variance (σ 2 ): average of squared deviation from the mean. For Bernoulli (p) σ 2 = (0 p) 2 Pr(X = 0) + (1 p) 2 Pr(X = 1) = p 2 (1 p) + (1 p) 2 p = p p 2 = p(1 p)

24 Properties of mean and variance Let X be a random variable, c be a constant E(X ± c) = E(X) ± c V ariance(x ± c) = V ariance(x) = σ 2 Measure the average height of a population, let s denote it as E(X) = µ. Now, we add a constant to height by measuring the height of people standing on a box with height c, so the mean will be µ + c. We can also subtract a constant from µ by measuring the height of people standing in a hole with depth c, so the mean will be µ c. Either people standing on a box or in a hole, the variability has not changed, so the variance will still be σ 2.

25 What happen to cx? E(cX) =? E(cX) = c E(X) = cµ? V ariance(cx) =? = (cxi cµ) 2 N 1 c 2 (X i µ) 2 = N 1 = c 2 σ 2

26 How about X ± Y, where X and Y are independent variables? Imagine measuring the height (X) of people standing on boxes of random height (Y ). The mean should be E(X) + E(Y ). Since some short people could be on the small boxes, while tall people could be on the large boxes, the variability is thus certainly increased. In fact V ariance(x + Y ) = σ 2 X + σ 2 Y Imagine measure the heights above ground levels of our people standing in holes of random depth. The mean above ground would be E(X) E(Y ). Since some short people could be in deep holes while some tall people in shallow holes, the variability of heights above ground would be increased. In fact, V ariance(x Y ) = σ 2 X+σ 2 Y

27 Mean and variance of binomial distribution (n, p) Recall, Binomial is the sum of n Bernoulli s (p) The mean for Bernoulli (p) is p, the mean for binomial must be np The variance for Bernoulli (p) is p(1 p), the variance for Binomial (n, p) must be np(1 p). Compare to p. 108

28 Continuous distribution For discrete variable in general and binomial in particular, we obtain the distributions by simply counting them. Although it is tedious, but still countable. Image what would happen if we try to assign probabilities to numbers in an interval of [0,1] We don t even known how many possible numbers in there!

29 How about we assign probability not to a single number, but to an interval!

30 Normal distribution One essential continuous distribution is normal distribution, or Gaussian distribution, after Karl Fredrich Gauss ( ). Pr(a < X < b) = b f(x)dx, where f(x) = 1 σ (x µ) 2 2π e 2σ 2, < x <, a f(x) is called density function. e x = y x = lny = log y. e =? π =?

31 Empirical rule comes from N(µ, σ 2 ): Pr(µ 1σ < X < µ + 1σ).68 Pr(µ 2σ < X < µ + 2σ).95 Pr(µ 3σ < X < µ + 3σ).99 Mean = Median This implies Pr(X < µ) = Pr(X > µ) =.50

32 Example 3.7 (p. 110). µ = 70in and σ = 2in. Heights for a specific age and gender are approximately normally distributed This is an assumption we have to make. Otherwise we cannot just make something out of nothing. X N(70, 2 2 ) Suppose a male age 25 is selected at random.

33 Ex 3.7 (cont) a) What is the probability that his height is more than 70? Pr(X > 70) = Pr(X > µ) = 0.50 b) What is the probability that his height is between 70 to 72? Pr(70 < X < 72) = Pr(µ < X < µ + σ) = 1/2 Pr(µ 1σ < X < µ + 1σ) =.68/2 =.34 c) What is the probability that his height exceeds 72? Pr(X > 72) = Pr(X > µ + σ) = Pr(X > µ) Pr(µ < X < µ + σ) = = 0.16

34 From the normal density f(x) = 1 σ (x µ) 2 2π e 2σ 2, < x <, we can see a family of functions, determined by µ, σ. Denote normal distribution as N(µ, σ 2 ). 2 Since it is rather hard to do 1 1 σ 2π (x µ) 2 e 2σ 2 dx, e.g. suppose we want to know the probability of X (1, 2), where X N(2, 3 2 ) Pr(1 < X < 2) = (x 2)2 e dx =? 2π 3 2

35 Use tables! But we need to standardized the variable first. subtract X from population mean µ X µ standardize by standard deviation σ: X µ σ call this Z (has nothing to do with Zou!) it is really Z N(0, 1). Z = X µ σ

36 The normal law of errors stands out in the endeavors of mankind as one of the broadest generalization of natural philosophy. It serves as the guiding instrument for research in the physical and social sciences and in medicine, agricultural and in the biological world in general.it is a tool important for the analyses and interpretations of the basic information obtained by observation and by experimental design to produce fundamental statistical inferences.

37 Example 3.8 (p. 114): BP X is assume to be N(108, 14 2 ) in a population. If a person is randomly sampled from this population. a) What is that probability that this person s BP is below 112. Pr(X < 112) = Pr(Z < ) = Pr(Z <.29) =.6141 Interpretation based on random sample: The probability of a person s BP below 112 is % of the population have BP less than 112. b) c) g) are left for your reading.

38 Numbers to remember Percentile Z Notation Z Z Z Z 1 0.2

39 Table B.2 on pp You should remember and % %

40 Flip an unbiased coin 5 time Probability of all HEADS Probability of all TAILS The sum is 6.25% 5%

41 Example 3.9. (p. 120) BP X N(108, 14 2 ). a) Find the 5 percentile in BP X = ( 1.645) 14 b) Find the 90 percentile in BP X =

42 Normal to approximate binomial Example 3.10 (p. 122). Antibiotic with p = 70% =.7 success rate to give to 25 people. a) What is the probability that the antibiotic is effective in more than 15 patients. Pr(X > 15) Pr(X = x) = ( ) n x p x (1 p) n x n = 25, p = 0.7 Pr(X > 15) = Pr(X = 16) + Pr(x = 17) + + Pr(X = 25) = ( ) (0.7) 16 (1 0.7) ( ) (0.7) 25 =

43 µ = np = = 17.5 σ 2 = npq = = 5.25 Standardize as Z X µ σ = X np npq Z X =.25/ 5.25 = 1.09 Pr(Z > 1.09) = 1 Pr(Z < 1.09) = = Recall exact calculation result is

44 To make the approximation more accurate, we use Z = X Why? See my chicken scratch on board. Pr(X > 15) Z > X = Pr(Z > 0.87) = Suppose we want to know the probability of the antibiotic is effective fewer than 12 people?, i.e. Pr(X < 12). Not Pr(X 12) (see that in the book).

45 Pr(X < 12) Z < Pr(Z < 2.62) = No adjustment calculation results in Exact calculation result is

Chapter 5. Sampling Distributions

Chapter 5. Sampling Distributions Lecture notes, Lang Wu, UBC 1 Chapter 5. Sampling Distributions 5.1. Introduction In statistical inference, we attempt to estimate an unknown population characteristic, such as the population mean, µ,

More information

The Bernoulli distribution

The Bernoulli distribution This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Random Variables Handout. Xavier Vilà

Random Variables Handout. Xavier Vilà Random Variables Handout Xavier Vilà Course 2004-2005 1 Discrete Random Variables. 1.1 Introduction 1.1.1 Definition of Random Variable A random variable X is a function that maps each possible outcome

More information

Probability. An intro for calculus students P= Figure 1: A normal integral

Probability. An intro for calculus students P= Figure 1: A normal integral Probability An intro for calculus students.8.6.4.2 P=.87 2 3 4 Figure : A normal integral Suppose we flip a coin 2 times; what is the probability that we get more than 2 heads? Suppose we roll a six-sided

More information

MA : Introductory Probability

MA : Introductory Probability MA 320-001: Introductory Probability David Murrugarra Department of Mathematics, University of Kentucky http://www.math.uky.edu/~dmu228/ma320/ Spring 2017 David Murrugarra (University of Kentucky) MA 320:

More information

Probability: Week 4. Kwonsang Lee. University of Pennsylvania February 13, 2015

Probability: Week 4. Kwonsang Lee. University of Pennsylvania February 13, 2015 Probability: Week 4 Kwonsang Lee University of Pennsylvania kwonlee@wharton.upenn.edu February 13, 2015 Kwonsang Lee STAT111 February 13, 2015 1 / 21 Probability Sample space S: the set of all possible

More information

BIO5312 Biostatistics Lecture 5: Estimations

BIO5312 Biostatistics Lecture 5: Estimations BIO5312 Biostatistics Lecture 5: Estimations Yujin Chung September 27th, 2016 Fall 2016 Yujin Chung Lec5: Estimations Fall 2016 1/34 Recap Yujin Chung Lec5: Estimations Fall 2016 2/34 Today s lecture and

More information

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation. 1) If n 100 and p 0.02 in a binomial experiment, does this satisfy the rule for a normal approximation? Why or why not? No, because np 100(0.02) 2. The value of np must be greater than or equal to 5 to

More information

4: Probability. What is probability? Random variables (RVs)

4: Probability. What is probability? Random variables (RVs) 4: Probability b binomial µ expected value [parameter] n number of trials [parameter] N normal p probability of success [parameter] pdf probability density function pmf probability mass function RV random

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial

More information

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41

STA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41 STA258H5 Al Nosedal and Alison Weir Winter 2017 Al Nosedal and Alison Weir STA258H5 Winter 2017 1 / 41 NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION. Al Nosedal and Alison Weir STA258H5 Winter 2017

More information

Section 0: Introduction and Review of Basic Concepts

Section 0: Introduction and Review of Basic Concepts Section 0: Introduction and Review of Basic Concepts Carlos M. Carvalho The University of Texas McCombs School of Business mccombs.utexas.edu/faculty/carlos.carvalho/teaching 1 Getting Started Syllabus

More information

Random Variables and Probability Functions

Random Variables and Probability Functions University of Central Arkansas Random Variables and Probability Functions Directory Table of Contents. Begin Article. Stephen R. Addison Copyright c 001 saddison@mailaps.org Last Revision Date: February

More information

A useful modeling tricks.

A useful modeling tricks. .7 Joint models for more than two outcomes We saw that we could write joint models for a pair of variables by specifying the joint probabilities over all pairs of outcomes. In principal, we could do this

More information

Central Limit Theorem (cont d) 7/28/2006

Central Limit Theorem (cont d) 7/28/2006 Central Limit Theorem (cont d) 7/28/2006 Central Limit Theorem for Binomial Distributions Theorem. For the binomial distribution b(n, p, j) we have lim npq b(n, p, np + x npq ) = φ(x), n where φ(x) is

More information

4: Probability. Notes: Range of possible probabilities: Probabilities can be no less than 0% and no more than 100% (of course).

4: Probability. Notes: Range of possible probabilities: Probabilities can be no less than 0% and no more than 100% (of course). 4: Probability What is probability? The probability of an event is its relative frequency (proportion) in the population. An event that happens half the time (such as a head showing up on the flip of a

More information

Review. Binomial random variable

Review. Binomial random variable Review Discrete RV s: prob y fctn: p(x) = Pr(X = x) cdf: F(x) = Pr(X x) E(X) = x x p(x) SD(X) = E { (X - E X) 2 } Binomial(n,p): no. successes in n indep. trials where Pr(success) = p in each trial If

More information

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise. Math 224 Q Exam 3A Fall 217 Tues Dec 12 Version A Problem 1. Let X be the continuous random variable defined by the following pdf: { 1 x/2 when x 2, f(x) otherwise. (a) Compute the mean µ E[X]. E[X] x

More information

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial Lecture 23 STAT 225 Introduction to Probability Models April 4, 2014 approximation Whitney Huang Purdue University 23.1 Agenda 1 approximation 2 approximation 23.2 Characteristics of the random variable:

More information

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence continuous rv Let X be a continuous rv. Then a probability distribution or probability density function (pdf) of X is a function f(x) such that for any two numbers a and b with a b, P(a X b) = b a f (x)dx.

More information

Math 227 Elementary Statistics. Bluman 5 th edition

Math 227 Elementary Statistics. Bluman 5 th edition Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find

More information

Experimental Probability - probability measured by performing an experiment for a number of n trials and recording the number of outcomes

Experimental Probability - probability measured by performing an experiment for a number of n trials and recording the number of outcomes MDM 4U Probability Review Properties of Probability Experimental Probability - probability measured by performing an experiment for a number of n trials and recording the number of outcomes Theoretical

More information

The normal distribution is a theoretical model derived mathematically and not empirically.

The normal distribution is a theoretical model derived mathematically and not empirically. Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.

More information

CS 237: Probability in Computing

CS 237: Probability in Computing CS 237: Probability in Computing Wayne Snyder Computer Science Department Boston University Lecture 12: Continuous Distributions Uniform Distribution Normal Distribution (motivation) Discrete vs Continuous

More information

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution The Central Limit Theorem Sec. 8.1: The Random Variable it s Distribution Sec. 8.2: The Random Variable it s Distribution X p and and How Should You Think of a Random Variable? Imagine a bag with numbers

More information

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Announcements: There are some office hour changes for Nov 5, 8, 9 on website Week 5 quiz begins after class today and ends at

More information

Statistics and Probability

Statistics and Probability Statistics and Probability Continuous RVs (Normal); Confidence Intervals Outline Continuous random variables Normal distribution CLT Point estimation Confidence intervals http://www.isrec.isb-sib.ch/~darlene/geneve/

More information

Module 4: Probability

Module 4: Probability Module 4: Probability 1 / 22 Probability concepts in statistical inference Probability is a way of quantifying uncertainty associated with random events and is the basis for statistical inference. Inference

More information

Law of Large Numbers, Central Limit Theorem

Law of Large Numbers, Central Limit Theorem November 14, 2017 November 15 18 Ribet in Providence on AMS business. No SLC office hour tomorrow. Thursday s class conducted by Teddy Zhu. November 21 Class on hypothesis testing and p-values December

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic Probability Distributions: Binomial and Poisson Distributions Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College

More information

Introduction to Business Statistics QM 120 Chapter 6

Introduction to Business Statistics QM 120 Chapter 6 DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics QM 120 Chapter 6 Spring 2008 Chapter 6: Continuous Probability Distribution 2 When a RV x is discrete, we can

More information

Sampling Distributions For Counts and Proportions

Sampling Distributions For Counts and Proportions Sampling Distributions For Counts and Proportions IPS Chapter 5.1 2009 W. H. Freeman and Company Objectives (IPS Chapter 5.1) Sampling distributions for counts and proportions Binomial distributions for

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 5 Continuous Random Variables and Probability Distributions Ch. 5-1 Probability Distributions Probability Distributions Ch. 4 Discrete Continuous Ch. 5 Probability

More information

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution Section 7.6 Application of the Normal Distribution A random variable that may take on infinitely many values is called a continuous random variable. A continuous probability distribution is defined by

More information

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10 ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10 Fall 2011 Lecture 8 Part 2 (Fall 2011) Probability Distributions Lecture 8 Part 2 1 / 23 Normal Density Function f

More information

Bernoulli and Binomial Distributions

Bernoulli and Binomial Distributions Bernoulli and Binomial Distributions Bernoulli Distribution a flipped coin turns up either heads or tails an item on an assembly line is either defective or not defective a piece of fruit is either damaged

More information

2. Modeling Uncertainty

2. Modeling Uncertainty 2. Modeling Uncertainty Models for Uncertainty (Random Variables): Big Picture We now move from viewing the data to thinking about models that describe the data. Since the real world is uncertain, our

More information

Statistics 6 th Edition

Statistics 6 th Edition Statistics 6 th Edition Chapter 5 Discrete Probability Distributions Chap 5-1 Definitions Random Variables Random Variables Discrete Random Variable Continuous Random Variable Ch. 5 Ch. 6 Chap 5-2 Discrete

More information

II. Random Variables

II. Random Variables II. Random Variables Random variables operate in much the same way as the outcomes or events in some arbitrary sample space the distinction is that random variables are simply outcomes that are represented

More information

Statistical Methods in Practice STAT/MATH 3379

Statistical Methods in Practice STAT/MATH 3379 Statistical Methods in Practice STAT/MATH 3379 Dr. A. B. W. Manage Associate Professor of Mathematics & Statistics Department of Mathematics & Statistics Sam Houston State University Overview 6.1 Discrete

More information

STA 6166 Fall 2007 Web-based Course. Notes 10: Probability Models

STA 6166 Fall 2007 Web-based Course. Notes 10: Probability Models STA 6166 Fall 2007 Web-based Course 1 Notes 10: Probability Models We first saw the normal model as a useful model for the distribution of some quantitative variables. We ve also seen that if we make a

More information

Binomial distribution

Binomial distribution Binomial distribution Jon Michael Gran Department of Biostatistics, UiO MF9130 Introductory course in statistics Tuesday 24.05.2010 1 / 28 Overview Binomial distribution (Aalen chapter 4, Kirkwood and

More information

Theoretical Foundations

Theoretical Foundations Theoretical Foundations Probabilities Monia Ranalli monia.ranalli@uniroma2.it Ranalli M. Theoretical Foundations - Probabilities 1 / 27 Objectives understand the probability basics quantify random phenomena

More information

Part 10: The Binomial Distribution

Part 10: The Binomial Distribution Part 10: The Binomial Distribution The binomial distribution is an important example of a probability distribution for a discrete random variable. It has wide ranging applications. One readily available

More information

We use probability distributions to represent the distribution of a discrete random variable.

We use probability distributions to represent the distribution of a discrete random variable. Now we focus on discrete random variables. We will look at these in general, including calculating the mean and standard deviation. Then we will look more in depth at binomial random variables which are

More information

MATH 3200 Exam 3 Dr. Syring

MATH 3200 Exam 3 Dr. Syring . Suppose n eligible voters are polled (randomly sampled) from a population of size N. The poll asks voters whether they support or do not support increasing local taxes to fund public parks. Let M be

More information

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management BA 386T Tom Shively PROBABILITY CONCEPTS AND NORMAL DISTRIBUTIONS The fundamental idea underlying any statistical

More information

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82 Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections

More information

Statistics for IT Managers

Statistics for IT Managers Statistics for IT Managers 95-796, Fall 212 Course Overview Instructor: Daniel B. Neill (neill@cs.cmu.edu) TAs: Eli (Han) Liu, Kats Sasanuma, Sriram Somanchi, Skyler Speakman, Quan Wang, Yiye Zhang (see

More information

A.REPRESENTATION OF DATA

A.REPRESENTATION OF DATA A.REPRESENTATION OF DATA (a) GRAPHS : PART I Q: Why do we need a graph paper? Ans: You need graph paper to draw: (i) Histogram (ii) Cumulative Frequency Curve (iii) Frequency Polygon (iv) Box-and-Whisker

More information

MATH 264 Problem Homework I

MATH 264 Problem Homework I MATH Problem Homework I Due to December 9, 00@:0 PROBLEMS & SOLUTIONS. A student answers a multiple-choice examination question that offers four possible answers. Suppose that the probability that the

More information

Continuous Probability Distributions & Normal Distribution

Continuous Probability Distributions & Normal Distribution Mathematical Methods Units 3/4 Student Learning Plan Continuous Probability Distributions & Normal Distribution 7 lessons Notes: Students need practice in recognising whether a problem involves a discrete

More information

PROBABILITY DISTRIBUTIONS

PROBABILITY DISTRIBUTIONS CHAPTER 3 PROBABILITY DISTRIBUTIONS Page Contents 3.1 Introduction to Probability Distributions 51 3.2 The Normal Distribution 56 3.3 The Binomial Distribution 60 3.4 The Poisson Distribution 64 Exercise

More information

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables Chapter 5 Continuous Random Variables and Probability Distributions 5.1 Continuous Random Variables 1 2CHAPTER 5. CONTINUOUS RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS Probability Distributions Probability

More information

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, 2013 Abstract Introduct the normal distribution. Introduce basic notions of uncertainty, probability, events,

More information

MVE051/MSG Lecture 7

MVE051/MSG Lecture 7 MVE051/MSG810 2017 Lecture 7 Petter Mostad Chalmers November 20, 2017 The purpose of collecting and analyzing data Purpose: To build and select models for parts of the real world (which can be used for

More information

x is a random variable which is a numerical description of the outcome of an experiment.

x is a random variable which is a numerical description of the outcome of an experiment. Chapter 5 Discrete Probability Distributions Random Variables is a random variable which is a numerical description of the outcome of an eperiment. Discrete: If the possible values change by steps or jumps.

More information

Binomial Random Variables. Binomial Random Variables

Binomial Random Variables. Binomial Random Variables Bernoulli Trials Definition A Bernoulli trial is a random experiment in which there are only two possible outcomes - success and failure. 1 Tossing a coin and considering heads as success and tails as

More information

BIOL The Normal Distribution and the Central Limit Theorem

BIOL The Normal Distribution and the Central Limit Theorem BIOL 300 - The Normal Distribution and the Central Limit Theorem In the first week of the course, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are

More information

Counting Basics. Venn diagrams

Counting Basics. Venn diagrams Counting Basics Sets Ways of specifying sets Union and intersection Universal set and complements Empty set and disjoint sets Venn diagrams Counting Inclusion-exclusion Multiplication principle Addition

More information

Business Statistics. Chapter 5 Discrete Probability Distributions QMIS 120. Dr. Mohammad Zainal

Business Statistics. Chapter 5 Discrete Probability Distributions QMIS 120. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 5 Discrete Probability Distributions QMIS 120 Dr. Mohammad Zainal Chapter Goals After completing this chapter, you should

More information

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Chapter 14: random variables p394 A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Consider the experiment of tossing a coin. Define a random variable

More information

Business Statistics 41000: Probability 4

Business Statistics 41000: Probability 4 Business Statistics 41000: Probability 4 Drew D. Creal University of Chicago, Booth School of Business February 14 and 15, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office:

More information

Chapter 5: Statistical Inference (in General)

Chapter 5: Statistical Inference (in General) Chapter 5: Statistical Inference (in General) Shiwen Shen University of South Carolina 2016 Fall Section 003 1 / 17 Motivation In chapter 3, we learn the discrete probability distributions, including Bernoulli,

More information

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment

More information

Binomial Distribution. Normal Approximation to the Binomial

Binomial Distribution. Normal Approximation to the Binomial Binomial Distribution Normal Approximation to the Binomial /29 Homework Read Sec 6-6. Discussion Question pg 337 Do Ex 6-6 -4 2 /29 Objectives Objective: Use the normal approximation to calculate 3 /29

More information

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics Chapter 5 Student Lecture Notes 5-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 5 Discrete Probability Distributions QMIS 120 Dr. Mohammad Zainal Chapter Goals

More information

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations MLLunsford 1 Activity: Central Limit Theorem Theory and Computations Concepts: The Central Limit Theorem; computations using the Central Limit Theorem. Prerequisites: The student should be familiar with

More information

Class 12. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 12. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 12 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 2017 by D.B. Rowe 1 Agenda: Recap Chapter 6.1-6.2 Lecture Chapter 6.3-6.5 Problem Solving Session. 2

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial Lecture 8 The Binomial Distribution Probability Distributions: Normal and Binomial 1 2 Binomial Distribution >A binomial experiment possesses the following properties. The experiment consists of a fixed

More information

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution?

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution? Distributions 1. What are distributions? When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution? In other words, if we have a large number of

More information

Chapter 7: Point Estimation and Sampling Distributions

Chapter 7: Point Estimation and Sampling Distributions Chapter 7: Point Estimation and Sampling Distributions Seungchul Baek Department of Statistics, University of South Carolina STAT 509: Statistics for Engineers 1 / 20 Motivation In chapter 3, we learned

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.

More information

Chapter 6: Random Variables and Probability Distributions

Chapter 6: Random Variables and Probability Distributions Chapter 6: Random Variables and Distributions These notes reflect material from our text, Statistics, Learning from Data, First Edition, by Roxy Pec, published by CENGAGE Learning, 2015. Random variables

More information

AP Statistics Ch 8 The Binomial and Geometric Distributions

AP Statistics Ch 8 The Binomial and Geometric Distributions Ch 8.1 The Binomial Distributions The Binomial Setting A situation where these four conditions are satisfied is called a binomial setting. 1. Each observation falls into one of just two categories, which

More information

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions Announcements Unit2: Probabilityanddistributions 3. Normal and binomial distributions Sta 101 - Fall 2017 Duke University, Department of Statistical Science Formatting of problem set submissions: Bad:

More information

4 Random Variables and Distributions

4 Random Variables and Distributions 4 Random Variables and Distributions Random variables A random variable assigns each outcome in a sample space. e.g. called a realization of that variable to Note: We ll usually denote a random variable

More information

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley Outline: 1) Review of Variation & Error 2) Binomial Distributions 3) The Normal Distribution 4) Defining the Mean of a population Goals:

More information

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial. Lecture 21,22, 23 Text: A Course in Probability by Weiss 8.5 STAT 225 Introduction to Probability Models March 31, 2014 Standard Sums of Whitney Huang Purdue University 21,22, 23.1 Agenda 1 2 Standard

More information

AMS7: WEEK 4. CLASS 3

AMS7: WEEK 4. CLASS 3 AMS7: WEEK 4. CLASS 3 Sampling distributions and estimators. Central Limit Theorem Normal Approximation to the Binomial Distribution Friday April 24th, 2015 Sampling distributions and estimators REMEMBER:

More information

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Math 224 Fall 207 Homework 5 Drew Armstrong Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Section 3., Exercises 3, 0. Section 3.3, Exercises 2, 3, 0,.

More information

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x

More information

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS A random variable is the description of the outcome of an experiment in words. The verbal description of a random variable tells you how to find or calculate

More information

5.2 Random Variables, Probability Histograms and Probability Distributions

5.2 Random Variables, Probability Histograms and Probability Distributions Chapter 5 5.2 Random Variables, Probability Histograms and Probability Distributions A random variable (r.v.) can be either continuous or discrete. It takes on the possible values of an experiment. It

More information

Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny.

Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny. Distributions September 17 Random variables Anything that can be measured or categorized is called a variable If the value that a variable takes on is subject to variability, then it the variable is a

More information

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by Normal distribution The normal distribution is the most important distribution. It describes well the distribution of random variables that arise in practice, such as the heights or weights of people,

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions Announcements Unit2: Probabilityanddistributions 3. Normal and binomial distributions Sta 101 - Summer 2017 Duke University, Department of Statistical Science PS: Explain your reasoning + show your work

More information

Chapter 4 Continuous Random Variables and Probability Distributions

Chapter 4 Continuous Random Variables and Probability Distributions Chapter 4 Continuous Random Variables and Probability Distributions Part 2: More on Continuous Random Variables Section 4.5 Continuous Uniform Distribution Section 4.6 Normal Distribution 1 / 27 Continuous

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 7 Random Variables and Discrete Probability Distributions 7.1 Random Variables A random variable is a function or rule that assigns a number to each outcome of an experiment. Alternatively, the

More information

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized

More information

Random Variables and Applications OPRE 6301

Random Variables and Applications OPRE 6301 Random Variables and Applications OPRE 6301 Random Variables... As noted earlier, variability is omnipresent in the business world. To model variability probabilistically, we need the concept of a random

More information

Business Statistics 41000: Probability 3

Business Statistics 41000: Probability 3 Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404

More information

Probability Distributions II

Probability Distributions II Probability Distributions II Summer 2017 Summer Institutes 63 Multinomial Distribution - Motivation Suppose we modified assumption (1) of the binomial distribution to allow for more than two outcomes.

More information

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Chapter 3 - Lecture 5 The Binomial Probability Distribution Chapter 3 - Lecture 5 The Binomial Probability October 12th, 2009 Experiment Examples Moments and moment generating function of a Binomial Random Variable Outline Experiment Examples A binomial experiment

More information

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg :

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution?

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution? Distributions 1. What are distributions? When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution? In other words, if we have a large number of

More information

Chapter 7: Random Variables

Chapter 7: Random Variables Chapter 7: Random Variables 7.1 Discrete and Continuous Random Variables 7.2 Means and Variances of Random Variables 1 Introduction A random variable is a function that associates a unique numerical value

More information