Binomial and normal distributions

Size: px
Start display at page:

Download "Binomial and normal distributions"

Transcription

1 Binomial and normal distributions Business Statistics Fall

2 Topics 1. Sums of random variables 2. Binomial distribution 3. Normal distribution 4. Vignettes 2

3 Topic: sums of random variables Sums of random variables are important for two reasons: 1. Because we often care about aggregates and totals (sales, revenue, employees, etc). 2. Because averages are basically sums, and probabilities are basically averages (of dummy variables), when we go to estimate probabilities, we will end up using sums of random variables a lot. This second point is the topic of the next lecture. For now, we focus on the direct case. 3

4 A sum of two random variables Suppose X is a random variable denoting the profit from one wager and Y is a random variable denoting the profit from another wager. If we want to consider our total profit, we may consider the random variable that is the sum of the two wagers, S = X + Y. To determine the distribution of S, we must first know the joint distribution of (X, Y ). 4

5 A sum of two random variables Suppose that (X, Y ) has the following joint distribution: -$200 $100 $200 $ $ So S can take the values { 200, 100, 100, 200, 300}. Notice that there are two ways that S can be $200. 5

6 A sum of two random variables We can directly determine the distribution of S as: S s P(S = s) -$200 +$0 0 -$200 + $ $100 + $0 1 9 $100 + $100 or $200 + $ = 5 9 $200 + $ When determining the distribution of sums of random variables, we lose information about individual values and aggregate the probability of events giving the same sum. 6

7 Topic: binomial distribution A binomial random variable can be constructed as the sum of independent Bernoulli random variables. Familiarity with the binomial distribution eases many practical probability calculations. See OpenIntro sections 3.4 and

8 Sums of Bernoulli RVs When rolling two dice, what is the probability of rolling two ones? By independence we can calculate this probability as P(1, 1) = 1 ( ) 1 = Now with three dice, what is the probability of rolling exactly two 1 s? 8

9 Sums of Bernoulli RVs (cont d) The event A = rolling a one, can be described as a Bernoulli random variable with p = 1 6. We can denote the three independent rolls by writing iid X i Bernoulli(p), i = 1, 2, 3. The notation iid is shorthand for independent and identically distributed. Determining the probability of rolling exactly two 1 s can be done by considering the random variable Y = X 1 + X 2 + X 3 and asking for P(Y = 2). 9

10 Sums of Bernoulli random variables (cont d) Consider the distribution of Y = X 1 + X 2 + X 3. Y Event y P(Y = y) (1 p) or 100 or (1 p)(1 p)p + p(1 p)(1 p) + (1 p)p(1 p) 011 or 110 or (1 p)p 2 + p 2 (1 p) + p(1 p)p p 3 Remember that for this example p =

11 Sums of Bernoulli random variables (cont d) Determining the probability of a certain number of successes requires knowing 1) the probability of each individual success and 2) the number of ways that number of successes can arise. Y Event y P(Y = y) (1 p) or 100 or (1 p) 2 p 011 or 110 or (1 p)p p 3 We find that P(Y = 2) = 3p 2 (1 p) = 3(1/36)(5/6) = 5 6(12) =

12 Sums of Bernoulli random variables (cont d) What if we had four rolls, and the probability of success was 1 3?

13 Sums of Bernoulli random variables (cont d) Summing up the probabilities for each of the values of Y, we find: Y y P(Y = y) 0 (1 p) 4 1 4(1 p) 3 p 2 6(1 p) 2 p 2 3 4(1 p)p 3 4 p 4 Substituting p = 1 3 we can now find P(Y = y) for any y = 0, 1, 2, 3, 4. 13

14 Defintion: N choose y The number of ways we can arrange y successes among N trials can be calculated efficiently by a computer. We denote this number with a special expression. N choose y The notation ( ) N N! = y (N y)!y! designates the number of ways that y items can be assigned to N possible positions. This notation can be used to summarize the entries in the previous tables for various values of N and y. 14

15 Definition: Binomial distribution Binomial distribution A random variable Y has a binomial distribution with parameters N and p if its probability distribution function is of the form: ( ) N p(y) = p y (1 p) N y y for integer values of y between 0 and N. 15

16 Example: drunk batter What is the probability that our alcoholic major-leaguer gets more than 2 hits in a game in which he has 5 at bats? Let X = number of hits. We model X as a binomial random variable with parameters N = 5 and p = X x P(X = x) 0 (1 p) 5 1 5(1 p) 4 p 2 10(1 p) 3 p (1 p) 2 p 3 4 5(1 p)p 4 5 p 5 Substituting p = we calculate P(X > 2) =

17 Example: winning a best-of-seven play-off Assume that the Chicago Bulls have probability 0.4 of beating the Miami Heat in any given game and that the outcomes of individual games are independent. What is the probability that the Bulls win a seven game series against the Heat? 17

18 Example: winning a best-of-seven play-off (cont d) Consider the number of games won by the Bulls over a full seven games against the Heat. We model this as a binomial random variable Y with parameters N = 7 and p = 0.4, which we express with the notation Y Bin(7, 0.4). The symbol is read distributed as. Bin is short for binomial. The numbers which follow are the values of the two binomial parameters, the number of independent Bernoulli trials (N) and the probability of success at each trial (p). 18

19 Example: winning a best-of-seven play-off (cont d) Although we never see all seven games played (because the series stops as soon as one team wins four games) we note that in this expanded event space any event with at least four Bulls wins corresponds to an observable Bulls series win, any event corresponding to an observed Bulls series win has at least four total Bulls wins. 19

20 Example: winning a best-of-seven play-off (cont d) For example, the observable sequence (where a 1 stands for a Bulls win) has two possible completions, or Any hypothetical games played beyond the series-ending fourth win can only increase the total number of wins tallied by Y. Conversely, the sequence is an event corresponding to Y = 5 and we can associate it with the observable subsequence , a Bulls series win in six games. 20

21 Example: winning a best-of-seven play-off (cont d) Therefore, the events corresponding to Bulls win the series are precisely those corresponding to Y 4. We may conclude that the probability of a series win for the Bulls is P(Y 4) = P(Y = 4) + P(Y = 5) + P(Y = 6) + P(Y = 7) =

22 Example: winning a best-of-seven play-off (cont d) We can arrive at this answer without reference to the binomial random variable Y if we are willing to do our own counting. ( ) ( ) ( ) P(Bulls series win) = p p 4 5 (1 p) + p 4 (1 p) p 4 (1 p) ( ) ( ) ( ) = p p 4 5 (1 p) + p 4 (1 p) p 4 (1 p) = This calculation explicitly accounts for the fact that Bulls series wins necessarily conclude with a Bulls game win. 22

23 Example: double lottery winners In 1971, Jane Adams won the lottery twice in one year! If you read of a double winner in your daily newspaper, how surprised should you be? To answer this question we need to make some assumptions. Consider 40 state lotteries. Assume that each one has a 1 in 18 million chance of winning. Assume that each one has 1 million people that play it daily (say, 250 times a year), and that each one buys 5 tickets. Given these conditions, what is the probability that in one calendar year there is at least one double winner? 23

24 Example: double lottery winners (cont d) Let X i be the random variable denoting how many winning tickets person i has: X i Binomial(5(250), p = (1/18) 10 6 ). Now let Y i be the dummy variable for the event X i > 1, which is the event that person i is a double (or more) winner: Y i Bernoulli(q). We can compute q = 1 Pr(X i = 0) Pr(X i = 1) =

25 Example: double lottery winners (cont d) To account for the million people playing the lottery in each of 40 states, we consider Z = N i=1 Y i, which is another binomial random variable: Z Binomial(N = , q). Finally, the probability that Z > 0 can be found as 1 P(Z = 0) = 1 (1 q) N = 1/11. Not so rare! 25

26 Example: rural vs. urban hospitals About as many boys as girls are born in hospitals. In a small Country Hospital only a few babies are born every week. In the urban center, many babies are born every week at City General. Say that a normal week is one where between 45% and 55% of the babies are female. An unusual week is one where more than 55% are girls or more than 55% are boys. Which of the following is true? Unusual weeks occur equally often at Country Hospital and at City General. Unusual weeks are more common at Country Hospital than at City General. Unusual weeks are less common at Country Hospital than at City General. 26

27 Example: rural vs. urban hospital (cont d) We can model the births in the two hospitals as two independent random variables. Let X = number of baby girls born at Country Hospital and Y = number of baby girls born at City General. X Binomial(N 1, p) Y Binomial(N 2, p) Assume that p = 0.5. The key difference is that N 1 is much smaller than N 2. To illustrate, assume that N 1 = 20 and N 2 =

28 Example: rural vs. urban hospital (cont d) During a usual week at the rural hospital between 0.45N 1 = 0.45(20) = 9 and 0.55N 1 = 0.55(20) = 11 baby girls are born. The probability of usual week is P(9 X 11) 0.50, so the probability of an unusual week is 1 P(9 X 11) = P(X < 9) + P(X > 11) 0.5. Note: satisfying the condition X < 9 is the same as not satisfying the condition X 9; strict versus non-strict inequalities make a difference. 28

29 Example: rural vs. urban hospital (cont d) Country Hospital Probability Births 29

30 Example: rural vs. urban hospital (cont d) In a usual week at the city hospital between 0.45N 2 = 0.45(500) = 225 and 0.55N 2 = 0.55(500) = 275 baby girls are born. Then the probability of a usual week is P(225 X 275) = 0.978, so the probability of an unusual week is 1 P(225 X 275) = P(X < 225) + P(X > 275) =

31 Example: rural vs. urban hospital (cont d) City General Probability Births 31

32 Variance of a sum of independent random variables A useful fact: Variance of linear combinations of independent random variables A weighted sum/difference of random variables Y = m i a i X i can be expressed as m V(Y ) = ai 2 V(X i ). i How can this be used to derive the expression for the variance of a binomial random variable? 32

33 Variance of binomial random variable Variance of a binomial random variable A binomial random variable X with parameters N and p has variance V(X ) = Np(1 p). 33

34 Variance of a proportion By dividing through by the total number of babies born each week we can consider the proportion of girl babies. Define the random variables Then it follows that P 1 = X N 1 and P 2 = Y N 2. and V (P 1 ) = V(X ) N 2 1 = N 1p(1 p) N 2 1 = p(1 p)/n 1 V (P 2 ) = V(Y ) N 2 2 = N 2p(1 p) N 2 2 = p(1 p)/n 2. 34

35 Law of Large Numbers An arithmetical average of random variables is itself a random variable. As more and more individual random variables are averaged up, the variance decreases but the mean stays the same. As a result, the distribution of the averaged random variable becomes more and more concentrated around its expected value. 35

36 Law of Large Numbers Distribution of sample proportion (N = 10, p = 0.7)

37 Law of Large Numbers Distribution of sample proportion (N = 20, p = 0.7)

38 Law of Large Numbers Distribution of sample proportion (N = 50, p = 0.7)

39 Law of Large Numbers Distribution of sample proportion (N = 150, p = 0.7)

40 Law of Large Numbers Distribution of sample proportion (N = 300, p = 0.7)

41 Example: Schlitz Super Bowl taste test 41

42 Bell curve approximation to binomial The binomial distributions can be approximated by a smooth density function for large N. Normal approximation for binomial distribution with N = 20, p = 0.5 Probability mass / Density x 42

43 Bell curve approximation to binomial Normal approximation for binomial distribution with N = 60, p = 0.1 Probability mass / Density x 43

44 Bell curve approximation to binomial Normal approximation for binomial distribution with N = 500, p = 0.8 Probability mass / Density x What are some reasons that very small p or small N lead to bad approximations? 44

45 Central limit theorem The normal distribution can be justified via its relationship to the binomial distribution. Roughly: if a random outcome is the combined result of many individual random events, its distribution will follow a normal curve. The quincunx or Galton box is a device which physically simulates such a scenario using ball bearings and pins stuck in a board. PLAY VIDEO The CLT can be stated more precisely, but the practical impact is just this: random variables which arise as sums of many other random variables (not necessarily normally distributed) tend to be normally distributed. 45

46 Normal distributions The normal family of densities has two parameters, typically denoted µ and σ 2, which govern the location and scale, respectively. Gaussian densities for various location parameters f(x) x 46

47 Normal distributions (cont d) I will use the terms normal distribution, normal density and normal random variable more or less interchangeably. Mean-zero Gaussian densities with differing scale parameters f(x) x The normal distribution is also called the Gaussian distribution or the bell curve. 47

48 Normal means and variances Mean and variance of a normal random variable A normal random variable X, with parameters µ and σ 2, is denoted X N(µ, σ 2 ). The mean and variance of X are E(X ) = µ, V (X ) = σ 2. The density function is symmetric and unimodal, so the median and mode of X are also given by the location parameter µ. The standard deviation of X is given by σ. 48

49 Normal approximation to binomial The binomial distributions can be approximated by a normal distribution. Normal approximation to the binomial A Bin(N, p) distribution can be approximated by a N(Np, Np(1 p)) distribution for N large enough. Notice that this just matches the mean and variance of the two distributions. 49

50 Linear transformation of normal RVs We can add a fixed number to a normal random variable and/or multiply it by a fixed number and get a new normal random variable. This sort of operation is called a linear transformation. Linear transformation of normal random variables If X N(µ, σ 2 ) and Y = a + bx for fixed numbers a and b, then Y N(a + bµ, b 2 σ 2 ). For example, if X N(1, 2) and Y = 3 5X, then Y N( 2, 50). 50

51 Standard normal RV Standard normal A standard normal random variable is one with mean 0 and variance 1. It is often denoted by the letter Z: Z N(0, 1). We can write any normal random variable as a linear transformation of a standard normal RV. For normal random variable X N(µ, σ 2 ), we can write X = µ + σz. 51

52 The empirical rule It is convenient to characterize where the bulk of the probability mass of a normal distribution resides by providing an interval, in terms of standard deviations, about the mean. N(µ,σ) Density % µ 4σ µ 3σ µ 2σ µ σ µ µ + σ µ + 2σ µ + 3σ µ + 4σ x 52

53 The empirical rule (cont d) The widespread application of the normal distribution has lead this to be dubbed the empirical rule. N(µ,σ) Density % µ 4σ µ 3σ µ 2σ µ σ µ µ + σ µ + 2σ µ + 3σ µ + 4σ x 53

54 The empirical rule (cont d) It is, for obvious reasons, sometimes called the rule. N(µ,σ) Density % µ 4σ µ 3σ µ 2σ µ σ µ µ + σ µ + 2σ µ + 3σ µ + 4σ x 54

55 The empirical rule (cont d) To revisit some earlier examples: 68% of Chicago daily highs in the winter season are between 19 and 48 degrees. 95% of NBA players are between 6ft and 7ft 2in. In 99.7% of weeks, the proportion of baby girls born at City General is between and

56 Sums of normal random variables Weighted sums of normal random variables are also normally distributed. For example if X 1 N(5, 20) and X 2 N(1, 0.5) then for Y = 0.1X X 2 Y N(m, v). where m = 0.1(5) + 0.9(1) = 1.4 and v = (20) (0.5) =

57 Linear combinations of normal RVs Linear combinations of independent normal random variables For i = 1,..., n, let iid X i N(µ i, σi 2 ). Define Y = n i=1 a ix i for weights a 1, a 2,..., a n. Then where Y N(m, v) n n m = a i µ i and v = ai 2 σi 2. i=1 i=1 57

58 Example: two-stock portfolio Consider two stocks, A and B, with annual returns (in percent of investment) distributed according to normal distributions X A N(5, 20) and X B N(1, 0.5). What fraction of our investment should we put into stock A, with the remainder put in stock B? 58

59 Example: two-stock portfolio (cont d) For a given fraction α, the total return on our portfolio is Y = αx A + (1 α)x B with distribution Y N(m, v). where m = 5α + (1 α) and v = 20α (1 α) 2. 59

60 Example: two-stock portfolio (cont d) Suppose we want to find α so that P(Y 0) is as small as possible. Two-stock portfolio Density Stock A Stock B Percent return The blue distributions correspond to varying values of α. 60

61 Example: two-stock portfolio (cont d) We can plot the probability of a loss as a function of α. Probability of a loss Probability α We see that this probability is minimized when α = 11% approximately. This is the LLN at work! 61

62 Variance of a sum of correlated random variables For correlated (dependent) random variables, we have a modified formula: Variance of linear combinations of two correlated random variables A weighted sum/difference of random variables Y = a 1 X 1 + a 2 X 2 can be expressed as V(Y ) = a 2 1V(X 1 ) + a 2 2V(X 2 ) + 2a 1 a 2 Cov(X 1, X 2 ). There is a homework problem that asks you to find the variance of portfolios of stocks, as in the example above, for stocks which are related to one another (in a common industry, for example). 62

63 Vignettes 1. Differential dispersion 2. Average number of sex partners 3. mean reversion 63

64 Vignette: a difference in dispersion In this vignette we observe how selection (in the sense of evolution, or hiring, or admissions) can turn higher variability into over-representation. The analysis uses the ideas of random variables, distribution functions, and conditional probability. For more background, read the article Sex Ed from the February 2005 issue of the New Republic (available at the course home page). 64

65 A difference in dispersion Consider two groups of college graduates with employee fitness scores following the distributions shown below. Distribution of Capabilities, Group A Probability Score Distribution of Capabilities, Group B Probability Score These distributions have the same mean, the same median, and the same mode. But they differ in their dispersion, or variability. 65

66 A difference in dispersion (cont d) Let X denote the random variables recording the scores and let A and B denote membership in the respective groups. Distribution of Capabilities, Group A Probability Score Distribution of Capabilities, Group B Probability Score V (X A) = 5.87 and V (X B) = The corresponding standard deviations are σ(x A) = 2.42 and σ(x B) =

67 A difference in dispersion (cont d) But now consider only elite jobs, for which it is necessary that fitness score X 4. Distribution of Capabilities, Group A Probability Score Distribution of Capabilities, Group B Probability Score We can use Bayes rule to calculate P(A X 4) and P(B X 4). 67

68 A difference in dispersion (cont d) If we assume a priori that P(A) = P(B) = 1/2, we find P(X 4 A)P(A) P(A X 4) = P(X 4 A)P(A) + P(X 4 B)P(B) 0.094(0.5) = 0.094(0.5) (0.5) = Why don t we need to calculate P(B X 4) separately? 68

69 Larry Summers and women-in-science Summers s critics have repeatedly mangled his suggestion that innate differences might be one cause of gender disparities... into the claim that they must be the only cause. And they have converted his suggestion that the statistical distributions of men s and women s abilities are not identical to the claim that all men are talented and all women are not as if someone heard that women typically live longer than men and concluded that every woman lives longer than every man.... In many traits, men show greater variance than women, and are disproportionately found at both the low and high ends of the distribution. Boys are more likely to be learning disabled or retarded but also more likely to reach the top percentiles in assessments of mathematical ability, even though boys and girls are similar in the bulk of the bell curve.... Stephen Pinker in The New Republic 69

70 Example: gender and aptitudes revisited Assume that job aptitude can be represented as a continuous random variable and that the distribution of scores differs by gender. Aptitude distribution Density women men Score For women, 93.7% of the scores are between the vertical dashed lines, whereas only 68.6% of the men s scores fall in this range. 70

71 Example: gender and aptitudes revisited (cont d) The corresponding CDFs reveals the same difference. Cumulative distribution function F(x) Score These distributions are meant to be illustrative rather than factual. 71

72 Sex partners vignette: which average? Here is a torn-from-the-headlines example of why it pays to know a little probability. Everyone knows men are promiscuous by nature...surveys bear this out. In study after study and in country after country, men report more, often many more, sexual partners than women... But there is just one problem, mathematicians say. It is logically impossible for heterosexual men to have more partners on average than heterosexual women. Those survey results cannot be true. 72

73 A sex-partners statistical model Question: is it possible for men to have more sex partners, on average, than women? To answer this question, we will consider a toy probability model for homo sapiens mating behavior. John Lenny Romeo Sally Chastity Maude Let s call it the summer camp model. 73

74 A sex-partners random variable The quantity of interest is the number of sex partners. In our model, this will be a number between 0 and 3. For each individual we can compute the distribution of this random variable. We will denote individuals by their first initial. A red initial means they partnered, a black initial means they did not. We will assume independence. This means, for example, that Sally hooking up with Romeo makes it neither more nor less likely that she will hook up with Lenny. 74

75 Sally s sex-partner distribution X s Event x P(X s = x) JLR 0 (1-0.07)(1-0.06)(1-0.05) JLR or JLR or JLR 1 (0.07)(1-0.06)(1-0.05) + (1-0.07)(0.06)(1-0.05) + (1-0.07)(1-0.06)(0.05) JLR or JLR or JLR 2 (0.07)(0.06)(1-0.05) + (1-0.07)(0.06)(0.05) + (0.07)(1-0.06)(0.05) JLR 3 (0.07)(0.06)(0.05) Can you see the probability laws in action here? 75

76 Sally s sex-partner distribution X s Event x p s(x) = P(X s = x) JLR JLR or JLR or JLR JLR or JLR or JLR JLR Here is what it looks like after the calculation (rounded a bit). We can do similarly for each individual. 76

77 Sally s sex-partners distribution Here is a picture of Sally s sex partner distribution. Distribution of sex partners for Sally Probability e Number of partners The mean is 0(0.83) + 1(0.16) + 2(0.01) + 3(0.0002) = What is the mode? What is the median? 77

78 Female sex-partner distribution To get the distribution for all females, we sum over the individual women. We apply the law of total probability using all three conditional distributions: p female (x) = p s (x)p(sally) + p c (x)p(chastity) + p m (x)p(maude). We assume that the women are selected at random with equal probability P(Maude) = P(Chastity) = P(Sally) = 1/3. 78

79 Female sex-partner distribution At the end we get a distribution like this. Distribution of sex partners for females Probability Number of partners The mean is 0.62, the mode is 0, and the median is 0. 79

80 Male sex-partner distribution We can do the same thing for the males, and we get this. Distribution of sex partners for males Probability Number of partners The mean is 0.62, the mode is 1, and the median is 1. 80

81 Sex-partners vignette recap The narrow lesson is that it pays to be specific about which measure of central tendency you re talking about! The more general lesson is that using probability models and a little bit of algebra can help us see a situation more clearly. This example uses the concepts of random variable, independence, conditional distribution, mean, median...and others. 81

82 Idea: statistical null hypotheses The hypothesis that events are independent often makes a nice contrast to other explanations, namely that random events are somehow related. This vantage point allows us to judge if those other explanations fit the facts any better than the uninteresting null explanation that events are independent. 82

83 Vignette: making better pilots Flight instructors have a policy of berating pilots who make bad landings. They notice that good landings met with praise mostly result in subsequently less-good landings, while bad landings met with harsh criticism mostly result in subsequently improved landings. Is their causal reasoning necessarily valid? To stress-test their judgment that criticism works we consider the evidence in light of the null hypothesis that subsequent landings are in fact independent of one another, regardless of criticism or praise. 83

84 Example: making better pilots (cont d) Contrary to the assumptions of the instructors, consider each landing as independent of subsequent landings (irrespective of feedback). Assume that landings can be classified into three types: poor, adequate, or excellent. Further assume the following probabilities: Event bad adequate good Probability p b p a p g Remember that p b + p a + p g = 1. 84

85 Example: making better pilots (cont d) Assume that the policy of criticism is judged to work when a poor landing is followed by a not-poor landing. Then P(criticism seems to work) = P(not bad 2 bad 1 ) = P(not bad 2 ) = p a +p g by independence. Conversely, the policy of praise appears to work when an good landing is followed by another good landing. So P(good 2 good 1 ) = P(good 2 ) = p g. Praise always appears to work less often than criticism! 85

86 Remark: null and alternative hypotheses The previous example shows that the evidence can appear to favor criticism over praise even if criticism and praise are totally irrelevant. Does this mean that criticism does not work? No, it just means that the observed facts are not compelling evidence that criticism works, because they are entirely consistent with the null hypothesis that landing quality is independent of previous landings and feedback. In cases like this we say we fail to reject the null hypothesis. We ll revisit this terminology a couple weeks from now. 86

87 Example: making better pilots (continuous version) What if we want to take pilot skill into account? We will model this situation using normal random variables and see if the same conclusions (that praise appears to hurt performance and criticism seems to boost it) could arise by chance. 87

88 Example: making better pilots (continuous version, cont d) Assume that each pilot has a certain ability level, call it A. Each individual landing score arises as a combination of this ability and certain random fluctuations, call them ɛ. The landing score at time t can be expressed as S t = A + ɛ t. iid Assuming that ɛ t N(0, σ 2 ), then S t N(A, σ 2 ). 88

89 Example: making better pilots (continuous version, cont d) Denote an average landing score as M. Consider a pilot with A > M. When he makes an exceptional landing, because ɛ 1 > 2σ, he is unlikely to best it on his next landing. Distribution of landing scores Density M A A+ε 1 S 2 For this reason, praise is unlikely to work even though landings are independent of one another. 89

90 Example: making better pilots (continuous version, cont d) For a poor pilot with A < M a similar argument holds. When he makes a very poor landing, because ɛ 1 < 2σ, he is unlikely to do worse on his next landing. Distribution of landing scores Density A+ε 1 A M S 2 For this reason, criticism is likely to work even though landings are independent. 90

91 Idea: mean reversion The previous example illustrates an idea known as mean reversion. This name refers to the fact that subsequent observations tend to be pulled back towards the overall mean even if the events are independent of one another. Mean reversion describes a probabilistic fact, not a physical process. What might the flight instructors have done (as an experiment) to really get to the bottom of their question? 91

Business Statistics 41000: Probability 4

Business Statistics 41000: Probability 4 Business Statistics 41000: Probability 4 Drew D. Creal University of Chicago, Booth School of Business February 14 and 15, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office:

More information

A useful modeling tricks.

A useful modeling tricks. .7 Joint models for more than two outcomes We saw that we could write joint models for a pair of variables by specifying the joint probabilities over all pairs of outcomes. In principal, we could do this

More information

Chapter 8. Variables. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc.

Chapter 8. Variables. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Random Variables Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 8.1 What is a Random Variable? Random Variable: assigns a number to each outcome of a random circumstance, or,

More information

2. Modeling Uncertainty

2. Modeling Uncertainty 2. Modeling Uncertainty Models for Uncertainty (Random Variables): Big Picture We now move from viewing the data to thinking about models that describe the data. Since the real world is uncertain, our

More information

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial Lecture 23 STAT 225 Introduction to Probability Models April 4, 2014 approximation Whitney Huang Purdue University 23.1 Agenda 1 approximation 2 approximation 23.2 Characteristics of the random variable:

More information

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Announcements: There are some office hour changes for Nov 5, 8, 9 on website Week 5 quiz begins after class today and ends at

More information

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, 2013 Abstract Introduct the normal distribution. Introduce basic notions of uncertainty, probability, events,

More information

MidTerm 1) Find the following (round off to one decimal place):

MidTerm 1) Find the following (round off to one decimal place): MidTerm 1) 68 49 21 55 57 61 70 42 59 50 66 99 Find the following (round off to one decimal place): Mean = 58:083, round off to 58.1 Median = 58 Range = max min = 99 21 = 78 St. Deviation = s = 8:535,

More information

4.3 Normal distribution

4.3 Normal distribution 43 Normal distribution Prof Tesler Math 186 Winter 216 Prof Tesler 43 Normal distribution Math 186 / Winter 216 1 / 4 Normal distribution aka Bell curve and Gaussian distribution The normal distribution

More information

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82 Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections

More information

Probability Distributions II

Probability Distributions II Probability Distributions II Summer 2017 Summer Institutes 63 Multinomial Distribution - Motivation Suppose we modified assumption (1) of the binomial distribution to allow for more than two outcomes.

More information

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8) 3 Discrete Random Variables and Probability Distributions Stat 4570/5570 Based on Devore s book (Ed 8) Random Variables We can associate each single outcome of an experiment with a real number: We refer

More information

Business Statistics 41000: Probability 3

Business Statistics 41000: Probability 3 Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404

More information

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4 7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4 - Would the correlation between x and y in the table above be positive or negative? The correlation is negative. -

More information

Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem

Normal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem 1.1.2 Normal distribution 1.1.3 Approimating binomial distribution by normal 2.1 Central Limit Theorem Prof. Tesler Math 283 Fall 216 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 1

More information

Counting Basics. Venn diagrams

Counting Basics. Venn diagrams Counting Basics Sets Ways of specifying sets Union and intersection Universal set and complements Empty set and disjoint sets Venn diagrams Counting Inclusion-exclusion Multiplication principle Addition

More information

Chapter 5: Probability

Chapter 5: Probability Chapter 5: These notes reflect material from our text, Exploring the Practice of Statistics, by Moore, McCabe, and Craig, published by Freeman, 2014. quantifies randomness. It is a formal framework with

More information

Chapter 6. The Normal Probability Distributions

Chapter 6. The Normal Probability Distributions Chapter 6 The Normal Probability Distributions 1 Chapter 6 Overview Introduction 6-1 Normal Probability Distributions 6-2 The Standard Normal Distribution 6-3 Applications of the Normal Distribution 6-5

More information

Chapter 4 Probability Distributions

Chapter 4 Probability Distributions Slide 1 Chapter 4 Probability Distributions Slide 2 4-1 Overview 4-2 Random Variables 4-3 Binomial Probability Distributions 4-4 Mean, Variance, and Standard Deviation for the Binomial Distribution 4-5

More information

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions Chapter 4 Probability Distributions 4-1 Overview 4-2 Random Variables 4-3 Binomial Probability Distributions 4-4 Mean, Variance, and Standard Deviation for the Binomial Distribution 4-5 The Poisson Distribution

More information

PROBABILITY DISTRIBUTIONS

PROBABILITY DISTRIBUTIONS CHAPTER 3 PROBABILITY DISTRIBUTIONS Page Contents 3.1 Introduction to Probability Distributions 51 3.2 The Normal Distribution 56 3.3 The Binomial Distribution 60 3.4 The Poisson Distribution 64 Exercise

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 5 Continuous Random Variables and Probability Distributions Ch. 5-1 Probability Distributions Probability Distributions Ch. 4 Discrete Continuous Ch. 5 Probability

More information

Statistics and Probability

Statistics and Probability Statistics and Probability Continuous RVs (Normal); Confidence Intervals Outline Continuous random variables Normal distribution CLT Point estimation Confidence intervals http://www.isrec.isb-sib.ch/~darlene/geneve/

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution

More information

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized

More information

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom Review for Final Exam 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom THANK YOU!!!! JON!! PETER!! RUTHI!! ERIKA!! ALL OF YOU!!!! Probability Counting Sets Inclusion-exclusion principle Rule of product

More information

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial. Lecture 21,22, 23 Text: A Course in Probability by Weiss 8.5 STAT 225 Introduction to Probability Models March 31, 2014 Standard Sums of Whitney Huang Purdue University 21,22, 23.1 Agenda 1 2 Standard

More information

Section Random Variables and Histograms

Section Random Variables and Histograms Section 3.1 - Random Variables and Histograms Definition: A random variable is a rule that assigns a number to each outcome of an experiment. Example 1: Suppose we toss a coin three times. Then we could

More information

FINAL REVIEW W/ANSWERS

FINAL REVIEW W/ANSWERS FINAL REVIEW W/ANSWERS ( 03/15/08 - Sharon Coates) Concepts to review before answering the questions: A population consists of the entire group of people or objects of interest to an investigator, while

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Random Variables and Applications OPRE 6301

Random Variables and Applications OPRE 6301 Random Variables and Applications OPRE 6301 Random Variables... As noted earlier, variability is omnipresent in the business world. To model variability probabilistically, we need the concept of a random

More information

The topics in this section are related and necessary topics for both course objectives.

The topics in this section are related and necessary topics for both course objectives. 2.5 Probability Distributions The topics in this section are related and necessary topics for both course objectives. A probability distribution indicates how the probabilities are distributed for outcomes

More information

Statistics. Marco Caserta IE University. Stats 1 / 56

Statistics. Marco Caserta IE University. Stats 1 / 56 Statistics Marco Caserta marco.caserta@ie.edu IE University Stats 1 / 56 1 Random variables 2 Binomial distribution 3 Poisson distribution 4 Hypergeometric Distribution 5 Jointly Distributed Discrete Random

More information

Section 0: Introduction and Review of Basic Concepts

Section 0: Introduction and Review of Basic Concepts Section 0: Introduction and Review of Basic Concepts Carlos M. Carvalho The University of Texas McCombs School of Business mccombs.utexas.edu/faculty/carlos.carvalho/teaching 1 Getting Started Syllabus

More information

Midterm Exam III Review

Midterm Exam III Review Midterm Exam III Review Dr. Joseph Brennan Math 148, BU Dr. Joseph Brennan (Math 148, BU) Midterm Exam III Review 1 / 25 Permutations and Combinations ORDER In order to count the number of possible ways

More information

Probability. An intro for calculus students P= Figure 1: A normal integral

Probability. An intro for calculus students P= Figure 1: A normal integral Probability An intro for calculus students.8.6.4.2 P=.87 2 3 4 Figure : A normal integral Suppose we flip a coin 2 times; what is the probability that we get more than 2 heads? Suppose we roll a six-sided

More information

Review of commonly missed questions on the online quiz. Lecture 7: Random variables] Expected value and standard deviation. Let s bet...

Review of commonly missed questions on the online quiz. Lecture 7: Random variables] Expected value and standard deviation. Let s bet... Recap Review of commonly missed questions on the online quiz Lecture 7: ] Statistics 101 Mine Çetinkaya-Rundel OpenIntro quiz 2: questions 4 and 5 September 20, 2011 Statistics 101 (Mine Çetinkaya-Rundel)

More information

TOPIC: PROBABILITY DISTRIBUTIONS

TOPIC: PROBABILITY DISTRIBUTIONS TOPIC: PROBABILITY DISTRIBUTIONS There are two types of random variables: A Discrete random variable can take on only specified, distinct values. A Continuous random variable can take on any value within

More information

Discrete Random Variables; Expectation Spring 2014

Discrete Random Variables; Expectation Spring 2014 Discrete Random Variables; Expectation 18.05 Spring 2014 https://en.wikipedia.org/wiki/bean_machine#/media/file: Quincunx_(Galton_Box)_-_Galton_1889_diagram.png http://www.youtube.com/watch?v=9xubhhm4vbm

More information

Statistics for Business and Economics: Random Variables:Continuous

Statistics for Business and Economics: Random Variables:Continuous Statistics for Business and Economics: Random Variables:Continuous STT 315: Section 107 Acknowledgement: I d like to thank Dr. Ashoke Sinha for allowing me to use and edit the slides. Murray Bourne (interactive

More information

Lecture III. 1. common parametric models 2. model fitting 2a. moment matching 2b. maximum likelihood 3. hypothesis testing 3a. p-values 3b.

Lecture III. 1. common parametric models 2. model fitting 2a. moment matching 2b. maximum likelihood 3. hypothesis testing 3a. p-values 3b. Lecture III 1. common parametric models 2. model fitting 2a. moment matching 2b. maximum likelihood 3. hypothesis testing 3a. p-values 3b. simulation Parameters Parameters are knobs that control the amount

More information

Review. What is the probability of throwing two 6s in a row with a fair die? a) b) c) d) 0.333

Review. What is the probability of throwing two 6s in a row with a fair die? a) b) c) d) 0.333 Review In most card games cards are dealt without replacement. What is the probability of being dealt an ace and then a 3? Choose the closest answer. a) 0.0045 b) 0.0059 c) 0.0060 d) 0.1553 Review What

More information

Module 4: Probability

Module 4: Probability Module 4: Probability 1 / 22 Probability concepts in statistical inference Probability is a way of quantifying uncertainty associated with random events and is the basis for statistical inference. Inference

More information

Binomial Random Variable - The count X of successes in a binomial setting

Binomial Random Variable - The count X of successes in a binomial setting 6.3.1 Binomial Settings and Binomial Random Variables What do the following scenarios have in common? Toss a coin 5 times. Count the number of heads. Spin a roulette wheel 8 times. Record how many times

More information

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution Section 7.6 Application of the Normal Distribution A random variable that may take on infinitely many values is called a continuous random variable. A continuous probability distribution is defined by

More information

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Section 2: Estimation, Confidence Intervals and Testing Hypothesis Section 2: Estimation, Confidence Intervals and Testing Hypothesis Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/

More information

Numerical Descriptive Measures. Measures of Center: Mean and Median

Numerical Descriptive Measures. Measures of Center: Mean and Median Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where

More information

AMS7: WEEK 4. CLASS 3

AMS7: WEEK 4. CLASS 3 AMS7: WEEK 4. CLASS 3 Sampling distributions and estimators. Central Limit Theorem Normal Approximation to the Binomial Distribution Friday April 24th, 2015 Sampling distributions and estimators REMEMBER:

More information

Commonly Used Distributions

Commonly Used Distributions Chapter 4: Commonly Used Distributions 1 Introduction Statistical inference involves drawing a sample from a population and analyzing the sample data to learn about the population. We often have some knowledge

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions. ME3620 Theory of Engineering Experimentation Chapter III. Random Variables and Probability Distributions Chapter III 1 3.2 Random Variables In an experiment, a measurement is usually denoted by a variable

More information

Lecture 9. Probability Distributions. Outline. Outline

Lecture 9. Probability Distributions. Outline. Outline Outline Lecture 9 Probability Distributions 6-1 Introduction 6- Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7- Properties of the Normal Distribution

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Chapter 5. Sampling Distributions

Chapter 5. Sampling Distributions Lecture notes, Lang Wu, UBC 1 Chapter 5. Sampling Distributions 5.1. Introduction In statistical inference, we attempt to estimate an unknown population characteristic, such as the population mean, µ,

More information

One sample z-test and t-test

One sample z-test and t-test One sample z-test and t-test January 30, 2017 psych10.stanford.edu Announcements / Action Items Install ISI package (instructions in Getting Started with R) Assessment Problem Set #3 due Tu 1/31 at 7 PM

More information

5.3 Statistics and Their Distributions

5.3 Statistics and Their Distributions Chapter 5 Joint Probability Distributions and Random Samples Instructor: Lingsong Zhang 1 Statistics and Their Distributions 5.3 Statistics and Their Distributions Statistics and Their Distributions Consider

More information

Section Sampling Distributions for Counts and Proportions

Section Sampling Distributions for Counts and Proportions Section 5.1 - Sampling Distributions for Counts and Proportions Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin Distributions When dealing with inference procedures, there are two different

More information

Statistics 511 Supplemental Materials

Statistics 511 Supplemental Materials Gaussian (or Normal) Random Variable In this section we introduce the Gaussian Random Variable, which is more commonly referred to as the Normal Random Variable. This is a random variable that has a bellshaped

More information

Lecture 9. Probability Distributions

Lecture 9. Probability Distributions Lecture 9 Probability Distributions Outline 6-1 Introduction 6-2 Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7-2 Properties of the Normal Distribution

More information

Bin(20,.5) and N(10,5) distributions

Bin(20,.5) and N(10,5) distributions STAT 600 Design of Experiments for Research Workers Lab 5 { Due Thursday, November 18 Example Weight Loss In a dietary study, 14 of 0 subjects lost weight. If weight is assumed to uctuate up or down by

More information

The Bernoulli distribution

The Bernoulli distribution This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 6 Point Estimation Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Point Estimation Statistical inference: directed toward conclusions about one or more parameters. We will use the generic

More information

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence continuous rv Let X be a continuous rv. Then a probability distribution or probability density function (pdf) of X is a function f(x) such that for any two numbers a and b with a b, P(a X b) = b a f (x)dx.

More information

Prof. Thistleton MAT 505 Introduction to Probability Lecture 3

Prof. Thistleton MAT 505 Introduction to Probability Lecture 3 Sections from Text and MIT Video Lecture: Sections 2.1 through 2.5 http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-041-probabilistic-systemsanalysis-and-applied-probability-fall-2010/video-lectures/lecture-1-probability-models-and-axioms/

More information

Statistical Methods in Practice STAT/MATH 3379

Statistical Methods in Practice STAT/MATH 3379 Statistical Methods in Practice STAT/MATH 3379 Dr. A. B. W. Manage Associate Professor of Mathematics & Statistics Department of Mathematics & Statistics Sam Houston State University Overview 6.1 Discrete

More information

Probability Distribution Unit Review

Probability Distribution Unit Review Probability Distribution Unit Review Topics: Pascal's Triangle and Binomial Theorem Probability Distributions and Histograms Expected Values, Fair Games of chance Binomial Distributions Hypergeometric

More information

Part V - Chance Variability

Part V - Chance Variability Part V - Chance Variability Dr. Joseph Brennan Math 148, BU Dr. Joseph Brennan (Math 148, BU) Part V - Chance Variability 1 / 78 Law of Averages In Chapter 13 we discussed the Kerrich coin-tossing experiment.

More information

Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7

Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7 Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7 Lew Davidson (Dr.D.) Mallard Creek High School Lewis.Davidson@cms.k12.nc.us 704-786-0470 Probability & Sampling The Practice of Statistics

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1 8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions For Example: On August 8, 2011, the Dow dropped 634.8 points, sending shock waves through the financial community.

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter7 Probability Distributions and Statistics Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number of boys in

More information

Statistical Methods for NLP LT 2202

Statistical Methods for NLP LT 2202 LT 2202 Lecture 3 Random variables January 26, 2012 Recap of lecture 2 Basic laws of probability: 0 P(A) 1 for every event A. P(Ω) = 1 P(A B) = P(A) + P(B) if A and B disjoint Conditional probability:

More information

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Statistics 431 Spring 2007 P. Shaman. Preliminaries Statistics 4 Spring 007 P. Shaman The Binomial Distribution Preliminaries A binomial experiment is defined by the following conditions: A sequence of n trials is conducted, with each trial having two possible

More information

Introduction to Statistics I

Introduction to Statistics I Introduction to Statistics I Keio University, Faculty of Economics Continuous random variables Simon Clinet (Keio University) Intro to Stats November 1, 2018 1 / 18 Definition (Continuous random variable)

More information

What is the probability of success? Failure? How could we do this simulation using a random number table?

What is the probability of success? Failure? How could we do this simulation using a random number table? Probability Ch.4, sections 4.2 & 4.3 Binomial and Geometric Distributions Name: Date: Pd: 4.2. What is a binomial distribution? How do we find the probability of success? Suppose you have three daughters.

More information

Introduction to Probability and Inference HSSP Summer 2017, Instructor: Alexandra Ding July 19, 2017

Introduction to Probability and Inference HSSP Summer 2017, Instructor: Alexandra Ding July 19, 2017 Introduction to Probability and Inference HSSP Summer 2017, Instructor: Alexandra Ding July 19, 2017 Please fill out the attendance sheet! Suggestions Box: Feedback and suggestions are important to the

More information

Chapter 6: Random Variables and Probability Distributions

Chapter 6: Random Variables and Probability Distributions Chapter 6: Random Variables and Distributions These notes reflect material from our text, Statistics, Learning from Data, First Edition, by Roxy Pec, published by CENGAGE Learning, 2015. Random variables

More information

Discrete Random Variables and Probability Distributions

Discrete Random Variables and Probability Distributions Chapter 4 Discrete Random Variables and Probability Distributions 4.1 Random Variables A quantity resulting from an experiment that, by chance, can assume different values. A random variable is a variable

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number

More information

MA 1125 Lecture 14 - Expected Values. Wednesday, October 4, Objectives: Introduce expected values.

MA 1125 Lecture 14 - Expected Values. Wednesday, October 4, Objectives: Introduce expected values. MA 5 Lecture 4 - Expected Values Wednesday, October 4, 27 Objectives: Introduce expected values.. Means, Variances, and Standard Deviations of Probability Distributions Two classes ago, we computed the

More information

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Section 2: Estimation, Confidence Intervals and Testing Hypothesis Section 2: Estimation, Confidence Intervals and Testing Hypothesis Tengyuan Liang, Chicago Booth https://tyliang.github.io/bus41000/ Suggested Reading: Naked Statistics, Chapters 7, 8, 9 and 10 OpenIntro

More information

6. Continous Distributions

6. Continous Distributions 6. Continous Distributions Chris Piech and Mehran Sahami May 17 So far, all random variables we have seen have been discrete. In all the cases we have seen in CS19 this meant that our RVs could only take

More information

Theoretical Foundations

Theoretical Foundations Theoretical Foundations Probabilities Monia Ranalli monia.ranalli@uniroma2.it Ranalli M. Theoretical Foundations - Probabilities 1 / 27 Objectives understand the probability basics quantify random phenomena

More information

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4 Week 7 Oğuz Gezmiş Texas A& M University Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4 Oğuz Gezmiş (TAMU) Topics in Contemporary Mathematics II Week7 1 / 19

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution January 31, 2018 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The

More information

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality Point Estimation Some General Concepts of Point Estimation Statistical inference = conclusions about parameters Parameters == population characteristics A point estimate of a parameter is a value (based

More information

Discrete Random Variables

Discrete Random Variables Discrete Random Variables In this chapter, we introduce a new concept that of a random variable or RV. A random variable is a model to help us describe the state of the world around us. Roughly, a RV can

More information

Descriptive Statistics (Devore Chapter One)

Descriptive Statistics (Devore Chapter One) Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf

More information

STAT 201 Chapter 6. Distribution

STAT 201 Chapter 6. Distribution STAT 201 Chapter 6 Distribution 1 Random Variable We know variable Random Variable: a numerical measurement of the outcome of a random phenomena Capital letter refer to the random variable Lower case letters

More information

But suppose we want to find a particular value for y, at which the probability is, say, 0.90? In other words, we want to figure out the following:

But suppose we want to find a particular value for y, at which the probability is, say, 0.90? In other words, we want to figure out the following: More on distributions, and some miscellaneous topics 1. Reverse lookup and the normal distribution. Up until now, we wanted to find probabilities. For example, the probability a Swedish man has a brain

More information

The Normal Distribution

The Normal Distribution Will Monroe CS 09 The Normal Distribution Lecture Notes # July 9, 207 Based on a chapter by Chris Piech The single most important random variable type is the normal a.k.a. Gaussian) random variable, parametrized

More information

BIOL The Normal Distribution and the Central Limit Theorem

BIOL The Normal Distribution and the Central Limit Theorem BIOL 300 - The Normal Distribution and the Central Limit Theorem In the first week of the course, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution January 31, 2019 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The

More information

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation Name In a binomial experiment of n trials, where p = probability of success and q = probability of failure mean variance standard deviation µ = n p σ = n p q σ = n p q Notation X ~ B(n, p) The probability

More information

Section Distributions of Random Variables

Section Distributions of Random Variables Section 8.1 - Distributions of Random Variables Definition: A random variable is a rule that assigns a number to each outcome of an experiment. Example 1: Suppose we toss a coin three times. Then we could

More information

Review of the Topics for Midterm I

Review of the Topics for Midterm I Review of the Topics for Midterm I STA 100 Lecture 9 I. Introduction The objective of statistics is to make inferences about a population based on information contained in a sample. A population is the

More information

Statistics, Measures of Central Tendency I

Statistics, Measures of Central Tendency I Statistics, Measures of Central Tendency I We are considering a random variable X with a probability distribution which has some parameters. We want to get an idea what these parameters are. We perfom

More information

Part 10: The Binomial Distribution

Part 10: The Binomial Distribution Part 10: The Binomial Distribution The binomial distribution is an important example of a probability distribution for a discrete random variable. It has wide ranging applications. One readily available

More information

MATH MW Elementary Probability Course Notes Part IV: Binomial/Normal distributions Mean and Variance

MATH MW Elementary Probability Course Notes Part IV: Binomial/Normal distributions Mean and Variance MATH 2030 3.00MW Elementary Probability Course Notes Part IV: Binomial/Normal distributions Mean and Variance Tom Salisbury salt@yorku.ca York University, Dept. of Mathematics and Statistics Original version

More information

Probability Theory. Probability and Statistics for Data Science CSE594 - Spring 2016

Probability Theory. Probability and Statistics for Data Science CSE594 - Spring 2016 Probability Theory Probability and Statistics for Data Science CSE594 - Spring 2016 What is Probability? 2 What is Probability? Examples outcome of flipping a coin (seminal example) amount of snowfall

More information

Chapter 7: Point Estimation and Sampling Distributions

Chapter 7: Point Estimation and Sampling Distributions Chapter 7: Point Estimation and Sampling Distributions Seungchul Baek Department of Statistics, University of South Carolina STAT 509: Statistics for Engineers 1 / 20 Motivation In chapter 3, we learned

More information