Dr. Allen Back. Oct. 28, 2016

Size: px
Start display at page:

Download "Dr. Allen Back. Oct. 28, 2016"

Transcription

1 Dr. Allen Back Oct. 28, 2016

2 A coffee vending machine dispenses coffee into a paper cup. You re supposed to get 10 ounces of coffee., but the amount varies slightly from cup to cup. The amounts measured in a random sample of 20 cups have summary statistics as given below. Is there evidence that the machine is shortchanging customers? x = s =.1986.

3 Notation: Let µ denote the mean amount of coffee in a dispensed cup. Hypotheses: H 0 : µ = 10 (or µ 10) H a : µ < 10

4 As usual with, we are interested in whether the observed statistic of x = is reasonably consistent with the sampling distribution assuming H 0 is true.

5 If s = σ, we d look at a Z-statistic Z = x µ 0 σ n = where we ve written H 0 more abstractly as µ = µ 0, µ 0 being the hypothesized value, 10 in this case.

6 Because s will not exactly match σ, we actually get a bit of extra error here. This is compensated for by viewing as a t-statistic. t = x µ 0 s = n = =

7 Since the error in approximating σ by s varies with the sample size, there is a different t-distribution for each sample size. These are labeled by the degrees of freedom which for a 1-sample t-test is: df = n 1.

8 These are all critical values t. For example P(t > 1.328) =.10 for the t distribution with 19 df.

9 Our t-statistic of is more extreme than any on the df=19 row of the table. The picture shows what the critical value t = for a tail prob. of.005 means.

10 So by symmetry P(T < 2.861) =.005 as well.

11 So our tail probability and p-value are both less than.005 and we reject the null. The machine does appear to be shortchanging.

12 Sps. our t-statistic had been with the same 1-sided hypotheses H 0 : µ = 10 (or µ 10) H a : µ < 10 What P-value would we report?

13 Sps. our t-statistic had been with the same 1-sided hypotheses H 0 : µ = 10 (or µ 10) H a : µ < 10 What P-value would we report? Answer: A tail probability and P-value of between.025 and.05.

14 Sps. instead our t-statistic had been 2.00 with 2-sided hypotheses H 0 : µ = 10 H a : µ 10 What P-value would we report?

15 Sps. instead our t-statistic had been 2.00 with 2-sided hypotheses H 0 : µ = 10 H a : µ 10 What P-value would we report? Answer: Our tail probability is still between.025 and.05 but our P-value is now between.05 and.10.

16 Sps. instead our t-statistic had been 2.00 with 2-sided hypotheses H 0 : µ = 10 H a : µ 10 What P-value would we report? In using the table, always be conservative. If you want df=138 and there is a df=120 as well as a df=140 row, the principle of being conservative means to use the df=120 row. When you say I am 95% confident... or I reject H 0, you ll be delivering what you promised; with better tables you d just be able to report your level of confidence as somewhat higher.

17 By Calculator TI 83/84: 2nd distr tcdf(-100,-2,19) to find P(T < 2) for a t-distribution with df = 19.

18 By Calculator TI 83/84: 2nd distr tcdf(-100,-2,19) to find P(T < 2) for a t-distribution with df = 19. TI 89: catalog F3 2nd-alpha t... tcdf(-100,-2,19) to find P(T < 2) for a t-distribution with df = 19.

19 A t-hypothesis test with is resolved using a t-statistic of H 0 : µ = µ 0 t = x µ 0 s. n

20 A t-hypothesis test with is resolved using a t-statistic of H a can be any of the three 1 µ µ 0 2 µ > µ 0 3 µ < µ 0 H 0 : µ = µ 0 t = x µ 0 s. n

21 A t confidence interval for µ is x ± t s n for the same reason as the corresponding formula in the proportion case:

22 The two sampling distributions at the border of the CI. The blue lines delimit central regions on the sampling distributions. The CI is between the green lines on the horizontal axis.

23 In the coffee machine example, this gives a 95% CI for µ of ± 2.093(.0444) = ±.093 = (9.752, 9.938) (since n = 20 and SE( x) = s =.1986 =.0444 here.) n 20 You did expect it to not include 10, didn t you?

24 t-distributions Let X 0, X 1,... X d be d + 1 independent standard normal random variables.

25 t-distributions Let X 0, X 1,... X d be d + 1 independent standard normal random variables. The t-distribution with d degrees of freedom is defined to be the random variable X 0. X1 2 + X X d 2

26 t-distributions The t-distribution with d degrees of freedom is defined to be the random variable X 0 X X X 2 d Think of this as keeping track of the t-statistic x µ 0 s = n x µ 0 σ n s σ with the top X 0 keeping track of the numerator and the denominator X1 2 + X X d 2 keeping track of the ratio of s to σ..

27 t-distributions Just as the normal distribution has the density formula f (x) = e (x µ) 2 2σ 2 2πσ

28 t-distributions Methods of calculus show that the density formula for the t-distribution with d degrees of freedom is f (x) = Γ( d+1 2 ) Γ( d 2 ) 1 dπ 1 (1 + x2 d ) d+1 2 where d is the number of degrees of freedom and Γ(n) = 0 t n 1 e t dt is the gamma function, a generalized factorial.

29 t-distributions It will be important when we think about 2-sample tests to realize that the degrees of freedom is just a parameter in the above formula.

30 t-distributions A little more probability in the tails

31 t-distributions Cumulative Dist. Fcn (Graphical Form of Table Z)

32 t-distributions This picture shows why t can be very different from z.

33 t-distributions

34 Conditions for t Tests random sampling 10% condition nearly normal condition n < 15 unimodal and symmetric or only slight skew 15 <= n < 30 avoid strong (maybe even moderate) skewness and outliers (unimodal and sym best) 30 n < 60 moderate skewness ok n 60 pretty much (not really always) ok even with strong skew

35 Conditions for t Tests random sampling 10% condition nearly normal condition n < 15 unimodal and symmetric or only slight skew 15 <= n < 30 avoid strong (maybe even moderate) skewness and outliers (unimodal and sym best) 30 n < 60 moderate skewness ok n 60 pretty much (not really always) ok even with strong skew What Happens if Not Satisfied: random sampling - could be critical; might be ok if representative representative hard/impossible to define

36 Conditions for t Tests random sampling 10% condition nearly normal condition n < 15 unimodal and symmetric or only slight skew 15 <= n < 30 avoid strong (maybe even moderate) skewness and outliers (unimodal and sym best) 30 n < 60 moderate skewness ok n 60 pretty much (not really always) ok even with strong skew What Happens if Not Satisfied: 10% condition - results in overestimation of samp. dist. st dev gradual breakdown in formulas, not method

37 Conditions for t Tests random sampling 10% condition nearly normal condition n < 15 unimodal and symmetric or only slight skew 15 <= n < 30 avoid strong (maybe even moderate) skewness and outliers (unimodal and sym best) 30 n < 60 moderate skewness ok n 60 pretty much (not really always) ok even with strong skew What Happens if Not Satisfied: nearly normal - no guarantee progressive reduction of accuracy

38 Conditions for t Tests random sampling 10% condition nearly normal condition n < 15 unimodal and symmetric or only slight skew 15 <= n < 30 avoid strong (maybe even moderate) skewness and outliers (unimodal and sym best) 30 n < 60 moderate skewness ok n 60 pretty much (not really always) ok even with strong skew What Happens if Not Satisfied:

39 Conditions for t Tests For 2-sample inference, we add the independence groups assumption. The chance of an individual in one of the groups assuming a certain value should be independent of the values assumed by any of the individuals in the other group.

40 Formulas for t-inference and regression inference are based on assumptions of normality of the data. Yet most distributions are not normal. (Although the CLT makes averages of lots normal.) So it is perhaps remarkable that t-inference methods for moderate size data sets without outliers are typically pretty good. Why is this?

41 Formulas for t-inference and regression inference are based on assumptions of normality of the data. Yet most distributions are not normal. (Although the CLT makes averages of lots normal.) So it is perhaps remarkable that t-inference methods for moderate size data sets without outliers are typically pretty good. Why is this? One answer is robustness against non-normality.

42 Actual confidence level vs kurtosis for some symmetric n = 25 test distributions in a 1975 Biometrika paper of Pearson and Please. (kurtosis=3 in the normal case.)

43 Actual confidence level vs kurtosis for some skewed n = 25 test distributions in a 1975 Biometrika paper of Pearson and Please.

44 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? haiku: A Japanese poem composed of three unrhymed lines of five, seven, and five syllables. Haiku often reflect on some aspect of nature.

45 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this?

46 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Randomized Experiment on Creativity in poetry: Grades on creativity recorded. (Higher is better.) Both groups filled out questionnaires before writing poetry. Intrinsic group first did questionnaire emphasizing intrinsic motivations. Extrinsic group first did questionnaire emphasizing extrinsic motivations.

47 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? People then wrote Haiku poems which were graded by experienced poets.

48 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? The Results Group n x s Intrinsic Extrinsic

49 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Notation: Let µ i and µ e denote the respective mean haiku scores for the intrinsically and the extrinsically motivated groups.

50 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Notation: Let µ i and µ e denote the respective mean haiku scores for the intrinsically and the extrinsically motivated groups. Hypotheses: H 0 : µ i = µ e H a : µ i µ e

51 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Hypotheses: H 0 : µ i = µ e H a : µ i µ e Under H 0, the sampling distribution of x i x e has mean 0 and approx. std deviation si 2 SE( x i x e ) = + s2 e n i n e

52 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Hypotheses: H 0 : µ i = µ e H a : µ i µ e Under H 0, the sampling distribution of x i x e has mean 0 and approx. std deviation si 2 SE( x i x e ) = + s2 e n i n e SE( x i x e ) = = 1.42.

53 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Hypotheses: H 0 : µ i = µ e H a : µ i µ e SE( x i x e ) = = Our t-statistic is t = = 2.92.

54 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Hypotheses: H 0 : µ i = µ e H a : µ i µ e Our t-statistic is SE( x i x e ) = t = = = For homework and hand computation on exams, we suggest you use df the smaller of 24-1 and 23-1 to interpret the t-statistic. To be explained later.

55 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Hypotheses: H 0 : µ i = µ e H a : µ i µ e With df = 22, 2.92 is more extreme than the biggest t in table T (which is 2.819) so our tail probability is less than.005 and our P-value is less than.01.

56 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Hypotheses: H 0 : µ i = µ e H a : µ i µ e With df = 22, 2.92 is more extreme than the biggest t in table T (which is 2.819) so our tail probability is less than.005 and our P-value is less than.01. We reject H 0. Intrinsic vs. extrinsic motivation does seem to make a difference.

57 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Hypotheses: H 0 : µ i = µ e H a : µ i µ e We reject H 0. Intrinsic vs. extrinsic motivation does seem to make a difference. A 95% CI for µ i µ e would be ± = 4.15 ± 2.95 = (1.20, 7.10).

58 Haiku Poetry: Intrinsic or Extrinsic Motivation Different in Effectiveness? How can you study this? Hypotheses: H 0 : µ i = µ e H a : µ i µ e Scope of the Inference: Causal relationship seems to be established. Doesn t say much about a population since not an SRS.

59 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y.

60 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. But the sampling distribution of x ȳ can be well approximated by a t-distribution with the following degrees of freedom:

61 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 ( s 2 y n y ) 2

62 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 ( s 2 y n y ) 2 Let min(n x 1, n y 1) denote the smallest of n x 1 and n y 1. This df is always between min(n x 1, n y 1) and n x + n y 2.

63 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 Special Case s x = s y = s, n x = n y = n. df = ( s 2 y n y ) 2 ( ) 2s 2 2 n = 2(n 1). 1 2s 4 n 1 n 2

64 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. Special Case s2 x n x df = << s2 y n y df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 ( s 2 ) 2 y n y 1 n y 1 ( s 2 y n y ) 2 ( s 2 y n y ) 2 = n y 1.

65 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 ( s 2 y n y ) 2 Calculators and computers will use this df formula. For hand computation on HW and exams, we suggest you never use this formula. Instead use the conservative min(n x 1, n y 1).

66 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 ( s 2 y n y ) 2 Calculators and computers will use this df formula. For hand computation on HW and exams, we suggest you never use this formula. Instead use the conservative min(n x 1, n y 1). Your t will be a little too big so your CI will be a little too wide and your HT a little harder to show statistical significance. But this is not so bad.

67 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 ( s 2 y n y ) 2 In your published papers and lab reports, use a computer which will likely automatically use this formula.

68 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 ( s 2 y n y ) 2 derived this formula using the fact that the exact mean and variance of the sampling distribution of x ȳ is easy to calculate. He asked, What df in the t-distribution would give the right µ and σ. The answer is the above.

69 In the 2-independent sample case, even if the individual data is normal, the difference x ȳ does not follow a t-distribution unless σ x = σ y. df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 ( s 2 y n y ) 2 f (x) = Γ( d+1 2 ) Γ( d 2 ) 1 dπ 1 (1 + x2 d ) d+1 2

70

71

72

73

74

75

76

77

78

79

80

81

82 In the 2-independent sample case, x ȳ can usually be well approximated by a t-distribution with the following degrees of freedom: df = ( s 2 x n x + s2 y n y ) 2 ( ) 1 s 2 2 x n x 1 n x + 1 n y 1 ( s 2 y n y ) 2

83 The graphs below, from the java applet at illustrate the fact that this df is always between min(n x 1, n y 1) and n x + n y 2.

84

85

86 Pulse Rate Does regular exercise reduce resting pulse rates? 10 volunteers do 20 minutes of exercise 3 times a week for 6 weeks. Their resting pulse rates before and after were measured. (Beats/min.)

87 Pulse Rate Subject Before After Allen Brandon Carlos David Edwin Franco Graeme Hans Ivan Jorge 79 76

88 Pulse Rate Subject Before After Allen Brandon Carlos David Edwin Franco Graeme Hans Ivan Jorge Notice that 8 out of 10 subjects had reduced RPR s. There is a non-parametric test called the sign test that could be used to conclude that RPR s tend to decrease as hypothesized. Can you see how binomial distribution ideas could be used for this?

89 Pulse Rate Notation: Let µ b be the mean resting pulse rate before the exercise program and µ a the mean resting pulse rate after.

90 Pulse Rate Notation: Let µ b be the mean resting pulse rate before the exercise program and µ a the mean resting pulse rate after. Hypotheses: H 0 : µ b = µ a (or µ b µ a 0 or... ) H a : µ b > µ a

91 Pulse Rate Since individuals have important characteristics besides their exercise programs which determine their RPR s, statistical tests based on Var( x b x a ) = Var( x b ) + Var( x a ) would not be appropriate here.

92 Pulse Rate In MP, we look at the differences and apply 1-sample ideas to the differences.

93 Pulse Rate Differences d i Subject Before - After Allen 0 Brandon 4 Carlos 4 David 1 Edwin 4 Franco 8 Graeme 3 Hans 2 Ivan -1 Jorge 3

94 Pulse Rate The mean d = 2.8 with a standard deviation s of Note you need to directly calculate s for the differences; it is not determined by s a and s b.

95 Pulse Rate The mean d = 2.8 with a standard deviation s of t = = 3.50 with df=9. Our P-value is <.005 and we reject H 0. Exercise does seem to reduce RPR s.

96 vs. Indep. Group1 Group 2 x 1 y 1 x 2 y x n... y m x ȳ mean s x s y std. dev

97 vs. Indep. Group1 Group 2 x 1 y 1 x 2 y x n... y m x ȳ mean s x s y std. dev Independent Each x i and each y j are (except to the extent that they come from one of the groups) independent of each other. The sample sizes m and n can be different.

98 vs. Indep. Group1 Group 2 x 1 y 1 x 2 y x n... y m x ȳ mean s x s y std. dev The i th observation x i in group 1 is more closely related (in a relevant way) to the i th observation y i in group 2 than a general x i is to a general y j. The sample sizes m and n must be the same.

99 or Indep? Chapter 25 # 24: In a test of breaking performance, a tire manufacturer measured the stopping distance for one of its tire models. On a test track, a car made repeated stops from 60 miles per hour. The test was run on both dry and wet pavement, with results as shown in the table. Are wet stopping distances greater?

100 or Indep? Stopping Distance (ft) Dry Pavement Wet Pavement

101 or Indep? Chapter 25 # 24: In a test of breaking performance, a tire manufacturer measured the stopping distance for one of its tire models. On a test track, a car made repeated stops from 60 miles per hour. The test was run on both dry and wet pavement, with results as shown in the table. Are wet stopping distances greater? The wet pavement figure on e.g. the fifth line bears no closer relationship to the fifth line in the dry column than it does to any other dry measurement. There is no matching. This is independent 2-sample.

102 or Indep? Had the problem referred to 10 different cars or 10 different drivers each trying both wet and dry, it would have been matched pairs.

103 or Indep? Summary of the Data Mean Std. Dev. dry wet

104 or Indep? Notation: Let µ d be the mean dry stopping distance. and µ w be the mean wet stopping distance.

105 or Indep? Notation: Let µ d be the mean dry stopping distance. and µ w be the mean wet stopping distance. Hypotheses: H 0 : µ d = µ w (or µ w µ d 0 or... ) H a : µ w > µ d

106 or Indep? Hypotheses: H 0 : µ d = µ w (or µ w µ d 0 or... ) H a : µ w > µ d n w = n d = 10, so we use df = 9 for hand calculation.

107 or Indep? Hypotheses: H 0 : µ d = µ w (or µ w µ d 0 or... ) H a : µ w > µ d n w = n d = 10, so we use df = 9 for hand calculation SE = =

108 or Indep? Hypotheses: H 0 : µ d = µ w (or µ w µ d 0 or... ) H a : µ w > µ d n w = n d = 10, so we use df = 9 for hand calculation SE = = t = =

109 or Indep? Hypotheses: H 0 : µ d = µ w (or µ w µ d 0 or... ) H a : µ w > µ d n w = n d = 10, so we use df = 9 for hand calculation SE = = t = = Our P-value is very tiny (<.005 from the tables) so we reject the null; Wet stopping distances appear to be longer.

110 or Indep? Chapter 25 # 19: A company institutes an exercise break for its workers to see if this will improve job satisfaction, as measured by a questionnaire that assesses workers satisfaction. Scores of 10 randomly selected workers before and after the implementation of the exercise program are shown.

111 or Indep? Job Satisfaction Index Worker Number Before After

112 or Indep? Worker (e.g.) 5 s job satisfaction after the program clearly has some extra association with his job satisfaction before. We have a clear matching here and MP is appropriate.

113 or Indep? Summary of the Data Mean Std. Dev. before -after

114 or Indep? Notation: Let µ b be the mean job satisfaction before. and µ a be the mean job satisfaction after.

115 or Indep? Notation: Let µ b be the mean job satisfaction before. and µ a be the mean job satisfaction after. Hypotheses: H 0 : µ b = µ a H a : µ a > µ b

116 or Indep? Hypotheses: H 0 : µ b = µ a H a : µ a > µ b n b = n a = 10, so we use df = 9 for hand calculation.

117 or Indep? Hypotheses: H 0 : µ b = µ a H a : µ a > µ b n b = n a = 10, so we use df = 9 for hand calculation. SE = s n = = 2.36

118 or Indep? Hypotheses: H 0 : µ b = µ a H a : µ a > µ b n b = n a = 10, so we use df = 9 for hand calculation. SE = s n = = 2.36 t = =

119 or Indep? Hypotheses: H 0 : µ b = µ a H a : µ a > µ b n b = n a = 10, so we use df = 9 for hand calculation. SE = s = 7.47 = 2.36 n 10 t = = Our P-value is very tiny (<.005 from the tables) so we reject the null; The program appears to improve job satisfaction.

120 or Indep? Chapter 25 # 21: Do these data suggest that there is a significant difference in calories between servings of strawberry and vanilla yogurt?

121 or Indep? Calories per Serving Brand Strawberry Vanilla America s Choice Breyer s Lowfat Columbo Dannon Light n Fit Dannon Lowfat Dannon lacreme Great Value La Yogurt Mountain High Stonyfield Farm Yoplait Custard Yoplait Light

122 or Indep? Since the two flavors are the same brand, MP is appropriate. There is also an outlier Great Value which we will exclude from our analysis.

123 or Indep? Summary of the Data (w/o Great Value) Mean Std. Dev. strawberry-vanilla

124 or Indep? Notation: Let µ s be the mean calories in a strawberry serving. and µ v be the mean calories in a vanilla serving.

125 or Indep? Notation: Let µ s be the mean calories in a strawberry serving. and µ v be the mean calories in a vanilla serving. Hypotheses: H 0 : µ s = µ v H a : µ s µ v

126 or Indep? Hypotheses: H 0 : µ s = µ v H a : µ s µ v n s = n v = 11, so we use df = 10 for hand calculation.

127 or Indep? Hypotheses: H 0 : µ s = µ v H a : µ s µ v n s = n v = 11, so we use df = 10 for hand calculation. SE = s n = = 5.45

128 or Indep? Hypotheses: H 0 : µ s = µ v H a : µ s µ v n s = n v = 11, so we use df = 10 for hand calculation. SE = s n = = 5.45 t = =.835

129 or Indep? Hypotheses: H 0 : µ s = µ v H a : µ s µ v n s = n v = 11, so we use df = 10 for hand calculation. SE = s = = 5.45 n 11 t = =.835 Our tail probability is above.1 from the tables and so our P-value is above.2. We retain the null. We haven t proven there are any differences in calories between strawberry and vanilla of a give brand.

130 MP as more Sensitive Sales figures for two car brands by district.

131 MP as more Sensitive

132 MP as more Sensitive

133 MP as more Sensitive

134 MP as more Sensitive

135 MP as more Sensitive

136 MP as more Sensitive

137 MP as more Sensitive

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means Chapter 11: Inference for Distributions 11.1 Inference for Means of a Population 11.2 Comparing Two Means 1 Population Standard Deviation In the previous chapter, we computed confidence intervals and performed

More information

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Chapter 7 Sampling Distributions and Point Estimation of Parameters Chapter 7 Sampling Distributions and Point Estimation of Parameters Part 1: Sampling Distributions, the Central Limit Theorem, Point Estimation & Estimators Sections 7-1 to 7-2 1 / 25 Statistical Inferences

More information

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics CONTENTS Estimating parameters The sampling distribution Confidence intervals for μ Hypothesis tests for μ The t-distribution Comparison

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. STAT 515 -- Chapter 5: Continuous Distributions Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. Continuous distributions typically are represented by

More information

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY 1 THIS WEEK S PLAN Part I: Theory + Practice ( Interval Estimation ) Part II: Theory + Practice ( Interval Estimation ) z-based Confidence Intervals for a Population

More information

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. STAT 515 -- Chapter 5: Continuous Distributions Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. Continuous distributions typically are represented by

More information

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1 8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions For Example: On August 8, 2011, the Dow dropped 634.8 points, sending shock waves through the financial community.

More information

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority Chapter 235 Analysis of 2x2 Cross-Over Designs using -ests for Non-Inferiority Introduction his procedure analyzes data from a two-treatment, two-period (2x2) cross-over design where the goal is to demonstrate

More information

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Statistics 431 Spring 2007 P. Shaman. Preliminaries Statistics 4 Spring 007 P. Shaman The Binomial Distribution Preliminaries A binomial experiment is defined by the following conditions: A sequence of n trials is conducted, with each trial having two possible

More information

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions SGSB Workshop: Using Statistical Data to Make Decisions Module 2: The Logic of Statistical Inference Dr. Tom Ilvento January 2006 Dr. Mugdim Pašić Key Objectives Understand the logic of statistical inference

More information

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether. Lecture 34 Section 10.2 Hampden-Sydney College Fri, Oct 31, 2008 Outline 1 2 3 4 5 6 7 8 Exercise 10.4, page 633. A psychologist is studying the distribution of IQ scores of girls at an alternative high

More information

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 7 Statistical Intervals Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to

More information

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions: Chapter 17 Inference about a Population Mean Conditions for inference Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions: (1) Our data (observations)

More information

The Two-Sample Independent Sample t Test

The Two-Sample Independent Sample t Test Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 The General Formula The Equal-n Formula 4 5 6 Independence Normality Homogeneity of Variances 7 Non-Normality Unequal

More information

Uniform Probability Distribution. Continuous Random Variables &

Uniform Probability Distribution. Continuous Random Variables & Continuous Random Variables & What is a Random Variable? It is a quantity whose values are real numbers and are determined by the number of desired outcomes of an experiment. Is there any special Random

More information

Module 3: Sampling Distributions and the CLT Statistics (OA3102)

Module 3: Sampling Distributions and the CLT Statistics (OA3102) Module 3: Sampling Distributions and the CLT Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chpt 7.1-7.3, 7.5 Revision: 1-12 1 Goals for

More information

12.1 One-Way Analysis of Variance. ANOVA - analysis of variance - used to compare the means of several populations.

12.1 One-Way Analysis of Variance. ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate

More information

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate

More information

Chapter 18: The Correlational Procedures

Chapter 18: The Correlational Procedures Introduction: In this chapter we are going to tackle about two kinds of relationship, positive relationship and negative relationship. Positive Relationship Let's say we have two values, votes and campaign

More information

Business Statistics 41000: Probability 3

Business Statistics 41000: Probability 3 Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404

More information

Chapter 8 Statistical Intervals for a Single Sample

Chapter 8 Statistical Intervals for a Single Sample Chapter 8 Statistical Intervals for a Single Sample Part 1: Confidence intervals (CI) for population mean µ Section 8-1: CI for µ when σ 2 known & drawing from normal distribution Section 8-1.2: Sample

More information

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed. The Central Limit Theorem The central limit theorem (clt for short) is one of the most powerful and useful ideas in all of statistics. The clt says that if we collect samples of size n with a "large enough

More information

Chapter 7. Inferences about Population Variances

Chapter 7. Inferences about Population Variances Chapter 7. Inferences about Population Variances Introduction () The variability of a population s values is as important as the population mean. Hypothetical distribution of E. coli concentrations from

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education

More information

1 Inferential Statistic

1 Inferential Statistic 1 Inferential Statistic Population versus Sample, parameter versus statistic A population is the set of all individuals the researcher intends to learn about. A sample is a subset of the population and

More information

Homework: (Due Wed) Chapter 10: #5, 22, 42

Homework: (Due Wed) Chapter 10: #5, 22, 42 Announcements: Discussion today is review for midterm, no credit. You may attend more than one discussion section. Bring 2 sheets of notes and calculator to midterm. We will provide Scantron form. Homework:

More information

Two-Sample T-Test for Superiority by a Margin

Two-Sample T-Test for Superiority by a Margin Chapter 219 Two-Sample T-Test for Superiority by a Margin Introduction This procedure provides reports for making inference about the superiority of a treatment mean compared to a control mean from data

More information

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 6 Point Estimation Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Point Estimation Statistical inference: directed toward conclusions about one or more parameters. We will use the generic

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions. ME3620 Theory of Engineering Experimentation Chapter III. Random Variables and Probability Distributions Chapter III 1 3.2 Random Variables In an experiment, a measurement is usually denoted by a variable

More information

Two-Sample T-Test for Non-Inferiority

Two-Sample T-Test for Non-Inferiority Chapter 198 Two-Sample T-Test for Non-Inferiority Introduction This procedure provides reports for making inference about the non-inferiority of a treatment mean compared to a control mean from data taken

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

Data Distributions and Normality

Data Distributions and Normality Data Distributions and Normality Definition (Non)Parametric Parametric statistics assume that data come from a normal distribution, and make inferences about parameters of that distribution. These statistical

More information

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley. Appendix: Statistics in Action Part I Financial Time Series 1. These data show the effects of stock splits. If you investigate further, you ll find that most of these splits (such as in May 1970) are 3-for-1

More information

One sample z-test and t-test

One sample z-test and t-test One sample z-test and t-test January 30, 2017 psych10.stanford.edu Announcements / Action Items Install ISI package (instructions in Getting Started with R) Assessment Problem Set #3 due Tu 1/31 at 7 PM

More information

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables Chapter 5 Continuous Random Variables and Probability Distributions 5.1 Continuous Random Variables 1 2CHAPTER 5. CONTINUOUS RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS Probability Distributions Probability

More information

IOP 201-Q (Industrial Psychological Research) Tutorial 5

IOP 201-Q (Industrial Psychological Research) Tutorial 5 IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution January 31, 2019 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The

More information

Statistics Class 15 3/21/2012

Statistics Class 15 3/21/2012 Statistics Class 15 3/21/2012 Quiz 1. Cans of regular Pepsi are labeled to indicate that they contain 12 oz. Data Set 17 in Appendix B lists measured amounts for a sample of Pepsi cans. The same statistics

More information

AP Stats Review. Mrs. Daniel Alonzo & Tracy Mourning Sr. High

AP Stats Review. Mrs. Daniel Alonzo & Tracy Mourning Sr. High AP Stats Review Mrs. Daniel Alonzo & Tracy Mourning Sr. High sdaniel@dadeschools.net Agenda 1. AP Stats Exam Overview 2. AP FRQ Scoring & FRQ: 2016 #1 3. Distributions Review 4. FRQ: 2015 #6 5. Distribution

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.) Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop

More information

Learning Objectives for Ch. 7

Learning Objectives for Ch. 7 Chapter 7: Point and Interval Estimation Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 7 Obtaining a point estimate of a population parameter

More information

Statistics & Statistical Tests: Assumptions & Conclusions

Statistics & Statistical Tests: Assumptions & Conclusions Degrees of Freedom Statistics & Statistical Tests: Assumptions & Conclusions Kinds of degrees of freedom Kinds of Distributions Kinds of Statistics & assumptions required to perform each Normal Distributions

More information

Chapter 8 Estimation

Chapter 8 Estimation Chapter 8 Estimation There are two important forms of statistical inference: estimation (Confidence Intervals) Hypothesis Testing Statistical Inference drawing conclusions about populations based on samples

More information

AP Stats. Review. Mrs. Daniel Alonzo & Tracy Mourning Sr. High

AP Stats. Review. Mrs. Daniel Alonzo & Tracy Mourning Sr. High AP Stats Review Mrs. Daniel Alonzo & Tracy Mourning Sr. High sdaniel@dadeschools.net Agenda 1. AP Stats Exam Overview 2. AP FRQ Scoring & FRQ: 2016 #1 3. Distributions Review 4. FRQ: 2015 #6 5. Distribution

More information

15.063: Communicating with Data Summer Recitation 4 Probability III

15.063: Communicating with Data Summer Recitation 4 Probability III 15.063: Communicating with Data Summer 2003 Recitation 4 Probability III Today s Content Normal RV Central Limit Theorem (CLT) Statistical Sampling 15.063, Summer '03 2 Normal Distribution Any normal RV

More information

Section The Sampling Distribution of a Sample Mean

Section The Sampling Distribution of a Sample Mean Section 5.2 - The Sampling Distribution of a Sample Mean Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin The Sampling Distribution of a Sample Mean Example: Quality control check of light

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Fall 2011 Exam Score: /75. Exam 3

Fall 2011 Exam Score: /75. Exam 3 Math 12 Fall 2011 Name Exam Score: /75 Total Class Percent to Date Exam 3 For problems 1-10, circle the letter next to the response that best answers the question or completes the sentence. You do not

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution January 31, 2018 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The

More information

8.1 Binomial Distributions

8.1 Binomial Distributions 8.1 Binomial Distributions The Binomial Setting The 4 Conditions of a Binomial Setting: 1.Each observation falls into 1 of 2 categories ( success or fail ) 2 2.There is a fixed # n of observations. 3.All

More information

Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at

Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at mailto:msfrisbie@pfrisbie.com. 1. Let X represent the savings of a resident; X ~ N(3000,

More information

Chapter 6: Random Variables. Ch. 6-3: Binomial and Geometric Random Variables

Chapter 6: Random Variables. Ch. 6-3: Binomial and Geometric Random Variables Chapter : Random Variables Ch. -3: Binomial and Geometric Random Variables X 0 2 3 4 5 7 8 9 0 0 P(X) 3???????? 4 4 When the same chance process is repeated several times, we are often interested in whether

More information

Study Ch. 11.2, #51, 63 69, 73

Study Ch. 11.2, #51, 63 69, 73 May 05, 014 11. Inferences for σ's, Populations Study Ch. 11., #51, 63 69, 73 Statistics Home Page Gertrude Battaly, 014 11. Inferences for σ's, Populations Procedures that assume = σ's 1. Pooled t test.

More information

STAT 201 Chapter 6. Distribution

STAT 201 Chapter 6. Distribution STAT 201 Chapter 6 Distribution 1 Random Variable We know variable Random Variable: a numerical measurement of the outcome of a random phenomena Capital letter refer to the random variable Lower case letters

More information

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 7. Sampling Distributions and the Central Limit Theorem Chapter 7. Sampling Distributions and the Central Limit Theorem 1 Introduction 2 Sampling Distributions related to the normal distribution 3 The central limit theorem 4 The normal approximation to binomial

More information

Section Sampling Distributions for Counts and Proportions

Section Sampling Distributions for Counts and Proportions Section 5.1 - Sampling Distributions for Counts and Proportions Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin Distributions When dealing with inference procedures, there are two different

More information

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Chapter 14: random variables p394 A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Consider the experiment of tossing a coin. Define a random variable

More information

1. Confidence Intervals (cont.)

1. Confidence Intervals (cont.) Math 1125-Introductory Statistics Lecture 23 11/1/06 1. Confidence Intervals (cont.) Let s review. We re in a situation, where we don t know µ, but we have a number from a normal population, either an

More information

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD MAJOR POINTS Sampling distribution of the mean revisited Testing hypotheses: sigma known An example Testing hypotheses:

More information

Chapter 8. Binomial and Geometric Distributions

Chapter 8. Binomial and Geometric Distributions Chapter 8 Binomial and Geometric Distributions Lesson 8-1, Part 1 Binomial Distribution What is a Binomial Distribution? Specific type of discrete probability distribution The outcomes belong to two categories

More information

Math 227 Elementary Statistics. Bluman 5 th edition

Math 227 Elementary Statistics. Bluman 5 th edition Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find

More information

Statistics for Business and Economics: Random Variables:Continuous

Statistics for Business and Economics: Random Variables:Continuous Statistics for Business and Economics: Random Variables:Continuous STT 315: Section 107 Acknowledgement: I d like to thank Dr. Ashoke Sinha for allowing me to use and edit the slides. Murray Bourne (interactive

More information

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality Point Estimation Some General Concepts of Point Estimation Statistical inference = conclusions about parameters Parameters == population characteristics A point estimate of a parameter is a value (based

More information

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment

More information

The normal distribution is a theoretical model derived mathematically and not empirically.

The normal distribution is a theoretical model derived mathematically and not empirically. Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.

More information

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg :

More information

Data Analysis. BCF106 Fundamentals of Cost Analysis

Data Analysis. BCF106 Fundamentals of Cost Analysis Data Analysis BCF106 Fundamentals of Cost Analysis June 009 Chapter 5 Data Analysis 5.0 Introduction... 3 5.1 Terminology... 3 5. Measures of Central Tendency... 5 5.3 Measures of Dispersion... 7 5.4 Frequency

More information

The Central Limit Theorem for Sample Means (Averages)

The Central Limit Theorem for Sample Means (Averages) The Central Limit Theorem for Sample Means (Averages) By: OpenStaxCollege Suppose X is a random variable with a distribution that may be known or unknown (it can be any distribution). Using a subscript

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 7 Estimation: Single Population Copyright 010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-1 Confidence Intervals Contents of this chapter: Confidence

More information

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 7. Sampling Distributions and the Central Limit Theorem Chapter 7. Sampling Distributions and the Central Limit Theorem 1 Introduction 2 Sampling Distributions related to the normal distribution 3 The central limit theorem 4 The normal approximation to binomial

More information

Chapter 6 Confidence Intervals

Chapter 6 Confidence Intervals Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) VOCABULARY: Point Estimate A value for a parameter. The most point estimate of the population parameter is the

More information

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions Random Variables Examples: Random variable a variable (typically represented by x) that takes a numerical value by chance. Number of boys in a randomly selected family with three children. Possible values:

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

The Standard Deviation as a Ruler and the Normal Model. Copyright 2009 Pearson Education, Inc.

The Standard Deviation as a Ruler and the Normal Model. Copyright 2009 Pearson Education, Inc. The Standard Deviation as a Ruler and the Normal Mol Copyright 2009 Pearson Education, Inc. The trick in comparing very different-looking values is to use standard viations as our rulers. The standard

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION In Inferential Statistic, ESTIMATION (i) (ii) is called the True Population Mean and is called the True Population Proportion. You must also remember that are not the only population parameters. There

More information

Statistical Methods in Practice STAT/MATH 3379

Statistical Methods in Practice STAT/MATH 3379 Statistical Methods in Practice STAT/MATH 3379 Dr. A. B. W. Manage Associate Professor of Mathematics & Statistics Department of Mathematics & Statistics Sam Houston State University Overview 6.1 Discrete

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

The Normal Distribution

The Normal Distribution Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

Chapter 6 Continuous Probability Distributions. Learning objectives

Chapter 6 Continuous Probability Distributions. Learning objectives Chapter 6 Continuous s Slide 1 Learning objectives 1. Understand continuous probability distributions 2. Understand Uniform distribution 3. Understand Normal distribution 3.1. Understand Standard normal

More information

Since his score is positive, he s above average. Since his score is not close to zero, his score is unusual.

Since his score is positive, he s above average. Since his score is not close to zero, his score is unusual. Chapter 06: The Standard Deviation as a Ruler and the Normal Model This is the worst chapter title ever! This chapter is about the most important random variable distribution of them all the normal distribution.

More information

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2] 1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders

More information

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82 Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections

More information

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems Interval estimation September 29, 2017 STAT 151 Class 7 Slide 1 Outline of Topics 1 Basic ideas 2 Sampling variation and CLT 3 Interval estimation using X 4 More general problems STAT 151 Class 7 Slide

More information

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Chapter 14: random variables p394 A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Consider the experiment of tossing a coin. Define a random variable

More information

MAKING SENSE OF DATA Essentials series

MAKING SENSE OF DATA Essentials series MAKING SENSE OF DATA Essentials series THE NORMAL DISTRIBUTION Copyright by City of Bradford MDC Prerequisites Descriptive statistics Charts and graphs The normal distribution Surveys and sampling Correlation

More information

Review: Population, sample, and sampling distributions

Review: Population, sample, and sampling distributions Review: Population, sample, and sampling distributions A population with mean µ and standard deviation σ For instance, µ = 0, σ = 1 0 1 Sample 1, N=30 Sample 2, N=30 Sample 100000000000 InterquartileRange

More information

CHAPTER 5 SAMPLING DISTRIBUTIONS

CHAPTER 5 SAMPLING DISTRIBUTIONS CHAPTER 5 SAMPLING DISTRIBUTIONS Sampling Variability. We will visualize our data as a random sample from the population with unknown parameter μ. Our sample mean Ȳ is intended to estimate population mean

More information

MA 1125 Lecture 05 - Measures of Spread. Wednesday, September 6, Objectives: Introduce variance, standard deviation, range.

MA 1125 Lecture 05 - Measures of Spread. Wednesday, September 6, Objectives: Introduce variance, standard deviation, range. MA 115 Lecture 05 - Measures of Spread Wednesday, September 6, 017 Objectives: Introduce variance, standard deviation, range. 1. Measures of Spread In Lecture 04, we looked at several measures of central

More information

SLIDES. BY. John Loucks. St. Edward s University

SLIDES. BY. John Loucks. St. Edward s University . SLIDES. BY John Loucks St. Edward s University 1 Chapter 10, Part A Inference About Means and Proportions with Two Populations n Inferences About the Difference Between Two Population Means: σ 1 and

More information

Continuous Distributions

Continuous Distributions Quantitative Methods 2013 Continuous Distributions 1 The most important probability distribution in statistics is the normal distribution. Carl Friedrich Gauss (1777 1855) Normal curve A normal distribution

More information

appstats5.notebook September 07, 2016 Chapter 5

appstats5.notebook September 07, 2016 Chapter 5 Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.

More information

χ 2 distributions and confidence intervals for population variance

χ 2 distributions and confidence intervals for population variance χ 2 distributions and confidence intervals for population variance Let Z be a standard Normal random variable, i.e., Z N(0, 1). Define Y = Z 2. Y is a non-negative random variable. Its distribution is

More information

Putting Things Together Part 2

Putting Things Together Part 2 Frequency Putting Things Together Part These exercise blend ideas from various graphs (histograms and boxplots), differing shapes of distributions, and values summarizing the data. Data for, and are in

More information

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 8-26-2016 On Some Test Statistics for Testing the Population Skewness and Kurtosis:

More information