6.1, 7.1 Estimating with confidence (CIS: Chapter 10)

Size: px
Start display at page:

Download "6.1, 7.1 Estimating with confidence (CIS: Chapter 10)"

Transcription

1 Objectives 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) Statistical confidence (CIS gives a good explanation of a 95% CI) Confidence intervals Choosing the sample size t distributions One-sample t confidence interval for a population mean How confidence intervals behave Adapted from authors slides 2012 W.H. Freeman and Company

2 Overview of Inference Sample population, and sample mean population mean µ. But we do not know the value of µ, and if we want to make any conclusions about µ then we have to use to do so. x x Methods for drawing conclusions about a population from sample data are called statistical inference. There are two main types of inference: Confidence Intervals - estimating the value of a population parameter, and Tests of Significance - assessing evidence for a claim (hypothesis) about a population. Inference is appropriate when data are produced by either a random sample or a randomized experiment.

3 Introducing con4idence intervals q It is very unlikely that the sample mean based on a sample will ever equal the true mean. Our aim is to construct an interval around the sample mean which is `likely to contain the mean. This is called a confidence interval. In the first lecture we considered a Gallop poll for the proportion of the electorate that would vote for Obama. Gallup predicted that the Obama vote would be in the interval [45%,51%] with 95% confidence. The Obama vote turned out to be 50.5%, so the interval did capture the true proportion. You may be asking yourself how do we understand 95%, since 50.5% lies in this interval, there does not appear to be any uncertainty in it. In the next few slides, our objective is to understand how a confidence interval is constructed and how to understand it.

4 Review: properties of the sample mean x The sample mean is a unique number for any particular sample. If you had obtained a different sample (by chance) you almost certainly would have had a different value for your sample mean. In fact, you could get many different values for the sample mean, and virtually none of them would actually equal the true population mean, µ.

5 Because the sampling distribution of distribution, by a factor of n. x is narrower than the population The the estimates x n Sample means, n subjects x tend to be closer to the population parameter µ than individual observations are. σ n Population, x individual subjects σ µ If the population is normally distributed N(µ,σ), the sampling distribution is N(µ,σ/ n),

6 Using the empirical distribution, since the sample mean is close to normal, 95% of the time it will be within 2 standard errors of the mean, that is if I had a hundred sample means, then about 95 times the sample mean lies in the interval [µ 2 σ/ n, µ + 2 σ/ n]. Now we make a small correction to the empirical rule. It is not 2 standard deviations of the mean, but 1.96 standard deviations from the mean. To see why, look up 1.96 in the z-tables. q But the mean is unknown, so our objective is to locate the true mean based on the sample mean. To do this we turn the story around, if the sample mean lies in the interval [µ 1.96 σ/ n, µ+1.96 σ/ n], this is the same as saying the mean µ lies in the interval [sample mean 1.96 σ/ n, sample mean σ/ n]. Thus 95% of the time, the true mean (that we want to estimate) will be in the interval [sample mean 1.96 σ/ n, sample mean σ/ n]. This is an interval which is centered about the sample mean. In the next slide we illustrate what we mean by 95%.

7 If multiple samples were possible 95% of all sample means will be within 1.96 (roughly 2) standard deviations (1.96 σ/ n) of the population parameter µ. σ n This implies that the population parameter µ will be within 1.96 standard deviations from the sample average samples. x, in 95% of all This reasoning is the essence of statistical inference. Red dot: mean value of individual sample

8 Mean height sample size one q Human heights are approximately a normal distribution. The standard deviation of a human height is 3.8 inches. Our objective is to construct a confidence interval for the mean height. We start with a very crude estimator and use just one height to estimate the mean, this is the same as using a sample of size one. In this case the standard error is 3.8/ 1 = 3.8. Each of you construct a 95% confidence interval for the mean height using your height as the sample: [your height , your height ] [your height 7.44, your height ].For example, in my case the interval is [ , ] = [55.56,70.44]. Each of you do this too. In fact it is known that the mean height of a person is 67 inches. Does you interval contain the mean? The proportion of intervals that contain the mean should be approx 95%.

9 Mean height sample size two In the previous experiment the we used just one individual to estimate the mean height. The `cost of using one individual was that the confidence interval was very wide. We repeat the experiment, but this time each of you buddy up with your neighbour and calculate the average height between the two of you (ie. (your height plus neighbour s height)/2). You and your buddy for a sample of size two. We know that this the sample mean based on a sample of size n=2. has the standard error 3.8/ 2 = Each group construct the interval [sample mean , sample mean ] = [sample mean ±5.26]. The mean height is 67 inches, does your interval contain the mean? What proportion of the intervals in the class contain the mean?

10 Observations We see that the length of confidence interval when using just one person in the sample is = 14.88, this is quite long, and does not really allow us to pinpoint the mean. Whereas the length of interval using two people to calculate the sample mean is 10.52, this is quite a big reduction in length! If ten people were used to calculate the sample mean the corresponding interval length would be 14.88/ 10 = 4.7. We see that for any given interval either the mean is in this interval or not. The 95% comes into play when we look at the proportion of intervals that contain the mean. In reality: We do not know the true mean µ, so will never know whether the interval contained the mean or not. We only observe one sample of size n, and thus have one CI. One confidence interval contain information about the mean. This is why we say with 95% confidence the mean lies in it.

11 Implications We do not need to (and cannot, anyway) take a lot of random samples to rebuild the sampling distribution and find µ at its center. n Sample Population µ n All we need is one SRS of size n and we can rely on the properties of the sampling distribution to infer reasonable values for the population mean µ.

12 Multiple samples revisited With 95% confidence, we can say that µ should be within 1.96 standard deviations (1.96 σ/ n) from our sample mean. In 95% of all possible samples of this size n, µ will indeed fall in our confidence interval. In only 5% of samples will farther from µ. be Confidence = the proportion of possible samples that give us a correct conclusion. x x σ n

13 Calculation practice You want to rent an unfurnished one-bedroom apartment in Dallas. The mean monthly rent for 10 randomly sampled apartments is 980 dollars. Assume that monthly rents follow a normal distribution with standard deviation 280 dollars. Construct a 95% confidence interval for the mean monthly rent of a one-bedroom apartment. The standard error for the sample mean is 280/ 10 = Thus the 95% CI is [980 ± ] = [806,1153]. With 95% confidence we believe the mean price of one-bedroom apartments in Dallas lies in this interval. Does the above confidence interval mean that 95% of all rents should lie in this interval? No, it is the interval for the mean. If we want the interval where 95% of all rents should lie it is [980 ±1.96( )] = [257,1720]. You do not have to understand the calculation, but you will notice this interval is much wider. The reason is that it must capture 95% of all rents, which are extremely varied. The previous CI was just capturing the mean rent, based on the sample mean, which is much less varied.

14 Calculation practice Hypokalemia is diagnosed when the blood potassium level is below 3.5mEq/dl. The potassium in a blood sample varies from sample to sample and follows a normal distribution with standard deviation 0.2. A patient s potassium is measured taken over 4 days. The sample mean level over these 4 days is 3.7. Construct a 95% confidence interval for the mean potassium and discuss whether the patient is likely to be diagnosed with Hypokalemia. The standard error for the sample mean is 0.2/ 2 = 0.1. Thus the 95% confidence interval for the mean potassium level is [3.7± ] = q [3.504,3.894]. This means with 95% confidence we believe the mean lies in this interval. Since 3.5 or less does not lie in this interval, with 95% confidence I can say that the patient does not have this condition.

15 Con4idence interval misunderstandings Suppose 400 alumni were asked to rate the University of Okoboji the university counseling services on a scale 1 to 10. The sample mean was found to be 8.6 and it is known that the standard deviation is σ=2. Ima Bitlost has done the analysis, but has made some mistakes. Ima computes the 95% CI interval for the mean satisfaction score as [8.6±1.96 2]. What is her mistake? Ima has not taken into account that the sample mean has a much smaller standard deviation (standard error) than the population. The standard error is 2/ 400 = 0.1. Thus the true CI is [8.6± ] = [8.4,8.796]. After correcting her mistake, she states that I am 95% confident that the sample mean lies in the interval [8.4,8.796] What is wrong with her statement? This is a meaningless statement, for sure the sample mean lies in this interval! It is the population mean that we are 95% confident lies there.

16 She quickly realizes her mistake and instead states the probability that the mean lies in the interval [8.4,8.796] is 95%, what misinterpretation is she making now? By 95%, we mean that if we repeated the experiment many times over about 95% of the time the intervals will contain the mean. For any given interval the mean is either in there or not. There is no probability attached to it. To overcome, this issue we say that with we have 95% confidence in the mean lies in this interval. Finally, in her defense for using the normal distribution to determine the confidence coefficient (1.96) she says Because the sample size is quite large, the population of alumni ratings will be close to normal. Explain to Ima her misunderstanding. The distribution of the population always stays the same, regardless of the sample size (in this case, it is clear that variables that take integer values between 1 to 10 cannot be normal). However, the sample mean does get closer to normal as the sample size grow. With a sample size of 400, the distribution of the sample mean will be very close to normal.

17 Different levels of con4idence There is no need to restrict ourselves to 95% confidence intervals. The level of confidence we use really depends on how much confidence we want. For example, you would expect a 99% confidence interval is more likely to contain the mean than a 95% confidence interval. To construct a 99% confidence interval we use exactly the same prescription as used to construct a 95% confidence interval, the only thing that changes is 1.96 goes to 2.57 (if you look up in the z- tables you will see this corresponds to 0.5%, so 99% of the time the sample mean will lie within 2.57 standard errors from the mean). q A 99% CI for the mean one-bedroom apartment price is [980± ]. Length of interval is A 90% CI for the mean one-bedroom apartment price is [980± ]. Length of interval is What does a 100% confidence interval look like? In a 100% CI we are sure to find the mean, but this interval is so wide it is not informative.

18 Sample size and length of the CI Let us return to the apartment example. We recall that for the confidence interval for the mean price is [980 ± ] = [806,1153]. The length of this interval is = 347. What happens to the length of interval if I increase the sample size? Suppose I take a SRS of 100 apartments in Dallas, the sample mean based on this sample is 1000, what will the CI be? The standard error is 280/ 100 = 28 (much smaller than when the sample size is 10), and the CI is [980 ± ]. The length of this interval is =109. What we observe is: The length of the interval does not depend on the sample mean, this is just the centralizing factor. It only depends on 1.96, the standard deviation and the sample size. The length of the interval gets smaller as the sample size grow. This suggests that if we want the interval to have a certain level of precision, we can choose the sample size accordingly.

19 Margin of Error Margin of error is the lingo used for the plus and minus part in the confidence interval. That is the confidence interval is [sample mean±1.96 σ/ n], the margin of error is 1.96 σ/ n. q For example, in the previous example the margin of error for the CI based on 10 apartments is q The margin of error for the CI based on 100 apartments is q The margin of error in some sense, is a measure of accuracy. The smaller the margin error the more precisely we can pinpoint the true mean. q Suppose we want the margin or error to be equal to some value, then we can find the sample size such that we obtain that margin of error. Solve for n the equation MoE = 1.96 σ/ n (the Margin of Error and the standard deviation σ are given). See the next slide for an example.

20 Calculation practice: What sample size for a given margin of error? Annual coffee sales: A marketing firm plans to study the annual sales in coffee shops. They want to estimate the mean annual sales to within $0.2 million, this time with 98% confidence. How many coffee shops should they sample to obtain a margin of error of at most $0.2 million with a confidence level of 98%? From a previous study they guess σ $1.03 million. To solve the formula we need to find the correct z-score that will give a 98% CI. Looking up the tables we see The z* = Thus we solve the equation: 2 2 z * σ n = 12.0 = 144. m 0.2 From the calculation, we see they need 144 observations such that the margin of error is 0.2million.

21 Calculation practice q In a study of bone turn over in young women with a medical condition, serum TRAP was measured in 31 subjects. The sample mean was 13.2 units per liter. Assume the standard deviation is known to be 6.5U/l. Find the 80% CI for the mean serum level. Look up 10% in the z-tables, this gives The standard error for the sample mean is 6.5/ 31 = Altogether this gives the CI [13.2± ] =[11.7,14.6]. This means with we believe with 80% confidence the mean level of serum for women with this medical condition should lie in this interval. By choosing such a low level of confidence our interval is quite narrow, but our confidence in this interval is relatively low. How large a sample size should we choose such that the 80% CI for the mean has the margin of error 1U/l. q This means solving / n = 1, n=( /1) 2 =70.

22 A confidence interval for µ can be expressed two ways. x ± m. m is called the margin of error Egg carton example: 64.17g ± 2.83 g. We say We conclude that µ is within 2.83g of 64.17g, with 95% confidence. Two endpoints of an interval: ( m) to ( + m). Egg carton example: 61.34g to 67.00g. We say We conclude that µ is between 61.43g and 67.00g, with 95% confidence. Again, the confidence level C is the proportion of possible samples for which the conclusion is correct. That is, it is the proportion of possible samples for which the interval contains µ. (C usually is given in %.) But there is an important issue to deal with. x We do not know the value of σ any more than we know the value of µ. x

23 When σ is unknown In the case the we can estimate the standard deviation from the data. The sample standard deviation s provides an estimate of the population standard deviation σ. When the sample size is large, the sample is likely to contain elements representative of the whole population. Then s is a good estimate of σ. Population distribution But when the sample size is small, the sample contains only a few individuals. Then s is a mediocre estimate of σ. The data is unlikely to contain values in the tails and, s is likely to underestimate σ. Large sample Small sample

24 The z- transform with estimated standard deviation Simply replacing the true standard deviation with the estimated standard deviation can have severe consequences on the confidence interval if we do not correct for it. To see why consider the z-transforms of the sample mean with known and estimated standard deviations: (sample mean - µ)/(σ/ n) (sample mean - µ)/(s/ n) In the first case, z-transform will be a standard normal. In the second case the estimated standard deviation adds extra variability into the `system. In particular, because s can be small then σ, this means the z-transform can be larger and take higher values then we would expect for a standard normal. In the next few slides we show that when we estimate the standard deviation the z-transform is no longer a standard normal, but the so called t-distribution.

25 How brewers saved statistics Just over 100 years ago, W.S. Gosset was a biometrician who worked for Guiness Brewery in Dublin, Ireland. Gosset realized that his inferences with small sample data seemed to be incorrect too often his true confidence level was less than it was stated to be! He worked out the proper method that took into account substituting s for σ. But he had to publish under a pseudonym: Student. Gosset s theory is based on the distribution of the quantity t = x s µ. n This looks like the z-score for x, except that s replaces σ in the denominator.

26 Student s t distributions Suppose that an SRS of size n is drawn from an Normal(µ,σ) population. x µ When σ is known, the sampling distribution for z = σ n is Normal(0,1). q q When σ is estimated from the sample standard deviation s, the x µ sampling distribution for t = will be very close to normal if the s n sample size n is large. This is because for large n, s will be a very reliable estimator of σ. However, in the case that n is not so large, the variability in s will have an impact on the distribution. It is clear that the impact it has depends on the sample size.

27 Student s t distributions When σ is estimated from the sample standard deviation s, the sampling distribution for t = x s µ will depend on the sample size. n The sample distribution of x µ t = s n is a t distribution with n 1 degrees of freedom. q The degrees of freedom (df) is a measure of how well s estimates σ. The larger the degrees of freedom, the better σ is estimated. This means we need a new set of tables!

28 When n is very large, s is a very good estimate of σ, and the corresponding t distributions are very close to the normal distribution. The t distributions become wider (thicker tailed) for smaller sample sizes, reflecting that s can be smaller than σ, so the corresponding t- transform is more likely to take extreme values than the z-transform.

29 Impact on con4idence intervals Suppose we want to construct the C% confidence interval for the mean. The standard deviation is unknown, so as well as estimating the mean we also estimate the standard deviation from the sample. Practical use of t: t* t* is related to the chosen confidence level C. C C is the area under Student s t curve between t* and t*. The confidence interval is thus: x ± t * s n t* Example: For an 80% confidence level C, 80% of Student s t curve s area is contained in the interval. t*

30 Con=idence level and the margin of error The confidence level C determines the value of t* (in table D). The margin of error also depends on t*. Higher confidence C implies a larger margin of error m (thus less precision in our estimates). A lower confidence level C produces a smaller margin of error m (thus better precision in our estimates). * m= t s n C We find t* in the line of Table D for df = n 1 and confidence level C. t* t*

31 Table D When σ is unknown, we use a t distribution with n 1 degrees of freedom (df). Table D shows the z-values and t-values corresponding to landmark P-values/ confidence levels. t= When the sample is very large, we use the normal distribution and the standardized z-value. x µ s n

32 Focus first on 2.5%. For each n, the 2.5% corresponds to the area on the left and right tails of the t-distribution with n degrees of freedom. Remember a distribution gives the chance/likelihood of certain outcomes. Recall that for a normal distribution, the point where we get 2.5% on the left and the right of the tails of the distribution is If we go down the table. we see that as the sample size, n, increases the value corresponding to 2.5, goes from (for n=1) to a number that is very close to 1.96 for extremely large n. This means for small n the variability on the standard deviation s means that the chance of the t-transform being extreme is relatively large. However, as n grows, the estimator of the standard deviation improves, and the t-transform gets closer to a normal distribution. You will observe the same is true for other percentages. Take a look at 5% and 0.5% and look down the table.

33 Calculation practice (red wine 1) It has been suggested that drinking red wine in moderation may protect against heart attacks. This is because red wind contains polyphenols which act on blood cholesterol. To see if moderate red wine consumption increases the average blood level of polyphenols, a group of nine randomly selected healthy men were assigned to drink half a bottle of red wine daily for two weeks. The percent change in their blood polyphenol levels are presented here: Sample average = 5.50 Sample standard deviation s = Degrees of freedom df = n 1 = 8 x We will encounter two problems when doing the analysis. The first is that the sample size is not huge so we have to hope that the sample mean is close to normal. The second is the standard deviation is unknown and has to be estimated from the data.

34 q What is the 95% confidence interval for the average percent change? First, we determine what t* is. The degrees of freedom are df = n 1 = 8 and C = 95%. From Table D we get t* = ( ) The margin of error m is: m = t* s/ n = / So the 95% confidence interval is 5.50 ± 1.93, or 3.57 to We can say With 95% confidence, the mean of percent increase is between 3.57% and 7.43%. What if we want a 99% confidence interval instead? For C = 99% and df = 8, we find t* = Thus m = / Now, with 99% confidence, we only can conclude the mean is between 2.69 and (A big price to pay for the extra confidence.)

35 Calculation practice (red wine 2) Let us return to the same study, but this time we increase the sample size to 15 men. The data is now: 0.7,3.5,4,4.9,5.5,7,7.4,8.1,8.4, 3.2,0.8,4.3,-0.2,-0.6,7.5 The sample mean in this case is 4.3 and the sample standard deviation is Since the sample size has increased, it is likely that the sample standard deviation is a more reliable estimator of the true standard deviation. The number of degrees of freedom is 14. Just as in the previous example we can construct a 95% confidence interval but now we use 14df instead of 8dfs.

36 More calculation practice Let us return to the example of prices of apartments in Dallas. 10 apartments are randomly sampled. The sample mean and the sample standard deviation based on this sample is 980 dollars and 250 dollars (both are estimators based on a sample of size ten). Construct a 95% confidence interval for the mean: q The standard error is 250/ 10 = 79. Looking up the t-tables at 2.5% and 9 degrees of freedom gives The 95% confidence interval for the mean is [980 ± ]=[801,1159]. Suppose we want to know whether the price of apartments have increased since last year, where the mean price was 850 dollars. q Based on this interval we see that 850 dollars and greater is contained in this interval. This means the mean could be 850 dollars or higher. There given the sample it is unclear whether the mean price of apartments has increased since last year or not.

37 Example: comparing z and t- values We want to calculate a 99% CI for the mean weight of a newborn calf. To do this upload the calf data into Statcrunch. Go to Stat, from here you have two options. If we treat the standard deviation of calve weights as known (not random), then we can use the z-statistic, else we need to use the t-statistic. Suppose we choose t-statistic option -> one-sample -> with data -> then choose the variable of weights at birth (wt 0). To get the 99% CI we need to select 99% on the second pages of the options. We get the 99% interval (using a t-distribution with 43 degrees of freedom) [90.05,96.37] pounds. This means we 99% confidence we believe the mean weight of new born calves lies in this interval. To see how well the normal distribution works, we do the same, but choose the z-statistic option. This gives the 99% confidence interval [90.19,96.23]. Notice, that it is slightly narrower, because it does not take into account the underestimation of the standard deviation.

38 More calculation practice Let us return to the M&M data. Suppose we want to calculate a 99% confidence interval for the mean number of M&Ms in plain, peanut butter and peanut M&Ms. These can be calculated using the summary statistics output: Summary statistics for Total: Group by: Type Type n Mean Variance Std. Dev. Std. Err. Median Range Min Max Q1 Q3 M P PB Using this output we can calculate the confidence intervals for the mean number of M&Ms in each type. Do this.

39 Statcrunch will also give the CIs Go to Stats -> t-statistics -> one-sample -> with data -> select the column you want to analyse (choose the Group by if you want it grouped), on the next page select confidence interval and the level you want it at. Sample mean Std. err DF L Limit U limit Looking at the intervals, do you think it that the mean number of M&Ms in a plain and peanut bag could be the same. What about the mean number in peanut and peanut butter? Later on we shall make a formal test on these questions.

40 Calculation practice: coffee shop sales A marketing firm randomly samples 45 coffee shops and determines their annual sales. The sample has an average of $2.67 million and a standard deviation of $1.03 million. What can we say with 90% confidence about the mean annual sales for the population of all coffee shops? x ± t * s n The degrees of freedom is 45 1 = 44. For 90% confidence, we find t* = The margin of error is / 45 = So the interval for the true mean is 2.67 ± We conclude that the mean annual sales of all coffee shops is between $2.41 million and $2.93 million, with 90% confidence.

41 Summary of con4idence interval for µ. The confidence interval for a population mean µ is t* is obtained from Student s t distribution using n 1 degrees of freedom. (Table D in the textbook.) t* is the value such that the confidence level C is the area between t* and t*. Confidence is the proportion of samples that lead to a correct conclusion (for a specific method of inference). The investigator chooses the confidence level C. Tradeoff: more confidence means bigger margin of error, wider intervals. The degrees of freedom is associated with s, the estimate for σ. * / ts * x ± t s n. n The margin of error also depends on the sample size: larger samples are better.

42 Sample size and experimental design An investigator may need a certain margin of error m (e.g., in a marketing survey, in a drug trial, etc.). So plan ahead what sample size to use to achieve that margin of error. You will have to guess the value of σ, perhaps from historical data, and you will not know the degrees of freedom at first. But you can do a rough calculation. σ z * σ m z* n. n m This is done in the planning stages of the study. It is not an inference or conclusion and there are no data yet. Remember, too, that there typically are costs and constraints associated with large samples. Economy and feasibility are factors that will tend to keep sample sizes smaller. 2

43 Interpretation of con=idence, again The confidence level C is the proportion of all possible random samples (of size n) that will give results leading to a correct conclusion, for a specific method. In other words, if many random samples were obtained and confidence intervals were constructed from their data with C = 95% then 95% of the intervals would contain the true parameter value. In the same way, if an investigator always uses C = 95% then 95% of the confidence intervals he constructs will contain the parameter value being estimated. But he never knows which ones do! Changing the method (such as changing the value of t*) will change the confidence level. Once computed, any individual confidence interval either will or will not contain the true population parameter value. It is not random. It is not correct to say C is the probability that the true value falls in the particular interval you have computed.

44 Cautions about using * x ± t s/ n This formula is only for inference about µ, the population mean. Different formulas are used for inference about other parameters. The data must be a simple random sample from the population. The formula is not quite correct for other sampling designs. (But see a statistician to get the right inference method.) Confidence intervals based on t* are not resistant to outliers. If n is small and the population is not normal, the true confidence level could be smaller than C. (Usually n 30 suffices unless the data are highly skewed.) This inference cannot rescue sampling bias, badly produced data or computational errors.

45

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 14 (MWF) The t-distribution Suhasini Subba Rao Review of previous lecture Often the precision

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 14 (MWF) The t-distribution Suhasini Subba Rao Review of previous lecture Often the precision

More information

STAT Chapter 7: Confidence Intervals

STAT Chapter 7: Confidence Intervals STAT 515 -- Chapter 7: Confidence Intervals With a point estimate, we used a single number to estimate a parameter. We can also use a set of numbers to serve as reasonable estimates for the parameter.

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION In Inferential Statistic, ESTIMATION (i) (ii) is called the True Population Mean and is called the True Population Proportion. You must also remember that are not the only population parameters. There

More information

1 Inferential Statistic

1 Inferential Statistic 1 Inferential Statistic Population versus Sample, parameter versus statistic A population is the set of all individuals the researcher intends to learn about. A sample is a subset of the population and

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 13 (MWF) Designing the experiment: Margin of Error Suhasini Subba Rao Terminology: The population

More information

Lecture 2 INTERVAL ESTIMATION II

Lecture 2 INTERVAL ESTIMATION II Lecture 2 INTERVAL ESTIMATION II Recap Population of interest - want to say something about the population mean µ perhaps Take a random sample... Recap When our random sample follows a normal distribution,

More information

Chapter 8 Estimation

Chapter 8 Estimation Chapter 8 Estimation There are two important forms of statistical inference: estimation (Confidence Intervals) Hypothesis Testing Statistical Inference drawing conclusions about populations based on samples

More information

Statistics 13 Elementary Statistics

Statistics 13 Elementary Statistics Statistics 13 Elementary Statistics Summer Session I 2012 Lecture Notes 5: Estimation with Confidence intervals 1 Our goal is to estimate the value of an unknown population parameter, such as a population

More information

Chapter 5. Sampling Distributions

Chapter 5. Sampling Distributions Lecture notes, Lang Wu, UBC 1 Chapter 5. Sampling Distributions 5.1. Introduction In statistical inference, we attempt to estimate an unknown population characteristic, such as the population mean, µ,

More information

Confidence Intervals and Sample Size

Confidence Intervals and Sample Size Confidence Intervals and Sample Size Chapter 6 shows us how we can use the Central Limit Theorem (CLT) to 1. estimate a population parameter (such as the mean or proportion) using a sample, and. determine

More information

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate. Chapter 7 Confidence Intervals and Sample Sizes 7. Estimating a Proportion p 7.3 Estimating a Mean µ (σ known) 7.4 Estimating a Mean µ (σ unknown) 7.5 Estimating a Standard Deviation σ In a recent poll,

More information

8.1 Estimation of the Mean and Proportion

8.1 Estimation of the Mean and Proportion 8.1 Estimation of the Mean and Proportion Statistical inference enables us to make judgments about a population on the basis of sample information. The mean, standard deviation, and proportions of a population

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Review of previous lecture: Why confidence intervals? Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Suppose you want to know the

More information

1. Confidence Intervals (cont.)

1. Confidence Intervals (cont.) Math 1125-Introductory Statistics Lecture 23 11/1/06 1. Confidence Intervals (cont.) Let s review. We re in a situation, where we don t know µ, but we have a number from a normal population, either an

More information

Elementary Statistics

Elementary Statistics Chapter 7 Estimation Goal: To become familiar with how to use Excel 2010 for Estimation of Means. There is one Stat Tool in Excel that is used with estimation of means, T.INV.2T. Open Excel and click on

More information

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics CONTENTS Estimating parameters The sampling distribution Confidence intervals for μ Hypothesis tests for μ The t-distribution Comparison

More information

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions: Chapter 17 Inference about a Population Mean Conditions for inference Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions: (1) Our data (observations)

More information

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems Interval estimation September 29, 2017 STAT 151 Class 7 Slide 1 Outline of Topics 1 Basic ideas 2 Sampling variation and CLT 3 Interval estimation using X 4 More general problems STAT 151 Class 7 Slide

More information

The Two-Sample Independent Sample t Test

The Two-Sample Independent Sample t Test Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 The General Formula The Equal-n Formula 4 5 6 Independence Normality Homogeneity of Variances 7 Non-Normality Unequal

More information

Chapter 8 Statistical Intervals for a Single Sample

Chapter 8 Statistical Intervals for a Single Sample Chapter 8 Statistical Intervals for a Single Sample Part 1: Confidence intervals (CI) for population mean µ Section 8-1: CI for µ when σ 2 known & drawing from normal distribution Section 8-1.2: Sample

More information

Sampling Distributions For Counts and Proportions

Sampling Distributions For Counts and Proportions Sampling Distributions For Counts and Proportions IPS Chapter 5.1 2009 W. H. Freeman and Company Objectives (IPS Chapter 5.1) Sampling distributions for counts and proportions Binomial distributions for

More information

Chapter 7: Sampling Distributions Chapter 7: Sampling Distributions

Chapter 7: Sampling Distributions Chapter 7: Sampling Distributions Chapter 7: Sampling Distributions Objectives: Students will: Define a sampling distribution. Contrast bias and variability. Describe the sampling distribution of a proportion (shape, center, and spread).

More information

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1 Stat 226 Introduction to Business Statistics I Spring 2009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:30-10:50 a.m. Chapter 6, Section 6.1 Confidence Intervals Confidence Intervals

More information

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD MAJOR POINTS Sampling distribution of the mean revisited Testing hypotheses: sigma known An example Testing hypotheses:

More information

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Chapter 7 Sampling Distributions and Point Estimation of Parameters Chapter 7 Sampling Distributions and Point Estimation of Parameters Part 1: Sampling Distributions, the Central Limit Theorem, Point Estimation & Estimators Sections 7-1 to 7-2 1 / 25 Statistical Inferences

More information

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions SGSB Workshop: Using Statistical Data to Make Decisions Module 2: The Logic of Statistical Inference Dr. Tom Ilvento January 2006 Dr. Mugdim Pašić Key Objectives Understand the logic of statistical inference

More information

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Chapter 14 : Statistical Inference 1 Chapter 14 : Introduction to Statistical Inference Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Data x

More information

The "bell-shaped" curve, or normal curve, is a probability distribution that describes many real-life situations.

The bell-shaped curve, or normal curve, is a probability distribution that describes many real-life situations. 6.1 6.2 The Standard Normal Curve The "bell-shaped" curve, or normal curve, is a probability distribution that describes many real-life situations. Basic Properties 1. The total area under the curve is.

More information

Homework: (Due Wed) Chapter 10: #5, 22, 42

Homework: (Due Wed) Chapter 10: #5, 22, 42 Announcements: Discussion today is review for midterm, no credit. You may attend more than one discussion section. Bring 2 sheets of notes and calculator to midterm. We will provide Scantron form. Homework:

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Midterm Exam ٩(^ᴗ^)۶ In class, next week, Thursday, 26 April. 1 hour, 45 minutes. 5 questions of varying lengths.

More information

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means Chapter 11: Inference for Distributions 11.1 Inference for Means of a Population 11.2 Comparing Two Means 1 Population Standard Deviation In the previous chapter, we computed confidence intervals and performed

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://wwwstattamuedu/~suhasini/teachinghtml Suhasini Subba Rao Review of previous lecture The main idea in the previous lecture is that the sample

More information

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates A point estimate is a single number, a confidence interval provides additional information about the variability of the estimate Lower

More information

Sampling & Confidence Intervals

Sampling & Confidence Intervals Sampling & Confidence Intervals Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 24/10/2017 Principles of Sampling Often, it is not practical to measure every subject in a population.

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes. Standardizing normal distributions The Standard Normal Curve

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes. Standardizing normal distributions The Standard Normal Curve 6.1 6.2 The Standard Normal Curve Standardizing normal distributions The "bell-shaped" curve, or normal curve, is a probability distribution that describes many reallife situations. Basic Properties 1.

More information

FEEG6017 lecture: The normal distribution, estimation, confidence intervals. Markus Brede,

FEEG6017 lecture: The normal distribution, estimation, confidence intervals. Markus Brede, FEEG6017 lecture: The normal distribution, estimation, confidence intervals. Markus Brede, mb8@ecs.soton.ac.uk The normal distribution The normal distribution is the classic "bell curve". We've seen that

More information

Terms & Characteristics

Terms & Characteristics NORMAL CURVE Knowledge that a variable is distributed normally can be helpful in drawing inferences as to how frequently certain observations are likely to occur. NORMAL CURVE A Normal distribution: Distribution

More information

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY 1 THIS WEEK S PLAN Part I: Theory + Practice ( Interval Estimation ) Part II: Theory + Practice ( Interval Estimation ) z-based Confidence Intervals for a Population

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Chapter 4: Estimation

Chapter 4: Estimation Slide 4.1 Chapter 4: Estimation Estimation is the process of using sample data to draw inferences about the population Sample information x, s Inferences Population parameters µ,σ Slide 4. Point and interval

More information

Confidence Intervals. σ unknown, small samples The t-statistic /22

Confidence Intervals. σ unknown, small samples The t-statistic /22 Confidence Intervals σ unknown, small samples The t-statistic 1 /22 Homework Read Sec 7-3. Discussion Question pg 365 Do Ex 7-3 1-4, 6, 9, 12, 14, 15, 17 2/22 Objective find the confidence interval for

More information

Lecture 7 Random Variables

Lecture 7 Random Variables Lecture 7 Random Variables Definition: A random variable is a variable whose value is a numerical outcome of a random phenomenon, so its values are determined by chance. We shall use letters such as X

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 10 (MWF) Checking for normality of the data using the QQplot Suhasini Subba Rao Review of previous

More information

σ. Step 3: Find the corresponding area using the Z-table.

σ. Step 3: Find the corresponding area using the Z-table. Section 6.2: Applications of the Normal Distribution Suppose that the scores for a standardized test are normally distributed, have a mean of 100, and have a standard deviation of 15. Then we can use the

More information

If the distribution of a random variable x is approximately normal, then

If the distribution of a random variable x is approximately normal, then Confidence Intervals for the Mean (σ unknown) In many real life situations, the standard deviation is unknown. In order to construct a confidence interval for a random variable that is normally distributed

More information

Sampling Distributions and the Central Limit Theorem

Sampling Distributions and the Central Limit Theorem Sampling Distributions and the Central Limit Theorem February 18 Data distributions and sampling distributions So far, we have discussed the distribution of data (i.e. of random variables in our sample,

More information

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2 Determining Sample Size Slide 1 E = z α / 2 ˆ ˆ p q n (solve for n by algebra) n = ( zα α / 2) 2 p ˆ qˆ E 2 Sample Size for Estimating Proportion p When an estimate of ˆp is known: Slide 2 n = ˆ ˆ ( )

More information

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 16 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 7. - 7.3 Lecture Chapter 8.1-8. Review Chapter 6. Problem Solving

More information

Lecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean)

Lecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean) Statistics 16_est_parameters.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean) Some Common Sense Assumptions for Interval Estimates

More information

Sampling and sampling distribution

Sampling and sampling distribution Sampling and sampling distribution September 12, 2017 STAT 101 Class 5 Slide 1 Outline of Topics 1 Sampling 2 Sampling distribution of a mean 3 Sampling distribution of a proportion STAT 101 Class 5 Slide

More information

Chapter 7. Sampling Distributions

Chapter 7. Sampling Distributions Chapter 7 Sampling Distributions Section 7.1 Sampling Distributions and the Central Limit Theorem Sampling Distributions Sampling distribution The probability distribution of a sample statistic. Formed

More information

Sampling Distributions

Sampling Distributions AP Statistics Ch. 7 Notes Sampling Distributions A major field of statistics is statistical inference, which is using information from a sample to draw conclusions about a wider population. Parameter:

More information

Institute for the Advancement of University Learning & Department of Statistics

Institute for the Advancement of University Learning & Department of Statistics Institute for the Advancement of University Learning & Department of Statistics Descriptive Statistics for Research (Hilary Term, 00) Lecture 4: Estimation (I.) Overview of Estimation In most studies or

More information

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE 19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE We assume here that the population variance σ 2 is known. This is an unrealistic assumption, but it allows us to give a simplified presentation which

More information

The topics in this section are related and necessary topics for both course objectives.

The topics in this section are related and necessary topics for both course objectives. 2.5 Probability Distributions The topics in this section are related and necessary topics for both course objectives. A probability distribution indicates how the probabilities are distributed for outcomes

More information

STAT 201 Chapter 6. Distribution

STAT 201 Chapter 6. Distribution STAT 201 Chapter 6 Distribution 1 Random Variable We know variable Random Variable: a numerical measurement of the outcome of a random phenomena Capital letter refer to the random variable Lower case letters

More information

Central Limit Theorem

Central Limit Theorem Central Limit Theorem Lots of Samples 1 Homework Read Sec 6-5. Discussion Question pg 329 Do Ex 6-5 8-15 2 Objective Use the Central Limit Theorem to solve problems involving sample means 3 Sample Means

More information

Chapter 6: The Normal Distribution

Chapter 6: The Normal Distribution Chapter 6: The Normal Distribution Diana Pell Section 6.1: Normal Distributions Note: Recall that a continuous variable can assume all values between any two given values of the variables. Many continuous

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny.

Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny. Distributions September 17 Random variables Anything that can be measured or categorized is called a variable If the value that a variable takes on is subject to variability, then it the variable is a

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions

More information

Part V - Chance Variability

Part V - Chance Variability Part V - Chance Variability Dr. Joseph Brennan Math 148, BU Dr. Joseph Brennan (Math 148, BU) Part V - Chance Variability 1 / 78 Law of Averages In Chapter 13 we discussed the Kerrich coin-tossing experiment.

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 7 Estimation: Single Population Copyright 010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-1 Confidence Intervals Contents of this chapter: Confidence

More information

Confidence Intervals Introduction

Confidence Intervals Introduction Confidence Intervals Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample mean X is a point estimate of the population mean μ

More information

Expected Value of a Random Variable

Expected Value of a Random Variable Knowledge Article: Probability and Statistics Expected Value of a Random Variable Expected Value of a Discrete Random Variable You're familiar with a simple mean, or average, of a set. The mean value of

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

STAT243 LS: Intro to Probability and Statistics Final Exam, Mar 20, 2017 KEY

STAT243 LS: Intro to Probability and Statistics Final Exam, Mar 20, 2017 KEY STAT243 LS: Intro to Probability and Statistics Final Exam, Mar 20, 2017 KEY This is a 110-min exam. Students may use a page of note (front and back), and a calculator, but nothing else is allowed. 1.

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 10 (MWF) Checking for normality of the data using the QQplot Suhasini Subba Rao Checking for

More information

Lecture 10 - Confidence Intervals for Sample Means

Lecture 10 - Confidence Intervals for Sample Means Lecture 10 - Confidence Intervals for Sample Means Sta102/BME102 October 5, 2015 Colin Rundel Confidence Intervals in the Real World A small problem Lets assume we are collecting a large sample (n=200)

More information

CHAPTER 8 Estimating with Confidence

CHAPTER 8 Estimating with Confidence CHAPTER 8 Estimating with Confidence 8.2 Estimating a Population Proportion The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Estimating a Population

More information

Name: Period: Date: 1. Suppose we are interested in the average weight of chickens in America.

Name: Period: Date: 1. Suppose we are interested in the average weight of chickens in America. Name: Period: Date: Statistics Review MM4D1. Using simulation, students will develop the idea of the central limit theorem. MM4D2. Using student-generated data from random samples of at least 30 members,

More information

Lecture 6: Chapter 6

Lecture 6: Chapter 6 Lecture 6: Chapter 6 C C Moxley UAB Mathematics 3 October 16 6.1 Continuous Probability Distributions Last week, we discussed the binomial probability distribution, which was discrete. 6.1 Continuous Probability

More information

1. Statistical problems - a) Distribution is known. b) Distribution is unknown.

1. Statistical problems - a) Distribution is known. b) Distribution is unknown. Probability February 5, 2013 Debdeep Pati Estimation 1. Statistical problems - a) Distribution is known. b) Distribution is unknown. 2. When Distribution is known, then we can have either i) Parameters

More information

Data Analysis. BCF106 Fundamentals of Cost Analysis

Data Analysis. BCF106 Fundamentals of Cost Analysis Data Analysis BCF106 Fundamentals of Cost Analysis June 009 Chapter 5 Data Analysis 5.0 Introduction... 3 5.1 Terminology... 3 5. Measures of Central Tendency... 5 5.3 Measures of Dispersion... 7 5.4 Frequency

More information

Chapter 6: The Normal Distribution

Chapter 6: The Normal Distribution Chapter 6: The Normal Distribution Diana Pell Section 6.1: Normal Distributions Note: Recall that a continuous variable can assume all values between any two given values of the variables. Many continuous

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education

More information

FINAL REVIEW W/ANSWERS

FINAL REVIEW W/ANSWERS FINAL REVIEW W/ANSWERS ( 03/15/08 - Sharon Coates) Concepts to review before answering the questions: A population consists of the entire group of people or objects of interest to an investigator, while

More information

Sampling Distributions Chapter 18

Sampling Distributions Chapter 18 Sampling Distributions Chapter 18 Parameter vs Statistic Example: Identify the population, the parameter, the sample, and the statistic in the given settings. a) The Gallup Poll asked a random sample of

More information

Chapter Seven: Confidence Intervals and Sample Size

Chapter Seven: Confidence Intervals and Sample Size Chapter Seven: Confidence Intervals and Sample Size A point estimate is: The best point estimate of the population mean µ is the sample mean X. Three Properties of a Good Estimator 1. Unbiased 2. Consistent

More information

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether. Lecture 34 Section 10.2 Hampden-Sydney College Fri, Oct 31, 2008 Outline 1 2 3 4 5 6 7 8 Exercise 10.4, page 633. A psychologist is studying the distribution of IQ scores of girls at an alternative high

More information

Lecture 9. Probability Distributions. Outline. Outline

Lecture 9. Probability Distributions. Outline. Outline Outline Lecture 9 Probability Distributions 6-1 Introduction 6- Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7- Properties of the Normal Distribution

More information

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Announcements: There are some office hour changes for Nov 5, 8, 9 on website Week 5 quiz begins after class today and ends at

More information

(# of die rolls that satisfy the criteria) (# of possible die rolls)

(# of die rolls that satisfy the criteria) (# of possible die rolls) BMI 713: Computational Statistics for Biomedical Sciences Assignment 2 1 Random variables and distributions 1. Assume that a die is fair, i.e. if the die is rolled once, the probability of getting each

More information

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation. 1) If n 100 and p 0.02 in a binomial experiment, does this satisfy the rule for a normal approximation? Why or why not? No, because np 100(0.02) 2. The value of np must be greater than or equal to 5 to

More information

STA 320 Fall Thursday, Dec 5. Sampling Distribution. STA Fall

STA 320 Fall Thursday, Dec 5. Sampling Distribution. STA Fall STA 320 Fall 2013 Thursday, Dec 5 Sampling Distribution STA 320 - Fall 2013-1 Review We cannot tell what will happen in any given individual sample (just as we can not predict a single coin flip in advance).

More information

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean.

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean. Lecture 3 Sampling distributions. Counts, Proportions, and sample mean. Statistical Inference: Uses data and summary statistics (mean, variances, proportions, slopes) to draw conclusions about a population

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Lecture 9. Probability Distributions

Lecture 9. Probability Distributions Lecture 9 Probability Distributions Outline 6-1 Introduction 6-2 Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7-2 Properties of the Normal Distribution

More information

Lecture 6: Confidence Intervals

Lecture 6: Confidence Intervals Lecture 6: Confidence Intervals Taeyong Park Washington University in St. Louis February 22, 2017 Park (Wash U.) U25 PS323 Intro to Quantitative Methods February 22, 2017 1 / 29 Today... Review of sampling

More information

Estimation. Focus Points 10/11/2011. Estimating p in the Binomial Distribution. Section 7.3

Estimation. Focus Points 10/11/2011. Estimating p in the Binomial Distribution. Section 7.3 Estimation 7 Copyright Cengage Learning. All rights reserved. Section 7.3 Estimating p in the Binomial Distribution Copyright Cengage Learning. All rights reserved. Focus Points Compute the maximal length

More information

CHAPTER 6 Random Variables

CHAPTER 6 Random Variables CHAPTER 6 Random Variables 6.1 Discrete and Continuous Random Variables The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Discrete and Continuous Random

More information

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr.

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr. Department of Quantitative Methods & Information Systems Business Statistics Chapter 6 Normal Probability Distribution QMIS 120 Dr. Mohammad Zainal Chapter Goals After completing this chapter, you should

More information

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82 Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections

More information

A point estimate is a single value (statistic) used to estimate a population value (parameter).

A point estimate is a single value (statistic) used to estimate a population value (parameter). Shahzad Bashir. 1 Chapter 9 Estimation & Confidence Interval Interval Estimation for Population Mean: σ Known Interval Estimation for Population Mean: σ Unknown Determining the Sample Size 2 A point estimate

More information

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 7 Statistical Intervals Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to

More information

Business Statistics 41000: Probability 3

Business Statistics 41000: Probability 3 Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404

More information

= 0.35 (or ˆp = We have 20 independent trials, each with probability of success (heads) equal to 0.5, so X has a B(20, 0.5) distribution.

= 0.35 (or ˆp = We have 20 independent trials, each with probability of success (heads) equal to 0.5, so X has a B(20, 0.5) distribution. Chapter 5 Solutions 51 (a) n = 1500 (the sample size) (b) The Yes count seems like the most reasonable choice, but either count is defensible (c) X = 525 (or X = 975) (d) ˆp = 525 1500 = 035 (or ˆp = 975

More information