Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Similar documents
Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

8.1 Estimation of the Mean and Proportion

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

Chapter 5. Sampling Distributions

Statistics 13 Elementary Statistics

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance

STAT Chapter 7: Confidence Intervals

Chapter 8 Statistical Intervals for a Single Sample

Confidence Intervals Introduction

Confidence Intervals and Sample Size

Chapter 4: Estimation

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

χ 2 distributions and confidence intervals for population variance

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

4.1 Introduction Estimating a population mean The problem with estimating a population mean with a sample mean: an example...

FEEG6017 lecture: The normal distribution, estimation, confidence intervals. Markus Brede,

CHAPTER 8 Estimating with Confidence

Chapter 8 Estimation

Elementary Statistics

Probability. An intro for calculus students P= Figure 1: A normal integral

Data Analysis and Statistical Methods Statistics 651

Discrete Probability Distribution

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

Section 7-2 Estimating a Population Proportion

Sampling and sampling distribution

Confidence Intervals for Paired Means with Tolerance Probability

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

1 Inferential Statistic

Statistics 511 Additional Materials

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

Time Observations Time Period, t

Statistical Methods in Practice STAT/MATH 3379

Lecture 2 INTERVAL ESTIMATION II

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

Non-Inferiority Tests for Two Means in a 2x2 Cross-Over Design using Differences

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

Data Analysis and Statistical Methods Statistics 651

1/12/2011. Chapter 5: z-scores: Location of Scores and Standardized Distributions. Introduction to z-scores. Introduction to z-scores cont.

Section The Sampling Distribution of a Sample Mean

Statistics and Probability

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Lecture 6: Confidence Intervals

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

Chapter 8. Introduction to Statistical Inference

Chapter 9: Sampling Distributions

Lecture 9. Probability Distributions. Outline. Outline

1 Introduction 1. 3 Confidence interval for proportion p 6

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Estimation. Focus Points 10/11/2011. Estimating p in the Binomial Distribution. Section 7.3

5.3 Interval Estimation

HOMEWORK: Due Mon 11/8, Chapter 9: #15, 25, 37, 44

Lecture 9. Probability Distributions

Discrete Random Variables

Statistics for Managers Using Microsoft Excel 7 th Edition

Confidence Interval and Hypothesis Testing: Exercises and Solutions

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

4.2 Bernoulli Trials and Binomial Distributions

UNIVERSITY OF VICTORIA Midterm June 2014 Solutions

Lecture 9 - Sampling Distributions and the CLT

Today s plan: Section 4.4.2: Capture-Recapture method revisited and Section 4.4.3: Public Opinion Polls

The topics in this section are related and necessary topics for both course objectives.

6.5: THE NORMAL APPROXIMATION TO THE BINOMIAL AND

Data Analysis and Statistical Methods Statistics 651

Statistical Intervals (One sample) (Chs )

MATH 3200 Exam 3 Dr. Syring

A point estimate is a single value (statistic) used to estimate a population value (parameter).

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean.

Sampling Distribution

Central Limit Theorem (cont d) 7/28/2006

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics

Estimation Y 3. Confidence intervals I, Feb 11,

MATH 446/546 Homework 1:

Statistics for Business and Economics

Chapter Seven: Confidence Intervals and Sample Size

MATH 264 Problem Homework I

Chapter 8: Sampling distributions of estimators Sections

10/1/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

ECON 214 Elements of Statistics for Economists 2016/2017

Tests for Paired Means using Effect Size

Chapter 5. Statistical inference for Parametric Models

The Binomial and Geometric Distributions. Chapter 8

Confidence Intervals. σ unknown, small samples The t-statistic /22

Introduction to Statistics I

1. Covariance between two variables X and Y is denoted by Cov(X, Y) and defined by. Cov(X, Y ) = E(X E(X))(Y E(Y ))

6.1, 7.1 Estimating with confidence (CIS: Chapter 10)

Martingales, Part II, with Exercise Due 9/21

Section Sampling Distributions for Counts and Proportions

ECON 214 Elements of Statistics for Economists

Two-Sample Z-Tests Assuming Equal Variance

Section 4.3 Objectives

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Transcription:

Chapter 14 : Statistical Inference 1 Chapter 14 : Introduction to Statistical Inference Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Data x 1, x 2,..., x n is a random sample from some population with mean µ. We saw in Chapter 11 that the random variable x has expected value µ no matter what the value of µ is. x is an unbiased estimator of µ. It is also called a point estimator of µ. x is a random variable and it is typically not equal to the population parameter µ. What is the real value of µ? Can we say anything about the value of µ based on a random sample of data? Statistical Inference studies and gives methods to draw conclusions (make inferences) about a population from a sample of data. Remark : An inference is similar to a deduction, but the conclusion is not guaranteed with certainty, as opposed to a deduction where the conclusion is guaranteed with certainty as in a mathematical deduction. Some of this distinction will be clear from our examples. Specifically inferences are based on random samples and so if an experiment is repeated the data will change and the inferences will change, but typically not in a very major or important way. Some Very Simple Conditions for Inferences about a Population Mean 1. A simple random sample is obtained from a population. There is no non-response or other practical difficulties with the data 2. the variable we measure has exactly a normal distribution N(µ, sd = σ) 3. we do not know µ, but we know σ. Items 2 and 3 are typically not true. After seeing how we can make inferences about µ in this special case we will then see how to make more realistic conditions for making

Chapter 14 : Statistical Inference 2 inferences. In particular as in Chapter 11, we will have in place of the exact normal distribution for x a property of an approximate normal distribution for x. For point 3, we will have another way of approximating the population mean, in particular by using the sample variance to estimate the population variance σ 2.

Chapter 14 : Statistical Inference 3 Confidence Interval Here is an overview of the reasoning and the method used in inference 1. For a given value of µ, the random variable x has distribution Normal with mean µ, standard deviation σ n 2. For a given value of µ and using the 95% rule we have with probability 0.95 that x will fall into the interval µ 1.96 σ n, µ + 1.96 σ n Written in another fashion we have µ 1.96 σ n x µ + 1.96 σ n (1) with probability 0.95 The 95% rule will replace the critical value 1.96 by the value 2 and so we will have µ 2 σ n x µ + 2 σ n with probability 0.95 3. For a given sample we observe the value of x. We then use equation (1) to solve for µ in terms of this observed value of x. This gives x 1.96 σ µ x + 1.96 σ (2) n n or in terms of the 95% rule with 1.96 replaced by 2 x 2 σ n µ x + 2 σ n Equation (2) is called a 95% confidence interval, since it is based on a 0.95 probability or proportion central interval for the normal distribution of x. Notice a probability or proportion of 0.95 is also the same as 95% probability.

Chapter 14 : Statistical Inference 4 Item or property 2 gives the values in terms of an interval that x falls into with large probability, that is the ones that are reasonably or consistent with the given value of µ. This is equation (1). Item or property 3 then inverts this statement to ask which values of µ are consistent with the observed value of x. This is Equation (2). Aside: For those who are interested in a little of the algebra we can see how we get equation (2) from equation (1)? For those not interested in the algebra just skip these next few lines. Equation (1) is actually two inequalities µ 1.96 σ x n and x µ + 1.96 σ n From the first we get, by adding 1.96 σ n to both sides, we obtain µ x + 1.96 σ n From the second we get, by subtracting 1.96 σ n to both sides we obtain x 1.96 σ n µ Putting these inequalities together two together we get equation (2). End of Aside

Chapter 14 : Statistical Inference 5 For now pretend that the grades for test 1 followed a normal distribution with mean µ and standard deviation σ = 4.04, the actual standard deviation for the grades. The 95% confidence interval for the true population grade mean, based on a sample of size n = 10 is then given by (each line will follow from the previous line) x 1.96 4.04 10 µ x + 1.96 4.04 10 x 1.96 1.278 µ x + 1.96 1.278 x 2.50 µ x + 2.50 Notice that we can write this formula down even before taking a random sample of size n = 10. Take a random sample from the test 1 population. When I did this I obtained data 27, 25, 16, 25, 24, 29, 27, 27, 28, 24 For this data the sample mean is x = 25.2. Based on the normal assumptions above we have a 95% confidence interval x ± 4.04 10 = 25.2 ± 2.50 = [22.70, 27.70] Thus based on this random sample of n = 10 data points we have learned (actually inferred) that the actual population mean in reasonably thought to be between 22.7 and 27.7. This calculation means that with 95% confidence the true population mean (which we typically do not know) is a number between 22.70 and 27.70. In particular a claim that the true population mean parameter value µ falls into this interval and not outside this interval; thus for example it is not reasonable (at 95% confidence level) that the value of µ is 20.

Chapter 14 : Statistical Inference 6 Confidence Interval Our estimate of an interval of reasonable or consistent values of µ is of the form x ± margin or error The margin of error depends on the sample size through n the population standard deviation σ. Aside : when we generalize this method the dependence on the population variance (again typically unknown) will be through the sample standard deviation s 2 = s. Recall from our earlier discussion in Chapter 11 that the sample variance s 2 is an unbiased estimator of the true population variance. the confidence level as determined by the corresponding critical value for the sampling distribution of the estimator x. This interval is called a confidence interval. We can choose the probability interval (central 0.90, central 0.95 or central 0.99) and this yields the corresponding confidence interval through the relationships (1) and (2). The central 0.90 probability interval corresponds to a 90% confidence interval, a central 0.95 probability interval corresponds to a 95% confidence interval, and central 0.99 probability interval corresponds to a 99% confidence interval. A confidence interval is a random interval. The true value µ falls into this interval with the corresponding probability level. x z σ n µ x + z σ n (3) or in another notation x ± z σ n. The critical value z is chosen so the probability interval for x corresponds to the given confidence level. For the simple exact normal distribution assumptions we have the corresponding critical values

Chapter 14 : Statistical Inference 7 Confidence interval Probability interval probability in upper tail Critical value z 1.99 99% 0.99 =.005 2 2.58 1.95 95% 0.95 =.025 2 1.96 1.90 90% 0.90 =.05 2 1.65 For a 95% confidence interval the true value of the parameter falls into a confidence interval with probability 0.95. Thus on average 1 out 20 confidence intervals will not contain the true value of µ. A confidence interval is a random interval. Similarly for a 90% (or 99%) confidence interval, on average 9 out of 10 (99 out of 100) of these intervals contains the true population mean value µ. To illustrate this idea we have done a simulation experiment with M = 20 replicates. For each replicate a simple random sample of size n = 10 is taken from a normal distribution with mean = µ = 23.6 and standard deviation = σ = 4.04. For each random sample the corresponding 95% confidence interval is calculated. According to the probability rules about 1 in 20 (that is 5%) of such intervals on average will NOT contain the true value of µ = 23.6. This is shown in Figure 1. Each line in this plot is one of the confidence intervals. A centre dashed vertical line corresponding to µ = 23.6. In this plot all the confidence intervals overlap the value µ = 23.6, except for 1 interval.

Chapter 14 : Statistical Inference 8 18 20 22 24 26 28 5 10 15 20 Confidence Interval 1:M 20 confidence intervals, n = 10, N(23.6, sd = 4.04) Figure 1: M = 20 random confidence intervals

Chapter 14 : Statistical Inference 9 How can we obtain a more precise estimate of the true value of the population mean? In terms of our confidence interval our estimate of the values of µ that are consistent with the observed data is of the form x ± margin or error or more precisely x ± z σ n. Thus we can make the estimate more precise by use a smaller value of z, which means a lower confidence level, and hence less likely to contain the true value of µ use a larger value for n, that is increase the sample size. This will be more expensive, so it may not be possible. The increase in precision is proportional to 1 over n. Thus to make the confidence interval one half as long requires that n gets changed to 2 n = 4n, so that 4 imtes as many data points are required. make the population variance smaller. This typically cannot be done, as we are working with the population that is given, and the random sample that we obtain from it. However sometimes an experimental design such as matched pairs or a paired design will allow us to obtain data with the given population mean but smaller variability. Recall for example the shock absorbers example where this is possible. How can we guarantee that our confidence interval contains the true value? If we use a 100% confidence interval then z = and our confidence interval is x ±, or every possible value of µ. This interval is of course useful in helping us to learn or understand what value of µ are reasonable and which values of µ are not reasonable.

Chapter 14 : Statistical Inference 10 How many observations should we take? Ideally we want to take as many as possible. However in many cases it costs resources (typical scientific or engineering experiment, experimental units), time (all types of studies) and money (typically all types of experiments : lab assistants, poll questioners). Thus for practical considerations one cannot take arbitrarily large samples. On the other hand we might need to obtain a certain degree of precision. For example an opinion poll might want to measure the proportion of voters who favour the ruling party, but it is sufficient to know this to a margin or precision of plus or minus 3 percentage points. In using a drug to control blood pressure we might want to know the blood pressure to a precision of 5 units. We can now translate this question into the following : for a given precision m at say 95% confidence level, how big should the sample size n so that m = z σ n Where does this come from? The confidence interval form is x ± z σ n = x ± m and so we match up m and z σ n. Since n is the only unknown we the solve for n, yielding n = (z ) 2 σ 2 m 2 = ( z ) σ 2. m Since we can only take whole numbers (integers) of observations (how can one take.6 of an observation?) we will then take n to be the value of the right hand side, but rounded up to the next integer. Consider the test 1 grades example again. For the purpose of this calculation we pretend the distribution of grades is normal and that the population size is very large. Here we have the population standard deviation σ = 4.04. How big should the sample size be to have precision m = 3, that is our 95% confidence interval will be x ± 3.

Chapter 14 : Statistical Inference 11 We will need to take ( z ) σ 2 n = m ( 4.04 1.96 = m ( 4.04 1.96 = 3 = 2.64 2 = 6.97 or more specifically since n is an integer we take n = 7 by rounding up. We would have rounded up even if the calculated of the expression were 6.01. That is because we need an integer or whole number of observations and it has to be at least 6.01 (bigger than 6.01). For different value of precision, again at confidence level 95%, we have ) 2 ) 2 m ( ) z 2 σ m n 3.0 7.0 7 2.0 15.7 16 1.0 62.7 63 0.5 250.8 251 Aside : Comment on Opinion Polls It is for this reason that opinion polls take a random sample of approximately 1600 individuals. This will result in a 95% confidence interval of a population proportion (which is the same as a sample mean of success and failure counts) which is of the form ˆp ±.03 This is often reported as a margin of error of 3% 19 times out of 20.

Chapter 14 : Statistical Inference 12 Recall the beginning of our discussion of confidence intervals. We had some Very Simple Conditions for Inferences about a Population Mean, which are given here again. 1. A simple random sample is obtained from a population. There is no non-response or other practical difficulties with the data 2. the variable we measure has exactly a normal distribution N(µ, sd = σ) 3. we do not know µ, but we know σ. Suppose that instead of property 3 we do not known σ. This is much more realistic. What can we do now? Recall also that be based our confidence interval on the following idea. For a given value of µ and using the 95% rule we have with probability 0.95 that x will fall into the interval µ 1.96 σ n, µ + 1.96 σ n This used the property that x has exactly a normal distribution with mean µ and standard deviation σ n, or equivalently x µ σ n N(0, 1) Here is a shorthand for saying distributed as. When σ is not known we can use in place of σ the sample variance s = s 2, where s 2 is the sample variance. However the random variable x µ s n no longer has a standard normal distribution, but instead a distribution called the Student s t distribution with degrees of freedom n 1. The n 1 is related to the divisor n 1 in the formula for the sample variance. The critical values for the Student s t distribution are given in Table C near the end of the text. We discuss later how the corresponding confidence interval gets changed.