Chapter 8 Estimation

Similar documents
Estimation. Focus Points 10/11/2011. Estimating p in the Binomial Distribution. Section 7.3

Section 8.1 Estimating μ When σ is Known

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

Statistical Intervals (One sample) (Chs )

3/28/18. Estimation. Focus Points. Focus Points. Estimating p in the Binomial Distribution. Estimating p in the Binomial Distribution. Section 7.

If the distribution of a random variable x is approximately normal, then

ECON 214 Elements of Statistics for Economists 2016/2017

STAT Chapter 7: Confidence Intervals

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Chapter 7. Sampling Distributions

Statistics 13 Elementary Statistics

8.1 Estimation of the Mean and Proportion

Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) Estimating Population Parameters

CH 5 Normal Probability Distributions Properties of the Normal Distribution

ECON 214 Elements of Statistics for Economists

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Chapter 8 Statistical Intervals for a Single Sample

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Chapter 6 Confidence Intervals

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Statistical Methods in Practice STAT/MATH 3379

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION

MA131 Lecture 9.1. = µ = 25 and σ X P ( 90 < X < 100 ) = = /// σ X

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

2011 Pearson Education, Inc

Statistics for Business and Economics: Random Variables:Continuous

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance

CHAPTER 5 Sampling Distributions

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.

The normal distribution is a theoretical model derived mathematically and not empirically.

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Chapter 6. The Normal Probability Distributions

Math 251: Practice Questions Hints and Answers. Review II. Questions from Chapters 4 6

Confidence Intervals Introduction

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Statistics and Probability

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

AMS7: WEEK 4. CLASS 3

The Binomial Distribution

Data Analysis. BCF106 Fundamentals of Cost Analysis

Confidence Intervals. σ unknown, small samples The t-statistic /22

The Binomial Distribution

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Chapter Seven: Confidence Intervals and Sample Size

6.1, 7.1 Estimating with confidence (CIS: Chapter 10)

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions

Unit 5: Sampling Distributions of Statistics

1 Small Sample CI for a Population Mean µ

Unit 5: Sampling Distributions of Statistics

Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Continuous Probability Distributions & Normal Distribution

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Contents. The Binomial Distribution. The Binomial Distribution The Normal Approximation to the Binomial Left hander example

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Normal Probability Distributions

Part V - Chance Variability

1. Statistical problems - a) Distribution is known. b) Distribution is unknown.

Unit 2: Statistics Probability

Confidence Intervals and Sample Size

Expected Value of a Random Variable

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

Math 14 Lecture Notes Ch The Normal Approximation to the Binomial Distribution. P (X ) = nc X p X q n X =

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

MidTerm 1) Find the following (round off to one decimal place):

Chapter ! Bell Shaped

Math 227 Elementary Statistics. Bluman 5 th edition

GETTING STARTED. To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop

Section Introduction to Normal Distributions

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Statistics (This summary is for chapters 18, 29 and section H of chapter 19)

Sampling Distributions

Learning Objectives for Ch. 7

Introduction to Statistics I

Lecture 2 INTERVAL ESTIMATION II

PROBABILITY DISTRIBUTIONS

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1

Elementary Statistics Triola, Elementary Statistics 11/e Unit 14 The Confidence Interval for Means, σ Unknown

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes. Standardizing normal distributions The Standard Normal Curve

Lecture 6: Chapter 6

Y i % (% ( ( ' & ( # % s 2 = ( ( Review - order of operations. Samples and populations. Review - order of operations. Review - order of operations

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

The Normal Probability Distribution

The "bell-shaped" curve, or normal curve, is a probability distribution that describes many real-life situations.

MATH 264 Problem Homework I

Transcription:

Chapter 8 Estimation There are two important forms of statistical inference: estimation (Confidence Intervals) Hypothesis Testing Statistical Inference drawing conclusions about populations based on samples of the population parameter unknown : μ, σ A parameter is a number that describes the population of interest. Since we usually cannot examine the entire population of interest, parameters are generally unknown. statistic known:, s A statistic is a number that is computed from sample data. We often use a statistic to estimate an unknown population parameter. sample statistic and population parameter Notation μ = population mean (unknown) = sample mean (computed from the data we have on hand from a sample of the population) σ = population standard deviation (unknown) s = sample standard deviation (computed from the data we have on hand from a sample of the population) estimates μ s estimates Point Estimate An estimate of a population parameter given by a single number. A point estimation of a population parameter is an estimate of the parameter using a single number. is a point estimate of μ S is a point estimate of 1 P a g e

Sampling Variability Example: What is the average weight of women 5 1 tall between the ages of 21-45? The American Medical Association takes a sample of 1000 women between the ages of 21-45 years and with height 5 1 They find that that the mean weight is = 13 2 lbs Question: If our goal is to estimate the mean weight of the population, how should we deal with the fact that different samples yield different estimates of the mean weight?? The basic fact that the value of a sample statistic varies in (hypothetical) repeated random sampling is called sampling variability. E ample: If another sample of 1000 women was chosen from the same population of 5 1 women between 21-45 years old, the value of would almost certainly be different something other than 136.2 lbs. Answer: Allow a margin of error that takes sampling variability into account. Confidence Intervals Confidence intervals are generally of the form point estimate ± margin of error ± margin of error Question: Why should we estimate μ, true population mean, with an interval of numbers? Why not just use the point estimate as our estimate of μ? Answer: (1) Using an interval estimate (i.e. confidence interval) takes sampling variability into consideration, and (2) we can attach a level of confidence to an interval estimate which we cannot do with a point estimate. A confidence interval for μ has two parts: 1) A margin of error says how close lies to μ 2) A level of confidence says what percent of all possible samples satisfy the margin of error. 2 P a g e

A confidence level, c, is any value between 0 and 1 that corresponds to the area under the standard normal curve between zc and +zc. Margin of Error Even if we take a very large sample size, may differ from µ. Critical Values For an interval of numbers there is a left endpoint and a right endpoint. (lower bound, upper bound) For a confidence level c, the critical value is the number such that the area under the standard normal curve between and equals c (your confidence level) Example - Which of the following correctly expresses the confidence interval shown below? a)p 0.99 < z < 1 = 2.58 b) P 2.58 < z < 2.58 = 0.99 c) P 0 < z < 0.99 = 5.16 d) P 2.58 < z < 2.58 = 0.01 3 P a g e

Common Confidence Levels Area = 0.9 or 90% Area = 0.95 or 95% -1.645 1.645-1.96 1.96 Area = 0.98 or 98% Area = 0.99 or 99% -2.33 2.33-2.58 2.58 Notice as the confidence level increase the interval gets wider When constructing a confidence interval, you must decide on the risk you are willing to take of being wrong. A confidence interval is wrong if it doesn t contain the true value of the population parameter. 99% confidence ==> 1% chance of being wrong 95% confidence ==> 5% chance of being wrong 90% confidence ==> 10% chance of being wrong How confidence intervals behave High confidence says that our method almost always gives correct answers. A small margin of error says that we have pinned down the parameter quite precisely. 4 P a g e

The margin of error determines the width of the confidence interval. 1) The margin of error is larger for higher confidence levels. To obtain a smaller margin of error from the same data, you must be willing to accept lower confidence. 2) The margin of error is larger for smaller sample sizes. 3) The margin of error is larger for populations that have lots of variability. Interpreting confidence levels Take 95% confidence, for example. Practical Interpretation: We are confident that the is between and,on average. Statistical Interpretation: If we repeatedly take random samples of size n from the population and construct 95% confidence intervals for each sample, then in the long run 95% of these confidence intervals will capture the true value of μ Our sample is either one of the 95% for which the calculated interval captures μ, or one of the unlucky 5% that do not The idea of sampling distribution Take many samples from the same population. Collect the s from all the samples Display the distribution of the histogram, for example). s (in a The histogram will be bell-shaped and symmetric, centered at the population mean. The sampling distribution of distribution! is a normal 5 P a g e

Sampling Distribution The sampling distribution of a statistic is the distribution of values taken by the statistic in all possible samples of the same size from the same population. ampling Distribution of : eans that if we collect all the possible samples of si e n from the population of interest and collect all of the s and plot them in a histogram, that histogram will be bell shaped, symmetric and centered at μ with standard deviation σ/ n Facts about the sampling distribution of These facts describe how varies from one sample to the next: 1) In repeated sampling, will sometimes fall above the true value of μ and sometimes below it, but there is no systematic tendency for to overestimate or underestimate μ The sampling distribution of is centered at μ, and so is called an unbiased estimator of μ 2) The values of from larger samples are less variable than those from smaller samples. The standard deviation of the sampling distribution of is Mean of x : x x Standard Deviation of x: x x n ( ) Confidence interval for μ for 95% Confidence σ is known σ is unknown. ( ). ( ) If σ is known then we use Z c If σ is unknown then we have s, then we use t c 6 P a g e

Maximal Margin of Error Since µ is unknown, the margin of error µ is unknown. Using confidence level c, we can say that differs from µ by at most:.. = = The Probability Statement ( ( < < ) = < < ) = In words, c is the probability that the sample mean,, will differ from the population mean, µ, by at most, margin of error. 7 P a g e

Confidence Intervals A c confidence interval for µ is an interval computer from sample data in such a way that c is the probability of generating an interval containing the actual value of µ Example - For a population of domesticated geese, the standard deviation of the mass is 1.3 kg. A sample of 45 geese has a mean mass of 5.7 kg. Find the confidence interval for the population mean at the 95% confidence level. Notice that we have (population standard deviation) so we can use Z c Calculator: STAT, TEST, 7: Z-Interval, Choose STAT Critical Thinking Since is a random variable, so are the endpoints. After the confidence interval is numerically fixed for a specific sample, it either does or does not contain µ. If we repeated the confidence interval process by taking multiple random samples of equal size, some intervals would capture µ and some would not! The equation ( all intervals containing µ will be c. < < ) = states that the proportion of 8 P a g e

Estimating µ When σ is Unknown In most cases, researchers will have to estimate σ with s (the standard deviation of the sample). The sampling distribution for will follow a non-normal distribution called the Student s t distribution. The t Distribution Assume that x has a normal distribution with mean μ. For samples of size n with sample mean and sample standard deviation s, the t variable is = degrees of freedom = n-1 has a tudent s t distribution with Properties of the t-distribution bell shaped and symmetric and centered at zero there is more area in the tails in the t-distribution than there is in the N(0,1) distribution the t-distribution is really a family of density curves such that each one is significantly different depending on the degrees of freedom as degrees of freedom gets larger and larger the t-density curve looks more and more identical to the N(0,1) For different levels of Confidence: For 95% Confidence Interval For 90% Confidence Interval For 99% Confidence Interval 9 P a g e

Example -Find the t-value for the following data: a). 27.62 b). 0.11 x 55.2, 58.1, s 4.2, n 40 c). 8.95 d). 4.37 To find values of t c you use Table 6 of Appendix II to find the critical values t c for a confidence level c. Degrees of freedom, df, are the row headings. Confidence levels, c, are the column headings Maximal Margin of Error If we are using the t distribution: 10 P a g e

What Distribution Should We Use? 11 P a g e

Notes on Calculator: For Normal Distribution For Proportion σ is unknown σ is known Test Statistic t obs z obs Calculator Stat Test T-test Stat Test Z-test Stat Test 1-Prop Z-test Confidence ( ) ( σ Interval ) Calculator Stat Test T-Interval Stat Test Z-Interval Stat Test 1-Prop Z-Interval Example - A study was done to determine the average number of homes that a homeowner owns in his or her lifetime. For the 60 homeowners surveyed, the sample average was 4.2 and the sample standard deviation was 2.1. Calculate the 95% confidence interval for the true average number of homes that a person owns in his or her lifetime. Example - A study was done to determine the average number of homes that a homeowner owns in his or her lifetime. Suppose that this time sigma is known to be 2.8. Assume that we collect a sample of 60 homeowners and compute the sample average to be 4.2. Calculate the 95% confidence interval for the true average number of homes that a person owns in his or her lifetime. 12 P a g e

Example: The numbers of advertisements seen or heard in one week for 30 randomly selected people in the United States are listed below. Construct a 95% confidence interval for the true mean number of advertisements. 598 494 441 595 728 690 684 486 735 808 481 298 135 846 764 317 649 732 582 677 734 588 590 540 673 727 545 486 702 703 13 P a g e

Estimating p in the Binomial Distribution We will use large-sample methods in which the sample size, n, is fixed. We assume the normal curve is a good approximation to the binomial distribution if both np > 5 and nq = n(1 p) > 5. Point Estimates in the Binomial Case Margin of Error The magnitude of the difference between the actual value of p and its estimate is the margin of error. The Distribution of For large samples, the distribution is well approximated by a normal distribution. A Probability Statement With confidence level c, as before. 14 P a g e

Example - Suppose that 800 students were randomly selected from the student body of 20,000 and are given shots to prevent a certain type of flu. All 800 students were exposed to the flu, and 600 of them did not get the flu. Let p represent the probability that the shot will be successful for any single student selected at random from the entire population of 20,000. a) What are the point estimates for p and q? What is the value of n and r? b) Is the number of trials large enough to justify a normal approximation to the binomial? 15 P a g e

c) Find a 99% confidence interval for p. Calculator: STAT, TESTS, A: 1- Prop Z Int. The value of x = r Example: A survey of 300 fatal accidents showed that 123 were alcohol related. Construct a 98% confidence interval for the proportion of fatal accidents that were alcohol related. 16 P a g e

Choosing Sample Sizes When designing statistical studies, it is good practice to decide in advance: The confidence level The maximal margin of error Then, we can calculate the required minimum sample size to meet these goals. Sample Size for Estimating μ If σ is unknown, use σ from a previous study or conduct a pilot study to obtain s. Always round n up to the next integer!! Sample Size for Estimating If we have a preliminary estimate for p, use the following. If we have no preliminary estimate for p, use the following modification: Example A wildlife study is designed to find the mean weight of salmon caught be an Alaskan fishing company. A preliminary study of a random sample of 50 salmon showed = 2.15 pounds. How large a sample should be taken to be 90% confident that the sample mean, is within 0.20 pounds of the true mean weight μ? 17 P a g e

Example: A researcher wishes to estimate the number of households with two cars. How large a sample is needed in order to be 98% confident that the sample proportion will not differ from the true proportion by more than 5%? A previous study indicates that the proportion of households with two cars is 19%. 18 P a g e