Estimation Y 3. Confidence intervals I, Feb 11,

Similar documents
Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Confidence Intervals Introduction

STAT Chapter 7: Confidence Intervals

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

Statistics 13 Elementary Statistics

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Chapter 8. Introduction to Statistical Inference

Chapter 8 Statistical Intervals for a Single Sample

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

Chapter 7 Sampling Distributions and Point Estimation of Parameters

8.1 Estimation of the Mean and Proportion

Chapter 7.2: Large-Sample Confidence Intervals for a Population Mean and Proportion. Instructor: Elvan Ceyhan

Stat 213: Intro to Statistics 9 Central Limit Theorem

Chapter 5. Sampling Distributions

Simple Random Sampling. Sampling Distribution

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

Section 7-2 Estimating a Population Proportion

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

BIO5312 Biostatistics Lecture 5: Estimations

5.3 Interval Estimation

Lecture 9 - Sampling Distributions and the CLT

Sampling and sampling distribution

Central Limit Theorem

Statistics for Business and Economics

Data Analysis and Statistical Methods Statistics 651

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.

FEEG6017 lecture: The normal distribution, estimation, confidence intervals. Markus Brede,

Class 12. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Chapter Four: Introduction To Inference 1/50

Point Estimation. Principle of Unbiased Estimation. When choosing among several different estimators of θ, select one that is unbiased.

Making Sense of Cents

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

. (i) What is the probability that X is at most 8.75? =.875

MATH 3200 Exam 3 Dr. Syring

Statistics and Probability

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

Lecture 12: The Bootstrap

Data Analysis and Statistical Methods Statistics 651

Section 7.2. Estimating a Population Proportion

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE

Chapter 4: Estimation

Lecture 6: Confidence Intervals

Applied Statistics I

The Central Limit Theorem (Solutions) COR1-GB.1305 Statistics and Data Analysis

Statistical Intervals (One sample) (Chs )

Two Populations Hypothesis Testing

SAMPLING DISTRIBUTIONS. Chapter 7

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

The Central Limit Theorem

Statistics Class 15 3/21/2012

Chapter 7 - Lecture 1 General concepts and criteria

Business Statistics 41000: Probability 4

Data Analysis and Statistical Methods Statistics 651

CHAPTER 5 Sampling Distributions

Lecture 10 - Confidence Intervals for Sample Means

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

Multiple-Choice Questions

Stat 139 Homework 2 Solutions, Fall 2016

χ 2 distributions and confidence intervals for population variance

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

4.2 Probability Distributions

CHAPTER 5 SAMPLING DISTRIBUTIONS

Lecture 2 INTERVAL ESTIMATION II

HOMEWORK: Due Mon 11/8, Chapter 9: #15, 25, 37, 44

1 Introduction 1. 3 Confidence interval for proportion p 6

Statistical Methods in Practice STAT/MATH 3379

Chapter 5. Statistical inference for Parametric Models

STATISTICS and PROBABILITY

1. Variability in estimates and CLT

Chapter 8 Estimation

6.1, 7.1 Estimating with confidence (CIS: Chapter 10)

Sampling & Confidence Intervals

Back to estimators...

Lecture 9 - Sampling Distributions and the CLT. Mean. Margin of error. Sta102/BME102. February 6, Sample mean ( X ): x i

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

Introduction to Statistical Data Analysis II

Continuous Probability Distributions & Normal Distribution

STATISTICS and PROBABILITY

Lecture 6: Chapter 6

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Lecture 2. Probability Distributions Theophanis Tsandilas

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.

Y i % (% ( ( ' & ( # % s 2 = ( ( Review - order of operations. Samples and populations. Review - order of operations. Review - order of operations

MVE051/MSG Lecture 7

Data Analysis and Statistical Methods Statistics 651

Statistics for Managers Using Microsoft Excel 7 th Edition

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Simulation Lecture Notes and the Gentle Lentil Case

Statistics for Business and Economics: Random Variables:Continuous

σ 2 : ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

A point estimate is a single value (statistic) used to estimate a population value (parameter).

Introduction to Probability and Inference HSSP Summer 2017, Instructor: Alexandra Ding July 19, 2017

Learning Objectives for Ch. 7

Transcription:

Estimation Example: Cholesterol levels of heart-attack patients Data: Observational study at a Pennsylvania medical center blood cholesterol levels patients treated for heart attacks measurements 2, 4, and 14 days after the attack Id Y 1 Y 2 Y 3 Id Y 1 Y 2 Y 3 1 270 218 156 15 294 240 264 2 236 234 193 16 282 294 220 3 210 214 242 17 234 220 264 4 142 116 120 18 224 200 213 5 280 200 181 19 276 220 188 6 272 276 256 20 282 186 182 7 160 146 142 21 360 352 294 8 220 182 216 22 310 202 214 9 226 238 248 23 280 218 170 10 242 288 298 24 278 248 198 11 186 190 168 25 288 278 236 12 266 236 236 26 288 248 256 13 206 244 238 27 244 270 280 14 318 258 200 28 236 242 204 Aim: Make inference on distribution of cholesterol level 14 days after the attack: Y 3 decrease in cholesterol level: D = Y 1 Y 3 relative decrease in cholesterol level: R = Y 1 Y 3 Y 3 Confidence intervals I, Feb 11, 2004-1 -

Estimation Data: d 1,..., d 28 observed decrease in cholesterol level In this example, parameters of interest might be µ D = E(D) the mean decrease in cholesterol level, σ 2 D = var(d) the variation of the cholesterol level, p D = P(D 0) probability of no decrease in cholesterol level These parameters are naturally estimated by the following sample statistics: ˆµ D = 1 n d i (sample mean) n ˆσ 2 D = 1 n i=1 n (d i d) 2, (sample mean) i=1 ˆp D = #{d i d i 0} (sample proportion) n Such statistics are point estimators since they estimate the corresponding parameter by a single numerical value. Point estimates provide no information about their chance variation. Estimates without an indication of their variability are of limited value. Confidence intervals I, Feb 11, 2004-2 -

Confidence Intervals for the Mean Recall: CLT for the sample mean: For large n we have X N ( ) µ, σ2 n 68-95-99 rule: With 95% probability the sample differs from its mean µ by less that two standard deviations. More precisely, we have P ( µ 1.96 σ n X µ + 1.96 σ n ) = 0.95, or equivalently, after rearranging the terms, P ( X 1.96 σ n µ X + 1.96 σ n ) = 0.95. Interpretation: There is 95% probability that the random interval X 1.96 σ n, X + 1.96 σ n will cover the mean µ. Example: Cholesterol levels d = 36.89, σ = 51.00, n = 28. Therefore, the 95% confidence interval for µ is 18.00, 55.78. Confidence intervals I, Feb 11, 2004-3 -

Confidence Intervals for the Mean Assumption: The population standard deviation σ is known. In the next lecture, we will drop this unrealistic assumption. Assumption is approximately satisfied for large sample sizes, since then ˆσ σ by the law of large numbers. Definition: Confidence interval for µ (σ known) The interval X zα/2 σ n, X + z α/2 σ n is called a 1 α confidence interval for the population mean µ. (1 α) is the confidence level. For large sample sizes n, an approximate (1 α) confidence interval for µ is given by X z α/2 ˆσ n, X + z α/2 ˆσ n. Here, z α is the α-critical value of the standard normal distribution: z α has area α to its right Φ(z α ) = 1 α f(x) 0.4 0.3 0.2 0.1 0.0 3 2 1 α 0 1 z 2 z α 3 Confidence intervals I, Feb 11, 2004-4 -

Confidence Interval for the Mean Example: Community banks Community banks are banks with less than a billion dollars of assets. Approximately 7500 such banks in the United States. Annual survey of the Community Bankers Council of the American Bankers Association (ABA) Population: Community banks in the United States. Variable of interest: Total assets of community banks. Sample size: n = 110 Sample mean: X = 220 millions of dollars Sample standard deviation: SD = 161 millions of dollars Histogram of sampled values: 20 Assets of Community Banks in the U.S. (sample of 110 community banks) 15 Frequency 10 5 0 0 100 200 300 400 500 600 700 800 900 1000 Assets (in millions of dollars) Suppose we want to give a 95% confidence interval for the mean total assets of all community banks in the United States. α = 0.05, z α/2 = 1.96 A 95% confidence interval for the mean assets (in millions of dollars) is 161 161 220 1.96, 220 + 1.96 190, 250. 110 110 Confidence intervals I, Feb 11, 2004-5 -

Example: Cholesterol levels Sample Size Suppose we want a 99% confidence interval for the decrease in cholesterol level: α = 0.01, z 0.005 = 2.58 The 99% confidence interval for µ D is 36.89 2.58 50.93, 36.89 + 2.58 50.93 12.06, 61.72. 28 28 Note: If we raise the confidence level, the confidence interval becomes wider. Suppose we want to obtain increase the confidence level without increasing the error of estimation (indicated by the half-width of the confidence interval). For this we have to increase the sample size n. Question: What sample size n is needed to estimate the mean decrease in cholesterol with error e = 20 and confidence level 99%? The error (half-width of the confidence interval) is e = z α/2 σ n Therefore the sample size n e needed is given by ( zα/2 ) σ 2 ( ) 2.58 50.93 2 n e = = 43.16, e 20 that is, a sample of 44 patients is needed to estimate µ D with error e = 20 and 99% confidence. Confidence intervals I, Feb 11, 2004-6 -

Estimation of the Mean Example: Banks loan-to-deposit ratio The ABA survey of community banks also asked about the loan-to-deposit ratio (LTDR), a bank s total loans as a percent of its total deposits. Sample statistics: n = 110 ˆµ LTDR = 76.7 ˆσ LTDR = 12.3 Frequency 18 15 12 9 6 3 Loan To Deposit Ratio of Community Banks (sample of 110 community banks) 0 50 60 70 80 90 100 110 120 LTDR (in %) Construction of 95% confidence interval: α = 0.05, z α/2 = 1.96 Standard error σ X = σ LT DR n = 1.17 95% confidence interval for µ LTDR : σ LT DR X zα/2, X σ LT DR + z α/2 = 74.4, 79.0 n n To get an estimation with error e = 3.0 (half-width of confidence interval) it suffices to sample n e banks, ( ) zα/2 σ 2 ( ) 2 LT DR 1.96 12.3 n e = = 64.6. e 3.0 Thus a sample of n e = 65 banks it sufficient. Confidence intervals I, Feb 11, 2004-7 -