Confidence Intervals for Paired Means with Tolerance Probability

Similar documents
Confidence Intervals for the Difference Between Two Means with Tolerance Probability

Confidence Intervals for One Variance with Tolerance Probability

Confidence Intervals for Pearson s Correlation

Confidence Intervals for One-Sample Specificity

Tests for One Variance

Two-Sample Z-Tests Assuming Equal Variance

Confidence Intervals for an Exponential Lifetime Percentile

Tolerance Intervals for Any Data (Nonparametric)

Tests for Two Variances

Tests for Paired Means using Effect Size

Non-Inferiority Tests for the Ratio of Two Means in a 2x2 Cross-Over Design

Non-Inferiority Tests for the Ratio of Two Means

Non-Inferiority Tests for Two Means in a 2x2 Cross-Over Design using Differences

Two-Sample T-Tests using Effect Size

PASS Sample Size Software

Confidence Intervals for One Variance using Relative Error

Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X

Tests for Intraclass Correlation

Tests for the Difference Between Two Linear Regression Intercepts

Tests for the Difference Between Two Poisson Rates in a Cluster-Randomized Design

Superiority by a Margin Tests for the Ratio of Two Proportions

Tests for Two ROC Curves

Mixed Models Tests for the Slope Difference in a 3-Level Hierarchical Design with Random Slopes (Level-3 Randomization)

Equivalence Tests for the Difference of Two Proportions in a Cluster- Randomized Design

Equivalence Tests for the Odds Ratio of Two Proportions

Tests for Two Means in a Cluster-Randomized Design

Non-Inferiority Tests for the Ratio of Two Proportions

Equivalence Tests for One Proportion

Conditional Power of One-Sample T-Tests

Equivalence Tests for Two Correlated Proportions

Conover Test of Variances (Simulation)

Equivalence Tests for the Ratio of Two Means in a Higher- Order Cross-Over Design

Tests for the Matched-Pair Difference of Two Event Rates in a Cluster- Randomized Design

Conditional Power of Two Proportions Tests

Two-Sample T-Test for Superiority by a Margin

Two-Sample T-Test for Non-Inferiority

Tests for Two Exponential Means

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Confidence Intervals Introduction

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Group-Sequential Tests for Two Proportions

Non-Inferiority Tests for the Odds Ratio of Two Proportions

Tests for Two Means in a Multicenter Randomized Design

Tests for Two Independent Sensitivities

Non-Inferiority Tests for the Difference Between Two Proportions

Mendelian Randomization with a Binary Outcome

Tests for Multiple Correlated Proportions (McNemar-Bowker Test of Symmetry)

STAT Chapter 7: Confidence Intervals

Statistics for Business and Economics

Point-Biserial and Biserial Correlations

Mendelian Randomization with a Continuous Outcome

Statistics for Managers Using Microsoft Excel 7 th Edition

Chapter 8 Statistical Intervals for a Single Sample

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

8.1 Estimation of the Mean and Proportion

GETTING STARTED. To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop

ESG Yield Curve Calibration. User Guide

1 Inferential Statistic

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS

R & R Study. Chapter 254. Introduction. Data Structure

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Finite Element Method

Risk Analysis. å To change Benchmark tickers:

One Proportion Superiority by a Margin Tests

Estimation and Confidence Intervals

MgtOp S 215 Chapter 8 Dr. Ahn

Uniform Probability Distribution. Continuous Random Variables &

STATISTICS and PROBABILITY

Monte Carlo Simulation (General Simulation Models)

Statistics for Managers Using Microsoft Excel/SPSS Chapter 6 The Normal Distribution And Other Continuous Distributions

General Instructions

Confidence Intervals for the Median and Other Percentiles

Statistical Methods in Practice STAT/MATH 3379

Solutions of Equations in One Variable. Secant & Regula Falsi Methods

Accounting with MYOB Accounting Plus v18. Chapter Four Accounts Payable

Statistics 13 Elementary Statistics

Discrete Random Variables

5/5/2014 یادگیري ماشین. (Machine Learning) ارزیابی فرضیه ها دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی. Evaluating Hypothesis (بخش دوم)

Chapter 5 Discrete Probability Distributions. Random Variables Discrete Probability Distributions Expected Value and Variance

Gamma Distribution Fitting

Getting started with WinBUGS

Chapter 6 Analyzing Accumulated Change: Integrals in Action

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

Chapter 8 Estimation

Confidence Intervals. σ unknown, small samples The t-statistic /22

Descriptive Statistics

One-Sample Cure Model Tests

Sampling and sampling distribution

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

You should already have a worksheet with the Basic Plus Plan details in it as well as another plan you have chosen from ehealthinsurance.com.

Estimation and Confidence Intervals

NCSS Statistical Software. Reference Intervals

Chapter 7.2: Large-Sample Confidence Intervals for a Population Mean and Proportion. Instructor: Elvan Ceyhan

Transcription:

Chapter 497 Confidence Intervals for Paired Means with Tolerance Probability Introduction This routine calculates the sample size necessary to achieve a specified distance from the paired sample mean difference to the confidence limit(s) with a given tolerance probability at a stated confidence level for a confidence interval about a single mean difference when the underlying data distribution is normal. Technical Details For a paired sample mean difference from a normal distribution with unknown variance, a two-sided, 100(1 α)% confidence interval is calculated by X t ± ˆ σ n / 2, n 1 where X is the mean of the paired differences of the sample, and σ diff is the known standard deviation of paired sample differences. A one-sided 100(1 α)% upper confidence limit is calculated by X t +, n 1 Similarly, the one-sided 100(1 α)% lower confidence limit is X t, n 1 Each confidence interval is calculated using an estimate of the mean difference plus and/or minus a quantity that represents the distance from the mean difference to the edge of the interval. For two-sided confidence intervals, this distance is sometimes called the precision, margin of error, or half-width. We will label this distance, D. The basic equation for determining sample size when D has been specified is t D = n ˆ σ n ˆ σ n ˆ / 2, n 1σ 497-1

Solving for n, we obtain t n = / 2, n 1σ This equation can be solved for any of the unknown quantities in terms of the others. The value α/2 is replaced by α when a one-sided interval is used. There is an additional subtlety that arises when the standard deviation is to be chosen for estimating sample size. The sample sizes determined from the formula above produce confidence intervals with the specified widths only when the future sample has a sample standard deviation of differences that is no greater than the value specified. As an example, suppose that 15 pairs of individuals are sampled in a pilot study, and a standard deviation estimate of 3.5 is obtained from the sample. The purpose of a later study is to estimate the mean difference within 10 units. Suppose further that the sample size needed is calculated to be 57 pairs using the formula above with 3.5 as the estimate for the standard deviation. The sample of size 57 pairs is then obtained from the population, but the standard deviation of the 57 paired differences turns out to be 3.9 rather than 3.5. The confidence interval is computed and the distance from the mean difference to the confidence limits is greater than 10 units. This example illustrates the need for an adjustment to adjust the sample size such that the distance from the mean difference to the confidence limits will be below the specified value with known probability. Such an adjustment for situations where a previous sample is used to estimate the standard deviation is derived by Harris, Horvitz, and Mood (1948) and discussed in Zar (1984) and Hahn and Meeker (1991). The adjustment is D ˆ 2 t ˆ / 2, n 1σ n = F1 γ ; n 1, m 1 D where 1 γ is the probability that the distance from the mean difference to the confidence limit(s) will be below the specified value, and m is the sample size in the previous paired sample that was used to estimate the standard deviation. The corresponding adjustment when no previous sample is available is discussed in Kupper and Hafner (1989) and Hahn and Meeker (1991). The adjustment in this case is t n = 2 2 χ1 n 1 n 1 ˆ / 2,n 1 γ, D σ where, again, 1 γ is the probability that the distance from the mean difference to the confidence limit(s) will be below the specified value. Each of these adjustments accounts for the variability in a future estimate of the standard deviation. In the first adjustment formula (Harris, Horvitz, and Mood, 1948), the distribution of the standard deviation is based on the estimate from a previous paired sample. In the second adjustment formula, the distribution of the standard deviation is based on a specified value that is assumed to be the population standard deviation of differences. 2 497-2

Finite Population Size The above calculations assume that samples are being drawn from a large (infinite) population. When the population is of finite size (N), an adjustment must be made. The adjustment reduces the standard deviation as follows: σ finite = σ 1 n N This new standard deviation replaces the regular standard deviation in the above formulas. Confidence Level The confidence level, 1 α, has the following interpretation. If thousands of samples of n items are drawn from a population using simple random sampling and a confidence interval is calculated for each sample, the proportion of those intervals that will include the true population mean difference is 1 α. Procedure Options This section describes the options that are specific to this procedure. These are located on the Design tab. For more information about the options of other tabs, go to the Procedure Window chapter. Design Tab The Design tab contains most of the parameters and options that you will be concerned with. Solve For Solve For This option specifies the parameter to be solved for from the other parameters. One-Sided or Two-Sided Interval Interval Type Specify whether the interval to be used will be a one-sided or a two-sided confidence interval. Population Population Size This is the number of pairs in the population. Usually, you assume that samples are drawn from a very large (infinite) population. Occasionally, however, situations arise in which the population of interest is of limited size. In these cases, appropriate adjustments must be made. This option sets the population size. 497-3

Confidence and Tolerance Confidence Level (1 Alpha) The confidence level, 1 α, has the following interpretation. If thousands of samples of n items are drawn from a population using simple random sampling and a confidence interval is calculated for each sample, the proportion of those intervals that will include the true population mean difference is 1 α. Often, the values 0.95 or 0.99 are used. You can enter single values or a range of values such as 0.90 0.95 0.99 or 0.90 to 0.99 by 0.01. Tolerance Probability This is the probability that a future interval with sample size N and the specified confidence level will have a distance from the mean paired difference to the limit(s) that is less than or equal to the distance specified. If a tolerance probability is not used, as in the 'Confidence Intervals for Paired Means' procedure, the sample size is calculated for the expected distance from the mean paired difference to the limit(s), which assumes that the future standard deviation will also be the one specified. Using a tolerance probability implies that the standard deviation of the future sample will not be known in advance, and therefore, an adjustment is made to the sample size formula to account for the variability in the standard deviation. Use of a tolerance probability is similar to using an upper bound for the standard deviation in the 'Confidence Intervals for Paired Means' procedure. Values between 0 and 1 can be entered. The choice of the tolerance probability depends upon how important it is that the distance from the interval limit(s) to the mean difference is at most the value specified. You can enter a range of values such as 0.70 0.80 0.90 or 0.70 to 0.95 by 0.05. Sample Size (Number of Pairs) N (Sample Size or Number of Pairs) Enter one or more values for the sample size. This is the number of pairs selected at random from the population to be in the study. You can enter a single value or a range of values. Precision Distance from Mean erence to Limit(s) This is the distance from the confidence limit(s) to the mean paired difference. For two-sided intervals, it is also known as the precision, half-width, or margin of error. You can enter a single value or a list of values. The value(s) must be greater than zero. 497-4

Standard Deviation of Paired erences Standard Deviation Source This procedure permits two sources for estimates of the standard deviation of paired differences: S is a Population Standard Deviation This option should be selected if there is no previous sample that can be used to obtain an estimate of the standard deviation of the paired differences. In this case, the algorithm assumes that future sample obtained will be from a population with standard deviation S. S from a Previous Sample This option should be selected if the estimate of the standard deviation of the paired differences is obtained from a previous random sample from the same distribution as the one to be sampled. The sample size of the previous sample must also be entered under 'Sample Size of Previous Sample'. Standard Deviation of Paired erences S is a Population Standard Deviation S (Standard Deviation) Enter an estimate of the standard deviation of paired differences (must be positive). In this case, the algorithm assumes that future samples obtained will be from a population with standard deviation S. One common method for estimating the standard deviation is the range divided by 4, 5, or 6. You can enter a range of values such as 1 2 3 or 1 to 10 by 1. Press the Standard Deviation Estimator button to load the Standard Deviation Estimator window. Standard Deviation of Paired erences S from a Previous Sample S (SD Estimated from a Previous Sample) Enter an estimate of the standard deviation of paired differences from a previous (or pilot) study. This value must be positive. A range of values may be entered. Press the Standard Deviation Estimator button to load the Standard Deviation Estimator window. Sample Size (# of Pairs) of Previous Sample Enter the sample size (number of pairs) that was used to estimate the standard deviation entered in S (SD Estimated from a Previous Sample). This value is entered only when 'Standard Deviation Source:' is set to 'S from a Previous Sample'. 497-5

Example 1 Calculating Sample Size A researcher would like to estimate the mean difference in weight following a specific diet with 95% confidence. It is very important that the mean difference is estimated within 5 lbs. Data available from a previous study are used to provide an estimate of the standard deviation. The estimate of the standard deviation of before/after differences is 16.7 lbs, from a sample of size 17 individuals. The goal is to determine the sample size necessary to obtain a two-sided confidence interval such that the mean weight is estimated within 5 lbs. Tolerance probabilities of 0.70 to 0.95 will be examined. Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the procedure window by expanding Means, then expanding Paired Means, then clicking on Confidence Interval, and then clicking on. You may then make the appropriate entries as listed below, or open Example 1 by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Sample Size Interval Type... Two-Sided Population Size... Infinite Confidence Level... 0.95 Tolerance Probability... 0.70 to 0.95 by 0.05 Distance from Mean to Limit(s)... 5 Standard Deviation Source... S from a Previous Sample S... 16.7 Sample Size of Previous Sample... 17 Annotated Output Click the Calculate button to perform the calculations and generate the following output. Numeric Results Numeric Results for Two-Sided Confidence Intervals Target Actual Sample Dist from Dist from Standard Confidence Size Mean Mean Deviation Tolerance Level (N) to Limits to Limits (S) Probability 0.95 58 5.000 4.970 16.700 0.70 0.95 61 5.000 4.996 16.700 0.75 0.95 66 5.000 4.967 16.700 0.80 0.95 71 5.000 4.985 16.700 0.85 0.95 79 5.000 4.973 16.700 0.90 0.95 92 5.000 4.981 16.700 0.95 Sample size for estimate of S from previous paired sample = 17. References Hahn, G. J. and Meeker, W.Q. 1991. Statistical Intervals. John Wiley & Sons. New York. Zar, J. H. 1984. Biostatistical Analysis. Second Edition. Prentice-Hall. Englewood Cliffs, New Jersey. Harris, M., Horvitz, D. J., and Mood, A. M. 1948. 'On the Determination of Sample Sizes in Designing Experiments', Journal of the American Statistical Association, Volume 43, No. 243, pp. 391-402. 497-6

Report Definitions Confidence level is the proportion of confidence intervals (constructed with this same confidence level, sample size, etc.) that would contain the population mean difference. N is the size of the sample (or number of pairs) drawn from the population. Dist from Mean to Limit is the distance from the confidence limit(s) to the mean paired difference. For two-sided intervals, it is also know as the precision, half-width, or margin of error. Target Dist from Mean to Limit is the value of the distance that is entered into the procedure. Actual Dist from Mean to Limit is the value of the distance that is obtained from the procedure. The standard deviation (S) is the standard deviation of the paired differences. Tolerance Probability is the probability that a future interval with sample size N and corresponding confidence level will have a distance from the mean difference to the limit(s) that is less than or equal to the specified distance. Summary Statements The probability is 0.70 that a sample size of 58 will produce a two-sided 95% confidence interval with a distance from the mean paired difference to the limits that is less than or equal to 4.970 if the population standard deviation is estimated to be 16.700 by a previous paired sample of size 17. This report shows the calculated sample size for each of the scenarios. Plots Section This plot shows the sample size versus the tolerance probability. 497-7

Example 2 Validation This procedure uses the same mechanics as the Confidence Intervals for One Mean with Tolerance Probability procedure. The validation of this procedure is given in Examples 2, 3, and 4 of the Confidence Intervals for One Mean with Tolerance Probability procedure. 497-8