Two-Sample T-Tests using Effect Size

Similar documents
Tests for Paired Means using Effect Size

Two-Sample Z-Tests Assuming Equal Variance

Tests for Two Variances

Superiority by a Margin Tests for the Ratio of Two Proportions

Non-Inferiority Tests for the Odds Ratio of Two Proportions

Tests for One Variance

Non-Inferiority Tests for the Ratio of Two Proportions

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

Non-Inferiority Tests for the Ratio of Two Means

Equivalence Tests for the Odds Ratio of Two Proportions

Tests for Two Independent Sensitivities

Tests for Intraclass Correlation

Tests for the Difference Between Two Linear Regression Intercepts

Tests for Two Means in a Cluster-Randomized Design

Tests for Two Means in a Multicenter Randomized Design

Non-Inferiority Tests for the Difference Between Two Proportions

Tests for Two Exponential Means

Equivalence Tests for the Difference of Two Proportions in a Cluster- Randomized Design

Mixed Models Tests for the Slope Difference in a 3-Level Hierarchical Design with Random Slopes (Level-3 Randomization)

Group-Sequential Tests for Two Proportions

Tests for the Difference Between Two Poisson Rates in a Cluster-Randomized Design

Non-Inferiority Tests for Two Means in a 2x2 Cross-Over Design using Differences

PASS Sample Size Software

Confidence Intervals for Paired Means with Tolerance Probability

Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X

Mendelian Randomization with a Binary Outcome

Tests for Two ROC Curves

Equivalence Tests for One Proportion

Non-Inferiority Tests for the Ratio of Two Means in a 2x2 Cross-Over Design

Conditional Power of One-Sample T-Tests

Mendelian Randomization with a Continuous Outcome

Conditional Power of Two Proportions Tests

Tests for the Matched-Pair Difference of Two Event Rates in a Cluster- Randomized Design

Confidence Intervals for Pearson s Correlation

Tests for Multiple Correlated Proportions (McNemar-Bowker Test of Symmetry)

Conover Test of Variances (Simulation)

Equivalence Tests for Two Correlated Proportions

Equivalence Tests for the Ratio of Two Means in a Higher- Order Cross-Over Design

Two-Sample T-Test for Superiority by a Margin

Two-Sample T-Test for Non-Inferiority

Confidence Intervals for One-Sample Specificity

Tolerance Intervals for Any Data (Nonparametric)

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority

One-Sample Cure Model Tests

Point-Biserial and Biserial Correlations

Confidence Intervals for an Exponential Lifetime Percentile

One Proportion Superiority by a Margin Tests

R & R Study. Chapter 254. Introduction. Data Structure

GETTING STARTED. To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop

Confidence Intervals for One Variance with Tolerance Probability

Risk Analysis. å To change Benchmark tickers:

Two Populations Hypothesis Testing

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Confidence Intervals for One Variance using Relative Error

7.1 Comparing Two Population Means: Independent Sampling

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

Gamma Distribution Fitting

NCSS Statistical Software. Reference Intervals

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

The Two-Sample Independent Sample t Test

Lecture 35 Section Wed, Mar 26, 2008

Lecture 8: Single Sample t test

Chapter 6 Confidence Intervals

Tests for Two Correlations

Chapter 8 Statistical Intervals for a Single Sample

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Chapter 7. Inferences about Population Variances

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

The Two Sample T-test with One Variance Unknown

STA2601. Tutorial letter 105/2/2018. Applied Statistics II. Semester 2. Department of Statistics STA2601/105/2/2018 TRIAL EXAMINATION PAPER

Chapter 8 Estimation

σ 2 : ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

Experimental Design and Statistics - AGA47A

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples

Mean GMM. Standard error

Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) Estimating Population Parameters

Confidence Intervals Introduction

Chapter 7.2: Large-Sample Confidence Intervals for a Population Mean and Proportion. Instructor: Elvan Ceyhan

Copyright 2005 Pearson Education, Inc. Slide 6-1

Chapter Seven: Confidence Intervals and Sample Size

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

Agresso User Manual Enquiries

Jacob: The illustrative worksheet shows the values of the simulation parameters in the upper left section (Cells D5:F10). Is this for documentation?

Confidence Intervals for the Median and Other Percentiles

Chapter 3 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc.

Financial Econometrics Review Session Notes 4

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Some Discrete Distribution Families

Getting started with WinBUGS

Learning Objectives for Ch. 7

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Transcription:

Chapter 419 Two-Sample T-Tests using Effect Size Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the effect size is specified rather than the means and variance(s). The details of procedure are given in Cohen (1988). The design corresponding to this test procedure is sometimes referred to as a parallel-groups design. In this design, two groups from independent, normally distributed populations are compared by considering the difference in their means scaled be their common standard deviation. This procedure is specific to the two-sample t-test assuming equal variance. If the variances are known to be significantly different, this procedure can still be used if the group sample sizes are equal and the average of the variances is used. Test Assumptions When running a two-sample equal-variance t-test, the basic assumptions are that the distributions of the two populations are approximately normal, and that the variances of the two distributions are the same. If the variances are different, this procedure can still be used if the two group sample sizes are nearly equal. Test Procedure If we assume that μ 1 and μ represent the means of the two populations of interest and their common (unknown) standard deviation is σ, the effect size is represented by d where dd = μμ 1 μμ σσ The null hypothesis is H 0: d = 0 and the alternative hypothesis depends on the number of sides of the test: Two-Sided: H 1 : dd 0 or H 1 : μμ 1 μμ 0 Upper One-Sided: H 1 : dd > 0 or H 1 : μμ 1 μμ > 0 Lower One-Sided: H 1 : dd < 0 or H 1 : μμ 1 μμ < 0 A suitable Type I error probability (α) is chosen for the test, the data is collected, and a t-statistic is generated using the formula: t = ( n 1) s + ( n 1) 1 n 1 1 + n x 1 x s 1 1 + n1 n 419-1

This t-statistic follows a t distribution with n 1 + n degrees of freedom. The null hypothesis is rejected in favor of the alternative if, for H 1 : dd 0 or H 1 : μμ 1 μμ 0 for H 1 : dd > 0 or H 1 : μμ 1 μμ > 0 Or, for H 1 : dd < 0 or H 1 : μμ 1 μμ < 0 t < t α / or > t1 α / t > t 1 α, t < t α. t, Comparing the t-statistic to the cut-off t-value (as shown here) is equivalent to comparing the p-value to α. Power Calculation The power is calculated using the same formulation as in the Two-Sample T-Tests Assuming Equal Variances procedure with the modification that the σ used in that procedure is set equal to one. If the variances cannot be assumed to be equal, the modification suggested by Cohen (1988) is used. This modification is to substitute an average value of the two variances and then proceeding as if the variances were equal. The average value is computed using σσ = σσ 1 + σσ Cohen remarks that this method is only accurate if the two sample sizes are (nearly) equal. The Effect Size If we assume that μ 1 and μ represent the means of the two populations of interest and their common (unknown) standard deviation is σ, the effect size is represented by d where dd = μμ 1 μμ σσ Cohen (1988) proposed the following interpretation of the d values. A d near 0. is a small effect, a d near 0.5 is a medium effect, and a d near 0.8 is a large effect. These values for small, medium, and large effects are popular in the social sciences. However, this convention is not as popular among the medical sciences since the scale of the effect is left unstated which makes interpretation difficult. 419-

Procedure Options This section describes the options that are specific to this procedure. These are located on the Design tab. For more information about the options of other tabs, go to the Procedure Window chapter. Design Tab The Design tab contains most of the parameters and options that you will be concerned with. Solve For Solve For This option specifies the parameter to be solved for from the other parameters. The parameters that may be selected are Power, Sample Size, Effect Size, and Alpha. In most situations, you will likely select either Power or Sample Size. The Solve For parameter is the parameter that will be displayed on the vertical axis of any plots that are shown. Test Direction Alternative Hypothesis Specify whether the alternative hypothesis of the test is one-sided or two-sided. If a one-sided test is chosen, the hypothesis test direction is chosen based on whether the effect size is greater than or less than zero. Two-Sided Hypothesis Test H0: d = 0 vs. H1: d 0 One-Sided Hypothesis Tests Upper: H0: d 0 vs. H1: d > 0 Lower: H0: d 0 vs. H1: d < 0 Power and Alpha Power Power is the probability of rejecting the null hypothesis when it is false. Power is equal to 1 - Beta, so specifying power implicitly specifies beta. Beta is the probability obtaining a false negative with the statistical test. That is, it is the probability of accepting a false null hypothesis. The valid range is 0 to 1. Different disciplines have different standards for setting power. The most common choice is 0.90, but 0.80 is also popular. You can enter a single value, such as 0.90, or a series of values, such as.70.80.90, or.70 to.90 by.1. When a series of values is entered, PASS will generate a separate calculation result for each value of the series. Alpha Alpha is the probability of obtaining a false positive with the statistical test. That is, it is the probability of rejecting a true null hypothesis. The null hypothesis is usually that the parameters of interest (means, proportions, etc.) are equal. Since Alpha is a probability, it is bounded by 0 and 1. Commonly, it is between 0.001 and 0.10. Alpha is often set to 0.05 for two-sided tests and to 0.05 for one-sided tests. You can enter a single value, such as 0.05, or a series of values, such as.05.10.15, or.05 to.15 by.01. When a series of values is entered, PASS will generate a separate calculation result for each value of the series. 419-3

Sample Size (When Solving for Sample Size) Group Allocation Select the option that describes the constraints on N1 or N or both. The options are Equal (N1 = N) This selection is used when you wish to have equal sample sizes in each group. Since you are solving for both sample sizes at once, no additional sample size parameters need to be entered. Enter N1, solve for N Select this option when you wish to fix N1 at some value (or values), and then solve only for N. Please note that for some values of N1, there may not be a value of N that is large enough to obtain the desired power. Enter N, solve for N1 Select this option when you wish to fix N at some value (or values), and then solve only for N1. Please note that for some values of N, there may not be a value of N1 that is large enough to obtain the desired power. Enter R = N/N1, solve for N1 and N For this choice, you set a value for the ratio of N to N1, and then PASS determines the needed N1 and N, with this ratio, to obtain the desired power. An equivalent representation of the ratio, R, is N = R * N1. Enter percentage in Group 1, solve for N1 and N For this choice, you set a value for the percentage of the total sample size that is in Group 1, and then PASS determines the needed N1 and N with this percentage to obtain the desired power. N1 (Sample Size, Group 1) This option is displayed if Group Allocation = Enter N1, solve for N N1 is the number of items or individuals sampled from the Group 1 population. N1 must be. You can enter a single value or a series of values. N (Sample Size, Group ) This option is displayed if Group Allocation = Enter N, solve for N1 N is the number of items or individuals sampled from the Group population. N must be. You can enter a single value or a series of values. R (Group Sample Size Ratio) This option is displayed only if Group Allocation = Enter R = N/N1, solve for N1 and N. R is the ratio of N to N1. That is, R = N / N1. Use this value to fix the ratio of N to N1 while solving for N1 and N. Only sample size combinations with this ratio are considered. N is related to N1 by the formula: N = [R N1], where the value [Y] is the next integer Y. 419-4

For example, setting R =.0 results in a Group sample size that is double the sample size in Group 1 (e.g., N1 = 10 and N = 0, or N1 = 50 and N = 100). R must be greater than 0. If R < 1, then N will be less than N1; if R > 1, then N will be greater than N1. You can enter a single or a series of values. Percent in Group 1 This option is displayed only if Group Allocation = Enter percentage in Group 1, solve for N1 and N. Use this value to fix the percentage of the total sample size allocated to Group 1 while solving for N1 and N. Only sample size combinations with this Group 1 percentage are considered. Small variations from the specified percentage may occur due to the discrete nature of sample sizes. The Percent in Group 1 must be greater than 0 and less than 100. You can enter a single or a series of values. Sample Size (When Not Solving for Sample Size) Group Allocation Select the option that describes how individuals in the study will be allocated to Group 1 and to Group. The options are Equal (N1 = N) This selection is used when you wish to have equal sample sizes in each group. A single per group sample size will be entered. Enter N1 and N individually This choice permits you to enter different values for N1 and N. Enter N1 and R, where N = R * N1 Choose this option to specify a value (or values) for N1, and obtain N as a ratio (multiple) of N1. Enter total sample size and percentage in Group 1 Choose this option to specify a value (or values) for the total sample size (N), obtain N1 as a percentage of N, and then N as N - N1. Sample Size Per Group This option is displayed only if Group Allocation = Equal (N1 = N). The Sample Size Per Group is the number of items or individuals sampled from each of the Group 1 and Group populations. Since the sample sizes are the same in each group, this value is the value for N1, and also the value for N. The Sample Size Per Group must be. You can enter a single value or a series of values. N1 (Sample Size, Group 1) This option is displayed if Group Allocation = Enter N1 and N individually or Enter N1 and R, where N = R * N1. N1 is the number of items or individuals sampled from the Group 1 population. N1 must be. You can enter a single value or a series of values. 419-5

N (Sample Size, Group ) This option is displayed only if Group Allocation = Enter N1 and N individually. N is the number of items or individuals sampled from the Group population. N must be. You can enter a single value or a series of values. R (Group Sample Size Ratio) This option is displayed only if Group Allocation = Enter N1 and R, where N = R * N1. R is the ratio of N to N1. That is, R = N/N1 Use this value to obtain N as a multiple (or proportion) of N1. N is calculated from N1 using the formula: where the value [Y] is the next integer Y. N=[R x N1], For example, setting R =.0 results in a Group sample size that is double the sample size in Group 1. R must be greater than 0. If R < 1, then N will be less than N1; if R > 1, then N will be greater than N1. You can enter a single value or a series of values. Total Sample Size (N) This option is displayed only if Group Allocation = Enter total sample size and percentage in Group 1. This is the total sample size, or the sum of the two group sample sizes. This value, along with the percentage of the total sample size in Group 1, implicitly defines N1 and N. The total sample size must be greater than one, but practically, must be greater than 3, since each group sample size needs to be at least. You can enter a single value or a series of values. Percent in Group 1 This option is displayed only if Group Allocation = Enter total sample size and percentage in Group 1. This value fixes the percentage of the total sample size allocated to Group 1. Small variations from the specified percentage may occur due to the discrete nature of sample sizes. The Percent in Group 1 must be greater than 0 and less than 100. You can enter a single value or a series of values. Effect Size d Enter one or more values for d, the effect size, that you wish to detect. This is a standardized difference between the group means. The effect size is calculated using d = (μ1 - μ) / σ where μ1 and μ are the group means assumed by the alternative hypothesis and σ is your estimate of the population standard deviation. The value of d can be any non-zero value (positive or negative). However, it is usually between -3 and 3, excluding 0. You can enter a single value such as 0.5 or a series of values such as 0. 0.5 0.8 or 0. to 0.8 by 0.1. When a series of values is entered, PASS will generate a separate calculation result for each value of the series. 419-6

Cohen's Effect Size Table Cohen (1988) gave the following interpretation of d values that is still popular. Small d = 0. or 0% of σ Medium d = 0.5 or 50% of σ Large d = 0.8 or 80% of σ Unequal Group Variances If the two group variance are markedly different, the average of the two variances is computed and used: σ = [(σ₁² + σ₂²)/] This estimate of σ requires that the two sample sizes, N1 and N, be (nearly) equal. If they are not, this method may be inaccurate. Using d is criticized because it obscures the true scale of the data. 419-7

Example 1 Finding the Sample Size Researchers wish to compare two types of local anesthesia using a balanced, parallel-group design. Subjects in pain will be randomized to one of two treatment groups, the treatment will be administered, and the subject s evaluation of pain intensity will be measured on a seven-point scale. The researchers would like to determine the sample sizes required to detect a small, medium, and large effect size with a two-sided t-test when the power is 80% or 90% and the significance level is 0.05. Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the procedure. You may then make the appropriate entries as listed below, or open Example 1 by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Sample Size Alternative Hypothesis... Two-Sided Power... 0.80 0.90 Alpha... 0.05 Group Allocation... Equal (N1 = N) d... 0. 0.5 0.8 Output Click the Calculate button to perform the calculations and generate the following output. Numeric Results Numeric Results for Two-Sample T-Test Alternative Hypothesis: H1: d 0 Effect Target Actual Size Power Power N1 N N d Alpha 0.80 0.8004 393 393 786 0.0 0.050 0.90 0.900 56 56 105 0.0 0.050 0.80 0.8015 64 64 18 0.50 0.050 0.90 0.903 86 86 17 0.50 0.050 0.80 0.8075 6 6 5 0.80 0.050 0.90 0.9015 34 34 68 0.80 0.050 References Cohen, Jacob. 1988. Statistical Power Analysis for the Behavioral Sciences. Lawrence Erlbaum Associates. Hillsdale, New Jersey Julious, S. A. 010. Sample Sizes for Clinical Trials. Chapman & Hall/CRC. Boca Raton, FL. Machin, D., Campbell, M., Tan, B. T., Tan, S. H. 009. Sample Size Tables for Clinical Studies, 3rd Edition. Wiley-Blackwell. Ryan, Thomas P. 013. Sample Size Determination and Power. John Wiley & Sons. New Jersey. Report Definitions Target Power is the desired power. May not be achieved because of integer N1 and N. Actual Power is the achieved power. Because N1 and N are integers, this value is often (slightly) larger than the target power. N1 and N are the number of items sampled from each population. N is the total sample size, N1 + N. Effect Size: d = (μ1 - μ) / σ is the effect size. Cohen recommended Low = 0., Medium = 0.5, and High = 0.8. Alpha is the probability of rejecting a true null hypothesis. 419-8

Summary Statements Group sample sizes of 393 and 393 achieve 80.04% power to reject the null hypothesis of zero effect size when the population effect size is 0.0 and the significance level (alpha) is 0.050 using a two-sided two-sample equal-variance t-test. These reports show the values of each of the parameters, one scenario per row. Chart Section These plots show the relationship between effect size, power, and sample size. 419-9

Example Finding the Power Researchers wish to compare two types of local anesthesia using a balanced, parallel-group design. Subjects in pain will be randomized to one of two treatment groups, the treatment will be administered, and the subject s evaluation of pain intensity will be measured on a seven-point scale. The researchers would like to determine the power to detect a small, medium, and large effect size with a twosided t-test for group sample sizes of 5, 50, 100, 00, 400 and a significance level of 0.05. Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the procedure. You may then make the appropriate entries as listed below, or open Example by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Power Alternative Hypothesis... Two-Sided Alpha... 0.05 Group Allocation... Equal (N1 = N) Sample Size Per Group... 5 50 100 00 400 d... 0. 0.5 0.8 Output Click the Calculate button to perform the calculations and generate the following output. Numeric Results Numeric Results for Two-Sample T-Test Alternative Hypothesis: H1: d 0 Effect Size Power N1 N N d Alpha 0.1066 5 5 50 0.0 0.050 0.1677 50 50 100 0.0 0.050 0.906 100 100 00 0.0 0.050 0.5141 00 00 400 0.0 0.050 0.8073 400 400 800 0.0 0.050 0.4101 5 5 50 0.50 0.050 0.6969 50 50 100 0.50 0.050 0.9404 100 100 00 0.50 0.050 0.9988 00 00 400 0.50 0.050 1.0000 400 400 800 0.50 0.050 0.7915 5 5 50 0.80 0.050 0.9773 50 50 100 0.80 0.050 0.9999 100 100 00 0.80 0.050 1.0000 00 00 400 0.80 0.050 1.0000 400 400 800 0.80 0.050 This report presents the results for various values of d and sample size. 419-10

Example 3 Validation using Another Procedure This procedure should give identical results to the Two-Sample Tests Assuming Equal Variance procedure when the value of σ there is set to one. We will use this fact to provide a validation problem for this procedure. If we run that procedure with power = 0.90, alpha = 0.05, μ 1 = 1, μ = 0, σ = 1, and solve for sample size with N1 = N, we obtain N1 = N = 3. Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the procedure. You may then make the appropriate entries as listed below, or open Example 3 by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Sample Size Alternative Hypothesis... Two-Sided Power... 0.90 Alpha... 0.05 Group Allocation... Equal (N1 = N) d... 1 Output Click the Calculate button to perform the calculations and generate the following output. Numeric Results Numeric Results for Two-Sample T-Test Alternative Hypothesis: H1: d 0 Effect Target Actual Size Power Power N1 N N d Alpha 0.90 0.915 3 3 46 1.00 0.050 This procedure also calculated N1 = N = 3, thus the procedure is validated. 419-11