Abdul Latif Jameel Poverty Action Lab Executive Training: Evaluating Social Programs Spring 2009

Similar documents
Planning Sample Size for Randomized Evaluations Esther Duflo J-PAL

Tests for Intraclass Correlation

Elementary Statistics Triola, Elementary Statistics 11/e Unit 14 The Confidence Interval for Means, σ Unknown

Planning Sample Size for Randomized Evaluations

Budget Estimator Tool & Budget Template

Tests for Two Means in a Multicenter Randomized Design

RANDOMIZED TRIALS Technical Track Session II Sergio Urzua University of Maryland

Social Protection Floor Costing Tool. User Manual

Finance Mathematics. Part 1: Terms and their meaning.

Research Wizard: UPGRADE (March 2006) Descriptions and Screenshots

CashManager Home. Getting Started and Installation Guide

You should already have a worksheet with the Basic Plus Plan details in it as well as another plan you have chosen from ehealthinsurance.com.

Tests for Two Means in a Cluster-Randomized Design

Name Student ID # Instructor Lab Period Date Due. Lab 6 The Tangent

Social Protection Floor Costing Tool. User Manual

Entering Credit Card Charges

Two-Sample Z-Tests Assuming Equal Variance

Lesson 11. Tracking and Paying Sales Tax

SFSU FIN822 Project 1

OFFICE OF UNIVERSITY BUDGETS AND FINANCIAL PLANNING WMU BUDGET REPORTING INSTRUCTIONS (FOR THE WEB)

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

University of Texas at Dallas School of Management. Investment Management Spring Estimation of Systematic and Factor Risks (Due April 1)

F1 Results. News vs. no-news

Equivalence Tests for the Difference of Two Proportions in a Cluster- Randomized Design

MLC at Boise State Polynomials Activity 2 Week #3

starting on 5/1/1953 up until 2/1/2017.

Discrete Probability Distributions

Your Action Items. Add ADX & ATR to Graph Model your 1 st Delta Neutral Iron Condor Save trade in OptionsPro Put Condor Week on your calendar

Discrete Probability Distributions

Part 2 Handout Introduction to DemProj

Volcone Users Manual V2.0

Tests for the Difference Between Two Poisson Rates in a Cluster-Randomized Design

MLC at Boise State Logarithms Activity 6 Week #8

Tests for Two ROC Curves

Math 130 Jeff Stratton. The Binomial Model. Goal: To gain experience with the binomial model as well as the sampling distribution of the mean.

Summary of Statistical Analysis Tools EDAD 5630

Historic Volatility Calculator (HVC) Tutorial (Ver )

Formulating Models of Simple Systems using VENSIM PLE

AP Statistics Chapter 6 - Random Variables

1 Roy model: Chiswick (1978) and Borjas (1987)

Sampsize. Sample size and Power Version 0.6 November 9, Philippe Glaziou

Lab#3 Probability

WEB APPENDIX 8A 7.1 ( 8.9)

Elementary Statistics

Mixed Models Tests for the Slope Difference in a 3-Level Hierarchical Design with Random Slopes (Level-3 Randomization)

Equestrian Professional s Horse Business Challenge. Member s Support Program Workbook. Steps 1-3

Questions & Answers (Q&A)

R & R Study. Chapter 254. Introduction. Data Structure

Tests for Two Variances

Randomized Evaluation Start to finish

Studio 8: NHST: t-tests and Rejection Regions Spring 2014

Tolerance Intervals for Any Data (Nonparametric)

RPI Library Documentation Version 1.0.3

The Binomial Distribution

THE 2018 VAT CHANGE Updating VAT in QuickBooks Manually. Creating new VAT Codes a VAT Checklist

The Binomial Distribution

Insurance Tracking with Advisors Assistant

Confidence Intervals for an Exponential Lifetime Percentile

Risk Analysis. å To change Benchmark tickers:

Genium INET PRM User's Guide

Eligibility Troubleshooting 101

SESAM Web user guide

User Guide 24 May 2016 Copyright GMO-Z.com Forex HK Ltd. All rights reserved.

The following content is provided under a Creative Commons license. Your support

User guide for employers not using our system for assessment

Two-Sample T-Tests using Effect Size

Data Sheet for Trendline Trader Pro

Non-Inferiority Tests for the Ratio of Two Means

Manual Asset Based Finance Manager

Web Extension: Continuous Distributions and Estimating Beta with a Calculator

Bidding Decision Example

LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL

Getting started with WinBUGS

The Advanced Budget Project Part D The Budget Report

Technology Assignment Calculate the Total Annual Cost

Note on Using Excel to Compute Optimal Risky Portfolios. Candie Chang, Hong Kong University of Science and Technology

Module 6 Portfolio risk and return

Gamma Distribution Fitting

MMF Investment Policy Management

Tests for the Difference Between Two Linear Regression Intercepts

BUILDSMART DEBTORS. SmartAct. Authorized Training Manual

ESG Yield Curve Calibration. User Guide

The mathematical definitions are given on screen.

Chapter 5. Asset Allocation - 1. Modern Portfolio Concepts

Tests for the Matched-Pair Difference of Two Event Rates in a Cluster- Randomized Design

INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS. 20 th May Subject CT3 Probability & Mathematical Statistics

Binomial Distributions

ExcelSim 2003 Documentation

Expectation Exercises.

GuruFocus User Manual: Interactive Charts

Software Tutorial ormal Statistics

FOR USE FROM APRIL 2019

Gatekeeper Module Gatekeeper Version 3.5 June

Tests for Two Exponential Means

GOVERNMENT POLICIES AND POPULARITY: HONG KONG CASH HANDOUT

Report 2 Instructions - SF2980 Risk Management

Statistics 431 Spring 2007 P. Shaman. Preliminaries

FTS Real Time Project: Managing Duration

Banner Budget Reallocation Step-by-Step Training Guide. Process Opens March 12 and Closes April 5PM

Transcription:

MIT OpenCourseWare http://ocw.mit.edu Abdul Latif Jameel Poverty Action Lab Executive Training: Evaluating Social Programs Spring 2009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

Sample size calculation with clustered design In the panchayat case, you were introduced to the concept of clustering. Your evaluation team was interested in measuring the effect of a treatment (reservations for women) on outcomes at the village and household level. However, the randomization of women council leaders was done at the Gram Panchayat level it was done on a cluster of villages. It could be that the outcome of interest is correlated for villages that belong to the same Panchayat. For example, all the villages in a Panchayat will be subject to similar rainfall and other economic shocks. This means that when one village in the Panchayat does particularly well for this random reason (e.g. a good rainfall shock) all the villages in the same Panchayat will also do better. This will lead to more noise, and hence larger standard error than in the usual case of independent sampling: in effect, we have less variation than we think. When planning both the sample size and the best way to sample villages and Panchayats, we need to take this into account. This exercise will help you understand how to do that. Should you sample all the villages in a Gram Panchayat? Should you sample 1 village from 100 Gram Panchayats? Or should you sample 3 villages in 40 Panchayats? How do you decide? We will work through these questions by determining the sample size that allows us to detect a specific effect with at least 80% power. Remember power is the likelihood that when the treatment has an effect you will be able to distinguish it from zero in your sample. This exercise shows you how the power of your sample changes with the number of clusters, the size of the clusters, the size of the treatment effect and the Intraclass Correlation Coefficient. We will use a software program developed by Steve Raudebush with funding from the William T. Grant Foundation. You can find additional resources on clustered designs on their web site. Section I: Using the OD Software First download the OD software from the website (a software manual is also available): http://sitemaker.umich.edu/group-based/optimal_design_software After you have downloaded it, open the executable and you will see a screen which looks like the one below. Select the menu option: Cluster Randomized Trial.

Under the Cluster Randomized Trial menu you will see several options to generate graphs. Select the Power vs. number of clusters (J) option. Another menu pops up: Select α (alpha). You ll see it is already set to.050 for a 95% significance level. Next click on n, the number of villages per cluster. Suppose you are interested in knowing what kind of power you can get if you sample 2 villages from each Gram Panchayat. Fill in n(1) with 2 and click OK. Now we have to determine δ (delta), the standardized effect size (the effect size divided by the standard deviation of the variable of interest). Assume you are interested in detecting whether there is an increase of 30% in the investment in drinking water undertaken in Gram Panchayats reserved for women over two years. In a small pilot survey, you determined that, in Panchayats that are not reserved, there are on average 15 instances of investment in the past two years, and that the standard deviation of the number of investments is 19. We want to detect an effect size of 30% of 15, which is 4.5. We then divide this by the standard deviation to get δ equal to 4.5/19 or 0.24. Select δ from the menu. In the dialogue box that appears there is a prefilled value of 0.200 for delta(1). Change the value to 0.24, and change the value of delta (2) to empty. Select OK. Finally we need to choose ρ, which is the intra-cluster correlation. ρ tells us how strongly the outcomes are correlated for units within the same cluster. If the villages within a Gram Panchayat are in fact independent with respect to the variable of interest, in this case drinking water, then ρ will equal 0. If on the other hand the villages in the Gram Panchayat were identical (no variation) with respect to drinking water outcomes, then ρ would equal 1. You have determined in your pilot study that ρ is 0.07. Fill in rho(1) to 0.07, and set rho (2) to be empty.

You should see a graph similar to the one below. How does the number of clusters change the power of the sample? How many clusters do you need to sample to get 80% power? You can click on the graph with your mouse to see the exact power and number of clusters for a particular point:

You have seen how many clusters you need for 80% power, sampling two villages per Panchayat. Now suppose you are interested in sampling 5 villages from each Gram Panchayat. Change n to 5. What does the new graph look? What is the new number of clusters you need for 80% power? Finally, let s see how the Intraclass Correlation Coefficient (ρ) changes power of a given sample. Leave rho(1) to be 0.07 but for comparison change rho(2) to 0.00. You should see a graph like the one below. The solid blue curve is the one with the parameters you ve set - based on your pretesting estimates of the effect of reservations for women on drinking water. The blue dashed curve is there for comparison to see how much power you would get from your sample if ρ were zero. Look carefully at the graph. How does the power of the sample change with the Intraclass Correlation Coefficient (ρ)? You can rescale the x and y axis using the menu options:

Other Cluster Randomized Trial Menu Options To take a look at some of the other menu options, close the graph by clicking on the in the top right hand corner of the inner window. Select the Cluster Randomized Trial menu again. Try generating graphs for how power changes with cluster size (n), intra-class correlation (rho) and effect size (delta). You will have to re-enter your pre-test parameters each time you open a new graph. Section II: Working Within a Budget Now that you have some familiarity with the parameters that determine power in clustered randomization, you need to incorporate your budget constraints. See the associated Excel exercise. The Excel exercise is available online at: http://povertyactionlab.org/course/cambridge_2007/ex.php