Medical Expenditure Panel Survey. Household Component Statistical Estimation Issues. Copyright 2007, Steven R. Machlin,

Similar documents
Estimates of Medical Expenditures from the Medical Expenditure Panel Survey: Gains in Precision from Combining Consecutive Years of Data

Proc SurveyCorr. Jessica Hampton, CCSU, New Britain, CT

Effects of Poststratification and Raking Adjustments on Precision of MEPS Estimates Sadeq R. Chowdhury

The coverage of young children in demographic surveys

COMPARISON of WITH- REPLACEMENT and WITHOUT- REPLACEMENT VARIANCE ESTIMATES for a COMPLEX SURVEY

Applications of Data Analysis (EC969) Simonetta Longhi and Alita Nandi (ISER) Contact: slonghi and

Health Status, Health Insurance, and Health Services Utilization: 2001

PART B Details of ICT collections

RECOMMENDATIONS AND PRACTICAL EXAMPLES FOR USING WEIGHTING

Considerations for Sampling from a Skewed Population: Establishment Surveys

Incorporating a Finite Population Correction into the Variance Estimation of a National Business Survey

Lectures 04, 05, 06: Sample weights

Lap-Ming Wun and Trena M. Ezzati-Rice and Robert Baskin and Janet Greenblatt and Marc Zodet and Frank Potter and Nuria Diaz-Tena and Mourad Touzani

An Evaluation of Nonresponse Adjustment Cells for the Household Component of the Medical Expenditure Panel Survey (MEPS) 1

STRATEGIES FOR THE ANALYSIS OF IMPUTED DATA IN A SAMPLE SURVEY

Current Population Survey (CPS)

Design of a Multi-Stage Stratified Sample for Poverty and Welfare Monitoring with Multiple Objectives

Marital Disruption and the Risk of Loosing Health Insurance Coverage. Extended Abstract. James B. Kirby. Agency for Healthcare Research and Quality

Multiple Imputation of Family Income and Personal Earnings in the National Health Interview Survey: Methods and Examples

THE VALUE OF AN INVESTMENT & INSURANCE CUSTOMER TO A BANK

Design Issues for a Longitudinal Employer Health Insurance Survey to Facilitate Analysis of Policy Changes

National Health Interview Survey Early Release Program

The American Panel Survey. Study Description and Technical Report Public Release 1 November 2013

1 PEW RESEARCH CENTER

Introduction to Current Population Survey (CPS) Hsueh-Sheng Wu Center for Family and Demographic Research November 14, 2016

SURVEY OF INSURANCE STATUS 2006 METHODOLOGICAL REPORT

Introduction to Survey Weights for National Adult Tobacco Survey. Sean Hu, MD., MS., DrPH. Office on Smoking and Health

Weighting Survey Data: How To Identify Important Poststratification Variables

Medicaid Undercount in the American Community Survey (ACS)

Original data included. The datasets harmonised are:

Guide for Investigators. The American Panel Survey (TAPS)

WikiLeaks Document Release

Random Group Variance Adjustments When Hot Deck Imputation Is Used to Compensate for Nonresponse 1

GTSS. Global Adult Tobacco Survey (GATS) Sample Weights Manual

Russia Longitudinal Monitoring Survey (RLMS) Sample Attrition, Replenishment, and Weighting in Rounds V-VII

Weighting in Survey Sampling

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001

Healthy Incentives Pilot (HIP) Interim Report

Efficiency and Distribution of Variance of the CPS Estimate of Month-to-Month Change

STATISTICAL BRIEF #172

Benchmark Report for the 2008 American National Election Studies Time Series and Panel Study. ANES Technical Report Series, no. NES

New SAS Procedures for Analysis of Sample Survey Data

MEDICAID UNDERCOUNT IN THE AMERICAN COMMUNITY SURVEY

Table 1. Underinsured Indicators Among Adults Ages Insured All Year, 2003, 2005, 2010, 2012, 2014, 2016

Anomalies under Jackknife Variance Estimation Incorporating Rao-Shao Adjustment in the Medical Expenditure Panel Survey - Insurance Component 1

EXAMPLE 6: WORKING WITH WEIGHTS AND COMPLEX SURVEY DESIGN

Poststratification with PROC SURVEYMEANS

New Construction Program Participating Owner Survey

Measuring the Cost of Employment: Work-Related Expenses in the Supplemental Poverty Measure. No. 279 SEHSD No

AGING, DEMOGRAPHICS AND MEMORY STUDY (ADAMS) Sample Design, Weighting and Analysis for ADAMS. Report prepared by:

Medicaid Undercount in the American Community Survey: Preliminary Results

Survey Information and Methodology. Introduction

The Urban-Brookings Tax Policy Center Microsimulation Model: Documentation and Methodology for Version 0304

7 Construction of Survey Weights

THE MASSACHUSETTS HEALTH REFORM SURVEY: METHODOLOGY REPORT FOR 2006 TO 2012

Health Insurance Coverage: Early Release of Estimates From the National Health Interview Survey, January March 2016

Insurance, Access, and Quality of Care Among Hispanic Populations Chartpack

USE OF AN EXISTING SAMPLING FRAME TO COLLECT BROAD-BASED HEALTH AND HEALTH- RELATED DATA AT THE STATE AND LOCAL LEVEL

Introduction to the European Union Statistics on Income and Living Conditions (EU-SILC) Dr Alvaro Martinez-Perez ICOSS Research Associate

Designing a Multipurpose Longitudinal Incentive Experiment for the SIPP

Click to edit Master text styles

Mission Report for a short-term mission of the specialist in sampling for household surveys From 10 to 31 October 2015 David J.

National Longitudinal Survey of Youth 1997 (NLSY97) Technical Sampling Report

November 1, 2010 I. Survey Methodology Selection of Households

2019 Colorado Health Access Survey (CHAS) Survey Administrator Request for Proposal (RFP) April 2018

Balancing Cross-sectional and Longitudinal Design Objectives for the Survey of Doctorate Recipients

CLS Cohort. Studies. Centre for Longitudinal. Studies CLS. Nonresponse Weight Adjustments Using Multiple Imputation for the UK Millennium Cohort Study

2006 Health Care Survey of DoD Beneficiaries:

HRS Documentation Report

IMPROVING ON PROBABILITY WEIGHTING FOR HOUSEHOLD SIZE ANDREW GELMAN THOMAS C. LITTLE. Introduction. Method

S E P T E M B E R Comparing Federal Government Surveys that Count Uninsured People in America

Results from the 2009 Virgin Islands Health Insurance Survey

How Couples Meet and Stay Together Project

Small Area Estimation: Part I. Partha Lahiri JPSM, Univ. of Maryland, College Park, USA May 18, 2011

REDESIGNING THE NATIONAL HEALTH INTERVIEW SURVEY QUESTIONNAIRE

PREPARING DATA FOR TAX POLICY AND REVENUE ANALYSIS. George Ramsey Franchise Tax Board, P.O. Box 2229, Sacramento, CA

1 PEW RESEARCH CENTER

No K. Swartz The Urban Institute

Background Notes SILC 2014

Comparative Study of Electoral Systems (CSES) Module 4: Design Report (Sample Design and Data Collection Report) September 10, 2012

The Serbia 2013 Enterprise Surveys Data Set

HEALTH AND RETIREMENT STUDY Prescription Drug Study Final Release V1.0, November 2008 (Sensitive Health Data) Data Description and Usage

PSID Technical Report. Construction and Evaluation of the 2009 Longitudinal Individual and Family Weights. June 21, 2011

Central Statistical Bureau of Latvia INTERMEDIATE QUALITY REPORT EU-SILC 2011 OPERATION IN LATVIA

Journal of Global Business and Trade

CYPRUS FINAL QUALITY REPORT

Steven B. Cohen, Jill J. Braden, Agency for Health Care Policy and Research Steven B. Cohen, AHCPR, 2101 E. Jefferson St., Rockville, Maryland

Small Area Health Insurance Estimates from the Census Bureau: 2008 and 2009

BZComparative Study of Electoral Systems (CSES) Module 3: Sample Design and Data Collection Report June 05, 2006

HEALTH AND RETIREMENT STUDY Prescription Drug Study Final Release V1.0, March 2011 (Sensitive Health Data) Data Description and Usage

Tanzania - National Panel Survey , Wave 4

FINAL QUALITY REPORT EU-SILC

Designing a Multipurpose Longitudinal Incentives Experiment for the Survey of Income and Program Participation

$5,615 $15,745. The Kaiser Family Foundation - AND - Employer Health Benefits. Annual Survey. -and-

Chartpack Examining Sources of Supplemental Insurance and Prescription Drug Coverage Among Medicare Beneficiaries: August 2009

Older Immigrants and Health Insurance: Differences by Region of Origin in Patterns and Sources of Coverage

REPORT OF THE COUNCIL ON MEDICAL SERVICE

Health and Retirement Study. Imputations for Employer-Sponsored Pension Wealth from Current Jobs in Data Description and Usage

Description of the Sample and Limitations of the Data

Health Insurance Coverage in Oklahoma: 2008

Transcription:

Medical Expenditure Panel Survey Household Component Statistical Estimation Issues

Overview Annual person-level estimates Overlapping panels Estimation variables Weights Variance Pooling multiple years of annual data Longitudinal analysis of MEPS panels Two-year period Family-level estimation Other miscellaneous issues

Annual Person-Level Files

MEPS Annual Files Year Panel 1 (96-97) 97) 2 (97-98) 98) 3 (98-99) 99) 1997 Yr. 2 Yr. 1 1998 Yr. 2 Yr. 1 1999 Yr. 2 2000 2001 4 (99-00) 5 (00-01) 01) 6 (01-02) 02) Yr. 1 Yr. 2 Yr. 1 Yr. 2 Yr. 1

MEPS Annual Files Year Panel 6 (01-02) 02) 7 (02-03) 03) 8 (03-04) 04) 9 (04-05) 05) 2002 Yr. 2 Yr. 1 2003 Yr. 2 Yr. 1 2004 Yr. 2 Yr. 1

MEPS Annual Person Level Estimation 1997 1998 1999 2000 2001 File Number HC- 028 HC- 020 HC- 038 HC- 050 HC- 060 Persons with weight > 0 32,636 22,953 23,565 23,839 32,122 Weighted Persons: All INSC1231=1 (in target pop. at end of year) 271.3 million 267.7 million 273.5 million 270.1 million 276.4 million 273.0 million 278.4 million 275.2 million 284.2 million 280.8 million

MEPS Annual Person Level Estimation (continued) 2002 2003 2004 File Number HC-070 HC-079 HC-089 Persons with weight > 0 Weighted Persons: All INSC1231=1 (in target pop. at end of year) 37,418 288.2 million 284.6 million 32,681 290.6 million 286.8 million 32,737 293.5 million 289.7 million

Weights and Variance Estimation Variables

MEPS Sample Design Each panel is sub-sample of household respondents for the previous year s National Health Interview Survey (NHIS) NHIS sponsor is National Center for Health Statistics NHIS sample based on complex stratified multi-stage probability design Civilian non-institutionalized population

NHIS Sample Design (1995-2004) U.S. partitioned into 1,995 Primary Sampling Units (Counties or groups of adjacent counties) PSU s grouped into 237 design strata 358 PSU s sampled across strata Second Stage Units (SSU s) Clusters of housing units Oversample of SSU s with large Black/Hispanic populations MEPS based on subsample of about 200 PSU s from NHIS

Oversampling in MEPS Every year: Blacks and Hispanics Carryover from NHIS 1997: Selected subpopulations Functionally impaired adults Children with activity limitations Adults 18-64 predicted to have high medical expenditures Low income Adults with other impairments 2002 and beyond: Asians Low income Additional oversampling of blacks in 2004

Estimation from Complex Surveys Estimates need to be weighted to reflect sample design and survey nonresponse Unweighted estimates are biased Use appropriate method to compute standard errors to account for complex design Assuming simple random sampling usually underestimates sampling error

Base Weight (NHIS) Development of Person Weights Compensates for oversampling and nonresponse Adjustments for Household nonresponse (MEPS Round 1) Attrition of persons (Subsequent Rounds) Poststratification (Census Population Estimates) Trimming of extreme weights Final Person Weight Weight > 0: person selected and in-scope for survey Weight = 0 (about 5% in 2004): person not in-scope for survey but living in household with in-scope person(s)

Distribution of MEPS Sample Person Final Weights 1997 1998 1999 2000 2001 Average 8,312 11,917 11,730 11,679 8,849 Minimum 299 321 307 454 336 Maximum 68,518 84,587 80,062 78,157 67,537 Variable Name WTDPER97 WTDPER98 PERWT99F PERWT00F PERWT01F

Distribution of Sample Person Final Weights (continued) 2002 2003 2004 Average 7,702 8,892 8,966 Minimum 367 401 425 Maximum 46,766 60,273 63,728 Variable Name PERWT02F PERWT03F PERWT04F

Types of Basic Point Estimates Means Proportions Totals Differences between subgroups

Variance Estimation Basic software procedures assume simple random sampling (SRS) MEPS not SRS Point estimates correct (if weighted) Standard errors usually too small Software to account for complex design using Taylor Series approach SUDAAN (stand-alone alone or callable within SAS) STATA (svy commands) SAS 8.2 (survey procedures) SPSS (new complex survey features in 13.0)

Estimation Example: Average Total Expenditures, 2004 Weighted mean = $3,284 per capita Unweighted mean of $2,944 is biased SE based on Taylor Series = 89 SAS: PROC SURVEYMEANS SUDAAN: PROC DESCRIPT Stata: svymean SE assuming SRS = 56 (too low) SAS: PROC UNIVARIATE or MEANS

Computing Standard Errors for MEPS Estimates Document on MEPS website http://www.meps.ahrq.gov/mepsweb/sur vey_comp/standard_errors.jsp

Example (Point estimates and SEs): SAS V8.2 proc surveymeans data=work.h89 mean; stratum varstr; cluster varpsu; weight perwt04f; var totexp04;

Example (Point estimates and SEs): SUDAAN (SAS-callable) First need to sort file by varstr & varpsu proc descript data=work.h89 filetype=sas design=wr wr; nest varstr varpsu; weight perwt04f; var totexp04;

Example (Point estimates and SEs): Stata svyset [pweight=perwt04f], strata(varstr) psu (varpsu) svymean(totexp04)

Analysis of Subpopulations Analyzing files that contain only a subset of MEPS sample may produce error messages or incorrect standard errors Each software package has capability to produce subpopulation estimates from entire person-level file See Computing Standard Errors for MEPS Estimates http://www.meps.ahrq.gov/mepsweb/survey _comp/standard_errors.jsp

Sample Sizes Assessing Precision/Reliability of Estimates Standard Errors/Confidence Intervals Relative Standard Errors standard error of estimate estimate

Example: Average total expenses per capita, 2004 Sample Size = 32,737 Estimate = $3,284 Standard Error = 89 95% Confidence Interval: (3109, 3458) Relative Standard Error (RSE) or Coefficient of Variation (CV) = 89 3284 =.027 = 2.7%

Means Types of Basic Point Estimates: Examples Annual per capita expenses in 2004 = $3,284 Proportions Percent with some health expenses in 2004 = 84.7% Two methods to generate estimates: Totals percents obtained from frequency tables means of dichotomous variable Expenses in 2004 = $963.9 billion Number of persons (weighted) = 293,527,003 Differences between subgroups

Pooling Multiple Years of MEPS Data

Reasons for Pooling Reduce standard error of estimate(s) Stabilize trend analysis Enhance ability to analyze small subgroups

Minimum Sample Sizes CFACT Standards Minimum unweighted sample of 100 Flag estimates with RSE > 30% Confidence intervals become problematic with small samples and/or highly skewed data Consider larger minimum sample sizes for highly skewed variables Analysts may be comfortable with smaller minimums for less skewed variables ASA Paper: Yu and Machlin (Skewness( Skewness) http://www.meps.ahrq.gov/mepsweb/data_files/pu blications/workingpapers/wp_04002.pdf

Example: Annual Sample Sizes (Unpooled) Year 1996 1997 1998 1999 Total Population 21,571 32,636 22,953 23,565 Children 0-5 2,018 3,082 2,114 2,156 Asian/PI Children* 0-50 58 78 82 93 * Sample sizes do not meet AHRQ minimum requirement (n=100) to publish estimates.

Pooled Sample Sizes Years Total Sample Children 0-5 Asian/PI Children 0-50 1996-1997 1997 54,207 5,100 136 1998-1999 1999 46,518 4,270 175 1996-1999 1999 100,725 9,370 311

Relative Standard Errors for Estimated Mean Expenditures: Asian/PI Children 0-50 50% Relative Standard Error 40% 30% 20% 10% 0% 96 97 98 99 96-97 98-99 96-99 Annual 2 year 4 year

Creating a Pooled File for Analysis (1996-2002) Need to work with Pooled Estimation File (HC-036) when 1+ years being pooled include any year from 1996 through 2001 Stratum and PSU variables obtained from HC-036 for 1996-2004 Documentation for HC-036 provides instructions on how to properly create pooled analysis file Stratum (varstr( varstr) ) and PSU (varpsu( varpsu) ) variables properly standardized for pooling years from 2002 onward (i.e., do not need HC-036)

Creating Pooled Files: Summary of Important Steps Rename analytic and weight variables from different years to common names. Expenditures: TOTEXP99 & TOTEXP00 = TOTEXP Weights: PERWT99F & PERWT00F = POOLWT Divide weight variable by number of years pooled to produce estimates for an average year during the period. Keep original weight value if estimating total for period Concatenate annual files Merge variance estimation variables from HC-036 onto file (only if 1+ years prior to 2002) Strata variable: STRA9604 PSU variable: PSU9604

Estimates from Pooled Files Produce estimates in analogous fashion as for individual years Estimates interpreted as average annual for pooled period Example: Pooled 1996-99 data The average annual total health care expenditures for Asian/Pacific Islander children under 6 years of age during the period from 1996-1999 1999 was $525 (SE=97).

Pooling Annual Data: Lack of Independence Across Years Legitimate to pool data for persons in consecutive years Each yr. constitutes nationally representative sample Pooling produces average annual estimates Stratum & PSU variables sufficient to account for lack of independence between years Lack of independence actually begins with first stage of sample selection Same PSUs are used to select each MEPS panel See HC-036 documentation

Longitudinal Analysis of MEPS Panels

MEPS Longitudinal Analysis: Panel 4: 1999-2000 Πανελ 4: 1999 2000 1/1/1999 Ρουνδ 1 1999 2000 12/31/2000 Ρουνδ 2 Ρουνδ 3 Ρουνδ 4 Ρουνδ 5

MEPS Longitudinal Analysis National estimates of person-level changes over two-year period two-year period is relatively short Examine characteristics associated with changes mainly round 1 data

Variables that may change between years or rounds Insurance coverage Monthly indicators (24 measures) Annual summary (2 measures per person) Health status Each round (5 measures) Having a usual source of care Rounds 2 & 4 (2 measures) Use and expenditures Annual (2 measures per person)

MEPS Longitudinal Weight Files Currently Available (Oct 2007) MEPS Panel 1 2 3 4 5 6 7 8 Years Covered 1996-97 97 1997-98 98 1998-99 1999-00 2000-01 01 2001-02 02 2002-03 03 2003-04 04 PUF Number HC-023 HC-035 HC-048 HC-058 HC-065 HC-071 HC-080 HC-086

Creating Longitudinal Files (Panel 4) : Summary of Important Steps Select Panel 4 records from annual files 1999 (PUF HC-038) 2000 (PUF HC-050) Obtain MEPS Longitudinal File (HC-058) Contains weight and variance estimation variables Contains variable indicating whether complete data are available for 1 or both years of panel Link using DUPERSID

Longitudinal Weight Variable Name: LONGWTP# Produces estimates for persons in civilian noninstitutionalized population in two consecutive years when applied to persons participating in both years of a given panel (YRINDP# = 1)

Examples: Longitudinal Estimates Of those without insurance at any time in 1999, estimated 76.9% (SE=1.6) also uninsured throughout 2000 Estimated 8.2% (SE=0.4) of the population had no insurance throughout 1999-2000 Of those with no expenses in 1999, estimated 47.6% (SE=1.3) had some expenses in 2000 Of top 5% of spenders in 1996, 30% retain this position in 1997.

Family-Level Estimation

Family-Level Estimation Need to create families from person- level files (see documentation) Two family type options: MEPS: includes unmarried couples/foster children CPS: unmarried couples not family unit Two time frame options: December 31 (MEPS, CPS) Any time during year (MEPS only)

MEPS Annual Files: Family Sample Sizes, 2004 Unweighted MEPS Full yr. 13,018 MEPS Dec 31 12,913 CPS Dec 31 13,349 Weighted 123.0 million 121.8 million 125.8 million Family Weight Variable Name FAMWT04F FAMWT04F FAMWT04C

Family-Level Estimation Example: Average Expenses per MEPS Family, 2004 Based on MEPS families in scope at any time during year Average number of persons per family is about 2.4. Family size All 1 2 3 4 5+ Estimate $7,674 $5,337 $9,670 $7,435 $8,815 $8,265 SE 187 270 370 286 702 405

Other Miscellaneous Estimation Issues

Medical Event as Unit of Analysis Can use event files to estimate average expense per event Examples: In 2004, mean facility expense per inpatient stay was $8,679 (SE=403). mean expense per office visit to a medical provider was $141 (SE=3)

Special Supplements Self Administered Questionnaire (SAQ) Use SAQ weight Parent Administered Questionnaire (PAQ) 2000 only Use PAQ weight Diabetes Care Survey (DCS) Use DCS weight Variables on person-level files Consult documentation for appropriate weight