On Stratification in Medicare Investigations. Acknowledgments. Outline. Don Edwards

Similar documents
Statistical Sampling Approach for Initial and Follow-Up BMP Verification

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Week 1 Quantitative Analysis of Financial Markets Distributions B

Audit Sampling: Steering in the Right Direction

Business Statistics 41000: Probability 4

Chapter 7: Point Estimation and Sampling Distributions

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Learning Objectives for Ch. 7

Some Characteristics of Data

The Central Limit Theorem

Chapter 5. Sampling Distributions

Chapter 8 Estimation

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

Lecture 5: Sampling Distributions

Sampling and sampling distribution

STAT Chapter 6: Sampling Distributions

SAMPLING DISTRIBUTIONS. Chapter 7

Anticipating Medicare's Alphabet Soup of Audit Contractors, Ranging from ZPICs and RACs to CERTs and MACs

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.

Math 227 Elementary Statistics. Bluman 5 th edition

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

Sampling. Marc H. Mehlman University of New Haven. Marc Mehlman (University of New Haven) Sampling 1 / 20.

Confidence Intervals for the Median and Other Percentiles

Reminders. Quiz today - please bring a calculator I ll post the next HW by Saturday (last HW!)

Statistics 13 Elementary Statistics

Medicaid Overpayments: Challenging State Audit Allegations of Overpayment and the Use of Statistical Sampling and Extrapolation

RAC Audits, Extrapolation and Defensive Strategies

Chapter 5: Statistical Inference (in General)

Module 4: Probability

Introduction to Alternative Statistical Methods. Or Stuff They Didn t Teach You in STAT 101

Section 1.4: Learning from data

DATA SUMMARIZATION AND VISUALIZATION

Distribution of the Sample Mean

Statistics for Business and Economics: Random Variables:Continuous

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

6 Central Limit Theorem. (Chs 6.4, 6.5)

Chapter 8 Statistical Intervals for a Single Sample

1. Variability in estimates and CLT

Systems. Ruyue Yuan. May 3, Valparaiso University Department of Mathematics and Computer Science. Probabilistic Analysis of the Economic

Sampling & Confidence Intervals

Sampling & Statistical Methods for Compliance Professionals. Frank Castronova, PhD, Pstat Wayne State University

Math 140 Introductory Statistics

An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1

Aspects of Sample Allocation in Business Surveys

Chapter 15: Sampling distributions

Making Sense of Cents

Sampling Distributions

The Serbia 2013 Enterprise Surveys Data Set

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Chapter 3 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc.

STA Module 3B Discrete Random Variables

Agenda. Fraud, Waste, and Abuse. Extrapolation: Understanding the Statistics What to do When it Happens to your Audit Results 3/17/2015

UNIVERSITY OF VICTORIA Midterm June 2014 Solutions

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Refunds and Reporting Overpayments. David M. Glaser Fredrikson & Byron, P.A. (612)

Medicaid Audit Overpayments: Challenging Statistical Sampling and Extrapolation

The Assumption(s) of Normality

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

Week 1 Quantitative Analysis of Financial Markets Probabilities

The Armenia 2013 Enterprise Surveys Data Set

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

The Macedonia 2013 Enterprise Surveys Data Set

3.2 Binomial and Hypergeometric Probabilities

Medicaid Performance Audit. My Brief Resume 2/5/2014. Molina Healthcare of Washington: Blue Cross and Blue Shield: An Emerging Challenge for MCOs

Statistics, Their Distributions, and the Central Limit Theorem

P1: TIX/XYZ P2: ABC JWST JWST075-Goos June 6, :57 Printer Name: Yet to Come. A simple comparative experiment

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Problem Set #4. Econ 103. (b) Let A be the event that you get at least one head. List all the basic outcomes in A.

Lattice Valuation of Options. Outline

1.1 Interest rates Time value of money

Section 0: Introduction and Review of Basic Concepts

ASA Section on Business & Economic Statistics

Math 140 Introductory Statistics

Estimating the Parameters of Closed Skew-Normal Distribution Under LINEX Loss Function

Examiners commentaries 2011

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract

Chapter 7 Study Guide: The Central Limit Theorem

Sampling Strategies in Sales Tax Audits: Selecting an Appropriate Methodology and Negotiating With Auditors

An approximate sampling distribution for the t-ratio. Caution: comparing population means when σ 1 σ 2.

Sampling Distributions and the Central Limit Theorem

4. DESCRIPTIVE STATISTICS

Elementary Statistics Lecture 5

Data Analysis and Statistical Methods Statistics 651

Chapter 6.1 Confidence Intervals. Stat 226 Introduction to Business Statistics I. Chapter 6, Section 6.1

The Binomial Distribution

CAST: Canvass Audits by Sampling and Testing.

The Updated OIG Self-Disclosure Protocol and Statistical Sampling for Non-Statisticians

Derivative Instruments

Probability. An intro for calculus students P= Figure 1: A normal integral

Integrating Contract Risk with Schedule and Cost Estimates

Contents. The Binomial Distribution. The Binomial Distribution The Normal Approximation to the Binomial Left hander example

Provider Healthcare Portal Demonstration:

The Binomial Distribution

St. Xavier s College Autonomous Mumbai T.Y.B.A. Syllabus For 5 th Semester Courses in Statistics (June 2016 onwards)

Statistics and Probability

Statistics and Probabilities

Sampling Methods, Techniques and Evaluation of Results

Normal Probability Distributions

Transcription:

On Stratification in Medicare Investigations Don Edwards Dept. of Statistics University of South Carolina edwards@stat.sc.edu October 14, 2009 13 th Annual Medicare / Medicaid Statistics and Data Analysis Conference Omaha, NE Acknowledgments This research would not have been possible without the support of Palmetto GBA, Inc. and Tricenturion, Inc. I am especially grateful for advice and feedback received from: Gail Ward-Besser, Palmetto GBA Petko Kostadinov, TriCenturion Jennifer Lasecki, Palmetto GBA Outline Three Payment Populations What Are We Doing? In God We Trust the rest of us need to have data Stratification by Payment Amount Summary Appendix I: samptest Appendix II: the Minimum Sum Method 1 1

Three Payment Populations 1. Pediatric Services Shape: Right skew, no separation from 0 N = 3000, total pmt = $3.1M Frequency 0 200 400 600 800 1000 0 1 0 0 0 2 0 0 0 3 0 0 0 4 0 0 0 5 0 0 0 p t m t a m Three Payment Populations 2. Power Wheelchairs Shape: Similar pmt amts, separation from 0 N = 250, total pmt = $1.0M Frequency 0 50 100 150 200 0 1 0 0 0 2 0 0 0 3 0 0 0 4 0 0 0 5 0 0 0 p t m t a m Three Payment Populations 3. Home Health Shape: in-between examples 1 and 2 N = 9000, total pmt = $1.1M Frequency 0 1000 2000 3000 4000 5000 0 2 0 0 4 0 0 6 0 0 8 0 0 1 0 0 0 p m t a m t 2 2

What Are We Doing? We are sampling overpayments - the sampled objects are payments, but the observation in each case is an overpayment. (a) The operating characteristics of the sampling and extrapolation plan depends on the distribution of the overpayments. (b) Any planning calculations made using payment amounts are potentially very misleading unless the denial rate is 100% (c) We do not know the overpayment distribution, but we can infer a lot about its plausible shape from the pmt distribution. Wheelchair example: one Plausible Overpayment Population Compare to slide 5. This assumes an 80% denial rate, denials evenly distributed among payments Frequency 0 50 100 150 0 1 0 0 0 2 0 0 0 3 0 0 0 4 0 0 0 5 0 0 0 O p m t a m t Home Health example: one Plausible Overpayment Population Compare to slide 6. This assumes an 80% denial rate, denials evenly distributed among payments Frequency 0 1000 2000 3000 4000 0 2 0 0 4 0 0 6 0 0 8 0 0 1 0 0 0 O p m t a m t 3 3

What Are We Doing? We are not estimating the total universe overpayment. We are finding a 90% lower confidence bound for the total overpayment. (a) Regardless of the true nature of the overpayment population, the sampling and extrapolation plan should underrecoup 90% of the time. (b) Subject to (a) and available resources, we seek to recover as much of the universe overpayment (small or large) as possible. (c) All else equal, it s better to recoup 90% from each of 4 providers than 95% from 1 provider. In God We Trust We do not know the overpayment distribution, but we can infer a lot about its plausible shape from the payment distribution. If the payment distribution is right-skewed and not separated from 0, the overpayment distribution will almost certainly be right-skewed. If the payments are similar in size and separated from 0, under a high denial rate the overpayment distribution will almost certainly be bimodal and left-skewed. In God We Trust Cochran ( God) 3 rd ed. p.41: when we sample from positively skew populations the frequency with which y (1.96)s y is greater than µ is less than 2.5% Attempted translation by Edwards (not God): When we sample from a right-skewed overpayment population, the actual underrecoupment rate of a Central Limit Theorem-based 90% lower confidence bound is greater than 90%. 4 4

In God We Trust The bad news is that vice-versa holds: When we sample from a left-skewed overpayment population, the actual underrecoupment rate of a Central Limit Theorem-Based 90% lower confidence bound is less than 90%. In God We Trust Don t take my word for it. We can easily sample repeatedly from the overpayment population shown on slide 7, and discover that the 90% lower bound only works 86.5% of the time when n = 45. 500 600 700 800 900 0 5 0 0 1 0 0 0 1 5 0 0 2 0 0 0 2 5 0 0 3 0 0 0 i t e r a t i o n In God We Trust Often this problem gets worse with higher denial rates. Repeat the process just described for denial rates between 0.1 and 1.0 (wheelchair pop). A plot of the underrecoupment rate vs. the denial rate: Underrecoupment Rate (%) 5 5

In God We Trust An even more serious problem: the CLT lower bound can be greater than the total amount paid, rendering it completely indefensible as a recoupment demand. The chance of this happening approaches 40% for the wheelchair population when the denial rate is close to 1. Using a lower bound based on the ratio estimator fixes the above problem, but not the problem with the underrecoupment rate being << 90%. For any population with payment amounts similar and separated from 0, use the minimum-sum (Edwards et al 2005) extrapolation method. In God We Trust Don t abandon CLT methods for all populations. It is conservative when the overpayment population is right-skewed, e.g. for the pediatric services and home health (shown below, n=45) populations. Underrecoupment Rate (%) In God We Trust BOTTOM LINE: In this day and age, there is no reason to use a sampling-and-extrapolation plan without testing it thoroughly. TEST YOUR SAMPLING PLAN BEFORE YOU USE IT 6 6

Stratified Sampling by Payment Amounts Scheaffer et al ( God), 4 th ed., pp 98-99: Stratification may produce a smaller bound on the error of estimation than would be produced by a simple random sample of the same size. This result is particularly true if the measurements within strata are homogeneous. Stratified Sampling by Payment Amounts If we stratify the payment population by payment amounts, will we achieve homogeneous strata, which will lead to a smaller bound on the error of estimation (i.e. tighter lower bounds for total overpayment?) P a y m e n t S tr a ta 1 2 3 4 5 H o m e H e a lth P o p u la ti o n 0 2 0 0 4 0 0 6 0 0 8 0 0 D o lla r s T o ta l N = 9 0 0 0, T o ta l n = 4 5, T o ta l P m t= $ 1 1 4 7 K Stratified Sampling by Payment Amounts NOT NECESSARILY for the overpayments! Imagine each of these strata with a 90% denial rate each would extend from 0 to their largest payment amounts. Moreover, several of these overpayment strata would be left-skewed. P a y m e n t S t r a t a 1 2 3 4 5 H o m e H e a lth P o p u la ti o n 0 2 0 0 4 0 0 6 0 0 8 0 0 D o lla r s T o ta l N = 9 0 0 0, T o ta l n = 4 5, T o ta l P m t= $ 1 1 4 7 K 7 7

Stratified Sampling by Payment Amounts In short, if you stratify by payment amount, YOU RE STRATIFYING THE WRONG POPULATION Stratified Sampling by Payment Amounts A comparison for the Home Health population (#3): at left are the results for a simple random sample of size 45, at right for a stratified sample (5 strata, n i =9 each). Underrecoupment Rate (%) Underrecoupment rate (%) Stratified Sampling by Payment Amounts In the right conditions, careful stratification can help. For the pediatric services population (#1; pmts right skewed, no separation from 0): at left are results for a simple random sample n=45, at right a stratified sample (2 strata, n i =22,23, cutpoint $1145). Underrecoupment Rate (%) Underrecoupment rate (%) 8 8

Stratified Sampling by Payment Amounts Pediatric services population: Average overpayment recovery. At left: simple random sample of n=45, at right: stratified sample (2 strata,n=22,23 cutpt $1145). Ave. Overpmt Recovery (%) Ave. Overpmt Recovery (%) 0 0 Summary Lower Confidence Bounds using the Central Limit Theorem are usually conservative if the overpayment population is right-skewed, liberal if it is left-skewed. Careless stratification can destroy the confidence level of an otherwise perfectly adequate simple random sample, rendering it invalid for use. Careful stratification can sometimes yield a valid procedure with higher overpayment recovery than a simple random sample of the same size the best candidates are payment populations which are right-skewed and not separated from 0. Summary UNLESS YOU ARE DIVINELY OMNISCIENT, TEST YOUR SAMPLING PLAN BEFORE YOU USE IT 9 9

REFERENCES 1. Cochran, W.G. (1977). Sampling Techniques, 3rd edition. New York: John Wiley and Sons. 2. Edwards, Don; Ward-Besser, Gail; Lasecki, Jennifer; Parker, Brenda; Wieduwilt, Kristin; Wu, Fuming; and Moorhead, Philip (2003). The Minimum Sum Method: a Distribution-Free Sampling Procedure for Medicare Fraud Investigations. Health Services and Outcomes Research Methodology 4: 241-263. (published 2005) 3. Scheaffer, R.E., Mendenhall, W., and Ott, L. (1990). Elementary Survey Sampling, 4th edition. Boston: PWS- Kent. Appendix I: samptest R programs for Medicare studies functions included (* = used in this talk): simpic* SRStest* (CLT and Minimum Sum) get.ld* (formerly get.nel) StRStest* StRS.cutpts*; StRS.allocate Dblesamptest project These programs are free send me an email Developed under support of Palmetto GBA and TriCenturion You need to get R: www.r-project.org (also free) Appendix II: The Minimum Sum Method New notation: N = # of population pmts D = unknown # of denied pop. Pmts n = sample size (Simple RS) d = # of denied sample payments 10 10

The Minimum Sum Method Ref: Edwards et al, 2003 (appeared 2005). Step 1: Calculate L D, a 90% lower confidence bound for D by inverting the level-0.10 test for H 0 : D D 0 vs. H A : D > D 0 based on the hypergeometric distribution. The Minimum Sum Method Example: Suppose a simple random sample of n=45 payments from the wheelchair population shows d=40 denied payments. Use get.ld in samptest or do the following directly in R: > N = 250 > n = 45 > d = 40 > Pvalues = 1-phyper( d-1, 0:N, N:0, n) > min((0:n)[ Pvalues >= 0.10 ]) [1] 203 Red line: P-values for H 0 : D D 0 vs. H A : D > D 0 for D 0 = 0,1,2,...,N = 250 given n=45,d=40 P = Pr{d 40} Pvalues 0.0 0.2 0.4 0.6 0.8 1.0 203! 0 50 100 150 200 250 D0 values 11 11

Dramatization for Step 1 n=45,d=40 Suppose I claim that 50% of the population is justifiable 0.00 0.04 0.08 0.12 Hypergeom. pdf for d (N=250,D 0 =125,n=50) Pr{ d 40} = 2.11x10-9 0 2 4 6 8 11 14 17 20 23 26 29 32 35 38 41 44 Dramatization for Step 1 n=45,d=40 Suppose I claim that 40% of the population is justifiable 0.00 0.04 0.08 0.12 Hypergeom. pdf for d (N=250,D 0 =150,n=50) Pr{ d 40} = 3.86x10-6 0 2 4 6 8 11 14 17 20 23 26 29 32 35 38 41 44 Dramatization for Step 1 n=45,d=40 Suppose I claim that 30% of the population is justifiable 0.00 0.04 0.08 0.12 Hypergeom. pdf for d (N=250,D 0 =175,n=50) Pr{ d 40} = 0.0011 0 2 4 6 8 11 14 17 20 23 26 29 32 35 38 41 44 12 12

Dramatization for Step 1 n=45,d=40 Suppose I claim that 20% of the population is justifiable 0.00 0.05 0.10 0.15 Hypergeom. pdf for d (N=250,D 0 =200,n=50) Pr{ d 40} = 0.0698 0 2 4 6 8 11 14 17 20 23 26 29 32 35 38 41 44 Dramatization for Step 1 n=45,d=40 Suppose I claim that 18.8% of the population is justifiable 0.00 0.05 0.10 0.15 Hypergeom. pdf for d (N=250,D 0 =203,n=50) Pr{ d 40} = 0.1025 0 2 4 6 8 11 14 17 20 23 26 29 32 35 38 41 44 The Minimum Sum Method Step 2: Considering the results of Step 1, the MinSum recoupment demand is the sum of all sample overpayments plus the sum of the smallest remaining unsampled payments which would be in error. Example: For the wheelchair population example, there would be at least 203 denied population payments. The minimum sum extrapolation is thus: (40 sample overpayments) + (the smallest 163 non-sampled payments) 13 13

The Minimum Sum Method Operating Characteristics for the Wheelchair payment population, n=45 Underrecoupment Rate (%) Ave. Overpmt Recovery (%) proj. method=ms 0 proj. method=ms The Minimum Sum Method The MinSum method has some good properties: Because Step 1 works for any population size and any sample size, the method is mathematically guaranteed to under-recoup in at least 90% of repeated samples. The minimum sum extrapolation cannot be greater than the total payment amount The Minimum Sum Method The MinSum method recoups very well when payments are all nearly equal OR if the error rate is very high. However, because of the added conservatism of Step 2, for some populations the MinSum method will be too conservative to be useful...e.g. it would not recoup well for our example populations 1 or 3 unless the denial rate 95% or more. 14 14

The Minimum Sum Method Operating Characteristics for the Pediatric Services payment population, n=45 Underrecoupment Rate (%) Ave. Overpmt Recovery (%) proj. method=ms 0 proj. method=ms 15 15