Proc SurveyCorr. Jessica Hampton, CCSU, New Britain, CT
|
|
- Jessica Bates
- 6 years ago
- Views:
Transcription
1 Proc SurveyCorr Jessica Hampton, CCSU, New Britain, CT ABSTRACT This paper provides background information on survey design, with data from the Medical Expenditures Panel Survey (MEPS) as an example. SAS survey procedures in SAS/STAT used to analyze such data sets include PROC SURVEYMEANS, PROC SURVEYFREQ, PROC SURVEYREG, and PROC SURVEYLOGISTIC. There is no PROC SURVEYCORR. One solution is to use a macro approach to extract r-squared values from multiple iterations of PROC SURVEYREG when there are many possible predictor variables to examine. INTRODUCTION The Medical Expenditures Panel Survey (MEPS) is a household survey administered annually by the U.S. Department of Health and Human Services since 1996, with data released for public use. Although it is a nationally representative sample, it is not a random sample of the population and must be weighted to produce reliable population estimates. The panel survey is designed featuring several rounds of interviews and overlapping panels over two full calendar years with multiple components (household, insurance/employer, and medical provider). The household component used in this analysis covers the following topics: demographics, household income, employment, diagnosed health conditions, additional health status issues, medical expenditures and utilization, satisfaction with and access to care, and insurance coverage. This project is based on the household component 2010 full year consolidated data file released in September 2012 (H138.SSP or H138.DAT). MEPS makes the data available both as a SAS transport data file and in ASCII format, and provides sample SAS code to either read the ASCII file into SAS using a DATA step or to convert the.ssp file into a SAS data set using PROC XCOPY. The latter method was used to convert the SAS transport data file into a SAS dataset. MEPS 2010 data consists of data from two overlapping panels: Rounds 3-5 of Panel 14 and Rounds 1-3 of Panel 15 (MEPS, 2012; MEPS-HC Panel Design and Collection Process). SURVEY DESIGN METHODS MEPS survey design is complex, using both stratification and multi-stage clustering techniques; that is, as previously noted, it is not a random sample of the population. Stratification is a survey design technique which is typically done by demographic variables such as age, race, sex, income, etc. The goal is to maximize homogeneity within strata and heterogeneity between strata. Sometimes stratification is used when it is desirable to oversample certain groups under-represented in the general population or with interesting characteristics relevant to what is being studied (for example, MEPS oversamples blacks, Hispanics, and low-income households). Clustering is typically done by geography in order to reduce survey costs, where it is not feasible or cost-effective to do a random sample of the entire population of the U.S., for example. Within-cluster correlation underestimates variance/error, as two families in the same neighborhood are more likely to be similar demographically (in regard to income, for instance). Therefore, we want clusters to be spatially close for cost effectiveness but as heterogeneous within as possible for reasonable variance. Sometimes a multi-stage clustering approach is used, as in MEPS; for example, a sample of counties is taken, then a sample of blocks is taken from that sample of counties, and finally individuals/households are surveyed from the sample of blocks (Carrington et al, 2000). Information about how the survey was designed is then stored in survey design variables, such as those shown in Table 2. These survey design variables are used to obtain population means and estimates and can also be used in regression analysis. The original consolidated dataset contains 32,846 rows unique at the person level, and 31,228 with positive person weights, as well as 1,911 variables. For the full list of original variables, see the MEPS HC-138 codebook (2012). MEPS includes a unique person-level identifier, DUPERSID, along with the following survey design variables: estimation weight variable PERWT10F, sampling strata variable VARSTR, and primary sampling unit (or cluster) VARPSU. These survey design variables are used in SAS survey procedures such as PROC SURVEYMEANS, PROC SURVEYFREQ, and PROC SURVEYREG (SAS Institute, 2008). Survey procedures are designed to take into account survey design methods such as those used by MEPS where some groups are oversampled for both sample size and policy reasons (MEPS, 2012, B. Background, 1.0 household component). Person weights can also be used in some non-survey procedures such as PROC REG and PROC CORR. 1
2 SAS SURVEY PROCEDURES Using SAS PROC SURVEYMEANS and PROC SURVEYFREQ for an overview of the data, we can obtain estimated mean expenditures for the population, as well as estimated population sizes. PROC SURVEYFREQ outputs sample frequency, weighted (estimated population) frequency, percent, and standard error of percent for each variable selected in the tables statement. In the tables statement below, these variables are PRIEU10 (covered by private employer/union group insurance), PRING10 (covered by private non-group insurance), and INSCOV10 (values 1-3 indicate any private insurance coverage in 2010, only public insurance coverage, and uninsured). Survey design variables are referenced in the strata, cluster, and weight statements as follows: PROC SURVEYFREQ DATA=PQI.MEPS_2010; STRATA VARSTR; CLUSTER VARPSU; TABLES PRIEU10 PRING10 INSCOV10; Figure 1. We can use PROC SURVEYMEANS to look at mean expenditures for the overall population and for selected segments of the population. Output variables include sample N, estimated mean, standard error of the mean, and 95% confidence intervals. For example, the following SAS statements output the mean total (TOTEXP10) and out of pocket (TOTSLF10) expenditures for those with private non-group insurance coverage (domain = PRING10) as well as for the total population. Figures 1 and 2 show sample output for PROC SURVEYFREQ and PROC SURVEYMEANS. 2
3 PROC SURVEYMEANS DATA=PQI.MEPS_2010; STRATA VARSTR; CLUSTER VARPSU; DOMAIN PRING10; VAR TOTEXP10 TOTSLF10; Figure 2. PROC SURVEYREG and PROC SURVEYLOGISTIC have some of the same options available for output/diagnostics as do their non-survey counterparts, PROC REG and PROC LOGISTIC. Default output includes fit statistics (R squared, AIC, and Schwartz's criterion), chi-squared tests of the global null hypothesis, degrees of freedom, and coefficient estimates for each parameter along with standard error of coefficient estimates and p-values. PROC SURVEYLOGISTIC also includes odds ratio point estimates and 95% Wald confidence intervals for each input parameter, as does PROC LOGISTIC. The survey procedures are more limited in some ways, though. For example, PROC LOGISTIC can use an option such as stepwise selection to restrict the output to only predictors with significance above a certain level; there is also an option to rank those predictors. These options do not work with PROC SURVEYLOGISTIC, which makes the output more unwieldy with a large number of predictors. Most notably in terms of differences, PROC LOGISTIC automatically outputs a chi-squared test of the residuals for each input variable; however, any analysis of residuals is irrelevant for the survey procedures since assumptions of normality and equal variance are not applicable due to survey design. Tabled residuals are not output at all for the survey procedures, although covariance matrices are available for both as a non-default option. Similarly, influential observations/outliers are also not analyzed due to the use of person weights. As long as we use person weights, we would get the same coefficients with a regular PROC REG as we would with PROC SURVEYREG, but standard error estimates would be different and predictor significance could also vary. Limitations arise from the complex nature of the survey design and having only four available survey procedures in SAS (PROC SURVEYFREQ, PROC SURVEYMEANS, PROC SURVEYREG, and PROC SURVEYLOGISTIC). Other procedures can be used, and in fact several other procedures in SAS allow for using sampling weights, but without accounting fully for survey design, some caveats apply to the results. If person weights are ignored and one tries to generalize sample findings to the entire population, total numbers, percentages, or means are inflated for the groups that are oversampled and underestimated for others. It is therefore highly undesirable to estimate population frequencies or means without using the procedures discussed above. In regression analysis, ignoring person weights leads to biased coefficient estimates. If sampling strata and cluster variables are ignored, means and coefficient estimates are unaffected, but standard error (or population variance) may be underestimated; that is, the reliability of an estimate may be overestimated. For example, when comparing one estimated population 3
4 mean to another, the difference may appear to be statistically significant when it is not (Machlin, S., Yu, W., & Zodet, M., 2005). PROC SURVEYCORR For more in-depth exploratory data analysis, we want to look at correlations of potential predictor variables with target variables such as total expenditures, out of pocket expenditures, and insurance coverage status variables. However, there is no PROC SURVEYCORR in SAS, and the output from PROC CORR can be unwieldy when there are many variables to analyze. We need a way to tabulate the output for large numbers of variables in a readable format; one solution is to use SPSS for unweighted Pearson correlations. Another solution is to use a macro approach to extract r-squared values from multiple iterations of PROC SURVEYREG. First, the original continuous numeric age, income, and expenditure-related variables are analyzed in SAS using PROC CORR with person weights to find correlations with total expenditures: PROC CORR DATA=PQI.MEPS_2010 PLOTS=MATRIX RANK; VAR AGE10X WAGEP10X TTLP10X FAMINC10 POVLEV10 TOTSLF10 ERTEXP10 ERTOT10 RXEXP10 OPTEXP10 OPTOTV10 OBVEXP10 OBTOTV10 IPTEXP10 IPNGTD10; WITH TOTEXP10; Inpatient expenditures and length of stay are driving total expenditures with the strongest Pearson correlations. ER expenditures and Age are less strongly correlated, falling behind office-based visits, rx, and outpatient expenditures in importance. Wage income is still significant at the 95% level, with a negative correlation, but other income variables are not significantly correlated, as shown in the output below: Figure 3. An alternate approach which uses PROC SURVEYREG is described as follows. First, SQL dictionary tables are used to select the number of predictor variables in the dataset and to store it in a macro variable. Data set names are stored in dictionary tables using all caps. The number of predictor variables (nvar) is the number of iterations SAS will use in the DO LOOP later on in the program. PROC SQL; SELECT NVAR INTO :NVAR FROM DICTIONARY.TABLES WHERE LIBNAME='PQI' AND MEMNAME='MEPS_2010'; PROC CONTENTS is then used to obtain a list of the predictor variable names, which are then stored as macro variables using the PROC SQL SELECT INTO statement: PROC CONTENTS DATA=PQI.MEPS_2010 OUT=CONTENTS NOPRINT; PROC SQL NOPRINT; SELECT NAME INTO:VAR1-:VAR76 FROM WORK.CONTENTS; 4
5 Next, an empty table is created to store the output from PROC SURVEYREG as it is inserted one row at a time. PROC SQL; CREATE TABLE SURVEYCORR (PARAMETER CHAR(15),R_SQUARE CHAR(8),R NUM(8),PROBT NUM(8)); PROC SURVEYREG uses the survey design variables in its strata, cluster, and weight statements as do all the other survey procedures. It also has the option to include an ODS OUTPUT statement to store parameter estimates, fit statistics, and other information created when the model runs. The R-square value is extracted from the FitStatistics output using PROC SQL, while p-values and the sign of the estimated regression coefficient are extracted from the ParameterEstimates output. The square root function is used to get the correlation coefficient, combined with the sign of the regression coefficient to tell us the direction of the correlation (negative or positive) with the target variable. The target variable itself is input as a parameter when the macro is called, so the process can be re-run for multiple target variables of interest by changing the parameter, re-compiling, and re-running the macro. %MACRO CORR(TARGET=); PROC SURVEYREG DATA=PQI.MEPS_2010; STRATA VARSTR; CLUSTER VARPSU; MODEL &TARGET=&&VAR&I /SOLUTION; ODS OUTPUT PARAMETERESTIMATES=PARAMETER_EST FITSTATISTICS=FIT; PROC SQL; INSERT INTO SURVEYCORR SELECT PARAMETER,CVALUE1 AS R_SQUARE,SIGN(ESTIMATE)* SQRT(INPUT(CVALUE1,8.)) AS R,PROBT AS PVALUE FROM FIT,PARAMETER_EST WHERE LABEL1 = "R-SQUARE" AND PARAMETER = "&&VAR&I"; %MEND CORR; The process above will run for each predictor variable, inserting a new row in the table each time, once it is called within the following loop: %MACRO LOOP; %DO I=1 %TO &NVAR; %CORR(TARGET=PUBAT10X); %END; %MEND LOOP; After calling the macro, PROC SQL is used to format the results, sort by correlation size, and exclude the survey design variables from the tabulated output. 5
6 PROC SQL; CREATE TABLE PQI.SURVEYCORR AS SELECT PARAMETER,R_SQUARE,R FORMAT BEST6.4,PROBT AS PVALUE FORMAT PVALUE6.4,CASE WHEN PROBT <=0.05 THEN "YES" ELSE "NO" END AS SIGNIFICANT_95 FROM SURVEYCORR WHERE PARAMETER NOT IN ('DUPERSID','VARSTR','VARPSU','PERWT10F') ORDER BY ABS(R) DESC; The tabulated output for the top ten correlated variables is shown below. Table 1. Target Variable TOTEXP10 - r Values with Proc SurveyReg parameter r- square r p-value significance (95% C.L.) TOTEXP IPTEXP TOTEXP_HIGH IPNGTD OBVEXP RXEXP OBTOTV OPTEXP TOTSLF ADAPPT As expected, all expenditure/utilization variables have a significant positive correlation with total expenditures; again, the largest drivers of total expenditures are inpatient expenditures and length of stay, followed by officebased visit and prescription expenditures. CONCLUSIONS SAS Survey Procedures are useful for analyzing data with complex survey design variables, but only four survey procedures exist. PROC CORR can be used with person weights, but when desired, additional survey design variables can be used with PROC SURVEYREG to obtain correlation values. The iterative approach described above using the PROC SURVEYCORR macro provides tabled output for a large number of predictor values when PROC CORR s output is unsatisfactory. It should be noted that all variables used are numeric or have been reformatted as numeric variables prior to running the program; if categorical variables are included in the data set then the CLASS statement will have to be used in PROC SURVEYREG to designate those variables. REFERENCES: Carrington, W. J., Eltinge, J. L., & McCue, K. (2000). An Economist s Primer on Survey Samples. Working Paper no Suitland, MD: Center for Economic Studies, U.S. Bureau of the Census, October Retrieved from ftp://tigerline.census.gov/ces/wp/2000/ces-wp pdf January 15, Machlin, S., & Yu, W. (2005). MEPS Sample Persons In-Scope for Part of the Year: Identification and Analytic 6
7 Considerations. April Agency for Healthcare Research and Quality, Rockville, MD. Retrieved from /survey_comp/hc_survey/hc_sample.shtml Machlin, S., Yu, W., & Zodet, M. (2005). Computing Standard Errors for MEPS Estimates. January Agency for Healthcare Research and Quality, Rockville, Md. Retrieved from Medical Expenditure Panel Survey (MEPS). (2012). MEPS HC-138: 2010 Full Year Consolidated Data File. Rockville, MD: Agency for Healthcare Research and Quality (AHRQ), September Retrieved from September 27, Medical Expenditure Panel Survey (MEPS). (2012). MEPS HC-138: 2010 Full Year Consolidated Data Codebook. Rockville, MD: Agency for Healthcare Research and Quality (AHRQ), August 30, Retrieved from September 27, Medical Expenditure Panel Survey (MEPS). MEPS-HC Panel Design and Collection Process. Agency for Healthcare Research and Quality, Rockville, Md. Retrieved from Medical Expenditure Panel Survey (MEPS). Data Use Agreement. Agency for Healthcare Research and Quality, Rockville, Md. Retrieved from SAS Institute Inc.(2008). SAS/STAT 9.2 User s Guide. Chapter 14: Introduction to Survey Sampling and Analysis Procedures. Pp Cary, NC: SAS Institute Inc. Retrieved from on January 15, ACKNOWLEDGMENTS SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the author at: Jessica Hampton Work Phone: (860) Jessica.Hampton@gmail.com Web: * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * 7
Medical Expenditure Panel Survey. Household Component Statistical Estimation Issues. Copyright 2007, Steven R. Machlin,
Medical Expenditure Panel Survey Household Component Statistical Estimation Issues Overview Annual person-level estimates Overlapping panels Estimation variables Weights Variance Pooling multiple years
More informationNew SAS Procedures for Analysis of Sample Survey Data
New SAS Procedures for Analysis of Sample Survey Data Anthony An and Donna Watts, SAS Institute Inc, Cary, NC Abstract Researchers use sample surveys to obtain information on a wide variety of issues Many
More informationEstimates of Medical Expenditures from the Medical Expenditure Panel Survey: Gains in Precision from Combining Consecutive Years of Data
Estimates of Medical Expenditures from the Medical Expenditure Panel Survey: Gains in Precision from Combining Consecutive Years of Data Steven R. Machlin, Marc W. Zodet, and J. Alice Nixon, Center for
More informationis the bandwidth and controls the level of smoothing of the estimator, n is the sample size and
Paper PH100 Relationship between Total charges and Reimbursements in Outpatient Visits Using SAS GLIMMIX Chakib Battioui, University of Louisville, Louisville, KY ABSTRACT The purpose of this paper is
More informationEffects of Poststratification and Raking Adjustments on Precision of MEPS Estimates Sadeq R. Chowdhury
Effects of Poststratification and Raking Adjustments on Precision of MEPS Estimates Sadeq R. Chowdhury The opinions expressed in this presentation are those of the authors and do not reflect the official
More informationAn Evaluation of Nonresponse Adjustment Cells for the Household Component of the Medical Expenditure Panel Survey (MEPS) 1
An Evaluation of Nonresponse Adjustment Cells for the Household Component of the Medical Expenditure Panel Survey (MEPS) 1 David Kashihara, Trena M. Ezzati-Rice, Lap-Ming Wun, Robert Baskin Agency for
More information:R195.1 :A doi: /j.issn
1, 2,3* (1., 300070; 2., 100850; 3., 100029 * :,E -mail:lphu812@sina.com), ; ; ; ; :R195.1 :A doi:10.11886 /j.issn.1007-3256.2017.05.004 1, 2,3* (1.,,, 300070, ; 2.,, 100850, ; 3., 100029, * :, - : 812@.
More informationStatistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron
Statistical Models of Stocks and Bonds Zachary D Easterling: Department of Economics The University of Akron Abstract One of the key ideas in monetary economics is that the prices of investments tend to
More informationRECOMMENDATIONS AND PRACTICAL EXAMPLES FOR USING WEIGHTING
EXECUTIVE SUMMARY RECOMMENDATIONS AND PRACTICAL EXAMPLES FOR USING WEIGHTING February 2008 Sandra PLAZA Eric GRAF Correspondence to: Panel Suisse de Ménages, FORS, Université de Lausanne, Bâtiment Vidy,
More informationPredictive Modeling Cross Selling of Home Loans to Credit Card Customers
PAKDD COMPETITION 2007 Predictive Modeling Cross Selling of Home Loans to Credit Card Customers Hualin Wang 1 Amy Yu 1 Kaixia Zhang 1 800 Tech Center Drive Gahanna, Ohio 43230, USA April 11, 2007 1 Outline
More informationCalculating the Probabilities of Member Engagement
Calculating the Probabilities of Member Engagement by Larry J. Seibert, Ph.D. Binary logistic regression is a regression technique that is used to calculate the probability of an outcome when there are
More informationEXAMPLE 6: WORKING WITH WEIGHTS AND COMPLEX SURVEY DESIGN
EXAMPLE 6: WORKING WITH WEIGHTS AND COMPLEX SURVEY DESIGN EXAMPLE RESEARCH QUESTION(S): How does the average pay vary across different countries, sex and ethnic groups in the UK? How does remittance behaviour
More informationHealth Status, Health Insurance, and Health Services Utilization: 2001
Health Status, Health Insurance, and Health Services Utilization: 2001 Household Economic Studies Issued February 2006 P70-106 This report presents health service utilization rates by economic and demographic
More informationCHAPTER 6 DATA ANALYSIS AND INTERPRETATION
208 CHAPTER 6 DATA ANALYSIS AND INTERPRETATION Sr. No. Content Page No. 6.1 Introduction 212 6.2 Reliability and Normality of Data 212 6.3 Descriptive Analysis 213 6.4 Cross Tabulation 218 6.5 Chi Square
More informationTechnical Documentation for Household Demographics Projection
Technical Documentation for Household Demographics Projection REMI Household Forecast is a tool to complement the PI+ demographic model by providing comprehensive forecasts of a variety of household characteristics.
More informationEXST7015: Multiple Regression from Snedecor & Cochran (1967) RAW DATA LISTING
Multiple (Linear) Regression Introductory example Page 1 1 options ps=256 ls=132 nocenter nodate nonumber; 3 DATA ONE; 4 TITLE1 ''; 5 INPUT X1 X2 X3 Y; 6 **** LABEL Y ='Plant available phosphorus' 7 X1='Inorganic
More informationOut-of-Pocket Health Care Spending And The Rural Underinsured. December 2005
Out-of-Pocket Health Care Spending And The Rural Underinsured December 2005 Out-of-Pocket Health Care Spending And The Rural Underinsured December 2005 Maine Rural Health Research Center Working Paper
More informationbook 2014/5/6 15:21 page 261 #285
book 2014/5/6 15:21 page 261 #285 Chapter 10 Simulation Simulations provide a powerful way to answer questions and explore properties of statistical estimators and procedures. In this chapter, we will
More informationAnomalies under Jackknife Variance Estimation Incorporating Rao-Shao Adjustment in the Medical Expenditure Panel Survey - Insurance Component 1
Anomalies under Jackknife Variance Estimation Incorporating Rao-Shao Adjustment in the Medical Expenditure Panel Survey - Insurance Component 1 Robert M. Baskin 1, Matthew S. Thompson 2 1 Agency for Healthcare
More informationTo be two or not be two, that is a LOGISTIC question
MWSUG 2016 - Paper AA18 To be two or not be two, that is a LOGISTIC question Robert G. Downer, Grand Valley State University, Allendale, MI ABSTRACT A binary response is very common in logistic regression
More informationData Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing C. Olivia Rud, President, OptiMine Consulting, West Chester, PA ABSTRACT Data Mining is a new term for the
More informationTable of Contents. New to the Second Edition... Chapter 1: Introduction : Social Research...
iii Table of Contents Preface... xiii Purpose... xiii Outline of Chapters... xiv New to the Second Edition... xvii Acknowledgements... xviii Chapter 1: Introduction... 1 1.1: Social Research... 1 Introduction...
More informationSAS/STAT 14.3 User s Guide The FREQ Procedure
SAS/STAT 14.3 User s Guide The FREQ Procedure This document is an individual chapter from SAS/STAT 14.3 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute Inc.
More informationLap-Ming Wun and Trena M. Ezzati-Rice and Robert Baskin and Janet Greenblatt and Marc Zodet and Frank Potter and Nuria Diaz-Tena and Mourad Touzani
Using Propensity Scores to Adjust Weights to Compensate for Dwelling Unit Level Nonresponse in the Medical Expenditure Panel Survey Lap-Ming Wun and Trena M. Ezzati-Rice and Robert Baskin and Janet Greenblatt
More informationIncorporating a Finite Population Correction into the Variance Estimation of a National Business Survey
Incorporating a Finite Population Correction into the Variance Estimation of a National Business Survey Sadeq Chowdhury, AHRQ David Kashihara, AHRQ Matthew Thompson, U.S. Census Bureau FCSM 2018 Disclaimer
More informationSTRATEGIES FOR THE ANALYSIS OF IMPUTED DATA IN A SAMPLE SURVEY
STRATEGIES FOR THE ANALYSIS OF IMPUTED DATA IN A SAMPLE SURVEY James M. Lepkowski. Sharon A. Stehouwer. and J. Richard Landis The University of Mic6igan The National Medical Care Utilization and Expenditure
More informationSAS/STAT 14.2 User s Guide. The FREQ Procedure
SAS/STAT 14.2 User s Guide The FREQ Procedure This document is an individual chapter from SAS/STAT 14.2 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute Inc.
More informationThe SURVEYLOGISTIC Procedure (Book Excerpt)
SAS/STAT 9.22 User s Guide The SURVEYLOGISTIC Procedure (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.22 User s Guide. The correct bibliographic citation for the
More informationMaximum Likelihood Estimation
Maximum Likelihood Estimation EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #6 EPSY 905: Maximum Likelihood In This Lecture The basics of maximum likelihood estimation Ø The engine that
More informationSAS Simple Linear Regression Example
SAS Simple Linear Regression Example This handout gives examples of how to use SAS to generate a simple linear regression plot, check the correlation between two variables, fit a simple linear regression
More information2019 Colorado Health Access Survey (CHAS) Survey Administrator Request for Proposal (RFP) April 2018
Page 1 of 10 2019 Colorado Health Access Survey (CHAS) Survey Administrator Request for Proposal (RFP) April 2018 The Colorado Health Access Survey the CHAS is the premier source of information on health
More informationSTATISTICAL BRIEF #172
Medical Expenditure Panel Survey STATISTICAL BRIEF #172 Agency for Healthcare Research and Quality June 27 Health Insurance Status of Children in America, First Half 1996 26: Estimates for the U.S. Civilian
More informationXLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING
XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING INTRODUCTION XLSTAT makes accessible to anyone a powerful, complete and user-friendly data analysis and statistical solution. Accessibility to
More informationDetermining Probability Estimates From Logistic Regression Results Vartanian: SW 541
Determining Probability Estimates From Logistic Regression Results Vartanian: SW 541 In determining logistic regression results, you will generally be given the odds ratio in the SPSS or SAS output. However,
More informationUsing New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Paul J. Hilliard, Educational Testing Service (ETS)
Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds INTRODUCTION Multicategory Logit
More informationRising risk: Maximizing the odds for care management
Rising risk: Maximizing the odds for care management Ksenia Whittal, FSA, MAAA Abigail Caldwell, FSA, MAAA Most healthcare organizations already know which members are currently costly, but what about
More informationApplying Logistics Regression to Forecast Annual Organizational Retirements
SESUG Paper SD-137-2017 Applying Logistics Regression to Forecast Annual Organizational Retirements Alan Dunham, Greybeard Solutions, LLC ABSTRACT This paper briefly discusses the labor economics research
More informationHEALTH AND RETIREMENT STUDY Prescription Drug Study Final Release V1.0, November 2008 (Sensitive Health Data) Data Description and Usage
HEALTH AND RETIREMENT STUDY 2005 Prescription Drug Study Final Release V1.0, (Sensitive Health Data) Data Description and Usage To the researcher: This data set is intended for exclusive use by you under
More informationUniversity of Texas at Dallas School of Management. Investment Management Spring Estimation of Systematic and Factor Risks (Due April 1)
University of Texas at Dallas School of Management Finance 6310 Professor Day Investment Management Spring 2008 Estimation of Systematic and Factor Risks (Due April 1) This assignment requires you to perform
More informationPoststratification with PROC SURVEYMEANS
Poststratification with PROC SURVEYMEANS Overview When a population can be partitioned into homogeneous groups and there is significant heterogeneity between those groups, stratified sampling can substantially
More informationProceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001
Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001 A COMPARISON OF TWO METHODS TO ADJUST WEIGHTS FOR NON-RESPONSE: PROPENSITY MODELING AND WEIGHTING CLASS ADJUSTMENTS
More informationSAS/STAT 14.1 User s Guide. The LATTICE Procedure
SAS/STAT 14.1 User s Guide The LATTICE Procedure This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute
More informationPresented at the 2012 SCEA/ISPA Joint Annual Conference and Training Workshop -
Applying the Pareto Principle to Distribution Assignment in Cost Risk and Uncertainty Analysis James Glenn, Computer Sciences Corporation Christian Smart, Missile Defense Agency Hetal Patel, Missile Defense
More informationSurvey Sampling, Fall, 2006, Columbia University Homework assignments (2 Sept 2006)
Survey Sampling, Fall, 2006, Columbia University Homework assignments (2 Sept 2006) Assignment 1, due lecture 3 at the beginning of class 1. Lohr 1.1 2. Lohr 1.2 3. Lohr 1.3 4. Download data from the CBS
More informationBuilding Better Credit Scores using Reject Inference and SAS
ABSTRACT Building Better Credit Scores using Reject Inference and SAS Steve Fleming, Clarity Services Inc. Although acquisition credit scoring models are used to screen all applicants, the data available
More informationRelationship Between Household Nonresponse, Demographics, and Unemployment Rate in the Current Population Survey.
Relationship Between Household Nonresponse, Demographics, and Unemployment Rate in the Current Population Survey. John Dixon, Bureau of Labor Statistics, Room 4915, 2 Massachusetts Ave., NE, Washington,
More information1 PEW RESEARCH CENTER
1 Methodology This report is drawn from a survey conducted as part of the American Trends Panel (ATP), a nationally representative panel of randomly selected U.S. adults living in households recruited
More informationApplications of Data Analysis (EC969) Simonetta Longhi and Alita Nandi (ISER) Contact: slonghi and
Applications of Data Analysis (EC969) Simonetta Longhi and Alita Nandi (ISER) Contact: slonghi and anandi; @essex.ac.uk Week 2 Lecture 1: Sampling (I) Constructing Sampling distributions and estimating
More informationMATH 143: Introduction to Probability and Statistics Worksheet for Tues., Dec. 7: What procedure?
MATH 143: Introduction to Probability and Statistics Worksheet for Tues., Dec. 7: What procedure? For each numbered problem, identify (if possible) the following: (a) the variable(s) and variable type(s)
More informationLecture 21: Logit Models for Multinomial Responses Continued
Lecture 21: Logit Models for Multinomial Responses Continued Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University
More informationMarital Disruption and the Risk of Loosing Health Insurance Coverage. Extended Abstract. James B. Kirby. Agency for Healthcare Research and Quality
Marital Disruption and the Risk of Loosing Health Insurance Coverage Extended Abstract James B. Kirby Agency for Healthcare Research and Quality jkirby@ahrq.gov Health insurance coverage in the United
More informationMultinomial and ordinal logistic regression using PROC LOGISTIC Peter L. Flom Peter Flom Consulting, LLC
ABSTRACT Multinomial and ordinal logistic regression using PROC LOGISTIC Peter L. Flom Peter Flom Consulting, LLC Logistic regression may be useful when we are trying to model a categorical dependent variable
More informationPer Capita Housing Starts: Forecasting and the Effects of Interest Rate
1 David I. Goodman The University of Idaho Economics 351 Professor Ismail H. Genc March 13th, 2003 Per Capita Housing Starts: Forecasting and the Effects of Interest Rate Abstract This study examines the
More informationDesign Issues for a Longitudinal Employer Health Insurance Survey to Facilitate Analysis of Policy Changes
Design Issues for a Longitudinal Employer Health Insurance Survey to Facilitate Analysis of Policy Changes Steven Machlin David Kashihara 2018 FCSM Conference 1 Disclaimer The views expressed in this presentation
More informationList of figures. I General information 1
List of figures Preface xix xxi I General information 1 1 Introduction 7 1.1 What is this book about?........................ 7 1.2 Which models are considered?...................... 8 1.3 Whom is this
More informationIntro to GLM Day 2: GLM and Maximum Likelihood
Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University ECPR Summer School in Methods and Techniques 1 / 32 Generalized Linear Modeling 3 steps of GLM 1. Specify the
More informationThe SAS System 11:03 Monday, November 11,
The SAS System 11:3 Monday, November 11, 213 1 The CONTENTS Procedure Data Set Name BIO.AUTO_PREMIUMS Observations 5 Member Type DATA Variables 3 Engine V9 Indexes Created Monday, November 11, 213 11:4:19
More informationLectures 04, 05, 06: Sample weights
Lectures 04, 05, 06: Sample weights Ernesto F. L. Amaral September 12 19, 2017 Advanced Methods of Social Research (SOCI 420) Sources: Stata Help & General Social Survey Codebook. Using sample weights
More informationproc genmod; model malform/total = alcohol / dist=bin link=identity obstats; title 'Table 2.7'; title2 'Identity Link';
BIOS 6244 Analysis of Categorical Data Assignment 5 s 1. Consider Exercise 4.4, p. 98. (i) Write the SAS code, including the DATA step, to fit the linear probability model and the logit model to the data
More informationMultiple Regression and Logistic Regression II. Dajiang 525 Apr
Multiple Regression and Logistic Regression II Dajiang Liu @PHS 525 Apr-19-2016 Materials from Last Time Multiple regression model: Include multiple predictors in the model = + + + + How to interpret the
More informationLogit Models for Binary Data
Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response
More informationThe FREQ Procedure (Book Excerpt)
SAS/STAT 9.22 User s Guide The FREQ Procedure (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.22 User s Guide. The correct bibliographic citation for the complete
More informationThe Influence of Bureau Scores, Customized Scores and Judgmental Review on the Bank Underwriting
The Influence of Bureau Scores, Customized Scores and Judgmental Review on the Bank Underwriting Decision-Making Process Authors M. Cary Collins, Keith D. Harvey and Peter J. Nigro Abstract In recent years
More informationEconometrics is. The estimation of relationships suggested by economic theory
Econometrics is Econometrics is The estimation of relationships suggested by economic theory Econometrics is The estimation of relationships suggested by economic theory The application of mathematical
More informationINFLUENCE OF CAPITAL BUDGETING TECHNIQUESON THE FINANCIAL PERFORMANCE OF COMPANIES LISTED AT THE RWANDA STOCK EXCHANGE
INFLUENCE OF CAPITAL BUDGETING TECHNIQUESON THE FINANCIAL PERFORMANCE OF COMPANIES LISTED AT THE RWANDA STOCK EXCHANGE Liliane Gasana Jomo Kenyatta University of Agriculture and Technology, Rwanda Dr.
More informationStrategies for Assessing Health Plan Performance on Chronic Diseases: Selecting Performance Indicators and Applying Health-Based Risk Adjustment
Strategies for Assessing Health Plan Performance on Chronic Diseases: Selecting Performance Indicators and Applying Health-Based Risk Adjustment Appendix I Performance Results Overview In this section,
More informationQuantitative Methods
THE ASSOCIATION OF BUSINESS EXECUTIVES DIPLOMA PART 2 QM Quantitative Methods afternoon 26 May 2004 1 Time allowed: 3 hours. 2 Answer any FOUR questions. 3 All questions carry 25 marks. Marks for subdivisions
More informationKeywords Akiake Information criterion, Automobile, Bonus-Malus, Exponential family, Linear regression, Residuals, Scaled deviance. I.
Application of the Generalized Linear Models in Actuarial Framework BY MURWAN H. M. A. SIDDIG School of Mathematics, Faculty of Engineering Physical Science, The University of Manchester, Oxford Road,
More informationHEALTH AND RETIREMENT STUDY Prescription Drug Study Final Release V1.0, March 2011 (Sensitive Health Data) Data Description and Usage
HEALTH AND RETIREMENT STUDY 2007 Prescription Drug Study Final Release V1.0, (Sensitive Health Data) Data Description and Usage To the researcher: This data set is intended for exclusive use by you under
More informationANALYSIS USING STRUCTURAL EQUATION MODELING (SEM)
CHAPTER V ANALYSIS USING STRUCTURAL EQUATION MODELING (SEM) 5.1 Nature of SEM The model grows out of and serves purposes similar to multiple regression, but in a more powerful way which takes into account
More informationLloyds TSB. Derek Hull, John Adam & Alastair Jones
Forecasting Bad Debt by ARIMA Models with Multiple Transfer Functions using a Selection Process for many Candidate Variables Lloyds TSB Derek Hull, John Adam & Alastair Jones INTRODUCTION: No statistical
More informationSubject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018
` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.
More informationWesVar Analysis Example Replication C7
WesVar Analysis Example Replication C7 WesVar 5.1 is primarily a point and click application and though a text file of commands can be used in the WesVar (V5.1) batch processing environment, all examples
More informationAudit Sampling: Steering in the Right Direction
Audit Sampling: Steering in the Right Direction Jason McGlamery Director Audit Sampling Ryan, LLC Dallas, TX Jason.McGlamery@ryan.com Brad Tomlinson Senior Manager (non-attorney professional) Zaino Hall
More informationWhat is the Mortgage Shopping Experience of Today s Homebuyer? Lessons from Recent Fannie Mae Acquisitions
What is the Mortgage Shopping Experience of Today s Homebuyer? Lessons from Recent Fannie Mae Acquisitions Qiang Cai and Sarah Shahdad, Economic & Strategic Research Published 4/13/2015 Prospective homebuyers
More informationContents. An Overview of Statistical Applications CHAPTER 1. Contents (ix) Preface... (vii)
Contents (ix) Contents Preface... (vii) CHAPTER 1 An Overview of Statistical Applications 1.1 Introduction... 1 1. Probability Functions and Statistics... 1..1 Discrete versus Continuous Functions... 1..
More informationPredicting the Probability of Being a Smoker: A Probit Analysis
Predicting the Probability of Being a Smoker: A Probit Analysis Department of Economics Florida State University Tallahassee, FL 32306-2180 Abstract This paper explains the probability of being a smoker,
More informationIntroduction to Survey Weights for National Adult Tobacco Survey. Sean Hu, MD., MS., DrPH. Office on Smoking and Health
Introduction to Survey Weights for 2009-2010 National Adult Tobacco Survey Sean Hu, MD., MS., DrPH Office on Smoking and Health Presented to Webinar January 18, 2012 National Center for Chronic Disease
More informationPASS Sample Size Software
Chapter 850 Introduction Cox proportional hazards regression models the relationship between the hazard function λ( t X ) time and k covariates using the following formula λ log λ ( t X ) ( t) 0 = β1 X1
More informationThe coverage of young children in demographic surveys
Statistical Journal of the IAOS 33 (2017) 321 333 321 DOI 10.3233/SJI-170376 IOS Press The coverage of young children in demographic surveys Eric B. Jensen and Howard R. Hogan U.S. Census Bureau, Washington,
More informationQuantitative Techniques Term 2
Quantitative Techniques Term 2 Laboratory 7 2 March 2006 Overview The objective of this lab is to: Estimate a cost function for a panel of firms; Calculate returns to scale; Introduce the command cluster
More informationWeb Appendix for: Medicare Part D: Are Insurers Gaming the Low Income Subsidy Design? Francesco Decarolis (Boston University)
Web Appendix for: Medicare Part D: Are Insurers Gaming the Low Income Subsidy Design? 1) Data Francesco Decarolis (Boston University) The dataset was assembled from data made publicly available by CMS
More information101: MICRO ECONOMIC ANALYSIS
101: MICRO ECONOMIC ANALYSIS Unit I: Consumer Behaviour: Theory of consumer Behaviour, Theory of Demand, Recent Development of Demand Theory, Producer Behaviour: Theory of Production, Theory of Cost, Production
More informationIntroduction to Current Population Survey (CPS) Hsueh-Sheng Wu Center for Family and Demographic Research November 14, 2016
Introduction to Current Population Survey (CPS) Hsueh-Sheng Wu Center for Family and Demographic Research November 14, 2016 1 Introduction Outline What is special about the CPS? Navigate CPS website Survey
More informationCHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES
Examples: Monte Carlo Simulation Studies CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Monte Carlo simulation studies are often used for methodological investigations of the performance of statistical
More informationBooth School of Business, University of Chicago Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay. Solutions to Midterm
Booth School of Business, University of Chicago Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay Solutions to Midterm Problem A: (30 pts) Answer briefly the following questions. Each question has
More informationMeasuring the Cost of Employment: Work-Related Expenses in the Supplemental Poverty Measure. No. 279 SEHSD No
THE SURVEY OF INCOME AND PROGRAM PARTICIPATION Measuring the Cost of Employment: Work-Related in the Supplemental Poverty Measure Revised November 13, 2017 No. 279 SEHSD No. 2017-43 Abinash Mohanty Ashley
More informationHealth Information Technology and Management
Health Information Technology and Management CHAPTER 11 Health Statistics, Research, and Quality Improvement Pretest (True/False) Children s asthma care is an example of one of the core measure sets for
More informationThe FREQ Procedure. Table of Sex by Gym Sex(Sex) Gym(Gym) No Yes Total Male Female Total
Jenn Selensky gathered data from students in an introduction to psychology course. The data are weights, sex/gender, and whether or not the student worked-out in the gym. Here is the output from a 2 x
More informationTopic 30: Random Effects Modeling
Topic 30: Random Effects Modeling Outline One-way random effects model Data Model Inference Data for one-way random effects model Y, the response variable Factor with levels i = 1 to r Y ij is the j th
More informationThe American Panel Survey. Study Description and Technical Report Public Release 1 November 2013
The American Panel Survey Study Description and Technical Report Public Release 1 November 2013 Contents 1. Introduction 2. Basic Design: Address-Based Sampling 3. Stratification 4. Mailing Size 5. Design
More informationPRMIA Exam 8002 PRM Certification - Exam II: Mathematical Foundations of Risk Measurement Version: 6.0 [ Total Questions: 132 ]
s@lm@n PRMIA Exam 8002 PRM Certification - Exam II: Mathematical Foundations of Risk Measurement Version: 6.0 [ Total Questions: 132 ] Question No : 1 A 2-step binomial tree is used to value an American
More informationAppendix A: Detailed Methodology and Statistical Methods
Appendix A: Detailed Methodology and Statistical Methods I. Detailed Methodology Research Design AARP s 2003 multicultural project focuses on volunteerism and charitable giving. One broad goal of the project
More informationCHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA
Examples: Mixture Modeling With Longitudinal Data CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA Mixture modeling refers to modeling with categorical latent variables that represent subpopulations
More informationILO-IPEC Interactive Sampling Tools No. 7
ILO-IPEC Interactive Sampling Tools No. 7 Version 1 December 2014 International Programme on the Elimination of Child Labour (IPEC) Fundamental Principles and Rights at Work (FPRW) Branch Governance and
More informationWe will use an example which will result in a paired t test regarding the labor force participation rate for women in the 60 s and 70 s.
Now let s review methods for one quantitative variable. We will use an example which will result in a paired t test regarding the labor force participation rate for women in the 60 s and 70 s. 17 The labor
More informationCHAPTER V. PRESENTATION OF RESULTS
CHAPTER V. PRESENTATION OF RESULTS This study is designed to develop a conceptual model that describes the relationship between personal financial wellness and worker job productivity. A part of the model
More informationPhd Program in Transportation. Transport Demand Modeling. Session 11
Phd Program in Transportation Transport Demand Modeling João de Abreu e Silva Session 11 Binary and Ordered Choice Models Phd in Transportation / Transport Demand Modelling 1/26 Heterocedasticity Homoscedasticity
More informationLecture 3: Factor models in modern portfolio choice
Lecture 3: Factor models in modern portfolio choice Prof. Massimo Guidolin Portfolio Management Spring 2016 Overview The inputs of portfolio problems Using the single index model Multi-index models Portfolio
More informationWesVar uses repeated replication variance estimation methods exclusively and as a result does not offer the Taylor Series Linearization approach.
CHAPTER 9 ANALYSIS EXAMPLES REPLICATION WesVar 4.3 GENERAL NOTES ABOUT ANALYSIS EXAMPLES REPLICATION These examples are intended to provide guidance on how to use the commands/procedures for analysis of
More information