Applications of Data Analysis (EC969) Simonetta Longhi and Alita Nandi (ISER) Contact: slonghi and
|
|
- Philip Newton
- 5 years ago
- Views:
Transcription
1 Applications of Data Analysis (EC969) Simonetta Longhi and Alita Nandi (ISER) Contact: slonghi and Week 2 Lecture 1: Sampling (I) Constructing Sampling distributions and estimating their characteristics Example: estimating mean number of children among women (II) Computing unbiased estimates with correct standard errors Example: estimating mean pay/wage in UK Input datasets: Week2Lecure1.dta Do file in Week2Lecure1_DoFile.pdf New commands ci, test, tabstat, aweight, pweight, ciplot, svyset, svydes, svy: mean, estat, 1 W e e k 2 L e c t u r e 1
2 (I) Constructing Sampling distributions and estimating their characteristics Example: estimating mean number of children among women Suppose there is a population of 6 women with children. The distribution of children is shown in Table 1. We are interested in estimating the average number of children of these women and so would like to draw a sample and estimate this number from that sample. In this exercise, we will compare the characteristics of the sampling distribution of the sample mean children of two different sampling plans or sample designs. In the first sampling plan we will draw a sample of 2 women from the population of 6 women. In the second one, we will draw a sample of 3 women. Table 1: Distribution of children across a population of 6 women Women Total no. of children Total 30 Population Mean = 30/6 = 5 Sampling Plan 1 Table 2: All possible size 2 samples and corresponding sample averages Sample No. Women in the sample Mean number of children per woman in the sample (d) 1 1, , , , , , , , , , , , , , , W e e k 2 L e c t u r e 1
3 Table 3: Sampling distribution of d d Frequency (f) Total 15 di fi i 1 Expected value of d, d 9 = 5 f 9 i 1 ( di i 1 Standard error of d = 9 9 i 1 i d ) f i 2 f i =1.21 Bias = 0 MSE=Bias + Sampling Variance = Bias + (Standard Error^2) = 0 + (1.21^2) = 1.47 RMSE=Square root of MSE = 1.21 Sampling Plan 2 Table 4: All possible size 3 samples and corresponding sample averages Fill in the blanks Sample No. Women in the sample Mean number of children per woman in the sample (d) 1 1,2, ,2, ,2, ,2, ,3, ,3, ,3, ,4, ,4, W e e k 2 L e c t u r e 1
4 Compute the sampling distribution of d Calculate the expected value and standard deviation of the sampling distribution of d Calculate the bias, standard error, MSE and RMSE of d How does this sampling plan 2 compare with sampling plan 1 in terms of bias and standard error of d? (II) Computing unbiased estimates with correct standard errors Example: estimating mean pay/wage in UK Input dataset: Week2Lecure1.dta Do file in Week2Lecure1_DoFile.pdf We have provided a dataset Week2Lecure1.dta from the 15 th wave (corresponds to year 2005) of the British Household Panel Survey (BHPS) see previous section on how to create it. Before doing any analysis and estimation using survey data ask yourself these questions: What is the population of interest, i.e., the population that you want the results of your analysis based on the survey sample to generalize to? What is the survey design? Specifically is it a clustered, stratified sample? And is there unequal selection probabilities? Is there nonresponse (the answer is almost always yes!)? Are weights provided which account for unequal selection probabilities and/or non-response? The BHPS The original BHPS is a clustered, stratified sample but with an almost equal probability sampling design. This was designed to be representative of Great Britain in 1990 south of the Caledonian Canal. So, the original sample included households in England, Wales and Scotland and they all had (almost) equal selection probabilities. In later waves over-samples or boosts from Wales, Scotland, and Northern Ireland were added to the original sample, i.e., proportion of households who are from Wales, Scotland, and Northern Ireland in the BHPS sample is much higher than in the UK population. In other words, sample units from the four countries had unequal selection probabilities. While the Scottish and Welsh boosts 4 W e e k 2 L e c t u r e 1
5 had a similar clustered, stratified design as the original sample, the Northern Ireland boost was a simple random sample. Variables are provided which help identify the primary sampling unit and strata from which the sample household was drawn. Weights are provided which account for the unequal selection probability and non-response. In this example we will only look at cross-sectional respondent weights, i.e., weights for respondents that account for unequal selection probability, non-response (at the household and individual level) and post-stratification. In other words, weighted estimates using cross-section respondent weight for wave 15 will provide unbiased estimates of the corresponding parameter for the UK population in The BHPS is conducted primarily by face-to-face interviews. Some respondents refuse to participate but opt for a telephone interview or a proxy interview (someone from their household answers on their behalf). Note respondent cross-sectional weight is zero for proxy and telephone respondents and missing for dead and out-of-scopes. Inspecting the data Open Week2Lecure1.dta First inspect the different variables, particularly find out which variables represent wages (wage), response (ivfio), weights (xrwtuk1), stratification (strata) and clustering (psu) variables. Let s see what these variables look like: What are the variable names, value labels, their mean, standard deviation, frequency distribution? Are there any missing values? What are the weights for proxy and telephone respondents? Examine weights and the sample identifier, memorig Look at distributions of cross-sectional respondent weight and see how they vary by sample. Notes: tabstat xrwtuk1, stat(mean min max sd) by(memorig) longstub nototal tabstat displays summary statistics for a series of numeric variables (specified right after this) in one table, possibly broken down by a second variable. stat() requires the list of statistics to be displayed for each of the variables specified by() requires variable by which the table is to be broken down. longstub specifies that the left stub of the table be made wider so that it can include names of the statistics or variables in addition to the categories of by(). nototal does not report overall statistics; use with by(). Why are the weights, on average, higher for the Original sample than for the country boosts? 5 W e e k 2 L e c t u r e 1
6 Examine the variable of interest, wage Why is this missing for some people? Hint: Use ivfio, employed Do weights matter: are weighted estimates likely to be different from unweighted estimates? Yes, if there is variation across observations in weights. This variation in weights is more prominent with the introduction of extension samples (Wales, etc) as there are large differences in selection probabilities. Stata allows for four types of weights: pweight, aweight, fweight and iweight. pweight & aweight are the ones that we will be using. See Stata Manual for more explanation. PWEIGHT are probability or sampling weights, i.e., it is the inverse of the probability that the observation is included in the sample. As the BHPS weights are probability weights the Stata weight command that we should ALWAYS use is pweight. However, Stata does not allow pweight for certain commands such as summarize, it only allows aweight ( The estimated mean and standard deviation using pweight & aweight are the same, but not the standard error (& confidence interval). If Stata will not allow pweight and you have to use aweight be careful about its interpretation. aweight represents analytical weights which are inversely proportional to the variance of the observation. An example of analytical weights: If the observations are averages, then the number of observations used to compute the averages would be analytical weights. (See [U] Weighted estimation in the Stata Manual) To compute mean and standard deviation use summarize, table, tabstat summ wage To compute the mean, standard error and confidence intervals use ci ci wage Compute unweighted mean, standard errors and confidence interval for wage. To compute weighted mean, standard errors, confidence interval and standard deviation for wage but without correcting for clustering and stratification, there are two options: First you could use summarize and ci with the option for weights. But for these commands Stata only allows you to use aweight option which means the weights will be treated as analytical weights. This will produce the weighted mean estimate as when using pweight which treats the weights as probability weights but will produce different estimates of 6 W e e k 2 L e c t u r e 1
7 standard errors. As the BHPS weights are not analytical weights but probability weights this is not the best choice (see note above and Stata Help). summ wage [aweight = xrwtuk1] ci wage [aweight = xrwtuk1] Compute weighted (but without correcting for clustering and stratification) mean, standard errors, confidence interval and standard deviation for wage using summarize and ci. Stata does not allow pweight with summarize and ci and if you do use Stata will give an error message and the program will stop running. The part in this box is optional To see what happens if you use pweight instead, type summ wage [pweight = xrwtuk1] To see what happens if you use pweight instead but to prevent Stata from stopping, type capture noisily summ wage [pweight = xrwtuk1] capture tells Stata to not show the error message and to continue running the program inspite of the error noisily tells Stata to show the output Together capture noisily asks Stata to show the error message but to continue running the program The second option is using Stata s svy suite of commands. Here you tell Stata what the survey design is and then Stata computes the correct estimates taking the survey design into account. The other advantage of using this option is that for this Stata does treat the weights as probability weights. As the BHPS weights are probability weights (as will be the weights in almost all such micro-panel surveys) this will produce the correct estimates of standard errors. To do this you need to first inform Stata about the survey design. For this part of the exercise, we will ignore the clustering and stratification aspect of the survey and just focus on the weights. svyset [pweight = xrwtuk1] And then to compute the weighted means, standard error and confidence intervals svy: mean wage If you want to produce estimates of the sample standard deviation estat sd svyset instructs Stata that the dataset is a complex survey data. The different features of this complex survey dataset are given by the commands pweight, strata and psu. All these options need not be specified. Once we have told Stata what the survey design is then whatever commands we type in the format svy: command, Stata will carry out the command after taking into account the structure of the dataset. 7 W e e k 2 L e c t u r e 1
8 estat displays scalar- and matrix-valued statistics after estimation; it complements predict, which calculates variables after estimation. Exactly what statistics estat can calculate depends on the previous estimation command. Compute weighted (but without correcting for clustering and stratification) mean, standard errors, confidence interval and standard deviation for wage, treating the BHPS weights as probability weights. What will happen if you use aweight instead of pweight in the above command? Next we now want to take into account the complete survey design, i.e., we want to compute estimates of mean, standard errors and confidence interval of wage that corrects for clustering and stratification in addition to unequal selection probability and non-response. To do this we will again need to use the svy suite of commands but this time specify the strata and psu variables in addition to the weight variable. To do this you need to first clear Stata s memory of any previous svy instructions svyset, clear Next inform Stata about the survey design variables svyset [pweight = xrwtuk1], psu(psu) strata(strata) Then compute the weighted means etc. as before svy: mean wage Compute weighted mean, standard errors, confidence interval and standard deviation for wage after correcting for clustering and stratification and treating the BHPS weights as probability weights. This returns mean income, but does not return standard error or confidence interval: Find out why? Hint: Use svydes which describes the structure of the survey data. svydes You will find that there is a stratum (-8) with just 1 unit (psu) within it. Which sample is that? tab memorig if strata==-8 NB The values of psu and strata for all cases in Northern Ireland is "-8" because the NI sample is a simple random sample, i.e., no clustering or stratification. Stata cannot compute correct standard errors if a part of the sample has a different sampling design. So, exclude the Northern Ireland sample from the analysis svy: mean wage if memorig ~= 7 8 W e e k 2 L e c t u r e 1
9 Computing estimates of mean wages in the different countries of UK Next, we would like to compare the average hourly wage of the four countries of UK and for that we would need to compute weighted mean wage for the different countries and test the difference. Look at distributions of cross-section respondent weights and see how these vary by the four countries There are two ways to compute the estimates for sub-samples: either use subpop or over commands. Using if statement to estimate weighted means of sub populations will result in incorrect standard error estimations (see Stata Survey Data Reference Manual Release 11, pp 53). To use subpop command option svy: mean var, subpop (varname) This asks Stata to compute estimates for the ONE single subpopulation identified by varname. The subpopulation is defined by the observations for which varname!=0. Typically, varname =1 defines the subpopulation, and varname =0 indicates observations not belonging to the subpopulation. For observations whose subpopulation status is uncertain, varname should be set to a missing value; such observations are dropped from the estimation sample. Alternatively, an if condition can be used with varname svy: mean var, subpop (if varname=x) To use over command option: svy: mean var, over (varname) This asks Stata to compute the estimates for ALL categorical values of the categorical variable varname. You can use more than one variable in varname, separated by space Now you have the tools to compute estimates of mean wage in each country. Remember to avoid complications because of Northern Ireland and missing region/country variables eliminate those cases from the sample. drop if memorig == 7 drop if country==. Compute the unweighted mean wage for each country Estimate mean wage for each country separately using the if statement option. What happens? Estimate mean wage for each country by using the svypop command option Estimate mean wage for the four countries by using the over command option Are the estimates obtained by the two methods (subpop and over) exactly the same? subpop and over can be used to compute estimates for multiple subpopulations. Here are a couple of tasks to illustrate that: Estimate mean wage for men and women in the four countries by using the over command option 9 W e e k 2 L e c t u r e 1
10 Estimate mean wage for men and women in the four countries by using the subpop command option You can also test the differences in these estimated means across the four countries. Note the test command should follow immediately after the estimation. svy: mean wage, over(country) test [wage]england = [wage]scotland = [wage]wales Test if men earn higher wages than women in England, Scotland and Wales. Design Effect (deff): Is the ratio of the variance of a statistic based on the actual sample design to the variance of this statistic had the sample design been a SRS (simple random sample) of the same size. In other words, it indicates by how much the variance is inflated due to the sampling design. deft is the square root of deff. Compute design effects, design factor and effective sample size: quietly svy: mean wage estat effects, deff deft [Optional] Plot the weighted mean and the confidence interval. ciplot shows means and confidence intervals. Means are shown by point symbols and intervals by capped bars. ci is used for the calculations. If it is not already installed then use findit to find it and then install it ciplot paygu, by(country) saving(graph1, replace) ciplot paygu [aw=xrwtuk1], by(country) saving(graph2, replace) 10 W e e k 2 L e c t u r e 1
11 [Optional] To estimate mean wage in all four countries of UK, i.e., including Northern Ireland We had asked you to drop Northern Ireland from the dataset because it has a different sample design and Stata cannot handle data with mixed sample design one part being clustered and stratified and the other a simple random sample. It has been suggested that Stata can be tricked into believing that the SRS part of the sample is also a clustered stratified sample simply by using the unique household identifier (whid) as the psu variable (wpsu). In other words, each such psu has just one household observation. Remember the current dataset does not include the Northern Ireland sub-sample as we have dropped it earlier.so, open Week2Lecure1.dta again and this time instead of dropping the Northern Ireland sub-sample, replace the value of wpsu for the Northern Ireland sub-sample with the whid. Then follow the earlier steps to estimate mean wage in all four countries. [Optional] How to create dataset Week2Lecure1.dta We have provided these datasets, but if you wanted to create these yourself, here is a guide to do that. See Week2_dataprep_DoFile.pdf which contains the corresponding do file for this. 1. Get information on strata and primary sampling unit from mhhsamp.dta (you can use any wave from onwards wave 12/wave L) 2. Get information about the household that was asked in the household questionnaire from mhhresp.dta: number of children of different ages in the household (use the same wave that you have used for step 1) 3. Get information about the individual that was asked in the individual questionnaire from mindresp.dta: individual level respondent weight, interview outcome, education, marital status, number of own children in the household, wages, work hours, employment status, health problem, region of residence (use the same wave that you have used for step 1) 4. Get some fixed info from xwavedat.dta: race, sex, sample origin and date of birth In addition to these variables always remember to include the appropriate unique identifiers in each of the datasets pid, hid & pno. As a general rule we have dropped the wave prefix for each of these files. This is because it makes it easier to use the same program for a different wave data. This is not necessary, just convenient. 5. Merge all these datasets sequentially. Finally, keep only those observations present in all datasets. Points to remember about merging: Datasets being merged should be sorted on the variable or variables that are being used to merge these Check _merge to see how many cases were available in both, how many in only one _merge is created by Stata at every merge and so if you don t drop _merge or rename it to something else after each merge, Stata will produce an error message saying _merge already exists and will not allow you to perform merge until you have dropped _merge or renamed it. 6. Create the following variables 11 W e e k 2 L e c t u r e 1
12 (i) Usual hourly wage rate (does NOT include overtime pay) using PAYGU (Usual gross pay per month: current job) and JBHRS (hours expected to work in a normal week). Drop cases that have negative values for PAYGU or JBHRS. Also drop cases for which JBHRS=0 which means no expected working hours (ii) Create a 0-1 dummy variable that takes on 1 if currently employed using JBHAS (did paid work last week) and JBOFF (no work last week but has job) (iii) Create a variable for number of young children (defined as being < 5years old) in the household (we are not differentiating between other s and own children) using NCH02 NCH34. There were some discrepancies in the data some households had more number of young children than NCHILD, the number of own children in the household. So, restrict the number of young children to the number of own children in the household (iv) Create 0-1 dummies for country of residence using REGION. Also create a categorical variable to represent the country of residence. (v) Create a 0-1 dummy variable that takes on the value of 1 if the person s ethnicity is white and 0 otherwise 7. Sample restriction for Week2Lecure1.dta (i) Restrict the sample to those who are not self-employed (ii) Drop those for whom if EMPLOYED is missing (iii) Drop when wage is missing even though the person is employed and interviewed face-toface 12 W e e k 2 L e c t u r e 1
Lectures 04, 05, 06: Sample weights
Lectures 04, 05, 06: Sample weights Ernesto F. L. Amaral September 12 19, 2017 Advanced Methods of Social Research (SOCI 420) Sources: Stata Help & General Social Survey Codebook. Using sample weights
More informationEXAMPLE 6: WORKING WITH WEIGHTS AND COMPLEX SURVEY DESIGN
EXAMPLE 6: WORKING WITH WEIGHTS AND COMPLEX SURVEY DESIGN EXAMPLE RESEARCH QUESTION(S): How does the average pay vary across different countries, sex and ethnic groups in the UK? How does remittance behaviour
More informationSurvey Sampling, Fall, 2006, Columbia University Homework assignments (2 Sept 2006)
Survey Sampling, Fall, 2006, Columbia University Homework assignments (2 Sept 2006) Assignment 1, due lecture 3 at the beginning of class 1. Lohr 1.1 2. Lohr 1.2 3. Lohr 1.3 4. Download data from the CBS
More informationDescription Quick start Menu Syntax Options Remarks and examples Acknowledgment Also see
Title stata.com collapse Make dataset of summary statistics Description Quick start Menu Syntax Options Remarks and examples Acknowledgment Also see Description collapse converts the dataset in memory
More informationMedical Expenditure Panel Survey. Household Component Statistical Estimation Issues. Copyright 2007, Steven R. Machlin,
Medical Expenditure Panel Survey Household Component Statistical Estimation Issues Overview Annual person-level estimates Overlapping panels Estimation variables Weights Variance Pooling multiple years
More informationThe British Household Panel Survey (BHPS) and its successor, Understanding Society (US)
The British Household Panel Survey (BHPS) and its successor, Understanding Society (US) Professor Karl Taylor Department of Economics 25 th March 2014 OUTLINE 1. Background BHPS 2. Sampling 3. Waves and
More informationChapter 6 Part 3 October 21, Bootstrapping
Chapter 6 Part 3 October 21, 2008 Bootstrapping From the internet: The bootstrap involves repeated re-estimation of a parameter using random samples with replacement from the original data. Because the
More informationNational Statistics Opinions and Lifestyle Survey Technical Report January 2013
UK Data Archive Study Number 7388 Opinions and Lifestyle Survey, Well-Being Module, January, February, March and April, 2013 National Statistics Opinions and Lifestyle Survey Technical Report January 2013
More informationCLS Cohort. Studies. Centre for Longitudinal. Studies CLS. Nonresponse Weight Adjustments Using Multiple Imputation for the UK Millennium Cohort Study
CLS CLS Cohort Studies Working Paper 2010/6 Centre for Longitudinal Studies Nonresponse Weight Adjustments Using Multiple Imputation for the UK Millennium Cohort Study John W. McDonald Sosthenes C. Ketende
More informationFinal Quality report for the Swedish EU-SILC. The longitudinal component
1(33) Final Quality report for the Swedish EU-SILC The 2005 2006-2007-2008 longitudinal component Statistics Sweden December 2010-12-27 2(33) Contents 1. Common Longitudinal European Union indicators based
More informationFinal Quality report for the Swedish EU-SILC. The longitudinal component. (Version 2)
1(32) Final Quality report for the Swedish EU-SILC The 2004 2005 2006-2007 longitudinal component (Version 2) Statistics Sweden December 2009 2(32) Contents 1. Common Longitudinal European Union indicators
More informationConsumer Research: overdrafts and APR. Technical Report. December 2018
Consumer Research: overdrafts and APR. Technical Report December 2018 TECHNICAL REPORT 1. Introduction This technical report relates to research on overdrafts and APR published in the technical annex to
More informationReminders. Quiz today - please bring a calculator I ll post the next HW by Saturday (last HW!)
Reminders Quiz today - please bring a calculator I ll post the next HW by Saturday (last HW!) 1 Warm Up Chat with your neighbor. What is the Central Limit Theorem? Why do we care about it? What s the (long)
More informationChanges to work and income around state pension age
Changes to work and income around state pension age Analysis of the English Longitudinal Study of Ageing Authors: Jenny Chanfreau, Matt Barnes and Carl Cullinane Date: December 2013 Prepared for: Age UK
More informationRandom Group Variance Adjustments When Hot Deck Imputation Is Used to Compensate for Nonresponse 1
Random Group Variance Adjustments When Hot Deck Imputation Is Used to Compensate for Nonresponse 1 Richard A Moore, Jr., U.S. Census Bureau, Washington, DC 20233 Abstract The 2002 Survey of Business Owners
More information9. Methodology Shaun Scholes National Centre for Social Research Kate Cox National Centre for Social Research
9. Methodology Shaun Scholes National Centre for Social Research Kate Cox National Centre for Social Research Carli Lessof National Centre for Social Research This chapter presents a summary of the survey
More informationPART B Details of ICT collections
PART B Details of ICT collections Name of collection: Household Use of Information and Communication Technology 2006 Survey Nature of collection If possible, use the classification of collection types
More informationThe Serbia 2013 Enterprise Surveys Data Set
I. Introduction The Serbia 2013 Enterprise Surveys Data Set 1. This document provides additional information on the data collected in Serbia between January 2013 and August 2013 as part of the fifth round
More informationShifting our focus. We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why?
Probability Introduction Shifting our focus We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why? What is Probability? Probability is used
More informationThe Armenia 2013 Enterprise Surveys Data Set
I. Introduction The Armenia 2013 Enterprise Surveys Data Set 1. This document provides additional information on the data collected in Armenia between November 2012 and July 2013 as part of the fifth round
More informationThe Macedonia 2013 Enterprise Surveys Data Set
I. Introduction The Macedonia 2013 Enterprise Surveys Data Set 1. This document provides additional information on the data collected in Macedonia between November 2012 and May 2013 as part of the fifth
More informationNew SAS Procedures for Analysis of Sample Survey Data
New SAS Procedures for Analysis of Sample Survey Data Anthony An and Donna Watts, SAS Institute Inc, Cary, NC Abstract Researchers use sample surveys to obtain information on a wide variety of issues Many
More informationSouth Africa - National Income Dynamics Study , Wave 2
Microdata Library - National Income Dynamics Study 2010-2011, Wave 2 Southern Africa Labour and Development Research Unit - University of Cape Town Report generated on: August 31, 2016 Visit our data catalog
More informationNBER WORKING PAPER SERIES MAKING SENSE OF THE LABOR MARKET HEIGHT PREMIUM: EVIDENCE FROM THE BRITISH HOUSEHOLD PANEL SURVEY
NBER WORKING PAPER SERIES MAKING SENSE OF THE LABOR MARKET HEIGHT PREMIUM: EVIDENCE FROM THE BRITISH HOUSEHOLD PANEL SURVEY Anne Case Christina Paxson Mahnaz Islam Working Paper 14007 http://www.nber.org/papers/w14007
More informationYour State Pension Choice Pension now or extra pension later: A guide to State Pension Deferral
Your State Pension Choice Pension now or extra pension later: A guide to State Pension Deferral Introduction About this guide When you are coming up to State Pension age, you need to decide when you want
More informationNational Statistics Opinions and Lifestyle Survey Technical Report. February 2013
UK Data Archive Study Number 7555 - Opinions and Lifestyle Survey, Transport Issues Module, February - April 2013 National Statistics Opinions and Lifestyle Survey Technical Report 1. The sample February
More informationCYPRUS FINAL QUALITY REPORT
CYPRUS FINAL QUALITY REPORT STATISTICS ON INCOME AND LIVING CONDITIONS 2008 CONTENTS Page PREFACE... 6 1. COMMON LONGITUDINAL EUROPEAN UNION INDICATORS 1.1. Common longitudinal EU indicators based on the
More informationYou created this PDF from an application that is not licensed to print to novapdf printer (http://www.novapdf.com)
Monday October 3 10:11:57 2011 Page 1 (R) / / / / / / / / / / / / Statistics/Data Analysis Education Box and save these files in a local folder. name:
More informationGamma Distribution Fitting
Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics
More informationCommunity Survey on ICT usage in households and by individuals 2010 Metadata / Quality report
HH -p1 EU T H I S P L A C E C A N B E U S E D T O P L A C E T H E N S I N A M E A N D L O G O Community Survey on ICT usage in households and by 2010 Metadata / Quality report Please read this first!!!
More informationDE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS STA 105-M (BASIC STATISTICS) READ THE INSTRUCTIONS BELOW VERY CAREFULLY.
DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 003 MOCK EXAMINATIONS STA 105-M (BASIC STATISTICS) Time: hours READ THE INSTRUCTIONS BELOW VERY CAREFULLY. Do not open this question paper until you have been told
More informationThe following content is provided under a Creative Commons license. Your support
MITOCW Recitation 6 The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free. To make
More informationGuide to managing your workforce
For scheme administrators Guide to managing your workforce For schemes using contractual enrolment Workplace pensions CONTENTS Introduction... 4 View workforce... 4 Searching and filtering... 4 Identifying
More informationIntermediate Quality Report for the Swedish EU-SILC, The 2007 cross-sectional component
STATISTISKA CENTRALBYRÅN 1(22) Intermediate Quality Report for the Swedish EU-SILC, The 2007 cross-sectional component Statistics Sweden December 2008 STATISTISKA CENTRALBYRÅN 2(22) Contents page 1. Common
More informationCYPRUS FINAL QUALITY REPORT
CYPRUS FINAL QUALITY REPORT STATISTICS ON INCOME AND LIVING CONDITIONS 2010 CONTENTS Page PREFACE... 6 1. COMMON LONGITUDINAL EUROPEAN UNION INDICATORS 1.1. Common longitudinal EU indicators based on the
More informationCYPRUS FINAL QUALITY REPORT
CYPRUS FINAL QUALITY REPORT STATISTICS ON INCOME AND LIVING CONDITIONS 2009 CONTENTS Page PREFACE... 6 1. COMMON LONGITUDINAL EUROPEAN UNION INDICATORS 1.1. Common longitudinal EU indicators based on the
More informationStratification Analysis. Summarizing an Output Variable by a Grouping Input Variable
Stratification Analysis Summarizing an Output Variable by a Grouping Input Variable 1 Topics I. Stratification Analysis II. Stratification Analysis Tools Stratification Tables Bar Graphs / Pie Charts III.
More informationClaim form for Winter Fuel Payment for past winters 1998/99, 1999/00, 2000/01, 2001/02, 2002/03 and 2003/04
Winter Fuel Payment If you get in touch with us, please tell us this reference number Our phone number is Code Number Ext If you have a textphone, you can call on Code Number Date Claim form for Winter
More informationUniversal Credit: further information for families
GOV.UK Search Home Universal Credit: further information for families Department for Work & Pensions Guidance Universal Credit: further information for families Updated 30 November 2017 Contents 1. What
More informationLab#3 Probability
36-220 Lab#3 Probability Week of September 19, 2005 Please write your name below, tear off this front page and give it to a teaching assistant as you leave the lab. It will be a record of your participation
More informationIncorporating a Finite Population Correction into the Variance Estimation of a National Business Survey
Incorporating a Finite Population Correction into the Variance Estimation of a National Business Survey Sadeq Chowdhury, AHRQ David Kashihara, AHRQ Matthew Thompson, U.S. Census Bureau FCSM 2018 Disclaimer
More informationEcon 371 Problem Set #4 Answer Sheet. 6.2 This question asks you to use the results from column (1) in the table on page 213.
Econ 371 Problem Set #4 Answer Sheet 6.2 This question asks you to use the results from column (1) in the table on page 213. a. The first part of this question asks whether workers with college degrees
More informationLIS Self Teaching Package Stata version. Part II: Gender, employment, and wages
LIS Self Teaching Package 2018 Stata version Part II: Gender, employment, and wages 0 1 Part II Gender, employment, and wages Overall Plan and Structure of the Exercise The exercises in Part I demonstrated
More informationSampling & Confidence Intervals
Sampling & Confidence Intervals Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 24/10/2017 Principles of Sampling Often, it is not practical to measure every subject in a population.
More informationChapter 23: accuracy of averages
Chapter 23: accuracy of averages Context: previous chapters...................................................... 2 Context: previous chapters...................................................... 3 Context:
More informationSupplementary materials
Supplementary materials Appendix 1. Additional estimation results Table S.1. Two sided t tests for differences in means between women who have engaged in transactional sex with UN personnel and those who
More informationInternet use and attitudes Metrics Bulletin
Internet use and attitudes 2014 Metrics Bulletin Research Document Publication date: 7 August 2014 Contents Section Page 1 Introduction 3 2 Internet reach: 2014 9 3 Internet breadth of use 10 4 Internet
More informationConover Test of Variances (Simulation)
Chapter 561 Conover Test of Variances (Simulation) Introduction This procedure analyzes the power and significance level of the Conover homogeneity test. This test is used to test whether two or more population
More informationPlanning Sample Size for Randomized Evaluations Esther Duflo J-PAL
Planning Sample Size for Randomized Evaluations Esther Duflo J-PAL povertyactionlab.org Planning Sample Size for Randomized Evaluations General question: How large does the sample need to be to credibly
More informationThe use of linked administrative data to tackle non response and attrition in longitudinal studies
The use of linked administrative data to tackle non response and attrition in longitudinal studies Andrew Ledger & James Halse Department for Children, Schools & Families (UK) Andrew.Ledger@dcsf.gsi.gov.uk
More informationWILL QUESTIONNAIRE. Section 1: Your details. Client 1 Client 2. Your title: Your full name (include middle names): Have you ever used any other names?
WILL QUESTIONNAIRE This is our standard Will Questionnaire. It s long because it has to cover everybody. You don't need to fill in all the sections though - just the ones that apply to your circumstances.
More informationWhen to fill in form APSS227
Protection of your lifetime allowance fixed protection Notes to help you fill in form APSS227 Application for protection of your lifetime allowance fixed protection. When to fill in form APSS227 You should
More informationThe coverage of young children in demographic surveys
Statistical Journal of the IAOS 33 (2017) 321 333 321 DOI 10.3233/SJI-170376 IOS Press The coverage of young children in demographic surveys Eric B. Jensen and Howard R. Hogan U.S. Census Bureau, Washington,
More informationEXAMPLE 4: DISTRIBUTING HOUSEHOLD-LEVEL INFORMATION TO RESPONDENTS
EXAMPLE 4: DISTRIBUTING HOUSEHOLD-LEVEL INFORMATION TO RESPONDENTS EXAMPLE RESEARCH QUESTION(S): What are the flows into and out of poverty from one year to the next? What explains the probability that
More informationUser guide for employers not using our system for assessment
For scheme administrators User guide for employers not using our system for assessment Workplace pensions CONTENTS Welcome... 6 Getting started... 8 The dashboard... 9 Import data... 10 How to import a
More informationThe American Panel Survey. Study Description and Technical Report Public Release 1 November 2013
The American Panel Survey Study Description and Technical Report Public Release 1 November 2013 Contents 1. Introduction 2. Basic Design: Address-Based Sampling 3. Stratification 4. Mailing Size 5. Design
More informationNew Construction Program Participating Owner Survey
1/30/2009 New Construction Program Participating Owner Survey The purpose of the Participating Owner Survey was to gather information from building owners who participated in the NYSERDA New Construction
More informationRussia Longitudinal Monitoring Survey (RLMS) Sample Attrition, Replenishment, and Weighting in Rounds V-VII
Russia Longitudinal Monitoring Survey (RLMS) Sample Attrition, Replenishment, and Weighting in Rounds V-VII Steven G. Heeringa, Director Survey Design and Analysis Unit Institute for Social Research, University
More informationDoes Growth make us Happier? A New Look at the Easterlin Paradox
Does Growth make us Happier? A New Look at the Easterlin Paradox Felix FitzRoy School of Economics and Finance University of St Andrews St Andrews, KY16 8QX, UK Michael Nolan* Centre for Economic Policy
More informationAllison notes there are two conditions for using fixed effects methods.
Panel Data 3: Conditional Logit/ Fixed Effects Logit Models Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised April 2, 2017 These notes borrow very heavily, sometimes
More informationVALIDATING MORTALITY ASCERTAINMENT IN THE HEALTH AND RETIREMENT STUDY. November 3, David R. Weir Survey Research Center University of Michigan
VALIDATING MORTALITY ASCERTAINMENT IN THE HEALTH AND RETIREMENT STUDY November 3, 2016 David R. Weir Survey Research Center University of Michigan This research is supported by the National Institute on
More informationSTRATEGIES FOR THE ANALYSIS OF IMPUTED DATA IN A SAMPLE SURVEY
STRATEGIES FOR THE ANALYSIS OF IMPUTED DATA IN A SAMPLE SURVEY James M. Lepkowski. Sharon A. Stehouwer. and J. Richard Landis The University of Mic6igan The National Medical Care Utilization and Expenditure
More informationFinal Quality Report for the Swedish EU-SILC
Final Quality Report for the Swedish EU-SILC The 2006 2007 2008 2009 longitudinal component Statistics Sweden 2011-12-22 1 Table of contents 1. Common longitudinal European Union indicators... 3 2. Accuracy...
More informationCSC Advanced Scientific Programming, Spring Descriptive Statistics
CSC 223 - Advanced Scientific Programming, Spring 2018 Descriptive Statistics Overview Statistics is the science of collecting, organizing, analyzing, and interpreting data in order to make decisions.
More informationProf. Thistleton MAT 505 Introduction to Probability Lecture 3
Sections from Text and MIT Video Lecture: Sections 2.1 through 2.5 http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-041-probabilistic-systemsanalysis-and-applied-probability-fall-2010/video-lectures/lecture-1-probability-models-and-axioms/
More informationmpi A Stata command for the Alkire-Foster methodology Christoph Jindra 9 November 2015 OPHI Seminar Series - Michaelmas 2015
mpi A Stata command for the Alkire-Foster methodology Christoph Jindra OPHI Seminar Series - Michaelmas 2015 9 November 2015 Christoph Jindra (Research Officer) 9 November 2015 1 / 30 Outline What and
More informationA Stratified Sampling Plan for Billing Accuracy in Healthcare Systems
A Stratified Sampling Plan for Billing Accuracy in Healthcare Systems Jirachai Buddhakulsomsiri Parthana Parthanadee Swatantra Kachhal Department of Industrial and Manufacturing Systems Engineering The
More informationSPSS I: Menu Basics Practice Exercises Target Software & Version: SPSS V Last Updated on January 17, 2007 Created by Jennifer Ortman
SPSS I: Menu Basics Practice Exercises Target Software & Version: SPSS V. 14.02 Last Updated on January 17, 2007 Created by Jennifer Ortman PRACTICE EXERCISES Exercise A Obtain descriptive statistics (mean,
More informationSummarising Data. Summarising Data. Examples of Types of Data. Types of Data
Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017
More informationHomework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a
Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Announcements: There are some office hour changes for Nov 5, 8, 9 on website Week 5 quiz begins after class today and ends at
More informationNational Child Development Study and 1970 British Cohort Study Technical Report:
National Child Development Study and 1970 British Cohort Study Technical Report: Changes in the NCDS and BCS70 Populations and Samples over Time 1st Edition October 2004 By Ian Plewis, Lisa Calderwood,
More informationAP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE
AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,
More informationSTEP Survey Weighting Procedures Summary (Based on The World Bank Weight Requirement) Lao PDR. October 11, 2013
October 11, 2013 STEP Survey Weighting Procedures Summary (Based on The World Bank Weight Requirement) Lao PDR October 11, 2013 2 October 11, 2013 Table of Contents 1 Survey Design Overview... 1 2 Data
More informationThe Ethiopia 2011 Enterprise Surveys Data Set
The Ethiopia 2011 Enterprise Surveys Data Set I. Introduction 1. This document provides additional information on the data collected in Ethiopia between July 2011 and July 2012 as part of the Africa Enterprise
More informationAMS7: WEEK 4. CLASS 3
AMS7: WEEK 4. CLASS 3 Sampling distributions and estimators. Central Limit Theorem Normal Approximation to the Binomial Distribution Friday April 24th, 2015 Sampling distributions and estimators REMEMBER:
More information1) The Effect of Recent Tax Changes on Taxable Income
1) The Effect of Recent Tax Changes on Taxable Income In the most recent issue of the Journal of Policy Analysis and Management, Bradley Heim published a paper called The Effect of Recent Tax Changes on
More informationYour Name (Please print) Did you agree to take the optional portion of the final exam Yes No. Directions
Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No (Your online answer will be used to verify your response.) Directions There are two parts to the final exam.
More informationtm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6}
PS 4 Monday August 16 01:00:42 2010 Page 1 tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6} log: C:\web\PS4log.smcl log type: smcl opened on:
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow
More informationIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics 17.871 Types of Variables ~Nominal (Quantitative) Nominal (Qualitative) categorical Ordinal Interval or ratio Describing data Moment Non-mean based measure Center
More informationHomework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82
Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections
More informationFrom the date of your certificate you will be legally recognised in your acquired gender.
Benefits and Pensions note How getting a full Gender Recognition Certificate may affect National Insurance, pensions and other social security benefits for applicants and their spouses or civil partners.
More informationhhid marst age1 age2 sex1 sex2
The first step in the process is to select a topic that you will work on. There are 7 primary topics, and 5 secondary dimensions that you may choose from. Each team may have up to 4 people. All of the
More informationGuide for Investigators. The American Panel Survey (TAPS)
Draft (to be updated in January) Guide for Investigators The American Panel Survey (TAPS) Weidenbaum Center Washington University Steven S. Smith, Director About The American Panel Survey (TAPS) TAPS is
More informationAppendix for Incidence, Salience and Spillovers: The Direct and Indirect Effects of Tax Credits on Wages
Appendix for Incidence, Salience and Spillovers: The Direct and Indirect Effects of Tax Credits on Wages Table A.1. Parameters of Family Credit and WFTC ( per week) April 1999 (FC) October 1999 (WFTC)
More informationSIMULATION OF ELECTRICITY MARKETS
SIMULATION OF ELECTRICITY MARKETS MONTE CARLO METHODS Lectures 15-18 in EG2050 System Planning Mikael Amelin 1 COURSE OBJECTIVES To pass the course, the students should show that they are able to - apply
More informationName: 1. Use the data from the following table to answer the questions that follow: (10 points)
Economics 345 Mid-Term Exam October 8, 2003 Name: Directions: You have the full period (7:20-10:00) to do this exam, though I suspect it won t take that long for most students. You may consult any materials,
More informationTwo-Sample T-Test for Non-Inferiority
Chapter 198 Two-Sample T-Test for Non-Inferiority Introduction This procedure provides reports for making inference about the non-inferiority of a treatment mean compared to a control mean from data taken
More informationWe will use an example which will result in a paired t test regarding the labor force participation rate for women in the 60 s and 70 s.
Now let s review methods for one quantitative variable. We will use an example which will result in a paired t test regarding the labor force participation rate for women in the 60 s and 70 s. 17 The labor
More informationDesign of a Multi-Stage Stratified Sample for Poverty and Welfare Monitoring with Multiple Objectives
Policy Research Working Paper 7989 WPS7989 Design of a Multi-Stage Stratified Sample for Poverty and Welfare Monitoring with Multiple Objectives A Bangladesh Case Study Faizuddin Ahmed Dipankar Roy Monica
More informationB003 Applied Economics Exercises
B003 Applied Economics Exercises Spring 2005 Starred exercises are to be completed and handed in in advance of classes. Unstarred exercises are to be completed during classes. Ex 3.1 Ex 4.1 Ex 5.1 to be
More informationChapter 6 Part 6. Confidence Intervals chi square distribution binomial distribution
Chapter 6 Part 6 Confidence Intervals chi square distribution binomial distribution October 8, 008 Brief review of what we covered last time. In order to get a confidence interval for the population mean
More informationFinal Exam - section 1. Thursday, December hours, 30 minutes
Econometrics, ECON312 San Francisco State University Michael Bar Fall 2013 Final Exam - section 1 Thursday, December 19 1 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.
More informationProblem max points points scored Total 120. Do all 6 problems.
Solutions to (modified) practice exam 4 Statistics 224 Practice exam 4 FINAL Your Name Friday 12/21/07 Professor Michael Iltis (Lecture 2) Discussion section (circle yours) : section: 321 (3:30 pm M) 322
More informationQUALITY REPORT ESSPROS CORE SYSTEM MEMBER STATE: REFERENCE YEAR: 2015
QUALITY REPORT ESSPROS CORE SYSTEM (according to Commission Regulation (EC) No 1322/2007 of 12 November 2007, published in OJ L294, 13.11.2007, p.5) MEMBER STATE: UK REFERENCE YEAR: 2015 REPORT ISSUED:
More informationAnomalies under Jackknife Variance Estimation Incorporating Rao-Shao Adjustment in the Medical Expenditure Panel Survey - Insurance Component 1
Anomalies under Jackknife Variance Estimation Incorporating Rao-Shao Adjustment in the Medical Expenditure Panel Survey - Insurance Component 1 Robert M. Baskin 1, Matthew S. Thompson 2 1 Agency for Healthcare
More informationFINAL EXAM STAT 5201 Spring 2011
FINAL EXAM STAT 5201 Spring 2011 Due in Room 313 Ford Hall Friday May 13 at 3:45 PM Please deliver to the office staff of the School of Statistics READ BEFORE STARTING You must work alone and may discuss
More informationLabor Force Participation and the Wage Gap Detailed Notes and Code Econometrics 113 Spring 2014
Labor Force Participation and the Wage Gap Detailed Notes and Code Econometrics 113 Spring 2014 In class, Lecture 11, we used a new dataset to examine labor force participation and wages across groups.
More informationIncreases to minimum contributions
Increases to minimum contributions We re here to help you along the way For people, not profit Here s what we can offer to help you with the increases to minimum contributions This guide to tell you what
More informationAdvancing Methodology on Measuring Asset Ownership from a Gender Perspective
Advancing Methodology on Measuring Asset Ownership from a Gender Perspective Technical Meeting on the UN Methodological Guidelines on the Production of Statistics on Asset Ownership from a Gender Perspective
More information