Tests of significance for between year comparisons

Similar documents
Lecture 2 INTERVAL ESTIMATION II

housing Assessment of the impact of Warm Front on decent homes for private sector vulnerable households Housing Research Summary Introduction

THE DEMAND FOR SOCIAL RENTED HOUSING A REVIEW OF DATA SOURCES AND SUPPORTING CASE STUDY ANALYSIS

GPCO 453: Quantitative Methods I Review: Hypothesis Testing

Backtesting. Introduction

ESTIMATING THE COSTS AND BENEFITS OF GREATER COMPLIANCE WITH PROPERTY CONDITION STANDARDS

Analytical Finance 1 Seminar Monte-Carlo application for Value-at-Risk on a portfolio of Options, Futures and Equities

Market Conditions Review

12/1/2017. Chapter. Copyright 2009 by The McGraw-Hill Companies, Inc. 8B-2

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

INTRODUCTION TO PORTFOLIO ANALYSIS. Dimensions of Portfolio Performance

Supplemental Appendix for Cost Pass-Through to Higher Ethanol Blends at the Pump: Evidence from Minnesota Gas Station Data.

Statistical Tables Compiled by Alan J. Terry

Backtesting. Introduction

EUROPOP2003: Draft Population Projections - FRANCE -

7.1 Comparing Two Population Means: Independent Sampling

UK Government Construction Pipeline August 2015

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1

8.1 Estimation of the Mean and Proportion

MS-E2114 Investment Science Exercise 10/2016, Solutions

Pensioners Incomes Series: An analysis of trends in Pensioner Incomes: 1994/ /16

Chapter 1 A Brief History of Risk and Return

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

1 Introduction 1. 3 Confidence interval for proportion p 6

Does my beta look big in this?

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

THE GROWING IMPORTANCE OF GROUP STRUCTURE ARRANGEMENTS IN THE HOUSING ASSOCIATION SECTOR IN ENGLAND

Secondary analysis of lowincome working households in the private rented sector

ANNEXES. to the. COMMISSION DELEGATED REGULATION (EU) No.../...

ERM (Part 1) Measurement and Modeling of Depedencies in Economic Capital. PAK Study Manual

Brussels, XXX [ ](2016) XXX draft. ANNEXES 1 to 4 ANNEXES

Statistics for Business and Economics

Downside Risk: Implications for Financial Management Robert Engle NYU Stern School of Business Carlos III, May 24,2004

ECE 295: Lecture 03 Estimation and Confidence Interval

Economic aspects of Scottish independence: public spending and revenue

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Chapter 6 - Continuous Probability Distributions

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Salary Guide US Firms. London - Private Practice

Field Guide to Internal Models under the Basel Committee s Fundamental review of the trading book framework

The LWS database: user guide

appstats5.notebook September 07, 2016 Chapter 5

Estimates of Medical Expenditures from the Medical Expenditure Panel Survey: Gains in Precision from Combining Consecutive Years of Data

Confidence Intervals. σ unknown, small samples The t-statistic /22

Data Distributions and Normality

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

West London Alliance: Housing Stock Projections. Prepared for: Mr D McCulloch London Borough of Hillingdon for the West London Alliance.

GN47: Stochastic Modelling of Economic Risks in Life Insurance

Confidence Intervals for Large Sample Proportions

1 Inferential Statistic

National Grid Gas Distribution MOD0186 Report (Sep-16) UK GAS DISTRIBUTION

Contents. 1 Introduction. Math 321 Chapter 5 Confidence Intervals. 1 Introduction 1

Detailed calculation of out of London Living wage: method, rationale, data sources and figures for the 2010/11 calculation.

Two Hours. Mathematical formula books and statistical tables are to be provided THE UNIVERSITY OF MANCHESTER. 22 January :00 16:00

Wolverhampton CCG GP Services Budget NHS England West Midlands

Measurement of Market Risk

Great Britain Day Visits Survey

CIVL Confidence Intervals

χ 2 distributions and confidence intervals for population variance

The VaR Measure. Chapter 8. Risk Management and Financial Institutions, Chapter 8, Copyright John C. Hull

Introduction to Algorithmic Trading Strategies Lecture 8

Improving Timeliness and Quality of SILC Data through Sampling Design, Weighting and Variance Estimation

Quick start guide for the healthcare costing standards. Quick start guide for the standards >

IEOR E4602: Quantitative Risk Management

Advanced Financial Modeling. Unit 2

KAZAKHSTAN STOCK EXCHANGE JSC

Interim Results 6 months ended 30 September The Lincoln Centre, London

Statistical Intervals (One sample) (Chs )

High Speed Rail (Preparation) Act 2013 Expenditure Report 1 April March Moving Britain Ahead

Data Revisions and Macroecomics DR. ANA BEATRIZ GALVAO WARWICK BUSINESS SCHOOL UNIVERSITY OF WARWICK SEP, 2016

Top$Incomes$in$Malaysia$1947$to$the$Present$ (With$a$Note$on$the$Straits$Settlements$1916$to$1921)$ $ $ Anthony'B.'Atkinson' ' ' December'2013$ '

MTH6154 Financial Mathematics I Stochastic Interest Rates

30 June 2018 Forecast Common Equity 19.6% (19.2% F) 18.0% 19.5% (18.6% F) 17.8% (17.7% F) 15.6% 28 Feb 2016 Actual. 30 June 2016 Forecast

FNCE 4030 Fall 2012 Roberto Caccia, Ph.D. Midterm_2a (2-Nov-2012) Your name:

The Long-Run Equity Risk Premium

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

June 11, Dynamic Programming( Weighted Interval Scheduling)

Risk Management. BIS Capital Adequacy Ratio. Sep. 30, 2007 Sep. 30, Sumitomo Trust and Banking 2007 Interim Report 39

Chen-wei Chiu ECON 424 Eric Zivot July 17, Lab 4. Part I Descriptive Statistics. I. Univariate Graphical Analysis 1. Separate & Same Graph

1. Datsenka Dog Insurance Company has developed the following mortality table for dogs: l x

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Appendix for The Long-Run Risks Model and Aggregate Asset Prices: An Empirical Assessment

I. Return Calculations (20 pts, 4 points each)

A Rising Tide Lifts All Boats? IT growth in the US over the last 30 years

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Model Construction & Forecast Based Portfolio Allocation:

MATH 143: Introduction to Probability and Statistics Worksheet for Tues., Dec. 7: What procedure?

Financial Risk Management and Governance Other VaR methods. Prof. Hugues Pirotte

Chapter 1. Bond Pricing (continued)

Dynamic Risk Modelling

Sampling Design Report: Oxford Internet Survey 2003

Market Risk and Model Risk for a Financial Institution Writing Options

Mortgages and Home Finance: Conduct of Business Sourcebook. Chapter 10A. MCD Annual Percentage Rate of Charge

Construction of daily hedonic housing indexes for apartments in Sweden

A Study on the Risk Regulation of Financial Investment Market Based on Quantitative

Audience Insight Report

Kevin Dowd, Measuring Market Risk, 2nd Edition

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

PRIIPs Flow diagram for the risk and reward calculations in the PRIIPs KID 1. Introduction

Transcription:

UK Data Archive Study Number 7040 - English Housing Survey, 2010-2011: Household Data Tests of significance for between year comparisons The purpose of this note is to summarise the two main tests of significance that can be used to compare survey results between years and to identify what needs to be done if we are comparing overlapping multi-year data sets. There are two main significance tests, comparing individual survey estimates confidence intervals and testing the confidence interval of the difference in percentages. In the EHCS the first of these methods is normally used, both for within year comparisons between subsets of the data and for between year comparisons. Samples are rolled together to obtain 2-year aggregates with larger samples and therefore we measure the change between two year averages rather than annual change between each individual year. We need to agree which is the correct approach we should be using for between year comparisons. Three examples are considered using the different methods. Example 1: Percentage of stock non-decent for all tenures. Percentage stock non-decent 33.3 31.2 29.2 Sample Size 17,532 16,648 16,502 Confidence interval (+/-) 0.70 0.70 0.69 Lower Confidence Interval 32.6.5 28.5 Upper Confidence Interval 34.0 31.9 29.9 Using a 2-tailed 95% confidence interval we can say that the percentage of stock non-decent for all tenures is lower in 2002/03 than in 2001. Also it shows that the percentage of stock non-decent for all tenures is lower in 2003/04 than in 2002/03. This does not take into account overlapping samples, surveyor variability etc. 34 33 32 31 29 28 27 26 All Tenures Survey Estimate To calculate the standard error of a difference in percentages between two years, we use the following formula: Where p 1 is the percentage in year 1 and p 2 is percentage in year 2 and where n 1 and n 2 are the sample sizes in year 1 and 2 respectively. In the above example, the difference in percentage non-decent between 2001 and 02/03 is 2.15%. Using the above formula the 95% confidence interval for the difference in percentages is between 1.16% and 3.14%. Because this interval does not span zero we can conclude that the two means are significantly different.

The difference in percentage non-decent between 02/03 and 03/04 is 1.95%. Using the between 0.97% and 2.94%. As the interval does not span zero we can conclude that the two means are significantly different. However this method does not take into account overlapping samples. Standard errors of differences from overlapping multi-year data sets We can take account of overlapping samples as follows. In the above example 2003 is common to both the 2002/03 and 2003/04 data sets. If the two year datasets are of similar size we can also assume that the variances of the estimates from them are the same, say ν. Then: Var (p 1 p 2 ) = 2ν (1-r) where r is the degree of overlap between the two datasets. In our example r is approximately 0.5 so: Var (p 1 p 2 ) = 2ν (0.5) = ν = Var (p 1 ) Therefore SE (p 1 p 2 ) = Var (p 1 ) = SE (p 1 ) If there is no overlap between the years than r is 0, so: Var (p 1 p 2 ) = Var (p 1 ) and SE (p 1 p 2 ) = x SE(p 1 ) Therefore the reduction in SE (p 1 p 2 ) due to the overlapping year = ( I.e. when the overlapping year is taken into account the standard error is reduced by around 29%. In this example: Difference between percentages in 02/03 and 03/04 = -1.95 Standard error of the difference = 0.50 Revised standard error of the difference = 0.36 Applying the reduction in standard errors to the above example, then the difference between the two percentages is between 1.25% and 2.65% using a 95% confidence interval. As the interval does not span zero, we would still conclude that the two means are significantly different. This method can be tested using a slightly different method (comparing a normal test statistic against normal tables), which is covered in the annex.

Example 2: Percentage of stock non-decent for private sector in South East. Percentage stock non-decent 29.5 28.6 26.3 Sample Size 1110 1870 1988 Confidence interval (+/-) 2.68 2.05 1.94 Lower Confidence Interval 26.8 26.6 24.4 Upper Confidence Interval 32.2.7 28.2 Using a 2-tailed 95% confidence interval we cannot say that the percentage of stock non-decent in private sector in South East is lower in 2002/03 than in 2001. There is also no evidence to suggest the percentage of stock non-decent in private sector in South East is lower in 2003/04 than in 2002/03 or in 2001 at the 95% confidence level. This does not take into account overlapping samples, surveyor variability etc. 25 20 Private Sector South East Survey Estimates In the above example, the difference in percentage non-decent between 2001 and 02/03 is 0.88%. Using the above formula the 95% confidence interval for the difference in percentages is between -2.50% and 4.26%. As this interval spans zero we can conclude that the two means are not significantly different. The difference in percentage non-decent between 02/03 and 03/04 is 2.%. Using the between -0.47% and 5.17%. As this interval spans zero we can conclude that the two means are not significantly different. However this method does not take into account overlapping samples. Taking into account the common year which is 2003, the standard error reduces to (100-29)% = 71% of what it would be if these two-year datasets were not overlapping. In this example: Difference between percentages in 02/03 and 03/04 = -2. Standard error of the difference = 1.44 Revised standard error of the difference = 1.02 Applying the reduction in standard errors, the difference between the two percentages is between 0.36% and 4.34% using a 95% confidence interval. As the interval does not span zero we would conclude from this test that the two means are significantly different.

Example 3: Percentage of stock non-decent for Social Sector in West Midlands. Percentage stock non-decent 43.5 37.1 31.6 Sample Size 726 633 586 Confidence interval (+/-) 3.61 3.76 3.77 Lower Confidence Interval 39.9 33.3 27.9 Upper Confidence Interval 47.2 40.9.4 Using a 2-tailed 95% confidence interval, we cannot say that the percentage of stock non-decent in social sector stock non-decent in West Midlands is lower in 2002/03 than in 2001, nor that it is lower in 2003/04 than in 2002/03. However it is lower in 03/04 than in 2001 at the 95% confidence level. This does not take into account overlapping samples, surveyor variability etc. 50 45 40 25 Social Sector West Midlands Survey Estimates In this example, the difference in percentage non-decent between 2001 and 02/03 is 6.46%. Using the above formula the 95% confidence interval for the difference in percentages is between 1.23% and 11.67%. As this interval does not span zero we can conclude that the two means are significantly different. The difference in percentage non-decent between 02/03 and 03/04 is 5.46. Using the between 0.13% and 10.78%. As the interval does not span zero we can conclude that the two means are significantly different. However this method does not take into account overlapping samples. When comparing 02/03 and 03/04 then 2003 is a common year, so the standard error of the difference reduces to (100-29)% = 71% of what it would be if these two-year datasets were not overlapping. The table below shows the key statistics for our example: Difference between percentages in 02/03 and 03/04 = -5.46 Standard error of the difference = 2.72 Revised standard error of the difference = 1.92 Applying the reduction in standard errors, the difference between the two percentages is between 1.69% and 9.22% using a 95% confidence interval. As the interval does not span zero we would conclude that the two means are significantly different using this test, which is the opposite to our conclusion when testing the two sample estimates independently (see above).

Conclusion When choosing between the two main methods we have described, the key point is that we want to test for the change in the same variable over time e.g. percentage nondecent in London in 2002/03 against percentage non-decent in London in 2003/04. As the two time-points are related (not independent) then it is appropriate to use the second method as the two datasets are interrelated. The first method is better suited to comparing independent variables/indicators, e.g. percentage non-decent in London against percentage non-decent in North West. If a common year is involved between the two years being tested, then this should be accounted for, with the survey error of the difference being adjusted accordingly. In our examples, our conclusion is only different between the two tests in example 3, and as we have outlined that the second method is the one we should be using then we would conclude that the percentage of stock non-decent in social sector in West Midlands is significantly different between 02/03 and 03/04. Annex An alternative test statistic for a difference in percentages is: z 1 = which has a Normal distribution with mean =0 and standard error =1 Using example 1, between 02/03 and 03/04 the difference is -1.95, and the revised standard error of that difference of 0.36, gives a test statistic of 5.42. Using a 2 tail normal test at the 5% level gives a critical value of +/-1.96. Our test statistic is above the critical value, i.e. there is a significant difference between the two percentages. The advantage of this method is it allows you to calculate the probability of the difference being significant and it is easy to test at different levels of confidence, by looking up different values from Normal Tests. E.g. In example 2 (private sector in South East) between 02/03 and 03/04 the difference is 2., with a standard error of that difference of 1.02, which gives a test statistic of 2.. Using a 2 tail normal test at the 1% level gives a critical value of +/-2.576. Our test statistic is below the critical value, I.e. the evidence does not suggest a significant difference between the two percentages at the 1% level.