Multivariate Statistics Lecture Notes. Stephen Ansolabehere

Similar documents
Statistical Evidence and Inference

These notes essentially correspond to chapter 13 of the text.

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Mean-Variance Analysis

ECON Micro Foundations

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Introduction to Population Modeling

Characterization of the Optimum

EconS Micro Theory I 1 Recitation #9 - Monopoly

Microeconomics 3. Economics Programme, University of Copenhagen. Spring semester Lars Peter Østerdal. Week 17

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Lecture Notes: Basic Concepts in Option Pricing - The Black and Scholes Model (Continued)

Equilibrium Asset Returns

Markowitz portfolio theory

Models of Asset Pricing

Back to estimators...

Section 4.3 Objectives

Product Di erentiation: Exercises Part 1

Business Statistics 41000: Probability 3

Introducing nominal rigidities.

Monte Carlo probabilistic sensitivity analysis for patient level simulation models

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی

Chapter 1 Microeconomics of Consumer Theory

Expected Utility and Risk Aversion

The Simple Regression Model

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998

EconS Firm Optimization

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.

Mossin s Theorem for Upper-Limit Insurance Policies

10/1/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1

Population Economics Field Exam Spring This is a closed book examination. No written materials are allowed. You can use a calculator.

The Simple Regression Model

Consumption-Savings Decisions and State Pricing

Principles of Econometrics Mid-Term

8.1 Estimation of the Mean and Proportion

1 Supply and Demand. 1.1 Demand. Price. Quantity. These notes essentially correspond to chapter 2 of the text.

Econometrics and Economic Data

15. Multinomial Outcomes A. Colin Cameron Pravin K. Trivedi Copyright 2006

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Investment is one of the most important and volatile components of macroeconomic activity. In the short-run, the relationship between uncertainty and

Chapter 13 Return, Risk, and Security Market Line

Problem Set 1: Review of Mathematics; Aspects of the Business Cycle

Micro Theory I Assignment #5 - Answer key

Lecture 3: Factor models in modern portfolio choice

These notes essentially correspond to chapter 7 of the text.

Stochastic Budget Simulation

Problem Set # Public Economics

Advanced Industrial Organization I Identi cation of Demand Functions

Lecture 1: The Econometrics of Financial Returns

CS 237: Probability in Computing

Mixing Di usion and Jump Processes

EconS Micro Theory I 1 Recitation #7 - Competitive Markets

Chapter 5. Sampling Distributions

1.1 Some Apparently Simple Questions 0:2. q =p :

Note on Cost of Capital

Topic #1: Evaluating and Simplifying Algebraic Expressions

Regression. Lecture Notes VII

Question 1a 1b 1c 1d 1e 1f 2a 2b 2c 2d 3a 3b 3c 3d M ult:choice Points

Final Exam, section 1

Lecture Notes 1: Solow Growth Model

EconS Substitution E ects

CHAPTER 8 Estimating with Confidence

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Trade on Markets. Both consumers' initial endowments are represented bythesamepointintheedgeworthbox,since

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

2 Maximizing pro ts when marginal costs are increasing

1 Unemployment Insurance

STATE UNIVERSITY OF NEW YORK AT ALBANY Department of Economics. Ph. D. Comprehensive Examination: Macroeconomics Spring, 2013

Week 1 Quantitative Analysis of Financial Markets Basic Statistics A

Module 4: Point Estimation Statistics (OA3102)

CHAPTER III METHODOLOGY

Chapter 8. Introduction to Statistical Inference

Empirical Tests of Information Aggregation

HIGH-LOW METHOD. Key Terms and Concepts to Know

Risk and Return and Portfolio Theory

Market Risk Analysis Volume I

Introducing nominal rigidities. A static model.

Fuel-Switching Capability

Investments. Session 10. Managing Bond Portfolios. EPFL - Master in Financial Engineering Philip Valta. Spring 2010

Intro to GLM Day 2: GLM and Maximum Likelihood

5. COMPETITIVE MARKETS

CCAC ELEMENTARY ALGEBRA

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

1 Consumer Choice. 2 Consumer Preferences. 2.1 Properties of Consumer Preferences. These notes essentially correspond to chapter 4 of the text.

Approximating a multifactor di usion on a tree.

Cost of Capital (represents risk)

5.7 Probability Distributions and Variance

Topic 1: Basic Concepts in Finance. Slides

Question scores. Question 1a 1b 1c 1d 1e 2a 2b 2c 2d 2e 2f 3a 3b 3c 3d M ult:choice Points

2. Find the equilibrium price and quantity in this market.

The ratio of consumption to income, called the average propensity to consume, falls as income rises

Monetary Economics Measuring Asset Returns. Gerald P. Dwyer Fall 2015

Final Exam - section 1. Thursday, December hours, 30 minutes

WEB APPENDIX 8A 7.1 ( 8.9)

Log-linear Modeling Under Generalized Inverse Sampling Scheme

UNIT 4 MATHEMATICAL METHODS

Point Estimators. STATISTICS Lecture no. 10. Department of Econometrics FEM UO Brno office 69a, tel

Lecture Notes 1

Revision Lecture. MSc Finance: Theory of Finance I MSc Economics: Financial Economics I

Lecture 22. Survey Sampling: an Overview

Transcription:

Multivariate Statistics Lecture Notes Stephen Ansolabehere Spring 2004

TOPICS. The Basic Regression Model 2. Regression Model in Matrix Algebra 3. Estimation 4. Inference and Prediction 5. Logit and Probit 6. Panel Models 7. Structural Models and Simultaneous Equations

. The Basic Regression Model Multivariate regression is the primary statical tool for applied statistics in the social sciences. Consider two applications. Example. Capital Asset Pricing Model. In value an investment we assess the expected return and the risk associated with that investment. The risk is how much the stock may deviate from the expected return if we cash out the investment at some future date. The expected return is the average return. How risky is a stock? There are two views. First, there is the \overall" risk, which is measured as the variance of the stock. Second, there is the risk associated with the stock itself, and not with the greater market. It may be possible to reduce the overall risk by holding a diverse portfolio so we need to know how risky is a stock relative to the market. A simple model of this is captured in the regression of the rate of return to a given rm's stock on the rate of return of the market. Let Y t be the percentage change in the daily stock price of a rm; let X t be the percentage change in the market (e.g., Dow Jones Industrial Average); let ² t represent a random error term; and let t index date. Y t = + X t + ² t The coe±cient measures the risk relative to the market. If > the rm price is more volatile than the market; if < the rm price is less volatile than the market. The gure shows the CAPM regression for two arbitrarily chosen Fortune 500 rms { WalMart and General Motors. Example 2. Voting Technologies and Uncounted Votes. Votes are counted using at least 5 di erent technologies for tabulation in the United States { hand-counted paper, lever machines, punch cards (at the center of the controversial recount in Florida in 2000), optically scanned paper, and electronic voting machines. A standard measure for the percent of ballots uncounted is the percentage di erence between total ballots cast and the votes counted for any given o±ce, say president. This is termed the \residual vote." Other factors, such as demographics, also a ect the rate of uncounted ballots. The rate of residual votes ought to be independent of the technology used. For simplicity, suppose we wish to know whether punch cards are worse than alternative technologies, ignoring what exact alternative technology is used. Technologies used vary from county to county, sometimes town to town. Let j index counties. Let D j indicate whether a county uses punch cards; it equals if the county uses punchards and 0 otherwise. Let Y j represent the residual vote rate in each county, and let X j represent the socio-economic status of the county; let ² j represent a random error. Y j = + D j + 2X j + ² j Essentially, the regression is two parallel lines, both with slope 2, but one with intercept and the other with intercept +. Make a graph Y on X. 2

The coe±cient is the expected di erence between Punch Cards and other technologies, holding constant a county's socio-economic condition. Why hold X j constant? Without X j, the regression is mathematically identical to a di erence of means. One worry about the data, though, is that the counties with punch cards might di er systematically from the counties without punchcards and the factors that account for the di erence also a ect the residual vote. For example, rural counties have more administrative problems and thus higher residual vote rates than urban counties, but urban counties are more likely to use punch card equipment (as an accident of history). We might miss an important e ect of punch cards if we ignore county demographics. (Try to show this in a graph.) The coe±cient measures the di erence between punch card and non-punch card counties, holding other factors constant..a. Two Conceptions of Regression i. The functional relationship of Y to X ;X 2 ;:::;X k. Y i = + X ;i + 2X 2;i + 3X 3;i ::: + kx k;i + ² i ii. Conditional Mean The random variables Y;X ;X 2 ;:::;X k have joint density f(y;x ;x 2 ;:::;x k ). The conditional distribution f(yjx ;x 2 ;:::;x k ) can be characterized by its mean and conditional variance. We begin with the assumption that the conditional mean is a linear function of the values of the random variables and that the error variance is constant: E[YjX ;X 2 ;:::;X k ] = + X + 2X 2 + 3X 3 ::: + kx k V [YjX] = ¾ 2 ² Both assumptions are readily relaxed, as we will see later in the course. As a speci c case we may begin with the formula for the joint normal distribution. This is of historical interest, but I also nd that it helps make these ideas more explicit. For simplicity I will deal with the case of 2 variables. In fact, I will often work with just 2 variables in this course to develop the basic concept or intuition. where = ¾xy ¾ 2 x. f(y;x) = 2¼¾ y ¾ x q ( ½ 2 ) e 2( ½ 2 ) [(y ¹ y ¾y )2 +( x ¹x ¾x )2 2½( y ¹ y ¾y )(x ¹x ¾x )] f(yjx) = p 2¼¾y q ( ½ 2 ) e 2¾ 2 y ( ½2 ) [(y (¹y ¹x) x)2 ] ; 3

The conditional mean and variance are: E[YjX = x] = Z yf(yjx)dy = (¹ y ¹ x ) + x V [YjX = x] = ¾ 2 y ( ½2 ) To develop the core concepts we begin with the bivariate regression model. It is good to get your footing with a simple model and then build up. Here I will develop the basic properties of regression in the bivariate setting..b. Assumptions. Linearity y i = + X ;i +::: kx k;i + ² i 2. E[²] = 0. 3. Independence/Exogeneity: E[²X] = 0. 4. No Measurement Error in X. 5. Homoskedasticity: E[² i ] = ¾ 2 for all i. 6. No Autocorrelation: E[² i ² j ] = 0: The rst four assumptions a ect the bias of the estimates. Of particular concern is assumption 3, which means that there are no ommitted variables and the relationship between Y and X is not simultaneous. The last two assumptions concern e±ciency.. C. Estimation Estimation consists of deriving formulae for converting data into guesses about the unknown parameters of the model, i.e.,,, and ¾ 2 ². Method of Moments. We wish to nda and b, parameters of the estimated regression line also known as the predicted values: ^y = a + bx. De ne the residual as the vertical distance of any point y from the estimated regression line: e i = y i ^y i = y i a bx i. The residual can be thought of as a realization of the error term ². 4

The second and third assumptions are in fact theoretical moments describing the conditional distribution. Assume that those hold. That is, let us impose those restrictions on the residuals. nx e n i = 0 n nx i= i= (e i ¹e)(x i ¹x) = 0 Substituting the de nition of e i in each equation and solving for a and b yields: b = a = ¹y b¹x P ni= (x i ¹x)(y i ¹y) P ni= (x i ¹x) 2 Least Squares. Another way to derive estimates is to minimize the error variance to nd the \best tting" line. De ne the sum of squares as: S = nx ² 2 i= We wish to nd the values of and that minimize this function. Those values are determined by the rst derivatives. At a minimum both rst derivatives of the function with respect to the unknown parameters must equal 0. @S @ = 0 @S @ = 0 After some algebraic manipulations, we nd that least squares yields the following estimates: a = ¹y b¹x P ni= (x b = i ¹x)(y i ¹y) P ni= (x i ¹x) 2 Maximum Likelihood.. D. Inference Once we have taken data and calculated a and b and ^¾ ² 2, we would like to know how to use the estimates to draw inferences about theoretical conjectures and to make predictions. We will build on the basic concepts of sampling theory. 5

Sampling Theory. Recall from basic statistics that the average value of a random variable X in any data set or sample is thought of as an estimate or guess about the population mean.. The sample mean, ¹x, is expected to yield an unbiased estimate of the population mean, ¹. That is, E[¹x] = ¹. 2. From sample to sample or study to study, the mean varies at a rate of ¾ 2 =n, which is the variance of ¹x. The square root of the variabe of ¹x is called the standard error. 3. Because the mean is the sum of random variables, the distribution of the mean is normal with mean ¹ and variance ¾ 2 =n. This is an application of the Central Limit Theorem. 4. As n!, V (¹x)! and ¹x! ¹ in probability. This is called consistency of ¹x as an estimate of ¹. It is an application of the Law of Large Numbers. The same ideas apply in regression analysis, which is, afterall, the estimate of a conditional mean E[YjX]..D.i. Properties of Estimates We wish to know whether estimates are consistent or unbiased. Consistency means that as the number of cases becomes large the estimate becomes extremely close to the true value. This is the application of the Law of Large Numbers from probability theory to our statistical estimation problem. Unbiasedness means that the average value is equal to the true value. Both are statistical de nitions of the \correctness" or \accuracy" of the estimator. We also wish to know whether estimates are e±cient { that is, do they use the information available in a way the minimizes the variability or uncertainty about the estimate. Frequetly, in choosing the best statistical speci cation and model we must choose between a speci cation that is consistent (or unbiased) but ine±cient and one that is potentially inconsistent. We face this choice, for example, when we decide whether to include a variable or not in a regression analysis..d.ii. Con dence Intervals The degree of con dence that we have in an estimate will be expressed as the standard error, which measures the rate at which the estimates will vary from sample to sample, if we had the luxury of repeated sampling. The variance of the estimator (the square of the standard error) is the weighted average of the squared distance estimator from its mean (the true parameter value)..d.iii. Prediction. 6

Consider any speci c value of X, say x 0. We wish to know what value of y is predicted when X = x 0. This can be computed simply from the predicted values of the regression line Predictions are uncertain ^y 0 = a +bx 0 7