Homework Assignment Section 3

Similar documents
Homework Assignment Section 3

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron

Final Exam Suggested Solutions

This assignment contains 3 questions; there are 107 total points.

MA 1125 Lecture 14 - Expected Values. Wednesday, October 4, Objectives: Introduce expected values.

Monetary Economics Risk and Return, Part 2. Gerald P. Dwyer Fall 2015

WEB APPENDIX 8A 7.1 ( 8.9)

Stat 328, Summer 2005

Statistic Midterm. Spring This is a closed-book, closed-notes exam. You may use any calculator.

ST 350 Lecture Worksheet #33 Reiland

σ e, which will be large when prediction errors are Linear regression model

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998

Business Statistics Final Exam

Multiple regression - a brief introduction

CHAPTER 4 DATA ANALYSIS Data Hypothesis

Non-linearities in Simple Regression

Dummy Variables. 1. Example: Factors Affecting Monthly Earnings

Linear regression model

Stat3011: Solution of Midterm Exam One

Let us assume that we are measuring the yield of a crop plant on 5 different plots at 4 different observation times.

11/28/2018. Overview. Multiple Linear Regression Analysis. Multiple regression. Multiple regression. Multiple regression. Multiple regression

1. (35 points) Assume a farmer derives utility from Income in the following manner

Analysis of Variance in Matrix form

Economics 424/Applied Mathematics 540. Final Exam Solutions

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Business Statistics: A First Course

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Tests for the Difference Between Two Linear Regression Intercepts

STATISTICS 110/201, FALL 2017 Homework #5 Solutions Assigned Mon, November 6, Due Wed, November 15

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Regression and Simulation

Impact of Unemployment and GDP on Inflation: Imperial study of Pakistan s Economy

Section 0: Introduction and Review of Basic Concepts

Homework Solutions - Lecture 2 Part 2

Jaime Frade Dr. Niu Interest rate modeling

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING

NEWCASTLE UNIVERSITY. School SEMESTER /2013 ACE2013. Statistics for Marketing and Management. Time allowed: 2 hours

Fall 2004 Social Sciences 7418 University of Wisconsin-Madison Problem Set 5 Answers

STA2601. Tutorial letter 105/2/2018. Applied Statistics II. Semester 2. Department of Statistics STA2601/105/2/2018 TRIAL EXAMINATION PAPER

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2016, Mr. Ruey S. Tsay. Solutions to Midterm

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm

Project Risk Analysis and Management Exercises (Part II, Chapters 6, 7)

> attach(grocery) > boxplot(sales~discount, ylab="sales",xlab="discount")

Rand Final Pop 2. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

A STATISTICAL ANALYSIS OF GDP AND FINAL CONSUMPTION USING SIMPLE LINEAR REGRESSION. THE CASE OF ROMANIA

Statistics & Statistical Tests: Assumptions & Conclusions

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

STA258 Analysis of Variance

Econometric Methods for Valuation Analysis

Finance Practice Midterm #1 Solutions

Random Effects... and more about pigs G G G G G G G G G G G

starting on 5/1/1953 up until 2/1/2017.

General Business 706 Midterm #3 November 25, 1997

6 Multiple Regression

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Midterm

THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41202, Spring Quarter 2003, Mr. Ruey S. Tsay

CAPM (1) where λ = E[r e m ], re i = r i r f and r e m = r m r f are the stock i and market excess returns.

MBF2263 Portfolio Management. Lecture 8: Risk and Return in Capital Markets

Variance clustering. Two motivations, volatility clustering, and implied volatility

Example 1 of econometric analysis: the Market Model

AP Stats: 3B ~ Least Squares Regression and Residuals. Objectives:

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay. Solutions to Final Exam

GGraph. Males Only. Premium. Experience. GGraph. Gender. 1 0: R 2 Linear = : R 2 Linear = Page 1

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Estimating Support Labor for a Production Program

Use of EVM Trends to Forecast Cost Risks 2011 ISPA/SCEA Conference, Albuquerque, NM

STA218 Analysis of Variance

When determining but for sales in a commercial damages case,

MODEL SELECTION CRITERIA IN R:

Honor Code: By signing my name below, I pledge my honor that I have not violated the Booth Honor Code during this examination.

Multiple linear regression

Econ 371 Problem Set #4 Answer Sheet. 6.2 This question asks you to use the results from column (1) in the table on page 213.

The instructions on this page also work for the TI-83 Plus and the TI-83 Plus Silver Edition.

Introduction to Population Modeling

Learning Objectives = = where X i is the i t h outcome of a decision, p i is the probability of the i t h

Tests for Two Variances

Demonstrate Approval of Loans by a Bank

Chapter 5. Forecasting. Learning Objectives

Business Statistics 41000: Probability 4

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

Copyright (C) 2001 David K. Levine This document is an open textbook; you can redistribute it and/or modify it under the terms of version 1 of the

Estimating Beta. The standard procedure for estimating betas is to regress stock returns (R j ) against market returns (R m ): R j = a + b R m

Cameron ECON 132 (Health Economics): FIRST MIDTERM EXAM (A) Fall 17

20135 Theory of Finance Part I Professor Massimo Guidolin

Presented at the 2003 SCEA-ISPA Joint Annual Conference and Training Workshop -

Chapter 8 Statistical Intervals for a Single Sample

SUMMARY OUTPUT. Regression Statistics Multiple R R Square Adjusted R Standard E Observation 5

Chapter 5. Asset Allocation - 1. Modern Portfolio Concepts

Copyrighted 2007 FINANCIAL VARIABLES EFFECT ON THE U.S. GROSS PRIVATE DOMESTIC INVESTMENT (GPDI)

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance

Converting to the Standard Normal rv: Exponential PDF and CDF for x 0 Chapter 7: expected value of x

The Norwegian State Equity Ownership

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

NOBEL COLLEGE Assignment Questions. NOBEL COLLEGE Assignment Questions

Chapter 6 Analyzing Accumulated Change: Integrals in Action

Ch. 8 Risk and Rates of Return. Return, Risk and Capital Market. Investment returns

State Ownership at the Oslo Stock Exchange. Bernt Arne Ødegaard

Transcription:

Homework Assignment Section 3 Tengyuan Liang Business Statistics Booth School of Business Problem 1 A company sets different prices for a particular stereo system in eight different regions of the country. The table below shows the numbers of units sold (in 1000s of units) and the corresponding prices (in hundreds of dollars). Sales 420 380 350 400 440 380 450 420 Price 5.5 6.0 6.5 6.0 5.0 6.5 4.5 5.0 (i) In Excel, regress sales on price and obtain the intercept and slope estimates. SUMMARY OUTPUT Regression Statistics Multiple R 0.937137027 R Square 0.878225806 Adjusted R Square 0.857930108 Standard Error 12.74227575 Observations 8 ANOVA df SS MS F Significance F Regression 1 7025.806452 7025.806452 43.27152318 0.000592135 Residual 6 974.1935484 162.3655914 Total 7 8000 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 420 5.5 420 644.516129 36.68873299 17.56714055 2.18343E-06 554.7420336 734.2902244 X Variable 380 1 6 380-42.58064516 6.473082556-6.578109392 0.000592135-58.41970755-26.74158277 350 6.5 350 400 6 400 440 5 440 380 6.5 380 450 4.5 450 420 5 420 (ii) Present a plot with the data and the regression line 480 460 440 420 Sales 400 380 360 340 320 300 3.5 4 4.5 5 5.5 6 6.5 7 Price 1

(iii) Based on this analysis, briefly describe your understanding of the relationship between sales and prices. 2

Problem 2: Match the Plots Below (Figure 1) are 4 different scatter plots of an outcome variable y versus predictor x followed by 4 four regression output summaries labeled A, B, C and D. Match the outputs with the plots. Figure 1: Scatter Plots B-Black, C-Red, A-Green, D-Blue. Note: you can look at the R 2, intercept and slope to decide how the outputs and plots match each other. 3

Regression A: Coefficients: Estimate Std. Error (Intercept) 7.03747 0.12302 (Slope) 2.18658 0.07801 --- Residual standard error: 1.226 R-Squared: 0.8891 Regression B: Coefficients: Estimate Std. Error (Intercept) 1.1491 0.1013 (Slope) 1.4896 0.0583 Residual standard error: 1.012 R-Squared: 0.8695 Regression C: Coefficients: Estimate Std. Error (Intercept) 1.2486 0.2053 (Slope) 1.5659 0.1119 Residual standard error: 2.052 R-Squared: 0.6666 Regression D: Coefficients: Estimate Std. Error (Intercept) 9.0225 0.0904 (Slope) 2.0718 0.0270 --- Residual standard error: 0.902 R-Squared: 0.9835 4

Problem 3 Suppose we are modeling house price as depending on house size. Price is measured in thousands of dollars and size is measured in thousands of square feet. Suppose our model is: P = 20 + 50 s + ɛ, ɛ N(0, 15 2 ). (a) Given you know that a house has size s = 1.6, give a 95% predictive interval for the price of the house. The point prediction is ˆP f = 20 + 50 1.6 = 100 The prediction interval is [100 ± 2 15] = [70; 130] (b) Given you know that a house has size s = 2.2, give a 95% predictive interval for the price. The point prediction is ˆP f = 20 + 50 2.2 = 130 The prediction interval is [130 ± 2 15] = [100; 160] (c) In our model the slope is 50. What are the units of this number? 1,000$ / 1,000 Sq. Feet = $/Sq. Feet (d) What are the units of the intercept 20? 1,000$ (same as P ) (e) What are the units of the the error standard deviation 15? 1,000$ (same as P ) (f) Suppose we change the units of price to dollars and size to square feet What would the values and units of the intercept, slope, and error standard deviation? Intercept: 20,000 $ Slope: 50 $/Sq. Feet error standard deviation: 15,000 $ (g) If we plug s = 1.6 into our model equation, P is a constant plus the normal random variables ɛ. Given s = 1.6, what is the distribution of P? When s = 1.6 the mean of house prices is 20 + 50 1.6 = 100. The error standard deviation is the same, 15. Therefore P s = 1.6 N(100, 15 2 ) 5

Problem 4: The Shock Absorber Data SUMMARY OUTPUT Regression Statistics Multiple R 0.9666 R Square 0.9344 Adjusted R Squar 0.9324 Standard Error 7.6697 Observations 35.0000 ANOVA e sense to choose the df after SS measurment MS F Significance F as Y and the before mea Regression 1.0000 27635.7568 27635.7568 469.7986 0.0000 Residual 33.0000 1941.2146 58.8247 Total 34.0000 29576.9714 Coefficients Standard Error t Stat P value Lower 95% Upper 95% Intercept 18.2259 23.8852 0.7631 0.4508 30.3690 66.8208 reboundb 0.9495 0.0438 21.6748 0.0000 0.8603 1.0386 rebounda 500 520 540 560 580 600 500 520 540 560 580 600 620 reboundb (a) We are trying to determine whether or not the before measurement is predictive of the after measurement. Therefore the dependent variable (Y ) should be the after measurement and the explanatory variable (X) the before measurement. line + noise. [0.8603;1.0386]. ted about using regression. (b) From the output above, we see that the 95% confidence interval for the slope is 6 f the problem is that we if we could use reboundb to predict rebo

(c) Is zero a plausible value for β 0? By looking at the 95% confidence interval we see that yes, it is. We can also conclude the same think by looking at the t stat = 0.7631 meaning that the distance between the estimate b 0 = 18.22 and the proposed value β0 o = 0 is only 0.7631 standard deviations... ie, not that far. The conclusion is that, with the information in hand, we CAN T reject the hypothesis that β 0 = 0 (d) What line wold represent equality between the before measurement and the after measurement? That would be a line with intercept equal to zero and slope equal to one. Test whether the intercept is equal to the value proposed by the shock maker. Is β 0 = 0 a plausible value for the intercept. Again as in item (c) yes Test whether the slope is equal to the value proposed by the shock maker. Is β 1 = 1 a plausible value for the slope. By inspecting the 95% confidence interval we see that yes, β 1 = 1 is a plausible hypothesis. (f) Supppose the before measurement is 550. What is the plug-in predictive interval given x-before=550. The plug-in predictive interval is 18.22 + 0.949 550 ± 2 7.67 = [524.83;555.51] What does this interval suggest about the shock maker s claim? It looks like the shock maker is correct, ie., with x-before=550 we can predict with 95% probability that the after measurement will be within acceptable bounds. 7

Problem 5 The data file for this question is available in the course website. Consider the regression model Apple t = α + βsp 500 t + ɛ t ɛ t N(0, σ 2 ) where Apple t represents the return on Apple Computers in month t and SP 500 t represents the return on the S&P 500 in month t. (a) What is the interpretation of β in terms of a measure of risk of the stock? β is a measure of the risk of the stock relative to the market. If the return on the market goes up (or down) by 1% then we expect the return on the stock to go up (or down), on average, by β%. We also call β a measure of systematic risk, i.e., the part of the variation of the stock associated with the market (the economy). (b) What is the interpretation of α? α is the average return the stock gets regardless of the market behavior... when the market is not moving the expected value of the return for the stock is α. We tend to think of this as the average return you tend to get on top of the market for the stock. (c) Plot Apple against SP 500. What graphical evidence is there of a relationship between Apple and SP 500? Does the relationship appear to be linear? Why or why not? b0 = 1.89 b1=0.965 Apple -15-5 0 5 10 15 20-15 -10-5 0 5 10 15 20 SP500 Yes, by looking at the plot it appears that Apple and SP 500 could be linearly related... 8

(d) Estimate β. What does this estimate tell you about the risk of Apple? Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) 1.8967 0.8306 2.284 0.0257 * SP500 0.9659 0.1857 5.203 2.15e-06 *** --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 Residual standard error: 6.392 on 65 degrees of freedom Multiple R-squared: 0.294,Adjusted R-squared: 0.2831 F-statistic: 27.07 on 1 and 65 DF, p-value: 2.146e-06 The estimate of β is 0.965. This implies that Apple is less risky relative to the market as a 1% change in the market returns would result on a 0.96% change, on average, in Apple s return. (e) Is the estimate of β obtained in part (c) the actual value of β? Why or why not? No, it is an estimate and indeed our best guess about the true β. (f) Now consider the regression models Intel t = α + βsp 500 t + ɛ t ɛ t N(0, σ 2 ) where Intel t represents the return on Intel in month t, and Safeway t = α + βsp 500 t + ɛ t ɛ t N(0, σ 2 ) where Safeway t represents the return on Safeway in month t. How does the beta risk of the three companies compare? What do their α s tell you? The market risk of Safeway is the estimated to be the smallest with Intel as the second riskier and Apple as the stock with the highest market risk. (see results below). These are based on our estimates of the β s for each stock. 9

b0 = 0.57 b1=0.876 Intel -10-5 0 5 10 15-10 -5 0 5 10 15 SP500 b0 = 0.11 b1=0.861 Safeway -10 0 10 20-10 0 10 20 SP500 10

Problem 6: Presidential Election (Imagine this is October 2012) This question is based on an analysis presented in the New York Times blog FiveThirtyEight by Nate Silver... According to the blog post, the most accurate economic indicator to predict the results of the election for an incumbent president is the election year equivalent non-farm payroll growth (in number of jobs from Jan to Oct). In the blog they provided a picture summarizing the relationship between this economic indicator and the election results. The analysis was based on the last 16 elections where a sitting president was seeking re-election. The picture is displayed below: This week, The Department of Labor released the most recent employment numbers which says that the economy has added, on average, 100,000 non-farm jobs since January 2012. 11

Now, I am a gambling person and I need your help... Currently, on intrade.com, I can buy or sell a future contract on Obama s re-election for $64.7. This contract pays $100 if Obama wins (see below). 1. Based on the model presented in the New York Times blog and the numbers released by the Department of Labor how should I bet on the Intrade website (i.e., should I buy or sell the future contract)? Why? The model from the NYT blog predicts that Obama has a 50% chance of winning given the 100k jobs information. The futures market are trading a contract that pays 100 if Obama wins at 64.7. So, you sell short Basically you are getting payed 64.7 for a lottery where you think there is only a 50 expected value. If you dont believe me, think about it this way: how much are you willing the pay me to flip a coin such that if it lands heads I will give 100? If you say more than 50 please come to my office and lets play this game :) 2. Do you think the answer in the question above is a good idea? Why or why not? The NYT blog model only takes into account one factor (the best available economic indicator). The market is probably looking at every other aspect of the election... social issues, the 47% comment, Romney s hair, etc... A better regression model would have additional X variables to account for other important factors. The jobs variable is an important factor but doesn t tell all the story. 12