σ may be counterbalanced by a larger

Similar documents
CHAPTER 9 FUNCTIONAL FORMS OF REGRESSION MODELS

MgtOp 215 Chapter 13 Dr. Ahn

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

Which of the following provides the most reasonable approximation to the least squares regression line? (a) y=50+10x (b) Y=50+x (d) Y=1+50x

Module Contact: Dr P Moffatt, ECO Copyright of the University of East Anglia Version 2

Notes are not permitted in this examination. Do not turn over until you are told to do so by the Invigilator.

Tests for Two Correlations

Sampling Distributions of OLS Estimators of β 0 and β 1. Monte Carlo Simulations

Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 12

Evaluating Performance

Chapter 3 Descriptive Statistics: Numerical Measures Part B

Chapter 3 Student Lecture Notes 3-1

3/3/2014. CDS M Phil Econometrics. Vijayamohanan Pillai N. Truncated standard normal distribution for a = 0.5, 0, and 0.5. CDS Mphil Econometrics

Spatial Variations in Covariates on Marriage and Marital Fertility: Geographically Weighted Regression Analyses in Japan

Midterm Exam. Use the end of month price data for the S&P 500 index in the table below to answer the following questions.

Simple Regression Theory II 2010 Samuel L. Baker

Measures of Spread IQR and Deviation. For exam X, calculate the mean, median and mode. For exam Y, calculate the mean, median and mode.

Data Mining Linear and Logistic Regression

Chapter 5 Student Lecture Notes 5-1

UNIVERSITY OF VICTORIA Midterm June 6, 2018 Solutions

A Comparison of Statistical Methods in Interrupted Time Series Analysis to Estimate an Intervention Effect

Transformation and Weighted Least Squares

4. Greek Letters, Value-at-Risk

II. Random Variables. Variable Types. Variables Map Outcomes to Numbers

Tests for Two Ordered Categorical Variables

Risk and Return: The Security Markets Line

/ Computational Genomics. Normalization

Analysis of Variance and Design of Experiments-II

Elton, Gruber, Brown and Goetzmann. Modern Portfolio Theory and Investment Analysis, 7th Edition. Solutions to Text Problems: Chapter 4

Problem Set 6 Finance 1,

Basket options and implied correlations: a closed form approach

The Mack-Method and Analysis of Variability. Erasmus Gerigk

THE DEGREE OF COMPLIANCE BASED ON EXCISE DUTIES IN ROMANIA BETWEEN 2002 AND 2015

Linear Combinations of Random Variables and Sampling (100 points)

Interval Estimation for a Linear Function of. Variances of Nonnormal Distributions. that Utilize the Kurtosis

R Square Measure of Stock Synchronicity

Domestic Savings and International Capital Flows

Teaching Note on Factor Model with a View --- A tutorial. This version: May 15, Prepared by Zhi Da *

IMPACT OF STOCK CONTROL ON PROFIT MAXIMIZATION OF MANUFACTURING COMPANY. Keywords: Stock, Profit Maximization, Manufacturing Company, Nigeria.

Multifactor Term Structure Models

Random Variables. b 2.

Calibration Methods: Regression & Correlation. Calibration Methods: Regression & Correlation

Price Formation on Agricultural Land Markets A Microstructure Analysis

Graphical Methods for Survival Distribution Fitting

Mode is the value which occurs most frequency. The mode may not exist, and even if it does, it may not be unique.

2) In the medium-run/long-run, a decrease in the budget deficit will produce:

Testing for Omitted Variables

occurrence of a larger storm than our culvert or bridge is barely capable of handling? (what is The main question is: What is the possibility of

ASSESSING GOODNESS OF FIT OF GENERALIZED LINEAR MODELS TO SPARSE DATA USING HIGHER ORDER MOMENT CORRECTIONS

A Note on Robust Estimation of Repeat Sales Indexes with Serial Correlation in Asset Returns

Principles of Finance

Forecasts in Times of Crises

Does a Threshold Inflation Rate Exist? Quantile Inferences for Inflation and Its Variability

Consumption Based Asset Pricing

Likelihood Fits. Craig Blocker Brandeis August 23, 2004

The Analysis of Net Position Development and the Comparison with GDP Development for Selected Countries of European Union

Analysis of the Relationship between Managers Compensation and Earnings in Companies Listed in the Tehran Stock Exchange

Natural Resources Data Analysis Lecture Notes Brian R. Mitchell. IV. Week 4: A. Goodness of fit testing

THE RELATIONSHIP BETWEEN AVERAGE ASSET CORRELATION AND DEFAULT PROBABILITY

Heterogeneity in Expectations, Risk Tolerance, and Household Stock Shares

THE VOLATILITY OF EQUITY MUTUAL FUND RETURNS

Correlations and Copulas

Merton-model Approach to Valuing Correlation Products

Risk Reduction and Real Estate Portfolio Size

Forecasting and Stress Testing Credit Card Default using Dynamic Models

02_EBA2eSolutionsChapter2.pdf 02_EBA2e Case Soln Chapter2.pdf

CrimeStat Version 3.3 Update Notes:

Elements of Economic Analysis II Lecture VI: Industry Supply

THIS PAPER SHOULD NOT BE OPENED UNTIL PERMISSION HAS BEEN GIVEN BY THE INVIGILATOR.

ANOVA Procedures for Multiple Linear Regression Model with Non-normal Error Distribution: A Quantile Function Distribution Approach

ISyE 512 Chapter 9. CUSUM and EWMA Control Charts. Instructor: Prof. Kaibo Liu. Department of Industrial and Systems Engineering UW-Madison

A Bootstrap Confidence Limit for Process Capability Indices

Appendix - Normally Distributed Admissible Choices are Optimal

Information Flow and Recovering the. Estimating the Moments of. Normality of Asset Returns

Economics 330 Money and Banking Problem Set No. 3 Due Tuesday April 3, 2018 at the beginning of class

Real Exchange Rate Fluctuations, Wage Stickiness and Markup Adjustments

1 Omitted Variable Bias: Part I. 2 Omitted Variable Bias: Part II. The Baseline: SLR.1-4 hold, and our estimates are unbiased

Discounted Cash Flow (DCF) Analysis: What s Wrong With It And How To Fix It

Capability Analysis. Chapter 255. Introduction. Capability Analysis

A Comparative Study of Mean-Variance and Mean Gini Portfolio Selection Using VaR and CVaR

Market Opening and Stock Market Behavior: Taiwan s Experience

Loss Function Asymmetry and Forecast Optimality: Evidence from Individual Analysts' Forecasts

An Application of Alternative Weighting Matrix Collapsing Approaches for Improving Sample Estimates

Spurious Seasonal Patterns and Excess Smoothness in the BLS Local Area Unemployment Statistics

Clearing Notice SIX x-clear Ltd

arxiv: v1 [q-fin.pm] 13 Feb 2018

Survey of Math Test #3 Practice Questions Page 1 of 5

Competition in Hong Kong s banking industry

Explaining Movements of the Labor Share in the Korean Economy: Factor Substitution, Markups and Bargaining Power

A Test of Normality. Textbook Reference: Chapter 14.2 (eighth edition, pages 591 3; seventh edition, pages 624 6).

Asset Pricing When Returns Are Nonnormal: Fama-French Factors vs. Higher-order Systematic Co-Moments*

Quiz 2 Answers PART I

Physics 4A. Error Analysis or Experimental Uncertainty. Error

Asian basket options. in oil markets

Monetary Tightening Cycles and the Predictability of Economic Activity. by Tobias Adrian and Arturo Estrella * October 2006.

Self-controlled case series analyses: small sample performance

Statistical Inference for Risk-Adjusted Performance Measure. Miranda Lam

The well-known analyses of Balassa (1964) and Samuelson (1964) provide an

International ejournals

A Utilitarian Approach of the Rawls s Difference Principle

Transcription:

Questons CHAPTER 5: TWO-VARIABLE REGRESSION: INTERVAL ESTIMATION AND HYPOTHESIS TESTING 5.1 (a) True. The t test s based on varables wth a normal dstrbuton. Snce the estmators of β 1 and β are lnear combnatons of the error u, whch s assumed to be normally dstrbuted under CLRM, these estmators are also normally dstrbuted. (b) True. So long as E(u ) = 0, the OLS estmators are unbased. No probablstc assumptons are requred to establsh unbasedness. (c) True. In ths case the Eq. (1) n App. 3A, Sec. 3A.1, wll be absent. Ths topc s dscussed more fully n Chap. 6, Sec. 6.1. (d) True. The p value s the smallest level of sgnfcance at whch the null hypothess can be rejected. The terms level of sgnfcance and sze of the test are synonymous. (e) True. Ths follows from Eq. (1) of App. 3A, Sec. 3A.1. (f) False. All we can say s that the data at hand does not permt us to reject the null hypothess. (g) False. A larger σ may be counterbalanced by a larger s only f the latter s held constant, the statement can be true. x. It (h) False. The condtonal mean of a random varable depends on the values taken by another (condtonng) varable. Only f the two varables are ndependent, that the condtonal and uncondtonal means can be the same. () True. Ths s obvous from Eq. (3.1.7). (j) True. Refer of Eq. (3.5.). If X has no nfluence on Y, ˆβ wll be zero, n whch case y = uˆ. 33

5. ANOVA table for the Food Expendture n Inda Source of varaton SS df MSS Due to regresson (ESS) 13903 1 13903 Due to resdual (RSS) 36894 53 4470 TSS 375916 F = 13903 = 31.1013 wth df = 1 and 53, respectvely. 4470 Under the hypothess that there s no relatonshp between food expendture and total expendture, the p value of obtanng such an F value s almost zero, suggestng that one can strongly reject the null hypothess. 5.3 (a) se of the ntercept coeffcent s 6.153, so the t value under H 0 : β 1 = 0, s: 14.4773 =.353. Wth 3 degrees of freedom, the cutoff 6.153 for the 5% level of sgnfcance s.04 (usng 30 d.f. snce 3 s not n the table n the textbook s appendx), so the ntercept IS statstcally sgnfcant. (b) se of the slope coeffcent s 0.0003, so the t value under H 0 : β = 0, s: 0.00 = 6.8750. As noted n part a, the cutoff for the 5% level 0.0003 of sgnfcance s.04, so the slope IS statstcally sgnfcant. (c) The 95% confdence nterval for the true slope coeffcent would be: 0.00 ± (.04) ( 0.0003) 0.0015, 0.009. (d) If per capta ncome s $9000, the mean forecast value of cell phones demanded s 14.4773 + 0.00 (9000) = 34.773 per 100 persons. For the predcton confdence nterval, we frst need to compute var( Ŷ 0 )= σ 1 ( n + X 0 X ) x. var( Ŷ 0 )= 4.56 1 ( 9000 15819.865) + = 13.9785. Now the 34 1, 668, 91,885 confdence nterval s gven as 34

Pr Ŷ 0 t α se( Ŷ 0 ) Y 0 Ŷ0 + t α se ( Ŷ 0 ) = 1 α = Pr 34.773.04( 3.7388) Y 0 34.773+.04( 3.7388) = 0.95 6.647, 41.9119 5.4 Verbally, the hypothess states that there s no correlaton between the two varables. Therefore, f we can show that the covarance between the two varables s zero, then the correlaton must be zero. 5.5 (a) Use the t test to test the hypothess that the true slope coeffcent ˆ β 1 1.0598 1 s one. That s obtan: t = = = 0.81 se( ˆ β 0.078 ) For 38 df ths t value s not sgnfcant even at α = 10%. The concluson s that over the sample perod, IBM was not a volatle securty. 0.764 (b) Snce t = =.405, whch s sgnfcant at the two 0.3001 percent level of sgnfcance. But t has lttle economc meanng. Lterally nterpreted, the ntercept value of about 0.73 means that even f the market portfolo has zero return, the securty's return s 0.73 percent. 5.6 Under the normalty assumpton, ˆβ s normally dstrbuted. But snce a normally dstrbuted varable s contnuous, we know from probablty theory that the probablty that a contnuous random varable takes on a specfc value s zero. Therefore, t makes no dfference f the equalty s strong or weak. 5.7 Under the hypothess that β = 0, we obtan ˆ β ˆ β x ˆ β x t = = = se( ˆ β ˆ ) σ y (1 r ) ( n ) uˆ y (1 r ) because ˆ σ = =, from Eq.(3.5.10) ( n ) ( n ) ˆ β x ( n ) = y (1 r ) 35

But snce r x = ˆ β y r ( n ) ˆ β Thus, t = = (1 r) ˆ σ Emprcal Exercses, then x r = ˆ β, and x β r ( n ) t = F = = ˆ 1 r ˆ σ x y, from Eq. (5.9.1), from Eq.(3.5.6). 5.8 (a) There s a postve assocaton n the LFPR n 197 and 1968, whch s not surprsng n vew of the fact snce WW II there has been a steady ncrease n the LFPR of women. (b) Use the one-tal t test. 0.6560 1 t = = 1.754. For 17 df, the one-taled t value 0.1961 at α =5% s 1.740. Snce the estmated t value s sgnfcant, at ths level of sgnfcance, we can reject the hypothess that the true slope coeffcent s 1 or greater. (c) The mean LFPR s : 0.033 + 0.6560 (0.58) 0.5838. To establsh a 95% confdence nterval for ths forecast value, use the formula: 0.5838 ±.11(se of the mean forecast value), where.11 s the 5% crtcal t value for 17 df. To get the standard error of the forecast value, use Eq. (5.10.). But note that snce the authors do not gve the mean value of the LFPR of women n 1968, we cannot compute ths standard error. (d) Wthout the actual data, we wll not be able to answer ths queston because we need the values of the resduals to plot them and obtan the Normal Probablty Plot or to compute the value of the Jarque-Bera test. 36

5.9 (a) 45000 40000 35000 PAY 30000 5000 0000 15000 000 4000 6000 8000 10000 SPEND (b) Pay = 119.37 + 3.3076 Spend se = (1197.351) (0.3117) r = 0.6968; RSS =.65E+08 (c) If the spendng per pupl ncreases by a dollar, the average pay ncreases by about $3.31. The ntercept term has no vable economc meanng. (d) The 95% CI for β s: 3.3076 ± (0.3117) = (.684,3.931) Based on ths CI you wll not reject the null hypothess that the true slope coeffcent s 3. (e)the mean and ndvdual forecast values are the same, namely, 119.37 + 3.3076(5000) 8,667. The standard error of the mean forecast value, usng eq.(5.10.), s 50.5117 (dollars) and the standard error of the ndvdual forecast, usng Eq.(5.10.6), s 38.337. The confdence ntervals are: Mean Predcton: 8,667 ± (50.5117), that s, ( $7,66, $9,708) Indvdual Predcton: 8667 ± (38.337), that s, ($ 3,90, $33,43) As expected, the latter nterval s wder than the former. (f) 8 6 4 Seres: Resduals Sample 1 51 Observatons 51 Mean 9.13E-1 Medan -17.519 Maxmum 559.34 Mnmum -3847.976 Std. Dev. 301.414 Skewness 0.49916 Kurtoss.807557 0-4000 -000 0 000 4000 6000 Jarque-Bera.19673 Probablty 0.33349 The hstogram of the resduals can be approxmated by a normal curve. The Jarque-Bera statstc s.197 and ts p value s about 0.33. So, we do not reject the 37

normalty assumpton on the bass of ths test, assumng the sample sze of 51 observatons s reasonably large. 5.10 The ANOVA table for the busness sector s as follows: Source of Varaton SS df MSS Due to Regresson(ESS) 91915.537 1 91915.537 Due to resdual (RSS) 610.911 44 59.3391 Total(TSS) 9455.1748 The F value s 91914.537 = 1548.9657 59.3391 Under the null hypothess that there s no relatonshp between wages and productvty n the busness sector, ths F value follows the F dstrbuton wth 1 and 44 df n the numerator and denomnator, respectvely. The probablty of obtanng such an F value s 0.0000, that s, practcally zero. Thus, we can reject the null hypothess, whch should come as no surprse. (b) For the non-farm busness sector, the ANOVA table s as follows: Source of Varaton SS df MSS Due to regresson (ESS) 90303.3157 1 90303.3157 Due to resdual (RSS) 714.766 44 61.6991 Total 93018.0783 Under the null hypothess that the true slope coeffcent s s zero, the computed F value s: F = 90303.3157 1463.6071 61.6991 If the null hypothess were true, the probablty of obtanng such an F value s practcally zero, thus leadng to the rejecton of the the null hypothess. 38

5.11 (a) The plot shown below ndcates that the relatonshp between Impressons 100 90 80 70 60 50 40 30 0 10 0 0 100 AdExp 00 the two varables s nonlnear. Intally, as advertsng expendture ncreases, the number of mpressons retaned ncreases, but gradually they taper off. (b) As a result, t would be napproprate to ft a bvarate lnear regresson model to the data. At present we do not have the tools to ft an approprate model. As we wll show later, a model of the type: Y = β1 + β X + β3x + u may be approprate, where Y = mpressons retaned and X s advertsng expendture. Ths s an example of a quadratc regresson model. But note that ths model s stll lnear n the parameters. (c) The results of blndly usng a lnear model are as follows: Y =.163 + 0.3631 X se (7.089) (0.0971) r = 0.44 39

5.1 (a) U.S. CPI vs Canada CPI 50.0 00.0 150.0 100.0 50.0 0.0 60.0 80.0 100.0 10.0 140.0 160.0 180.0 00.0 Candan CPI The plot shows that the nflaton rates n the two countres generally move together. (b)& (c) The followng output s obtaned from EVews 3 statstcal package. Sample: 1980 005 Included observatons: 6 Varable Coeffcent Std. Error t-statstc Prob. C -8.5416 4.4795-1.9068 0.0686 ICAN 1.071 0.0316 33.9593 0.0000 R-squared 0.9796 F-statstc 1153.373 Adjusted R-squared 0.9788 Prob(F-statstc) 0.000000 As ths output shows, the relatonshp between the two varables s postve. One can easly reject the null hypothess that there s no relatonshp between the two varables, as the t value obtaned under 40

that hypothess s 33.9593, and the p value of obtanng such a t value s practcally zero. Although the two nflaton rates are postvely related, we cannot nfer causalty from ths fndng, for t must be nferred from some underlyng economc theory. Remember that regresson does not necessarly mply causaton. 5.13 (a) The two regressons are as follows: Goldprce t = 15.856 + 1.0384 CPI t se = (54.4685) (0.4038) t = (3.955) (.5718) r =0.1758 NYSEIndex t = -3444.990 + 50.97 CPI t se = (533.9663) (3.9584) t = (-6.4517) (1.7066) r = 0.839 (b) The Jarqu-Bera statstc for the gold prce equaton s 5.439 wth a p value 0.066. The JB statstc for the NYSEIndex equaton s 3.084 wth a p value 0.14. At the 5% level of sgnfcance, n both cases we do not reject the normalty assumpton. (c) Usng the usual t test procedure, we obtan: t = 1.0384 1 0.4038 = 0.0951 Snce ths t value does not exceed the crtcal t value of.04, we cannot reject the null hypothess. The true coeffcent s not statstcally dfferent from 1. (d) & (e) Usng the usual t test procedure, we obtan: t = 50.97 1 = 1.455 3.958 Snce ths t value exceeds the crtcal t value of.04, we reject the null hypothess. The estmated coeffcent s actually greater than 1. For ths sample perod, nvestment n the stock market probably was a hedge aganst nflaton. It certanly was a much better hedge aganst nflaton that nvestment n gold. 5.14 (a) None appears to be better than the others. All statstcal results are very smlar. Each slope coeffcent s statstcally sgnfcant at the 99% level of confdence. (b) The consstently hgh r s cannot be used n decdng whch monetary aggregate s best. However, ths does not suggest that t makes no dfference whch equaton to use. 41

(c) One cannot tell from the regresson results. But lately the Fed seems to be targetng the M measure. 5.15 Wrte the ndfference curve model as: 1 Y = β1( ) + β + u X Note that now β1 becomes the slope parameter and β the ntercept. But ths s stll a lnear regresson model, as the parameters are lnear (more on ths n Ch.6). The regresson results are as follows: Y ˆ = 3.87( 1 X ) + 1.1009 se =(1.599) (0.6817) r = 0.6935 The "slope" coeffcent s statstcally sgnfcant at the 9% confdence coeffcent. The margnal rate of substtuton (MRS) Y 1 of Y for X s: = 0.387. X X 5.16 (a) Let the model be: Y = β1 + β X + u where Y s the actual exchange rate and X the mpled PPP. If the PPP holds, one would expect the ntercept to be zero and the slope to be one. (b) The regresson results are as follows: Y ˆ = -33.0917 + 1.8147 X se = (6.9878) (0.074) t = (-1.6) (66.137) r = 0.991 To test the hypothess that β = 1, we use the t test, whch gves t = 1.8147 1 0.074 = 9.7336 Ths t value s hghly sgnfcant, leadng to the rejecton of the null hypothess. Actually, the slope coeffcent s s greater than 1. From the gven regresson, the reader can easly verfy that the ntercept coeffcent s not dfferent from zero, as the t value under the hypothess that the true ntercept s zero, s only -1.6. Note: Actually, we should be testng the (jont) hypothess that the ntercept s zero and the slope s 1 smultaneously. In Ch. 8, we wll show how ths s done. (c) Snce the Bg Max Index s "crude and hlarous" to begn wth, t probably doesn't matter. However, for the sample data, the 4

results do not support the theory. 5.17 (a) Lettng Y represent the male math score and X the female math score, we obtan the followng regresson: Yˆ = 198.737 + 0.6704X se = (1.875) (0.065) t = (15.435) (5.33) r = 0.9497 (b) The Jarque-Bera statstc s 1.1641 wth a p value of 0.5588. Therefore, asymptotcally we cannot reject the normalty assumpton. (c) t = 0.6704 1 = 1.4377. Therefore, wth 99% confdence we 0.065 can reject the hypothess that β = 1. (d) The ANOVA table s: Source of Varaton SS df MSS ESS 1605.916 1 1605.916 RSS 85.084 34.50 TSS 1691 35 Under the null hypothess that β = 0, the F value s 641.734, The p value of obtanng such an F value s almost zero, leadng to the rejecton of the null hypothess. 5.18 (a) The regresson results are as follows: Y ˆ = 13.778 + 0.750 X se = (33.74) (0.067) t = (3.937) (11.187) r = 0.786 (b) The Jarque-Bera statstcs s 1.1 wth a p value of 0.571. Therefore we can reject the null hypothess of non-normalty. (c) Under the null hypothess, we obtan: t = 0.750 1 0.067 = 3.7313. The crtcal t value at the 5% level s.04 (or -.04). Therefore, we can reject the null hypothess that the true slope coeffcent s 1. (d) The ESS, RSS, and TSS values are, respectvely, 1005.75 (l df), 73. (34 df), and 178.97 (35 df). Under the usual null 43

hypothess the F value s 15.156. The p value of such an F value s almost zero. Therefore, we can reject the null hypothess that there s no relatonshp between the two varables. 5.19 (a) CPI vs PPI (WPI) 50.0 00.0 150.0 100.0 50.0 0.0 80.0 90.0 100.0 110.0 10.0 130.0 140.0 150.0 160.0 170.0 PPI (WPI) The scattergram as well s shown n the above fgure. (b) Treat CPI as the regressand and WPI as the regressor. The CPI represents the prces pad by the consumers, whereas the WPI represents the prces pad by the producers. The former are usually a markup on the latter. (c) & (d) The followng output obtaned from Evews6 gves the necessary data. 44

Dependent Varable: CPI Method: Least Squares Sample: 1980 006 Included observatons: 7 CPI=C(1)+C()*PPI Coeffcent Std. Error t-statstc Prob. C(1) -81.01611 5.4946-14.75100 0.0000 C() 1.81760 0.044181 41.1400 0.0000 R-squared 0.985444 Mean dependent var 14.3963 Adjusted R-squared 0.98486 S.D. dependent var 34.67915 S.E. of regresson 4.6684 Akake nfo crteron 5.810804 Sum squared resd 455.1447 Schwarz crteron 5.90679 Log lkelhood -76.44585 Durbn-Watson stat 0.601660 The estmated t value of the slope coeffcent s 1.8176 under the null hypothess that there s no relatonshp between the two ndexes. The p value of obtanng such a t value s almost zero, suggestng the rejecton of the null hypothess. The hstogram and Jarque-Bera test based on the resduals from the precedng regresson are gven n the followng dagram. 45

Hstogram 9 8 7 6 5 4 Frequency 3 1 0-8.748351-5.308938-1.86955 1.569888 5.009301 More Bn The Jarqe-Bera statstc s 0.397 wth a p value 0.817. Therefore, we cannot reject the normalty assumpton. The hstogram also shows that the resduals are slghtly left-skewed, but not too far from symmetrc. 46

5.0 (a) There seems to be a general postve relatonshp between Smokng and Mortalty. Mortalty vs Smokng 180 160 140 10 100 80 60 40 0 0 60 70 80 90 100 110 10 130 140 150 Smokng Index (b) Y ˆ = -.8853 + 1.0875 X se = (3.0337) (0.09) t = (-0.153) (4.9) r = 0.5130 (c) The slope coeffcent has a t statstc of 4.9, whch ndcates a p value of almost 0. Therefore, we can reject the null hypothess and conclude that Smokng s related to Mortalty at the 5% level of sgnfcance. (d) The rskest occupatons seem to be Furnace forge foundry workers, Constructon workers, and Panters and decorators. One reason for why these occupatons are more rsky could be that they all work around toxc fumes and/or chemcals and therefore breathe n dangerous toxns frequently. (e) Unless there s a way to categorze the occupatons nto fewer groups, we cannot nclude them n the regresson analyss (ths wll be addressed later n the dscusson of dummy, or ndcator, varables n chapter 9). 47