Statistic Midterm. Spring This is a closed-book, closed-notes exam. You may use any calculator.

Similar documents
Business Statistics Final Exam

Section 0: Introduction and Review of Basic Concepts

Business Statistics Midterm Exam Fall 2013 Russell

Business Statistics Midterm Exam Fall 2013 Russell

Stat3011: Solution of Midterm Exam One

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Honor Code: By signing my name below, I pledge my honor that I have not violated the Booth Honor Code during this examination.

Homework Assignment Section 3

Random variables. Discrete random variables. Continuous random variables.

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

First Midterm Examination Econ 103, Statistics for Economists February 16th, 2016

Section 1.3: More Probability and Decisions: Linear Combinations and Continuous Random Variables

STA 103: Final Exam. Print clearly on this exam. Only correct solutions that can be read will be given credit.

Chapter 5. Sampling Distributions

Rand Final Pop 2. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Econ 250 Fall Due at November 16. Assignment 2: Binomial Distribution, Continuous Random Variables and Sampling

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

M3S1 - Binomial Distribution

Financial Econometrics Jeffrey R. Russell Midterm 2014

20135 Theory of Finance Part I Professor Massimo Guidolin

Discrete probability distributions

Midterm Exam. b. What are the continuously compounded returns for the two stocks?

Finance Practice Midterm #1 Solutions

Q-Center Math 1070 Exam #2 Review. November 8, 2016

Section 1.4: Learning from data

Final Exam Suggested Solutions

Business Statistics Fall Quarter 2015 Midterm Answer Key

STA Module 3B Discrete Random Variables

The binomial distribution p314

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Chapter 8 Statistical Intervals for a Single Sample

Homework Assigment 1. Nick Polson 41000: Business Statistics Booth School of Business. Due in Week 3

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Midterm

WEB APPENDIX 8A 7.1 ( 8.9)

Homework Assignment Section 3

Converting to the Standard Normal rv: Exponential PDF and CDF for x 0 Chapter 7: expected value of x

Chapter 16. Random Variables. Copyright 2010, 2007, 2004 Pearson Education, Inc.

The Assumption(s) of Normality

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Stat 328, Summer 2005

Department of Economics ECO 204 Microeconomic Theory for Commerce Test 2

STA2601. Tutorial letter 105/2/2018. Applied Statistics II. Semester 2. Department of Statistics STA2601/105/2/2018 TRIAL EXAMINATION PAPER

1. Covariance between two variables X and Y is denoted by Cov(X, Y) and defined by. Cov(X, Y ) = E(X E(X))(Y E(Y ))

χ 2 distributions and confidence intervals for population variance

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Section Distributions of Random Variables

Analysis of Variance in Matrix form

Business Statistics 41000: Homework # 2

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Final Exam

CHAPTER 6: PORTFOLIO SELECTION


Economics 424/Applied Mathematics 540. Final Exam Solutions

σ e, which will be large when prediction errors are Linear regression model

Simple Random Sample

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

The Two-Sample Independent Sample t Test

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

ECON Introductory Econometrics. Lecture 1: Introduction and Review of Statistics

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2013, Mr. Ruey S. Tsay. Midterm

Problem max points points scored Total 120. Do all 6 problems.

For each of the questions 1-6, check one of the response alternatives A, B, C, D, E with a cross in the table below:

Chapter 3 Discrete Random Variables and Probability Distributions

EXERCISES FOR PRACTICE SESSION 2 OF STAT CAMP

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Strategy -1- Strategy

STT 315 Handout and Project on Correlation and Regression (Unit 11)

UNIVERSITY OF OSLO. Please make sure that your copy of the problem set is complete before you attempt to answer anything.

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Midterm

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

Confidence Intervals. σ unknown, small samples The t-statistic /22

Business Statistics 41000: Probability 4

(Practice Version) Midterm Exam 1

CHAPTER 2 Describing Data: Numerical

The Normal Probability Distribution

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2016, Mr. Ruey S. Tsay. Midterm

Let us assume that we are measuring the yield of a crop plant on 5 different plots at 4 different observation times.

Portfolio Risk Management and Linear Factor Models

Multiple linear regression

Statistical Intervals (One sample) (Chs )

Economics 483. Midterm Exam. 1. Consider the following monthly data for Microsoft stock over the period December 1995 through December 1996:

Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) Estimating Population Parameters

BIO5312 Biostatistics Lecture 5: Estimations

STAT 1220 FALL 2010 Common Final Exam December 10, 2010

STAT 3090 Test 2 - Version B Fall Student s Printed Name: PLEASE READ DIRECTIONS!!!!

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2016, Mr. Ruey S. Tsay. Solutions to Midterm

Web Extension: Continuous Distributions and Estimating Beta with a Calculator

Standard Deviation. 1 Motivation 1

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Midterm

Regression and Simulation

GGraph. Males Only. Premium. Experience. GGraph. Gender. 1 0: R 2 Linear = : R 2 Linear = Page 1

Lecture 3: Return vs Risk: Mean-Variance Analysis

Hedging and Regression. Hedging and Regression

The Norwegian State Equity Ownership

> attach(grocery) > boxplot(sales~discount, ylab="sales",xlab="discount")

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

The test has 13 questions. Answer any four. All questions carry equal (25) marks.

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay Solutions to Final Exam

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Transcription:

Statistic Midterm Spring 2018 This is a closed-book, closed-notes exam. You may use any calculator. Please answer all problems in the space provided on the exam. Read each question carefully and clearly present your answers. Honor Code Pledge: I pledge my honor that I have not violated the University Honor Code during this examination. Signed: Name: Here are some useful formulas: E(aX + by ) = ae(x) + be(y ) V ar(ax + by ) = a 2 V ar(x) + b 2 V ar(y ) + 2ab Cov(X, Y ) The standard error of X is defined as s X = variance of X. s 2 X n where s 2 X denotes the sample The standard error for the difference in the averages between groups a and b is defined as: s ( Xa X b ) = s 2 a n a + s2 b n b where s 2 a denotes the sample variance of group a and n a the number of observations in group a. The standard error for a proportion is defined by: sˆp = ˆp(1 ˆp) n 1

Problem 1 (10 points each) In recent years the NFL changed the extra point rule and moved the distance of the field goal attempt to 33 yards (as compared to the previous distance of 19 yards). A successful kick gives the team 1 extra point. Of course, teams have the option to try for 2 extra points by attempting a touchdown starting from the 2-yard line. Currently, the probability of successful kicks from 33 yards is 0.93 whereas the probability of getting a touchdown from the 2-yard line is 0.48. 1. As a general rule, over the course of a season, what should you do as a team? Go for 1 or 2 extra points? Justify your answer. Goal: maximizing expected points. 1 wp 0.93 X 1 = 0 wp 0.07 2 wp 0.48, X 2 = 0 wp 0.52. So E[X 1 ] = 0.93, E[X 2 ] = 0.96. Go for touchdown. 2. Now consider a situation where your team just scored a touchdown to tie the game. There is only 1 second left on the clock meaning that your extra point conversion is the last play of the game. What should you do now? Go for 1 or 2 extra points? Justify your answer and contrast to your answer to the previous question. Goal: maximizing winning probability. win wp 0.93 win wp 0.48 X 1 =, X 2 = tie wp 0.07 tie wp 0.52. So going for 1 extra point gives higher winning probability. 2

Problem 2 (10 points each) The breathalyzers carried by cops have a false positive rate of 5%. However, they are perfect at detecting drunk drivers, i.e., the false negative probability is zero. Previous studies have concluded that on a Saturday night in Chicago, 1 in 500 drivers are drunk. Mayor Emmanuel decided to implement a policy of no-refusal, random traffic stops on Saturday s. That means that random cars will be pulled over and their drivers tested. 1. On your way home on a Saturday night, your Uber driver gets randomly stopped. She tests positive for being drunk. Given this information, what is the probability she is drunk? By Bayes Rule, Joint Probability Drunk Not Drunk Test Positive 1 1/500 0.05 499/500 Test Negative 0 1/500 0.95 499/500 Pr(drunk positive) = Pr(drunk & positive) Pr(positive) = Pr(drunk) Pr(positive drunk) Pr(positive drunk) Pr(drunk) + Pr(positive not drunk) Pr(not drunk) = 1/500 100% 100% 1/500 + 5% 499/500 = 0.038 2. Based on the numbers provided and your computation in question (1), do you agree or disagree with the Mayor s decision? Why or why not? I disagree. Most of the positive results will be false positive. 3

Problem 3 (5 points each) A construction company needs to complete a project within 11 weeks, or they will incur significant cost overruns, including penalties due to the client. The manager of the company has assessed that the project will take between 10 and 14 weeks to complete. The manager has also estimated the probability of each possible outcome: Weeks to complete Probability 10 0.075 11 0.65 12 0.2 13 0.05 14 0.025 1. What is the probability of completing the project on time? Pr(X 11) = Pr(X = 10) + Pr(X = 11) = 0.725 2. What is the expected value of the time to complete the project? E[X] = 5 i=1 x i Pr(X = x i ) = 11.3 3. The company must pay a penalty of $5,000 for every additional week (past 11 weeks) that they work, plus a additional $50,000 penalty if the work takes 14 weeks to complete. What is the expected value of the penalty incurred? E[Y ] = 5000 0.2 + 10000 0.05 + 65000 0.025 = 3125 4. Suppose you re the engineer in charge of bidding for the projects in this company (i.e. estimating the total cost of the job, plus overhead, potential overrun costs and profit). How would you use the information from the previous question to price this job? The price needs to be at least (total cost + 3125). 4

Problem 4 (10 points each) I am trying to build a portfolio composed of SP500 and Bonds. Assume SP 500 N(11, 19 2 ) and Bonds N(4, 6 2 ). In addition, the covariance between SP500 and Bonds is -22.6. 1. Which portfolio is better: a 50-50 split between the two assets or a 60-40 split between SP500 and Bonds? Justify your criteria for comparison. X = 0.5 SP 500 + 0.5 Bonds E[X] = 0.5 11 + 0.5 4 = 7.5 V ar[x] = 0.5 2 19 2 + 0.5 2 6 2 2 0.5 0.5 22.6 = 87.95 Y = 0.6 SP 500 + 0.4 Bonds E[Y ] = 0.6 11 + 0.4 4 = 8.2 V ar[y ] = 0.6 2 19 2 + 0.4 2 6 2 2 0.6 0.4 22.6 = 124.87 Then, Sharpe Ratio: SR X = 0.7997, SR Y = 0.7338. So 50-50 split is preferable. Although 50-50 split has lower expected value, it has much smaller risk than 60-40 split does. 2. Which of the two portfolios considered in item (1) has the largest probability of delivering a negative return? ( X 7.5 Pr(X < 0) = Pr < 7.5 ) = Pr(N(0, 1) < 0.7997) 87.95 87.95 ( Y 8.2 Pr(Y < 0) = Pr < 8.2 ) = Pr(N(0, 1) < 0.7338) 124.87 124.87 So 60-40 split has a large negative probability, simply looking at the standard normal density and area under curve. 5

Problem 5 I am trying to get into the insurance business. As a starter, I am going to sell the 2019 Booth MBA class an unemployment insurance policy that will pay a student $100k if they don t get a job by the end of the program (by graduation time). From years and years of observing this program, I figured that the probability that a student does not get a job by the end of program is 0.5%. Based on this number I decided to sell each policy for $850. 1. From the perspective of my insurance business, is that a good price? Justify your answer. (5 points) Let X be the profit. 850 wp 99.5% X = 850 100000 wp 0.5% E[X] = 350, so the expected profit is positive.. 2. It turns out that I was able to sell this policy to 1,000 students in the 2019 class! What is my expected profit at graduation time? (5 points) Let X i has the same distribution as X for i = 1,..., 1000. Selling 1000 policies has expected profit E[Y ] = E [ 1000 ] X i = 350000. i=1 6

3. The standard deviation for the payout of one policy is approximately $7k. What is the probability (approximately) that I will lose money by graduation time? (hint: use the normal approximation... ). (10 points) Y = 1000 i=1 X i N(350000, 7000 2 1000), so Pr (Y < 0) = Pr ( Y 350000 70002 1000 < = Pr(N(0, 1) < 1.5811) = 0.0569 ) 350000 70002 1000 Note it is totally ok to leave the answer as Pr(N(0, 1) < 1.5811) or Pr(Z < 1.5811) 7

Problem 6 (10 points each) Amy, a former Booth MBA student, now the manager of Windy City Trading Strategies, is trying to convince you to invest in the fund she runs. Based on the last two years of monthly returns, Amy provides you with the following summary of the performance of the fund: Sample average Sample std. deviation 2% per month 2% per month 1. Based on these results, if you decide to invest in her fund, what is the probability you will lose money next month? Pr(X < 0) = Pr ( X 2% 2% < 1) = Pr(N(0, 1) < 1) = 0.1587 2. Before deciding to invest, you remember that Carlos told you that the monthly sharpe ratio (mean/std. dev) of the market is 0.4. Is the sharpe ratio of Amy s fund for sure better than that? (and for sure I mean, with 95% confidence). Hint: first build a 95% confidence interval for the mean of Amy s fund. s X = 2% 24 = 0.0041 So 95% confidence interval of the mean is [ X 2s X, X + 2s X] = [0.0118, 0.0282], and 95% confidence interval for Sharpe ratio is [0.59, 1.41]. So we should be fairly confident that Amy s fund performs better than the market, because 0.4 is smaller than [0.59, 1.41] 8

Problem 7 (20 points) Your cognitive capacity is significantly reduced when your smartphone is within reach, even if it s off. That s the claim from a new study from the McCombs School of Business at The University of Texas at Austin. Marketing Assistant Professor Adrian Ward conducted experiments with nearly 600 smartphone users in an attempt to measure, for the first time, how well people can complete tasks when they have their smartphones nearby even when they re not using them. In one experiment, the researchers asked study participants to sit at a computer and perform a task that required full concentration in order to succeed. The tests were geared to measure participants available cognitive capacity that is, the brain s ability to hold and process data at any given time. Before beginning, participants were randomly instructed to place their smartphones either on the desk face down (group 1; 200 participants), in their pockets (group 2; 200 participants), or in another room (group 3; 200 participants). All participants were instructed to turn their phones to silent. The researchers found that participants with their phones in another room significantly outperformed those with their phones on the desk, and they also slightly outperformed those participants who had kept their phones in the pocket. Based on the results from the experiment (listed below), do you agree with the statement from the above (bold) paragraph? Justify your answer. group 1 group 2 group 3 Successful tasks 90 130 150 Sample mean: X1 =.45, X2 =.65, X3 =.75, so X 3 X 1 = 0.2, and X 3 X 2 = 0.1 Standard error: s X3 X 1 = X3 (1 X 3 ) + X 1 (1 X 1 ) = 0.0466, s 200 200 X3 X 2 = X3 (1 X 3 ) + X 2 (1 X 2 ) = 200 200 0.0456. Confidence interval: For X 3 X 1, CI is [ X 3 X 1 1.96s X3 X 1, X 3 X 1 +1.96s X3 X 1 ], i.e. [0.1086, 0.2914]. Similarly, for X 3 X 2, CI is [0.0107, 0.1873]. The implication is that, for both experiment designs, the effect of putting phones in another room is significant. 9

Problem 8 (3 points each) Assume the model: Y = 5 2X + ε, ε N(0, 9 2 ) 1. What is expected value of Y if X = 1, i.e, E[Y X = 1]? (a) 5 (b) 3 - Correct (c) 4 (d) 6 2. What is the V ar[y X = 0]? (a) 9 (b) 81 - Correct (c) 3 (d) 6 3. What is the P r(y > 10), given X = 2? (a) 13% (b) 68% (c) 16% - Correct (d) 2.5% 4. What is the P r( 6 < Y < 21), given X = 1? (a) 5% (b) 16% (c) 81.5% - Correct (d) 34% 10

Problem 9 (5 points each) Y1-1.5-1.0-0.5 0.0 0.5 1.0 1.5 Y2 0 2 4 6-1 0 1 2 3 4 X -1 0 1 2 3 4 X Y3-6 -4-2 0 2 Y4-5 0 5 10-1 0 1 2 3 4 X -1 0 1 2 3 4 X In the above scatterplots, four different variables Y 1, Y 2, Y 3 and Y 4 were regressed onto the same X (in all four scatterplot we have the exact same n = 200 values for X). Carefully examine the plots and answer the questions below: 1. The estimates of the slope (b 1 ) for the four regressions are listed below. Which one belongs to the regression of Y 3 onto X? (a) -0.054 (b) 3.020 (c) 1.031 (d) -1.094 - Correct 11

2. The estimates of the slope (b 1 ) for the four regressions are listed below. Which one belongs to the regression of Y 1 into X? (a) -0.054 - Correct (b) 3.020 (c) 1.031 (d) -1.094 3. The estimates of the intercept (b 0 ) for the four regressions are listed below. Which one belongs to the regression of Y 4 into X? (a) -2.044 - Correct (b) 0.212 (c) 0.135 (d) 1.951 4. The standard errors of the regressions (s) are listed below. Which one belongs to Y 3? (a) 0.52 (b) 0.49 (c) 1.51 - Correct (d) 0.51 5. The R 2 of the regressions are listed below. Which one belongs to Y 2? (a) 37% (b) 81% - Correct (c) 97 % (d) 1 % 12

6. What is the correlation between Y 1 and X? (a) -0.12 - Correct (b) 0.91 (c) 0.97 (d) -0.60 7. Using all the information provided so far, give a rough approximation of the 95% prediction interval for Y 4 when X = 3. Approximately, E[Y 4 X = 3] = 7 and σ = 1, so CI = [5, 9]. 8. Give an approximation for P r(y 3 > 0 X = 2) Approximately, E[Y 3 X = 2] = 2 and σ = 1.5, so Y 3 X = 2 N( 2, 1.5). Pr(Y 3 > 0 X = 2) = ( Y3 + 2 Pr > 2 ) 1.5 1.5 = Pr(N(0, 1) > 2/1.5) = 0.0912 13

Problem 10 (5 points each) In trying to understand how good of an investor Warren Buffet is, I collected data on the annual returns (in percentage terms) of his entire portfolio and decided to compare it to the market (again, in percentage terms). In doing so I ran the following regression: W B = α + βmarket + ɛ ɛ N(0, σ 2 ) where W B refers to the annual returns on Buffet s portfolio and Market refers to the annual returns on the U.S. stock market. The result of this regression is in the table below: SUMMARY OUTPUT Regression Statistics Multiple R 0.628697 R Square 0.39526 Adjusted R 0.382113 Standard Er 11.4114 Observatio 48 ANOVA df SS MS F ignificance F Regression 1 3915.157401 3915.157 30.06571 1.71E-06 Residual 46 5990.121766 130.22 Total 47 9905.279167 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 14.94673 Market 0.524621 1. What is the expected value for the returns on Buffet s portfolio when the market is up 10%? E[W B Market = 10%] = 14.95 + 0.525 10 = 20.2 14

2. What is the standard deviation for the returns on Buffet s portfolio when the market is up 10%? ˆσ = 1 46 the table. i ê2 1 i = SSR = 130.22 = 11.41. This is also the Standard E in 46 3. Give a 99% prediction range for the returns on Buffet s portfolio when the market is up 10%? Approximately, X N(20, 11.4 2 ). Suppose the 99% confidence interval is [20 3 11.4, 20 + 3 11.4]. 4. Provide an interpretation for both the intercept and the slope of this regression. What do these quantities tell us about Warren Buffet s performance, relative to the U.S. stock market? Intercept: When the market return is zero, Warren Buffet has 14.95% return on average. Slope: When the market return is 1% higher, Warren Buffet has 0.5% higher return on average. The coefficient tells us on average Warren Buffet beats the market, because 14.96+ 0.52Market > Market (unless Market > 31.17, which is rare). 15