Honor Code: By signing my name below, I pledge my honor that I have not violated the Booth Honor Code during this examination.

Similar documents
STA 103: Final Exam. Print clearly on this exam. Only correct solutions that can be read will be given credit.

First Midterm Examination Econ 103, Statistics for Economists February 16th, 2016

Business Statistics 41000: Probability 4

Copyright 2005 Pearson Education, Inc. Slide 6-1

Business Statistics Midterm Exam Fall 2013 Russell

Midterm Exam. b. What are the continuously compounded returns for the two stocks?

1. Covariance between two variables X and Y is denoted by Cov(X, Y) and defined by. Cov(X, Y ) = E(X E(X))(Y E(Y ))

Chapter 5. Sampling Distributions

Statistic Midterm. Spring This is a closed-book, closed-notes exam. You may use any calculator.

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

1 Describing Distributions with numbers

Data Distributions and Normality

Section Distributions of Random Variables

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Simple Random Sample

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1

1. Variability in estimates and CLT

Chapter 3 Discrete Random Variables and Probability Distributions

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Module 4: Probability

Some Characteristics of Data

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Describing Data: One Quantitative Variable

Business Statistics Midterm Exam Fall 2013 Russell

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Homework Assigment 1. Nick Polson 41000: Business Statistics Booth School of Business. Due in Week 3

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Economics 483. Midterm Exam. 1. Consider the following monthly data for Microsoft stock over the period December 1995 through December 1996:

2011 Pearson Education, Inc

Section 0: Introduction and Review of Basic Concepts

ECON 214 Elements of Statistics for Economists 2016/2017

Business Statistics Fall Quarter 2015 Midterm Answer Key

( ) P = = =

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

Section Distributions of Random Variables

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Chapter 7. Inferences about Population Variances


The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay Solutions to Final Exam

(# of die rolls that satisfy the criteria) (# of possible die rolls)

DATA SUMMARIZATION AND VISUALIZATION

CHAPTER 2 Describing Data: Numerical

THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41202, Spring Quarter 2008, Mr. Ruey S. Tsay. Solutions to Homework Assignment #1

STAT 201 Chapter 6. Distribution

Math Take Home Quiz on Chapter 2

Sampling and sampling distribution

Data Analysis and Statistical Methods Statistics 651

One sample z-test and t-test

3.1 Measures of Central Tendency

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Business Statistics 41000: Probability 3

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Introduction to Computational Finance and Financial Econometrics Descriptive Statistics

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Chapter 6 Probability

Statistics and Probability

2 Exploring Univariate Data

STA2601. Tutorial letter 105/2/2018. Applied Statistics II. Semester 2. Department of Statistics STA2601/105/2/2018 TRIAL EXAMINATION PAPER

KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI

Introduction to R (2)

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

Section 1.3: More Probability and Decisions: Linear Combinations and Continuous Random Variables

Lecture 3: Return vs Risk: Mean-Variance Analysis

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

Financial Econometrics Jeffrey R. Russell Midterm 2014

χ 2 distributions and confidence intervals for population variance

Lecture 9 - Sampling Distributions and the CLT

Statistics & Statistical Tests: Assumptions & Conclusions

BIOL The Normal Distribution and the Central Limit Theorem

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Review: Population, sample, and sampling distributions

MVE051/MSG Lecture 7

Standard Normal, Inverse Normal and Sampling Distributions

Chapter 7: Point Estimation and Sampling Distributions

The Binomial Probability Distribution

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Math 140 Introductory Statistics. First midterm September

The Mathematics of Normality

Sampling Distributions

4.1 Probability Distributions

MAS187/AEF258. University of Newcastle upon Tyne

Elementary Statistics Lecture 5

Section The Sampling Distribution of a Sample Mean

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

ECON 214 Elements of Statistics for Economists

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Homework Assignment 3. Nick Polson 41000: Business Statistics Booth School of Business. Due in Week 5

Random Variables CHAPTER 6.3 BINOMIAL AND GEOMETRIC RANDOM VARIABLES

The Binomial Distribution

Math 243 Lecture Notes

Probability and distributions

The Binomial Distribution

Numerical Descriptions of Data

Central Limit Theorem (cont d) 7/28/2006

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Lecture 8 - Sampling Distributions and the CLT

Transcription:

Name: OUTLINE SOLUTIONS University of Chicago Graduate School of Business Business 41000: Business Statistics Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper for the formulas. 2. Throughout this paper, N(µ, σ 2 ) will denote a normal distribution with mean µ and variance σ 2. 3. This is a 2 hr exam. Honor Code: By signing my name below, I pledge my honor that I have not violated the Booth Honor Code during this examination. Signature: Problem A. True or False: Please Explain your answers in detail. Partial credit will be given (50 points) 1. If the sample covariance between two variables is one, then there must be a strong linear relationship between the variables False. Cov(X, Y ) = Corr(X, Y ) V ar(x) V ar(y ), so knowing Cov(X, Y ) = 1 doesn t inform you about the correlation 2. If X and Y are independent random variables, then V ar(2x Y ) = 2V ar(x) V ar(y ). False. V ar(2x Y ) = 4V ar(x) + V ar(y ) 1

3. For any normal random variable, X, we have P (µ σ < X < µ + σ) = 0.64. [Hint: You may use pnorm(1) = 0.841] False. pnorm(1) = 0.841 and 2 pnorm(1) 1 = 0.6828 > 0.64. 4. The p-value is the probability that the Null hypothesis is true. False. The p-value is the probability of the observing something more extreme than the observed sample assuming the null hypothesis is true. 5. Suppose that you toss a fair coin with probability 0.5 a head. The probability of getting five heads is a row is less than three percent. False. Given a fair coin with 50% probability for a head, 0.5 5 = 0.0312. > 0.03 6. Arsenal are playing Burnley at home in an English Premier League (EPL) game this weekend. They are favourites to win. They have a Poisson distribution for the number of goals they will score with a mean rate of 2.5 per game. Given this, the odds of Arsenal scoring at least two goals is greater than 50%. True. Here S P ois(2.5). We need P (S 2) = 1 P (S 1) = 1 0.2873 = 0.7127. In R we have ppois(1, 2.5) = 0.28729. 7. Suppose that the annual returns for Facebook stock are normally distributed with a mean of 15% and a standard deviation of 20%. The probability that Facebook has returns greater than 10% for next year is 60% 2

True. Let R denote returns. Convert to standard normal and compute ( R µ P > 0.1 µ ) = P (Z > 0.25) = 0.5987 σ σ where µ = 0.15 and σ = 0.2. 8. LeBron James makes 85% of his free throw attempts and 50% of his regular shots from the field (field goals). Suppose that each shot is independent of the others. He takes 20 field goals and 10 free throws in a typical game. He gets one point for each free throw and two points for each field goal assuming no 3-point shots. The number of points he expects to score in a game is 28.5. True. E(points) = 2 E(#F Gmade)+1 E(#F T made) = 2 20 0.5+1 10 0.85 = 28.5 This holds even if FG and FT are dependent. 9. Given a random sample of 2000 voters, 800 say they will vote for Hillary Clinton in the 2016 US Presidential Election. At the 95% level, I can reject the null hypothesis that Hillary has an evens chance of winning the election. ˆp (1 ˆp) True. The t-statistic is T = (ˆp 0.5)/ = 9.13 < where qnorm(0.025) = n 1.96. 10. The Central Limit Theorem states that the distribution of a sample mean is approximately Normal. True. The standard deviation of the sample mean is σ n. The CLT has (approximately) X N(µ, σ2 n ), where µ and σ2 are the population true value. 3

Problem B. (20 points) The Chicago Cubs are having a great season. So far they ve won 72 out of the 100 games played so far. You also have the expert opinion of Bob the sports analysis. He tells you that he thinks the Cubs will win. Historically his predictions have a 60% chance of coming true. Calculate the probability that the Cubs will win given Bob s prediction Suppose you now learn that it s a home game and that the Cubs win 60% of their games at Wrigley field. What s you updated probability that the Cubs will win their game? 1. First we have P (Win Bob) = 0.72 0.6 0.72 0.6 + 0.28 0.4 = 0.79 2. Learning the new information, we use the previous posterior as a prior for the next Bayes update P (Win home, Bob) = it s highly likely that the Cubs will win! 0.79 0.60 0.79 0.60 + 0.21 0.40 = 0.85 4

Problem C. (20 points) You are given the following R output for s data on Apple (AAPL) and Google s (GOOG) stock from January 2005 to the end of December2013. AAPL GOOG Frequency 0 100 300 500 Frequency 0 200 400 600 0.15 0.10 0.10 0.15 Figure 1: Apple vs Google > summary(aapl) Min. 1st Qu. Median Mean 3rd Qu. Max. -0.179200-0.010400 0.001304 0.001544 0.013820 0.139000 > summary(goog) Min. 1st Qu. Median Mean 3rd Qu. Max -0.1161000-0.0082710 0.0004700 0.0009843 0.0104900 0.1999000 > skewness(aapl) [1] -0.04116922 > skewness(goog) [1] 0.7518 > kurtosis(aapl) [1] 13.10311 > kurtosis(goog) [1] 7.454387 5

AAPL GOOG 0.15 0.00 0.10 0.10 0.05 0.15 Figure 2: Boxplots Describe clearly what you learn from the summary and skewness and kurtosis empirical data. From the box plots, we can see many extreme observations out of 1.5 IQR (interquarter-range) of the lower and upper quartiles, which are confirmed by the high kurtosis values. Both kurtosis > 3 and that s why we can also see their heavy tails in the histograms. By the skewness values, GOOG is positive skew (mean much higher than median) and AAPL is not significantly skew (mean close to median). Over this period use the return distribution and box plots to describe the statistical properties of the investment returns from Apple and Google. How would you make a relative comparison? The average of AAPL is 0.13% that is almost triple of GOOG 0.047%. AAPL almost doubles its kurtosis value of GOOG (13 v.s. 7.5) and there are more extreme values of AAPL than GOOG. 6

Problem D. (20 points) Amazon is test marketing a new package delivery system. It wants to see if same-day service is feasible for packages bought with Amazon prime. In a small study of a hundred thousand delivers they find the following times for delivery Deliveries Mean-Time (Hours) Standard Deviation-Time new system 80,000 4.5 2.1 old system 20,000 5.6 2.5 1. Find a 95% confidence interval for the decrease in delivery time. Follow the hint, the answer is (-1.14, -1.06). > (4.5-5.6)-1.96*sqrt(2.1^2/80000+2.5^2/20000) [1] -1.14 > (4.5-5.6)+1.96*sqrt(2.1^2/80000+2.5^2/20000) [1] -1.06 Hence, the 95% confidence interval is (-1.14, -1.06). 2. If they switch to the new system, what proportion of deliveries will be under 5 hours which is required to guarantee same day service. As we have a large sample size, we can assume a normal distribution with given sample mean and standard deviation, > pnorm(5,mean=4.5,sd=2.1) [1] 0.594 P (T 5) = 0.594 Hence, 59.4% of deliveries will be under 5 hours. [Hint: a 95% confidence interval for a difference in means µ 1 µ 2 is given by ( x 1 x 2 ) ± s 1.96 2 1 n 1 + s2 2 n 2 ] 7