This is a open-book exam. Assigned: Friday November 27th 2009 at 16:00. Due: Monday November 30th 2009 before 10:00.

Size: px
Start display at page:

Download "This is a open-book exam. Assigned: Friday November 27th 2009 at 16:00. Due: Monday November 30th 2009 before 10:00."

Transcription

1 University of Iceland School of Engineering and Sciences Department of Industrial Engineering, Mechanical Engineering and Computer Science IÐN106F Industrial Statistics II - Bayesian Data Analysis Fall 2009 Take-home Exam This is a open-book exam. Assigned: Friday November 27th 2009 at 16:00. Due: Monday November 30th 2009 before 10:00. There is a total of five problems each problem weighing 20%. Within a problem each scenario weighs the same. Note that the number of scenarios is not the same between problems. It is assumed that students have access to Matlab or R or S+ but of course other computer programs can be used. Programs written for Matlab, R, S+ or other software should be indicated in the appendix. 1

2 1. (20%) The number of live births in Iceland from 1991 to 2008 can be found in the file birth_vs_total.txt (column 2) along with year (column 1) and total number of Icelanders ( 10 3 ) (column 3). Denote the number of births by y t, t = 1,..., T, T = 18, where t = 1 denotes the year 1991 and t = 18 denotes the year The total number of Icelanders is denoted by x t, t = 1,..., T. Assume a Poisson model for live births in Iceland of the form y i Poisson(x i θ), t = 1,..., T where θ is an unknown parameter representing the rate of birth per thousand Icelanders. Let p(θ) denote a noninformative prior for θ. Here a gamma distribution is assumed for θ, that is, p(θ) = Gamma(θ α = 0.001, β = 0.001). (a) Evaluate the posterior distribution of θ. (b) Evaluate the adequacy of the model by computing a Bayesian p-value for the following discrepancy measure T(y, θ) = T(y) = 1 T T 2 (y t x tˆθ) t=1 where T t=1 ˆθ = y t T t=1 x. t Is the proposed Poisson model adequate? If not, discuss what extensions could be made. 2

3 2. (20%) The data set in the file precip_rvk_1_2.txt contains measurements on the annual maximum daily precipitation (mm per 24 hours, from 9:00 AM to 9:00 AM the next day) recorded at two nearby station in Reykjavík over the years 1926 to (a) Plot a normal probability plot of the logarithm of the data for each station and evaluate the lognormal assumption for the data. (b) Assume the data from each station follow a lognormal distribution, that is, the logarithm of the data follow a normal distribution with mean µ i and variance σ 2 i, i = 1, 2. Use the noninformative prior p(µ i, σi 2) σ 2 i, i = 1, 2, and draw a sample of size L = 10 4 after burn-in within each of four chains from the joint posterior distribution of µ i and σi 2, i = 1, 2, by using the Gibbs sampler. Based on this sample compute the posterior mean, standard deviation, 2.5%, 25%, 50%, 75% and 97.5% percentiles for µ i and σ i, i = 1, 2. (c) Assume the data from both stations follow the same lognormal distribution with mean µ 0 and variance σ 2 0. Use the noninformative prior p(µ 0, σ 2 0 ) σ 2 0, and draw a sample of size L = 10 4 after burn-in within each of four chains from the joint posterior distribution of µ 0 and σ 2 0 by using the Gibbs sampler. Based on this sample compute the posterior mean, standard deviation, 2.5%, 25%, 50%, 75% and 97.5% percentiles for µ 0 and σ 0. (d) Assume the data from each station follow a lognormal distribution with mean µ i, i = 1, 2, and joint variance σ 2 0. Use the noninformative prior p(µ 1, µ 2, σ 2 0) σ 2 0 and draw a sample of size L = 10 4 after burn-in within each of four chains from the joint posterior distribution of µ i, i = 1, 2, and σ0 2 by using the Gibbs sampler. Based on this sample compute the posterior mean, standard deviation, 2.5%, 25%, 50%, 75% and 97.5% percentiles for µ i, i = 1, 2, and σ 0. (e) Compute DIC for the three models introduced in (b), (c) and (d). Which of these three models should be preferred according to DIC? 3

4 3. (20%) The median values of owner-occupied homes were collected for each of 506 areas of Boston along with thirteen other variables for the purpose of predicting housing values in other areas of Boston. The dependent variable is y i = log(the median value of owner-occupied homes in area i (in $1000)). The explanatory variables are x i,2 = per capita crime rate by town in area i x i,3 = percentage of residential land zoned for lots over 25,000 sq.ft. in area i x i,4 = percentage of non-retail business acres per town in area i x i,5 = Charles River dummy variable in area i (= 1 if tract bounds river; 0 otherwise) x i,6 = nitric oxides concentration in area i (parts per 10 million) x i,7 = average number of rooms per dwelling in area i x i,8 = percentage of owner-occupied units built prior to 1940 in area i x i,9 = weighted distances to five Boston employment centres in area i x i,10 = index of accessibility to radial highways in area i x i,11 = full-value property-tax rate per $10,000 in area i x i,12 = pupil-teacher ratio by town in area i x i,13 = 1000(B 0.63) 2 where B is the proportion of blacks by town in area i x i,14 = percentage of the population with lower status in area i x i,15 = (x i,3 x 3 ) 2 x i,16 = (x i,14 x 14 ) 2 where x 3 and x 14 are the sample means of x 3 and x 14, respectively. 4

5 The file boston_housing.data contains this data set with columns (x 2, x 3,..., x 14, z). Note that for this analysis the logarithm of z is needed (y = log(z)) and the variables x 15 and x 16 need to be created from x 3 and x 14. The following linear model is proposed 16 E(y i β, σ 2, X) = β j x ij, for i = 1,..., 506, with x i1 = 1 for all i. We further assume that the y s are independent, normally distributed and have equal variance, that is var(y i β, σ 2, X) = σ 2, for all i. j=1 (a) Plot y versus x 2, x 3,..., x 16, a total of 15 figures. Which explanatory variables show a clear relationship with y, and which don t? (b) Create a table with the 15 best models according to DIC where the table has columns showing DIC, the total number of parameters, and which explanatory variables are in the model. Based on the table and your knowledge of the problem, select one model for these data. This model will be used below. Use the Matlab routine dic_normal_models.m to compute all possible models, see course s web-page. To support your decision compute point estimates (posterior mean) and 95% marginal posterior intervals for the parameters β and σ in the full model, that is, the model which uses all the variables. Use L = (c) Draw a normal probability plot of the standardized residuals. Do the standardized residuals appear to follow a normal distribution? Draw the standardized residuals versus the predicted y, that is X ˆβ, and also versus all of the fifteen explanatory variables. Does the variance appear to be fixed when plotted against these variables? (d) Compute point estimates (posterior mean) and 95% marginal posterior intervals for the parameters in the final model selected in (b), that is, β and σ. Use L =

6 (e) Interpret the parameters in the model, that is, explain the effect of each explanatory variable on the median value of owner-occupied homes. Take into account the log-transformation (what is the expected value of z = exp(y) and how does it change when x i is increased by one unit?). (f) Compute a prediction and a 95% posterior predictive interval for an area in Boston that has the following explanatory variables; x 2 = 4.3, x 3 = 41, x 4 = 8.4, x 5 = 0, x 6 = 0.71, x 7 = 6.7, x 8 = 39, x 9 = 2.1, x 10 = 7, x 11 = 383, x 12 = 17.8, x 13 = 350, x 14 = 15. Take into account the log-transformation. 6

7 4. (20%) The sum of exponentially distributed random variables follows the Erlang distribution. Given that v j follows an exponential distribution with mean 1/θ, θ > 0, j = 1,..., r, then y = r j=1 follows an Erlang distribution with parameters r and θ. The parameter r is an integer which is usually known while θ is usually unknown. The Erlang distribution is a special case of the gamma distribution with α = r and β = θ. v j The density of a random variable y that follows the Erlang distribution is given by p(y r, θ) = θr y r 1 e θy, y > 0 (r 1)! and the mean and variance are E(y) = r/θ and var(y) = r/θ 2, respectively. (a) Assume n independent observations, y i, i = 1,..., n, follow an Erlang distribution with r = 20 and some unknown θ. Assume that the prior distribution is a gamma distribution with parameters α = and β = 0.001, that is p(θ) = Gamma(θ α = 0.001, β = 0.001) Find the general form of the posterior distribution of θ. (b) Evaluate the numerical values of the parameters of the posterior distribution using the data set in erlang.dat which contains n = 100 observed values. (c) Assume that it is not known whether r is equal to 18, 19, 20, 21 or 22. Further, assume that Compute the probability Pr(r = s) = 1, s = 18, 19, 20, 21, Pr(r = s y), 7

8 for s = 18, 19, 20, 21, 22, where y is a vector containing all the data. Which of these five values is most likely? (Hint: Pr(r = s y) can be evaluated analytically via integration of a gamma density. When computing Pr(r = s y), it is better to take the logarithm first, compute its value and then take the exponent.) 8

9 5. (20%) The binomial distribution is usually parameterized with n and θ and the sampling distribution of y is given by p(y n, θ) = Bin(y n, θ) = ( ) n θ y (1 θ) n y, y y {0, 1,..., n}. The binimial distribution can also be parameterized with κ = log(θ) log(1 θ), and θ and (1 θ) as function of κ are θ = eκ eκ, and 1 θ = 1 + e κ. In that case the sampling distribution of y is given by p(y n, κ) = Bin(y n, κ) = ( ) ( ) n e κ y ( ) n y 1 = y 1 + e κ 1 + e κ ( ) n e κy y (1 + e κ ) n. If the uniform distribution is assumed as a prior distribution for θ then the posterior distribution of θ is a beta distribution with α = y + 1 and β = n y + 1. If θ is transformed to κ then the posterior density of κ is given by p(κ y) = (n + 1)! y!(n y)! e κ(y+1) (1 + e κ ) (n+2). (a) Find the normal approximation to the posterior density of κ. (b) Draw the exact posterior density of κ and the approximated normal posterior density of κ on the same graph when n = 30 and y = 21. 9

Conjugate Models. Patrick Lam

Conjugate Models. Patrick Lam Conjugate Models Patrick Lam Outline Conjugate Models What is Conjugacy? The Beta-Binomial Model The Normal Model Normal Model with Unknown Mean, Known Variance Normal Model with Known Mean, Unknown Variance

More information

Non-informative Priors Multiparameter Models

Non-informative Priors Multiparameter Models Non-informative Priors Multiparameter Models Statistics 220 Spring 2005 Copyright c 2005 by Mark E. Irwin Prior Types Informative vs Non-informative There has been a desire for a prior distributions that

More information

Outline. Review Continuation of exercises from last time

Outline. Review Continuation of exercises from last time Bayesian Models II Outline Review Continuation of exercises from last time 2 Review of terms from last time Probability density function aka pdf or density Likelihood function aka likelihood Conditional

More information

Bayesian course - problem set 3 (lecture 4)

Bayesian course - problem set 3 (lecture 4) Bayesian course - problem set 3 (lecture 4) Ben Lambert November 14, 2016 1 Ticked off Imagine once again that you are investigating the occurrence of Lyme disease in the UK. This is a vector-borne disease

More information

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION INSTITUTE AND FACULTY OF ACTUARIES Curriculum 2019 SPECIMEN EXAMINATION Subject CS1A Actuarial Statistics Time allowed: Three hours and fifteen minutes INSTRUCTIONS TO THE CANDIDATE 1. Enter all the candidate

More information

What was in the last lecture?

What was in the last lecture? What was in the last lecture? Normal distribution A continuous rv with bell-shaped density curve The pdf is given by f(x) = 1 2πσ e (x µ)2 2σ 2, < x < If X N(µ, σ 2 ), E(X) = µ and V (X) = σ 2 Standard

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections 1 / 31 : Estimation Sections 7.1 Statistical Inference Bayesian Methods: 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods: 7.5 Maximum Likelihood

More information

Commonly Used Distributions

Commonly Used Distributions Chapter 4: Commonly Used Distributions 1 Introduction Statistical inference involves drawing a sample from a population and analyzing the sample data to learn about the population. We often have some knowledge

More information

SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS

SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS Questions 1-307 have been taken from the previous set of Exam C sample questions. Questions no longer relevant

More information

Continuous Distributions

Continuous Distributions Quantitative Methods 2013 Continuous Distributions 1 The most important probability distribution in statistics is the normal distribution. Carl Friedrich Gauss (1777 1855) Normal curve A normal distribution

More information

Describing Uncertain Variables

Describing Uncertain Variables Describing Uncertain Variables L7 Uncertainty in Variables Uncertainty in concepts and models Uncertainty in variables Lack of precision Lack of knowledge Variability in space/time Describing Uncertainty

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections 1 / 40 Chapter 7: Estimation Sections 7.1 Statistical Inference Bayesian Methods: Chapter 7 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods:

More information

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence continuous rv Let X be a continuous rv. Then a probability distribution or probability density function (pdf) of X is a function f(x) such that for any two numbers a and b with a b, P(a X b) = b a f (x)dx.

More information

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is: **BEGINNING OF EXAMINATION** 1. You are given: (i) A random sample of five observations from a population is: 0.2 0.7 0.9 1.1 1.3 (ii) You use the Kolmogorov-Smirnov test for testing the null hypothesis,

More information

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior (5) Multi-parameter models - Summarizing the posterior Models with more than one parameter Thus far we have studied single-parameter models, but most analyses have several parameters For example, consider

More information

1 Bayesian Bias Correction Model

1 Bayesian Bias Correction Model 1 Bayesian Bias Correction Model Assuming that n iid samples {X 1,...,X n }, were collected from a normal population with mean µ and variance σ 2. The model likelihood has the form, P( X µ, σ 2, T n >

More information

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial Lecture 23 STAT 225 Introduction to Probability Models April 4, 2014 approximation Whitney Huang Purdue University 23.1 Agenda 1 approximation 2 approximation 23.2 Characteristics of the random variable:

More information

Chapter 3 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc.

Chapter 3 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc. 1 3.1 Describing Variation Stem-and-Leaf Display Easy to find percentiles of the data; see page 69 2 Plot of Data in Time Order Marginal plot produced by MINITAB Also called a run chart 3 Histograms Useful

More information

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018 ` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.

More information

4-2 Probability Distributions and Probability Density Functions. Figure 4-2 Probability determined from the area under f(x).

4-2 Probability Distributions and Probability Density Functions. Figure 4-2 Probability determined from the area under f(x). 4-2 Probability Distributions and Probability Density Functions Figure 4-2 Probability determined from the area under f(x). 4-2 Probability Distributions and Probability Density Functions Definition 4-2

More information

MATH 3200 Exam 3 Dr. Syring

MATH 3200 Exam 3 Dr. Syring . Suppose n eligible voters are polled (randomly sampled) from a population of size N. The poll asks voters whether they support or do not support increasing local taxes to fund public parks. Let M be

More information

Bayesian Normal Stuff

Bayesian Normal Stuff Bayesian Normal Stuff - Set-up of the basic model of a normally distributed random variable with unknown mean and variance (a two-parameter model). - Discuss philosophies of prior selection - Implementation

More information

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise. Math 224 Q Exam 3A Fall 217 Tues Dec 12 Version A Problem 1. Let X be the continuous random variable defined by the following pdf: { 1 x/2 when x 2, f(x) otherwise. (a) Compute the mean µ E[X]. E[X] x

More information

CS340 Machine learning Bayesian statistics 3

CS340 Machine learning Bayesian statistics 3 CS340 Machine learning Bayesian statistics 3 1 Outline Conjugate analysis of µ and σ 2 Bayesian model selection Summarizing the posterior 2 Unknown mean and precision The likelihood function is p(d µ,λ)

More information

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions ELE 525: Random Processes in Information Systems Hisashi Kobayashi Department of Electrical Engineering

More information

Model 0: We start with a linear regression model: log Y t = β 0 + β 1 (t 1980) + ε, with ε N(0,

Model 0: We start with a linear regression model: log Y t = β 0 + β 1 (t 1980) + ε, with ε N(0, Stat 534: Fall 2017. Introduction to the BUGS language and rjags Installation: download and install JAGS. You will find the executables on Sourceforge. You must have JAGS installed prior to installing

More information

Homework Problems Stat 479

Homework Problems Stat 479 Chapter 10 91. * A random sample, X1, X2,, Xn, is drawn from a distribution with a mean of 2/3 and a variance of 1/18. ˆ = (X1 + X2 + + Xn)/(n-1) is the estimator of the distribution mean θ. Find MSE(

More information

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

Exam 2 Spring 2015 Statistics for Applications 4/9/2015 18.443 Exam 2 Spring 2015 Statistics for Applications 4/9/2015 1. True or False (and state why). (a). The significance level of a statistical test is not equal to the probability that the null hypothesis

More information

Practice Exam 1. Loss Amount Number of Losses

Practice Exam 1. Loss Amount Number of Losses Practice Exam 1 1. You are given the following data on loss sizes: An ogive is used as a model for loss sizes. Determine the fitted median. Loss Amount Number of Losses 0 1000 5 1000 5000 4 5000 10000

More information

Confidence Intervals Introduction

Confidence Intervals Introduction Confidence Intervals Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample mean X is a point estimate of the population mean μ

More information

1. You are given the following information about a stationary AR(2) model:

1. You are given the following information about a stationary AR(2) model: Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4

More information

Chapter 3 Common Families of Distributions. Definition 3.4.1: A family of pmfs or pdfs is called exponential family if it can be expressed as

Chapter 3 Common Families of Distributions. Definition 3.4.1: A family of pmfs or pdfs is called exponential family if it can be expressed as Lecture 0 on BST 63: Statistical Theory I Kui Zhang, 09/9/008 Review for the previous lecture Definition: Several continuous distributions, including uniform, gamma, normal, Beta, Cauchy, double exponential

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.

More information

Actuarial Society of India EXAMINATIONS

Actuarial Society of India EXAMINATIONS Actuarial Society of India EXAMINATIONS 7 th June 005 Subject CT6 Statistical Models Time allowed: Three Hours (0.30 am 3.30 pm) INSTRUCTIONS TO THE CANDIDATES. Do not write your name anywhere on the answer

More information

Econometric Methods for Valuation Analysis

Econometric Methods for Valuation Analysis Econometric Methods for Valuation Analysis Margarita Genius Dept of Economics M. Genius (Univ. of Crete) Econometric Methods for Valuation Analysis Cagliari, 2017 1 / 26 Correlation Analysis Simple Regression

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics March 12, 2018 CS 361: Probability & Statistics Inference Binomial likelihood: Example Suppose we have a coin with an unknown probability of heads. We flip the coin 10 times and observe 2 heads. What can

More information

Objective Bayesian Analysis for Heteroscedastic Regression

Objective Bayesian Analysis for Heteroscedastic Regression Analysis for Heteroscedastic Regression & Esther Salazar Universidade Federal do Rio de Janeiro Colóquio Inter-institucional: Modelos Estocásticos e Aplicações 2009 Collaborators: Marco Ferreira and Thais

More information

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial. Lecture 21,22, 23 Text: A Course in Probability by Weiss 8.5 STAT 225 Introduction to Probability Models March 31, 2014 Standard Sums of Whitney Huang Purdue University 21,22, 23.1 Agenda 1 2 Standard

More information

Appendix A. Selecting and Using Probability Distributions. In this appendix

Appendix A. Selecting and Using Probability Distributions. In this appendix Appendix A Selecting and Using Probability Distributions In this appendix Understanding probability distributions Selecting a probability distribution Using basic distributions Using continuous distributions

More information

# generate data num.obs <- 100 y <- rnorm(num.obs,mean = theta.true, sd = sqrt(sigma.sq.true))

# generate data num.obs <- 100 y <- rnorm(num.obs,mean = theta.true, sd = sqrt(sigma.sq.true)) Posterior Sampling from Normal Now we seek to create draws from the joint posterior distribution and the marginal posterior distributions and Note the marginal posterior distributions would be used to

More information

Applied Statistics I

Applied Statistics I Applied Statistics I Liang Zhang Department of Mathematics, University of Utah July 14, 2008 Liang Zhang (UofU) Applied Statistics I July 14, 2008 1 / 18 Point Estimation Liang Zhang (UofU) Applied Statistics

More information

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions. UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions. Random Variables 2 A random variable X is a numerical (integer, real, complex, vector etc.) summary of the outcome of the random experiment.

More information

Exam M Fall 2005 PRELIMINARY ANSWER KEY

Exam M Fall 2005 PRELIMINARY ANSWER KEY Exam M Fall 005 PRELIMINARY ANSWER KEY Question # Answer Question # Answer 1 C 1 E C B 3 C 3 E 4 D 4 E 5 C 5 C 6 B 6 E 7 A 7 E 8 D 8 D 9 B 9 A 10 A 30 D 11 A 31 A 1 A 3 A 13 D 33 B 14 C 34 C 15 A 35 A

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions. ME3620 Theory of Engineering Experimentation Chapter III. Random Variables and Probability Distributions Chapter III 1 3.2 Random Variables In an experiment, a measurement is usually denoted by a variable

More information

Chapter 5: Statistical Inference (in General)

Chapter 5: Statistical Inference (in General) Chapter 5: Statistical Inference (in General) Shiwen Shen University of South Carolina 2016 Fall Section 003 1 / 17 Motivation In chapter 3, we learn the discrete probability distributions, including Bernoulli,

More information

Data Analytics (CS40003) Practice Set IV (Topic: Probability and Sampling Distribution)

Data Analytics (CS40003) Practice Set IV (Topic: Probability and Sampling Distribution) Data Analytics (CS40003) Practice Set IV (Topic: Probability and Sampling Distribution) I. Concept Questions 1. Give an example of a random variable in the context of Drawing a card from a deck of cards.

More information

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties Posterior Inference Example. Consider a binomial model where we have a posterior distribution for the probability term, θ. Suppose we want to make inferences about the log-odds γ = log ( θ 1 θ), where

More information

Confidence Intervals for an Exponential Lifetime Percentile

Confidence Intervals for an Exponential Lifetime Percentile Chapter 407 Confidence Intervals for an Exponential Lifetime Percentile Introduction This routine calculates the number of events needed to obtain a specified width of a confidence interval for a percentile

More information

Continuous random variables

Continuous random variables Continuous random variables probability density function (f(x)) the probability distribution function of a continuous random variable (analogous to the probability mass function for a discrete random variable),

More information

MVE051/MSG Lecture 7

MVE051/MSG Lecture 7 MVE051/MSG810 2017 Lecture 7 Petter Mostad Chalmers November 20, 2017 The purpose of collecting and analyzing data Purpose: To build and select models for parts of the real world (which can be used for

More information

Chapter 7 - Lecture 1 General concepts and criteria

Chapter 7 - Lecture 1 General concepts and criteria Chapter 7 - Lecture 1 General concepts and criteria January 29th, 2010 Best estimator Mean Square error Unbiased estimators Example Unbiased estimators not unique Special case MVUE Bootstrap General Question

More information

Intro. Econometrics Fall 2015

Intro. Econometrics Fall 2015 ECO 5350 Prof. Tom Fomby Intro. Econometrics Fall 2015 MIDTERM EXAM TAKE-HOME PART KEY Assignment of Points: Q5.5 (2, 2, 3, 3) = 10 Q5.9 (2, 3, 2, 3) = 10 Q5.15 (2, 3, 3) = 8 Q5.18 (3, 3) = 6 Total = 34

More information

Chapter 7: Point Estimation and Sampling Distributions

Chapter 7: Point Estimation and Sampling Distributions Chapter 7: Point Estimation and Sampling Distributions Seungchul Baek Department of Statistics, University of South Carolina STAT 509: Statistics for Engineers 1 / 20 Motivation In chapter 3, we learned

More information

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples 1.3 Regime switching models A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples (or regimes). If the dates, the

More information

SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data

SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu September 5, 2015

More information

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom Review for Final Exam 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom THANK YOU!!!! JON!! PETER!! RUTHI!! ERIKA!! ALL OF YOU!!!! Probability Counting Sets Inclusion-exclusion principle Rule of product

More information

Common one-parameter models

Common one-parameter models Common one-parameter models In this section we will explore common one-parameter models, including: 1. Binomial data with beta prior on the probability 2. Poisson data with gamma prior on the rate 3. Gaussian

More information

Chapter 4 Probability Distributions

Chapter 4 Probability Distributions Slide 1 Chapter 4 Probability Distributions Slide 2 4-1 Overview 4-2 Random Variables 4-3 Binomial Probability Distributions 4-4 Mean, Variance, and Standard Deviation for the Binomial Distribution 4-5

More information

EE641 Digital Image Processing II: Purdue University VISE - October 29,

EE641 Digital Image Processing II: Purdue University VISE - October 29, EE64 Digital Image Processing II: Purdue University VISE - October 9, 004 The EM Algorithm. Suffient Statistics and Exponential Distributions Let p(y θ) be a family of density functions parameterized by

More information

Statistical estimation

Statistical estimation Statistical estimation Statistical modelling: theory and practice Gilles Guillot gigu@dtu.dk September 3, 2013 Gilles Guillot (gigu@dtu.dk) Estimation September 3, 2013 1 / 27 1 Introductory example 2

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections Chapter 7: Estimation Sections 7.1 Statistical Inference Bayesian Methods: 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions Frequentist Methods: 7.5 Maximum Likelihood Estimators

More information

Back to estimators...

Back to estimators... Back to estimators... So far, we have: Identified estimators for common parameters Discussed the sampling distributions of estimators Introduced ways to judge the goodness of an estimator (bias, MSE, etc.)

More information

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices Bachelier Finance Society Meeting Toronto 2010 Henley Business School at Reading Contact Author : d.ledermann@icmacentre.ac.uk Alexander

More information

1. Statistical problems - a) Distribution is known. b) Distribution is unknown.

1. Statistical problems - a) Distribution is known. b) Distribution is unknown. Probability February 5, 2013 Debdeep Pati Estimation 1. Statistical problems - a) Distribution is known. b) Distribution is unknown. 2. When Distribution is known, then we can have either i) Parameters

More information

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

Confidence Intervals for the Difference Between Two Means with Tolerance Probability Chapter 47 Confidence Intervals for the Difference Between Two Means with Tolerance Probability Introduction This procedure calculates the sample size necessary to achieve a specified distance from the

More information

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley Outline: 1) Review of Variation & Error 2) Binomial Distributions 3) The Normal Distribution 4) Defining the Mean of a population Goals:

More information

5.3 Interval Estimation

5.3 Interval Estimation 5.3 Interval Estimation Ulrich Hoensch Wednesday, March 13, 2013 Confidence Intervals Definition Let θ be an (unknown) population parameter. A confidence interval with confidence level C is an interval

More information

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics. ENM 207 Lecture 12 Some Useful Continuous Distributions Normal Distribution The most important continuous probability distribution in entire field of statistics. Its graph, called the normal curve, is

More information

Mathematics of Finance Final Preparation December 19. To be thoroughly prepared for the final exam, you should

Mathematics of Finance Final Preparation December 19. To be thoroughly prepared for the final exam, you should Mathematics of Finance Final Preparation December 19 To be thoroughly prepared for the final exam, you should 1. know how to do the homework problems. 2. be able to provide (correct and complete!) definitions

More information

CS 237: Probability in Computing

CS 237: Probability in Computing CS 237: Probability in Computing Wayne Snyder Computer Science Department Boston University Lecture 12: Continuous Distributions Uniform Distribution Normal Distribution (motivation) Discrete vs Continuous

More information

Point Estimation. Principle of Unbiased Estimation. When choosing among several different estimators of θ, select one that is unbiased.

Point Estimation. Principle of Unbiased Estimation. When choosing among several different estimators of θ, select one that is unbiased. Point Estimation Point Estimation Definition A point estimate of a parameter θ is a single number that can be regarded as a sensible value for θ. A point estimate is obtained by selecting a suitable statistic

More information

Construction and behavior of Multinomial Markov random field models

Construction and behavior of Multinomial Markov random field models Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 2010 Construction and behavior of Multinomial Markov random field models Kim Mueller Iowa State University Follow

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Random Variables Handout. Xavier Vilà

Random Variables Handout. Xavier Vilà Random Variables Handout Xavier Vilà Course 2004-2005 1 Discrete Random Variables. 1.1 Introduction 1.1.1 Definition of Random Variable A random variable X is a function that maps each possible outcome

More information

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized

More information

Module 2 caa-global.org

Module 2 caa-global.org Certified Actuarial Analyst Resource Guide 2 Module 2 2017 caa-global.org Contents Welcome to Module 2 3 The Certified Actuarial Analyst qualification 4 The syllabus for the Module 2 exam 5 Assessment

More information

Distribution of state of nature: Main problem

Distribution of state of nature: Main problem State of nature concept Monte Carlo Simulation II Advanced Herd Management Anders Ringgaard Kristensen The hyper distribution: An infinite population of flocks each having its own state of nature defining

More information

Rules and Models 1 investigates the internal measurement approach for operational risk capital

Rules and Models 1 investigates the internal measurement approach for operational risk capital Carol Alexander 2 Rules and Models Rules and Models 1 investigates the internal measurement approach for operational risk capital 1 There is a view that the new Basel Accord is being defined by a committee

More information

Machine Learning for Quantitative Finance

Machine Learning for Quantitative Finance Machine Learning for Quantitative Finance Fast derivative pricing Sofie Reyners Joint work with Jan De Spiegeleer, Dilip Madan and Wim Schoutens Derivative pricing is time-consuming... Vanilla option pricing

More information

Chapter 4 Random Variables & Probability. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Chapter 4 Random Variables & Probability. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Chapter 4.5, 6, 8 Probability for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random variable =

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial

More information

M.Sc. ACTUARIAL SCIENCE. Term-End Examination

M.Sc. ACTUARIAL SCIENCE. Term-End Examination No. of Printed Pages : 15 LMJA-010 (F2F) M.Sc. ACTUARIAL SCIENCE Term-End Examination O CD December, 2011 MIA-010 (F2F) : STATISTICAL METHOD Time : 3 hours Maximum Marks : 100 SECTION - A Attempt any five

More information

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions

Probability Theory and Simulation Methods. April 9th, Lecture 20: Special distributions April 9th, 2018 Lecture 20: Special distributions Week 1 Chapter 1: Axioms of probability Week 2 Chapter 3: Conditional probability and independence Week 4 Chapters 4, 6: Random variables Week 9 Chapter

More information

DYNAMIC ECONOMETRIC MODELS Vol. 8 Nicolaus Copernicus University Toruń Mateusz Pipień Cracow University of Economics

DYNAMIC ECONOMETRIC MODELS Vol. 8 Nicolaus Copernicus University Toruń Mateusz Pipień Cracow University of Economics DYNAMIC ECONOMETRIC MODELS Vol. 8 Nicolaus Copernicus University Toruń 2008 Mateusz Pipień Cracow University of Economics On the Use of the Family of Beta Distributions in Testing Tradeoff Between Risk

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

PhD Qualifier Examination

PhD Qualifier Examination PhD Qualifier Examination Department of Agricultural Economics May 29, 2013 Instructions The exam consists of six questions. You must answer all questions. If you need an assumption to complete a question,

More information

Stochastic Loss Reserving with Bayesian MCMC Models Revised March 31

Stochastic Loss Reserving with Bayesian MCMC Models Revised March 31 w w w. I C A 2 0 1 4. o r g Stochastic Loss Reserving with Bayesian MCMC Models Revised March 31 Glenn Meyers FCAS, MAAA, CERA, Ph.D. April 2, 2014 The CAS Loss Reserve Database Created by Meyers and Shi

More information

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. Summary of the previous lecture Moments of a distribubon Measures of

More information

Sampling Distribution

Sampling Distribution MAT 2379 (Spring 2012) Sampling Distribution Definition : Let X 1,..., X n be a collection of random variables. We say that they are identically distributed if they have a common distribution. Definition

More information

1. Covariance between two variables X and Y is denoted by Cov(X, Y) and defined by. Cov(X, Y ) = E(X E(X))(Y E(Y ))

1. Covariance between two variables X and Y is denoted by Cov(X, Y) and defined by. Cov(X, Y ) = E(X E(X))(Y E(Y )) Correlation & Estimation - Class 7 January 28, 2014 Debdeep Pati Association between two variables 1. Covariance between two variables X and Y is denoted by Cov(X, Y) and defined by Cov(X, Y ) = E(X E(X))(Y

More information

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions Chapter 4 Probability Distributions 4-1 Overview 4-2 Random Variables 4-3 Binomial Probability Distributions 4-4 Mean, Variance, and Standard Deviation for the Binomial Distribution 4-5 The Poisson Distribution

More information

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS Copyright 2008 by the Society of Actuaries and the Casualty Actuarial Society

More information

Earnings Inequality and the Minimum Wage: Evidence from Brazil

Earnings Inequality and the Minimum Wage: Evidence from Brazil Earnings Inequality and the Minimum Wage: Evidence from Brazil Niklas Engbom June 16, 2016 Christian Moser World Bank-Bank of Spain Conference This project Shed light on drivers of earnings inequality

More information

Lecture 10: Point Estimation

Lecture 10: Point Estimation Lecture 10: Point Estimation MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 31 Basic Concepts of Point Estimation A point estimate of a parameter θ,

More information

Lecture 3: Probability Distributions (cont d)

Lecture 3: Probability Distributions (cont d) EAS31116/B9036: Statistics in Earth & Atmospheric Sciences Lecture 3: Probability Distributions (cont d) Instructor: Prof. Johnny Luo www.sci.ccny.cuny.edu/~luo Dates Topic Reading (Based on the 2 nd Edition

More information

Final Exam Suggested Solutions

Final Exam Suggested Solutions University of Washington Fall 003 Department of Economics Eric Zivot Economics 483 Final Exam Suggested Solutions This is a closed book and closed note exam. However, you are allowed one page of handwritten

More information

Dealing with forecast uncertainty in inventory models

Dealing with forecast uncertainty in inventory models Dealing with forecast uncertainty in inventory models 19th IIF workshop on Supply Chain Forecasting for Operations Lancaster University Dennis Prak Supervisor: Prof. R.H. Teunter June 29, 2016 Dennis Prak

More information

3 ˆθ B = X 1 + X 2 + X 3. 7 a) Find the Bias, Variance and MSE of each estimator. Which estimator is the best according

3 ˆθ B = X 1 + X 2 + X 3. 7 a) Find the Bias, Variance and MSE of each estimator. Which estimator is the best according STAT 345 Spring 2018 Homework 9 - Point Estimation Name: Please adhere to the homework rules as given in the Syllabus. 1. Mean Squared Error. Suppose that X 1, X 2 and X 3 are independent random variables

More information

1. For two independent lives now age 30 and 34, you are given:

1. For two independent lives now age 30 and 34, you are given: Society of Actuaries Course 3 Exam Fall 2003 **BEGINNING OF EXAMINATION** 1. For two independent lives now age 30 and 34, you are given: x q x 30 0.1 31 0.2 32 0.3 33 0.4 34 0.5 35 0.6 36 0.7 37 0.8 Calculate

More information

Further Application of Confidence Limits to Quantile Measures for the Lognormal Distribution using the MATLAB Program

Further Application of Confidence Limits to Quantile Measures for the Lognormal Distribution using the MATLAB Program Further Application of Confidence Limits to Quantile Measures for the Lognormal Distribution using the MATLAB Program Introduction In the prior discussion as posted on the Petrocenter website, mean and

More information