Lecture III. 1. common parametric models 2. model fitting 2a. moment matching 2b. maximum likelihood 3. hypothesis testing 3a. p-values 3b.
|
|
- Flora Porter
- 5 years ago
- Views:
Transcription
1 Lecture III 1. common parametric models 2. model fitting 2a. moment matching 2b. maximum likelihood 3. hypothesis testing 3a. p-values 3b. simulation
2 Parameters Parameters are knobs that control the amount of probability we assign to particular events. In the last homework, λ was a parameter. Parameters let us assign probabilities to many events (even an infinity of them) using just a few numbers. Simplicity, simplicity, simplicity!
3 Bernoulli In the Bernoulli coin flip model, the only parameter is the probability of a success. p x (1 p) 1 x, x {0, 1} A success could be defined as: 1. a win or a loss in a sporting event 2. a profit or a loss in a fiscal year 3. a male or female newborn...
4 Binomial When we go on to consider the sum of n Bernoulli trials, we get a binomial model, which has parameters n and p. Pr(X = x) = n p x (1 p) n x x X {0, 1,,n} Could be used to model votes in a population, season win-loss record, number of students who get an A in a class...
5 Binomial Pr(X = x) = n p x (1 p) n x x X {0, 1,,n} > n <- 10 > x <- seq(0,n,by=1) > px <- dbinom(x,n,0.3) > barplot(px,names=x)
6 Binomial Pr(X = x) = n p x (1 p) n x x X {0, 1,,n} > n <- 10 > x <- seq(0,n,by=1) > px <- dbinom(x,n,0.8) > barplot(px,names=x)
7 Binomial Pr(X = x) = n p x (1 p) n x x X {0, 1,,n} > n <- 55 > x <- seq(0,n,by=1) > px <- dbinom(x,n,0.8) > barplot(px,names=x)
8 Poisson For large n we might consider a model which allows arbitrarily large integers. Pr(X = x) = λx x! exp ( λ) E(X) =λ Var(X) =λ Could be used to model number of infections in a large population (epidemiology), number of web hits in a given day (e-commerce) or counts of a certain word in a document (text analysis). What purposed does the exponent serve in this expression?
9 Poisson Pr(X = x) = λx x! exp ( λ) > lambda <- 1 > x <- seq(0,15,by=1) > px <- dpois(x,lambda) > barplot(px,names=x)
10 Poisson Pr(X = x) = λx x! exp ( λ) > lambda <- 3 > x <- seq(0,15,by=1) > px <- dpois(x,lambda) > barplot(px,names=x)
11 Poisson Pr(X = x) = λx x! exp ( λ) > lambda <- 20 > x <- seq(0,50,by=1) > px <- dpois(x,lambda) > barplot(px,names=x)
12 Normal distribution The renown of this distribution is well-deserved, as it arises in a variety of different contexts. We will see this connections in a moment. f(x) = 1 2πσ exp 1 2 (x µ) 2 σ 2 We can use this distribution to model log-returns of stocks, heights of individuals in the general population, or astronomical measurements taken by different individuals at different times, etc.
13 Normal distribution The parameters of the normal distribution are easy to interpret, because they give the mean and variance of the distribution, respectively. The expected value and the dispersion. f(x) = 1 2πσ exp 1 2 (x µ) 2 σ 2 E(X) =µ Var(X) =σ 2 E(X) = Var(X) = xf(x)dx (x E(X)) 2 f(x)dx Just remember, integrals are basically sums.
14 Normal distribution f(x) = 1 2πσ exp 1 2 (x µ) 2 σ 2 fx > mu <- -5 > sigma <- 1 > x<- seq(-10,10,by=0.1) > fx <- dnorm(x,mu,sigma) > plot(x,fx,type='l',bty='n') x
15 Normal distribution f(x) = 1 2πσ exp 1 2 (x µ) 2 σ 2 fx x > mu <- 1 > sigma <- 2.5 > x<- seq(-10,10,by=0.1) > fx <- dnorm(x,mu,sigma) > plot(x,fx,type='l',bty='n')
16 Central limit theorems As n gets large, the binomial distribution starts to look like a normal distribution with parameters N(np, np(1 p)) p = 0.3, n = 10
17 Central limit theorems As n gets large, the binomial distribution starts to look like a normal distribution with parameters N(np, np(1 p)) p = 0.3, n = 20
18 Central limit theorems As n gets large, the binomial distribution starts to look like a normal distribution with parameters N(np, np(1 p)) p = 0.3, n = 100
19 Central limit theorems One can also approximate the Poisson distribution using a normal distribution, again by making the mean and variance match
20 Central limit theorems A CLT is a theoretical result which says that sums and averages of random variables with finite mean and variance come to be normally distributed as the number of terms in the sum grows large. The binomial and Poisson approximations are a special case of this. Recall that a binomial random variable is a sum of independent Bernoulli random variables. The Poisson distribution can be similarly characterized.
21 Quincunx
22 Quincunx
23 Linear combinations of normals A fact about the normal distribution which is related to the central limit theorem is that weighted sums of normal random variables are still normally distributed, just with different parameters. X N(µ, σ 2 ) Y = α + βx Y N(α + βµ,β 2 σ 2 )
24 Linear combinations of normals In the case of two independent random variates we have: X 1 N(µ 1,v 1 ) X 2 N(µ 2,v 2 ) Y = α + β 1 X 1 + β 2 X 2 Y N(α + β 1 µ 1 + β 2 µ 2,β1v β 2 v 2 ) Intuitively, this works because the normal distribution is uniquely characterized by its first and second moment (e.g., its mean and variance), and...
25 Mean and variance of linear combinations RVs... in general, mean and variance of linear combinations may be calculated as Y = α + β 1 X 1 + β 2 X 2 Var(Y ) = Var(α + β 1 X 1 + β 2 X 2 ) = Var(α) + Var(β 1 X 1 ) + Var(β 2 X 2 ) = β1var(x 2 1 )+β2var(x 2 2 ) E(Y )=E(α + β 1 X 1 + β 2 X 2 ) =E(α)+E(β 1 X 1 )+E(β 2 X 2 ) = α + β 1 E(X 1 )+β 2 E(X 2 )
26 Exponential distribution For strictly positive quantities, an exponential distribution may be appropriate. fx f(x) = exp ( x/λ) λ E(X) =λ Var(X) =λ 2 Look familiar? x
27 Exponential distribution We have been reporting the probability density functions for these distributions, but we could as well have reported the cumulative distribution functions. F (x) =1 exp x λ Fx What is the inverse of this? x
28 Using the CDF Recall that CDFs allow us to compute the probability assigned to intervals. Fx x
29 Using the CDF In terms of the pdf, this is the area under the curve on that interval. I think the CDF is easier to think about -- remember the darts! fx x
30 t distribution Recall the emerging markets data had a decent proportion of astronomical profits/losses. The normal distribution does not have this feature (shown in green for comparison). f(x) = Γ( ν+1 2 ) νπγ( ν 2 ) 1+ (x µ)2 ν ν+1 2 fx x
31 Cauchy distribution With nu=1 we have the Cauchy distribution. The mean of this distribution doesn t exist! Practically, this means that if you collected a bunch of realizations of Cauchy random variables, the sample mean would never settle down no matter how many samples you collect. Cumulative mean Number of samples
32 Model fitting So far we ve considered the forward direction, where we specify a probability distribution from which we can calculate the probability of various events. There are two additional components to modeling real data. First is to figure out what to set the parameter values at, based on the observed data. This step is called estimation or model fitting. Last is the uncertainty assessment part, we try to characterize how confident we are in our estimates. This can be done with techniques called hypothesis testing and confidence intervals.
33 Moment matching One intuitive approach to model fitting is to select the parameter so that certain features of the observed data match up with features of the parametric distribution. Specifically, we can pick the parameter so that the population mean is equal to the observed sample mean. So in the case of a normal model, we might set ˆµ = x n ˆσ2 = n j=1 (x j x n ) 2 n
34 Sample mean vs population mean These are not the same thing! A random variable is a curve that maps intervals in the domain to numbers between zero and one. The population mean is an attribute of this curve. The sample mean is a function of random draws generated according to the chance process described by the curve. So the sample mean itself is a random variable! At least until we observe it. Once we observe it, its data.
35 Sample mean of normal RV Recall that the sum of independent normal random variables is also a normal random variable. This lets us figure out what the distribution of the sample mean of normal random variables is. X j N(µ, σ 2 ), j =1,...,n X n N µ, σ2 n Notice that its distribution depends on the parameter values of the data random variables.
36 Statistics This is true in general of statistics of random variables --- that they are themselves random variables. Their distributions are called the sampling distribution of the statistic. What is a statistic? Any function of a random variable or variables. Can you think of some examples? If you start with a Bernoulli model, what statistic has a binomial sampling distribution?
37 Example: Win-Loss The Bulls had a record at the end of the season. If we model each of their games as a Bernoulli random variable with probability p of winning, then 72 wins can be considered a draw from a binomial distribution with parameters p and n=82. The expected value of a binomial random variable is np, so we set np = 72, giving ˆp = 72 82
38 Example: Word Count If we go to major online news outlets (CNN, Fox, ABC, Slate, Washington Post, NY Times, etc), we can do a search for Romney ˆλ =
39 Example: Word Count If we go to major online news outlets (CNN, Fox, ABC, Slate, Washington Post, NY Times, etc), we can do a search for Romney ˆλ =4.66
40 Example: Word Count If we go to major online news outlets (CNN, Fox, ABC, Slate, Washington Post, NY Times, etc), we can do a search for Romney ˆλ =4.66 Is this a good fit? ˆλ 2 = 10.35
41 Example: Daily Stock Prices What if we look at the daily change in stock returns measured in log dollars? Consider Apple. Differenced log close price of Apple Inc. Density Change in Log Price
42 Example: Daily Stock Prices There are probably more unusual events than the normal model would suggest. Change in log close Log dollars
43 Example: Daily Stock Prices There are probably more unusual events than the normal model would suggest. Change in log close Log dollars
44 Example: Daily Stock Prices Normal Q-Q Plot Sample Quantiles Theoretical Quantiles
45 Maximum Likelihood Pia is 31 years old, single, outspoken, and smart. She was a philosophy major. When a student, she was an ardent supporter of Native American rights, and she picketed a department store that had no facilities for nursing mothers. Rank the following statements in order of probability from 1 (most probable) to 6 (least probable). a) Pia is an active feminist. b) Pia is a bank teller. c) Pia works in a small bookstore. d) Pia is a bank teller and an active feminist. e) Pia is a bank teller and an active feminist who takes yoga classes. f) Pia works in a small bookstore and is an active feminist who takes yoga classes.
46 Maximum Likelihood Principle: Pick the model (from some circumscribed class) that makes the data look best in the sense of being most likely under that model. In many cases, moment matching and maximum likelihood estimates coincide. This is true in the case of the Normal distribution, for example.
47 Maximum Likelihood Consider again the word count data. We can compute and plot the likelihood as a function of lambda: Why do/can we take logarithms here? Why is the log-likelihood written as a sum? Log-Likelihood λ > lam <- seq(0.2,10,by=0.01) > b<-rep(0,length(lam)) > for (j in 1:length(lam)){ + b[j] <- sum(dpois(c,lam[j],log =TRUE))} > plot(lam,b,xlab='lambda',ylab='log-likelihood',type='l') > lines(c(mean(c),mean(c)),c(-350,-10),col='red',lwd=3)
48 The Logic of Hypothesis Testing We keep hinting at the idea that we want to gauge our confidence in our parameter estimates. We begin with a special case where we have an a priori parameter value of interest. This parameter setting is called a null hypothesis, because it often encodes the idea that nothing interesting is going on or that the data could have happened by chance. The approach is this: consider the sampling distribution of a statistic of the data if the null hypothesis were true; if the observed statistic is unlikely under that distribution we reject the null hypothesis, otherwise we fail to reject.
49 p-values The approach is this: consider the sampling distribution of a statistic of the data if the null hypothesis were true; if the observed statistic is unlikely under that distribution we reject the null hypothesis, otherwise we fail to reject. The way we measure unlikeliness is via a tail probability: the probability (under the null) of drawing a statistic as-or-more extreme as the one we actually saw.
50 Example: difference of two means Do I wake up earlier on Monday than I do on Friday? To answer this question, imagine I collected the time I woke up on 20 consecutive weeks. If every Friday time was later than that week s Monday time, what would we conclude? In general, some Fridays I will rise later, but some I will rise earlier. The question is how can I decide if this effect is real, or just happened by chance?
51 Example: difference of two means Assume for simplicity that the time I wake up on Fridays is given by as F N(µ F, 100 min) and that my Monday wake up time is randomly distributed as M N(µ M, 100 min) then the sample mean of the difference between my wake up times is X i=1 (M i F i ), X N µ M µ F, 200 min 20
52 Example: difference of two means X i=1 (M i F i ), X N µ M µ F, 200 min 20 How can we use this information to decide if there is a significant difference? Here is one way to judge: we want a rule that if we follow it will lead us to incorrectly reject the null only rarely. H 0 : µ M µ F =0 H A : µ M µ F =0
53 Example: difference of two means X i=1 (M i F i ), X N µ M µ F, H 0 : µ M µ F =0 H A : µ M µ F =0 200 min 20 Rule: reject the null hypothesis if x min 20 or x min 20
54 Example: difference of two means Rule: reject the null hypothesis if x 1.96 Why does this work? 200 min 20 or x min 20 fx x 1 - (pnorm(1.96*sqrt(10),0,sqrt(10))- pnorm(-1.96*sqrt(10),0,sqrt(10))) = 5%
55 Example: difference of two means we want a rule that if we follow it will lead us to incorrectly reject the null only rarely In the previous example we followed the conventional 5% significance level. There is nothing in principle special about 5%. Can anyone think of a problem with this approach? We will come back to this point in a minute.
56 Simulation This example -- and indeed, much of recorded statistical methodology -- relies on mathematical derivations of sampling distributions for particular statistics. Modern computing technology has made hypothesis testing much easier, by allowing simulating from the null distribution. Consider solving the same problem as before by simulation in R: > xnull <- rnorm(10000,0,sqrt(10)) > quantile(xnull,0.025) > quantile(xnull,0.975)
57 Permutation tests Sometimes we can use the logic of hypothesis testing without making parametric assumptions. For example, if it were really the case that Mondays and Fridays were equivalent in terms of my wake-up times, then we would expect that shuffling the observed difference between the average wake-up time would not luck much different if we shuffled the days up. > tf <- c(-20, 10, 2, 3, -2, -1, 1, 6,20,-1,-1,0,0,0,1,2,5,5,4,30) > tm <- c(-17,0,5,0,-1,0,5,-20,5,5,3,3,-1,2,23,11,13,6,-4,-14) Would the parametric model say about the null hypothesis in this case?
58 Permutation tests Monday wake up times Frequency Minutes before or after alarm Friday wake up times Frequency Minutes before or after alarm
59 Permutation tests Difference in wake up times Frequency Minutes before or after alarm Does the data look to support the null hypothesis?
60 Permutation tests > tall <- c(tm,tf) > inds<-1:40 > reps< > m<-rep(0,reps) > for (j in 1:10000){ + idx <- sample(inds,20) + idx2 <- inds[setdiff(inds,idx)] + m[j] <- mean(tall[idx2]-tall[idx])} Null distribution Frequency Difference in wake-up times
Random Variables Handout. Xavier Vilà
Random Variables Handout Xavier Vilà Course 2004-2005 1 Discrete Random Variables. 1.1 Introduction 1.1.1 Definition of Random Variable A random variable X is a function that maps each possible outcome
More informationcontinuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence
continuous rv Let X be a continuous rv. Then a probability distribution or probability density function (pdf) of X is a function f(x) such that for any two numbers a and b with a b, P(a X b) = b a f (x)dx.
More informationLecture 2. Probability Distributions Theophanis Tsandilas
Lecture 2 Probability Distributions Theophanis Tsandilas Comment on measures of dispersion Why do common measures of dispersion (variance and standard deviation) use sums of squares: nx (x i ˆµ) 2 i=1
More informationCommonly Used Distributions
Chapter 4: Commonly Used Distributions 1 Introduction Statistical inference involves drawing a sample from a population and analyzing the sample data to learn about the population. We often have some knowledge
More information(Practice Version) Midterm Exam 1
EECS 126 Probability and Random Processes University of California, Berkeley: Fall 2014 Kannan Ramchandran September 19, 2014 (Practice Version) Midterm Exam 1 Last name First name SID Rules. DO NOT open
More informationLecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial
Lecture 23 STAT 225 Introduction to Probability Models April 4, 2014 approximation Whitney Huang Purdue University 23.1 Agenda 1 approximation 2 approximation 23.2 Characteristics of the random variable:
More informationHomework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a
Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Announcements: There are some office hour changes for Nov 5, 8, 9 on website Week 5 quiz begins after class today and ends at
More informationME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.
ME3620 Theory of Engineering Experimentation Chapter III. Random Variables and Probability Distributions Chapter III 1 3.2 Random Variables In an experiment, a measurement is usually denoted by a variable
More informationChapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi
Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized
More informationChapter 5. Statistical inference for Parametric Models
Chapter 5. Statistical inference for Parametric Models Outline Overview Parameter estimation Method of moments How good are method of moments estimates? Interval estimation Statistical Inference for Parametric
More informationSTA 6166 Fall 2007 Web-based Course. Notes 10: Probability Models
STA 6166 Fall 2007 Web-based Course 1 Notes 10: Probability Models We first saw the normal model as a useful model for the distribution of some quantitative variables. We ve also seen that if we make a
More information4-1. Chapter 4. Commonly Used Distributions by The McGraw-Hill Companies, Inc. All rights reserved.
4-1 Chapter 4 Commonly Used Distributions 2014 by The Companies, Inc. All rights reserved. Section 4.1: The Bernoulli Distribution 4-2 We use the Bernoulli distribution when we have an experiment which
More informationUQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.
UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions. Random Variables 2 A random variable X is a numerical (integer, real, complex, vector etc.) summary of the outcome of the random experiment.
More informationProbability. An intro for calculus students P= Figure 1: A normal integral
Probability An intro for calculus students.8.6.4.2 P=.87 2 3 4 Figure : A normal integral Suppose we flip a coin 2 times; what is the probability that we get more than 2 heads? Suppose we roll a six-sided
More informationReview for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom
Review for Final Exam 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom THANK YOU!!!! JON!! PETER!! RUTHI!! ERIKA!! ALL OF YOU!!!! Probability Counting Sets Inclusion-exclusion principle Rule of product
More informationChapter 4 and 5 Note Guide: Probability Distributions
Chapter 4 and 5 Note Guide: Probability Distributions Probability Distributions for a Discrete Random Variable A discrete probability distribution function has two characteristics: Each probability is
More informationStatistics and Probability
Statistics and Probability Continuous RVs (Normal); Confidence Intervals Outline Continuous random variables Normal distribution CLT Point estimation Confidence intervals http://www.isrec.isb-sib.ch/~darlene/geneve/
More informationCase Study: Heavy-Tailed Distribution and Reinsurance Rate-making
Case Study: Heavy-Tailed Distribution and Reinsurance Rate-making May 30, 2016 The purpose of this case study is to give a brief introduction to a heavy-tailed distribution and its distinct behaviors in
More informationBusiness Statistics 41000: Probability 4
Business Statistics 41000: Probability 4 Drew D. Creal University of Chicago, Booth School of Business February 14 and 15, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office:
More informationThe Bernoulli distribution
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationChapter 5. Sampling Distributions
Lecture notes, Lang Wu, UBC 1 Chapter 5. Sampling Distributions 5.1. Introduction In statistical inference, we attempt to estimate an unknown population characteristic, such as the population mean, µ,
More informationMATH 3200 Exam 3 Dr. Syring
. Suppose n eligible voters are polled (randomly sampled) from a population of size N. The poll asks voters whether they support or do not support increasing local taxes to fund public parks. Let M be
More informationNormal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.
Lecture 21,22, 23 Text: A Course in Probability by Weiss 8.5 STAT 225 Introduction to Probability Models March 31, 2014 Standard Sums of Whitney Huang Purdue University 21,22, 23.1 Agenda 1 2 Standard
More informationProbability Models.S2 Discrete Random Variables
Probability Models.S2 Discrete Random Variables Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard Results of an experiment involving uncertainty are described by one or more random
More informationA useful modeling tricks.
.7 Joint models for more than two outcomes We saw that we could write joint models for a pair of variables by specifying the joint probabilities over all pairs of outcomes. In principal, we could do this
More informationDiscrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)
3 Discrete Random Variables and Probability Distributions Stat 4570/5570 Based on Devore s book (Ed 8) Random Variables We can associate each single outcome of an experiment with a real number: We refer
More information6. Continous Distributions
6. Continous Distributions Chris Piech and Mehran Sahami May 17 So far, all random variables we have seen have been discrete. In all the cases we have seen in CS19 this meant that our RVs could only take
More informationStatistical Tables Compiled by Alan J. Terry
Statistical Tables Compiled by Alan J. Terry School of Science and Sport University of the West of Scotland Paisley, Scotland Contents Table 1: Cumulative binomial probabilities Page 1 Table 2: Cumulative
More informationLecture 10: Point Estimation
Lecture 10: Point Estimation MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 31 Basic Concepts of Point Estimation A point estimate of a parameter θ,
More informationReview. Binomial random variable
Review Discrete RV s: prob y fctn: p(x) = Pr(X = x) cdf: F(x) = Pr(X x) E(X) = x x p(x) SD(X) = E { (X - E X) 2 } Binomial(n,p): no. successes in n indep. trials where Pr(success) = p in each trial If
More informationUnit 5: Sampling Distributions of Statistics
Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate
More informationUnit 5: Sampling Distributions of Statistics
Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate
More informationIntroduction to Probability and Inference HSSP Summer 2017, Instructor: Alexandra Ding July 19, 2017
Introduction to Probability and Inference HSSP Summer 2017, Instructor: Alexandra Ding July 19, 2017 Please fill out the attendance sheet! Suggestions Box: Feedback and suggestions are important to the
More informationThe Binomial Distribution
The Binomial Distribution January 31, 2018 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The
More informationMAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw
MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment
More informationBIOL The Normal Distribution and the Central Limit Theorem
BIOL 300 - The Normal Distribution and the Central Limit Theorem In the first week of the course, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are
More informationVersion A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.
Math 224 Q Exam 3A Fall 217 Tues Dec 12 Version A Problem 1. Let X be the continuous random variable defined by the following pdf: { 1 x/2 when x 2, f(x) otherwise. (a) Compute the mean µ E[X]. E[X] x
More informationThe Binomial Distribution
The Binomial Distribution January 31, 2019 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The
More informationINDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.
INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. Summary of the previous lecture Moments of a distribubon Measures of
More informationECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 10: Continuous RV Families. Prof. Vince Calhoun
ECE 340 Probabilistic Methods in Engineering M/W 3-4:15 Lecture 10: Continuous RV Families Prof. Vince Calhoun 1 Reading This class: Section 4.4-4.5 Next class: Section 4.6-4.7 2 Homework 3.9, 3.49, 4.5,
More informationTwo hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER
Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.
More information6. Genetics examples: Hardy-Weinberg Equilibrium
PBCB 206 (Fall 2006) Instructor: Fei Zou email: fzou@bios.unc.edu office: 3107D McGavran-Greenberg Hall Lecture 4 Topics for Lecture 4 1. Parametric models and estimating parameters from data 2. Method
More informationDeriving the Black-Scholes Equation and Basic Mathematical Finance
Deriving the Black-Scholes Equation and Basic Mathematical Finance Nikita Filippov June, 7 Introduction In the 97 s Fischer Black and Myron Scholes published a model which would attempt to tackle the issue
More informationElementary Statistics Lecture 5
Elementary Statistics Lecture 5 Sampling Distributions Chong Ma Department of Statistics University of South Carolina Chong Ma (Statistics, USC) STAT 201 Elementary Statistics 1 / 24 Outline 1 Introduction
More informationE509A: Principle of Biostatistics. GY Zou
E509A: Principle of Biostatistics (Week 2: Probability and Distributions) GY Zou gzou@robarts.ca Reporting of continuous data If approximately symmetric, use mean (SD), e.g., Antibody titers ranged from
More informationMLLunsford 1. Activity: Central Limit Theorem Theory and Computations
MLLunsford 1 Activity: Central Limit Theorem Theory and Computations Concepts: The Central Limit Theorem; computations using the Central Limit Theorem. Prerequisites: The student should be familiar with
More informationSome Characteristics of Data
Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key
More informationThe Normal Distribution
Will Monroe CS 09 The Normal Distribution Lecture Notes # July 9, 207 Based on a chapter by Chris Piech The single most important random variable type is the normal a.k.a. Gaussian) random variable, parametrized
More informationWelcome to Stat 410!
Welcome to Stat 410! Personnel Instructor: Liang, Feng TA: Gan, Gary (Lingrui) Instructors/TAs from two other sessions Websites: Piazza and Compass Homework When, where and how to submit your homework
More informationMath489/889 Stochastic Processes and Advanced Mathematical Finance Homework 5
Math489/889 Stochastic Processes and Advanced Mathematical Finance Homework 5 Steve Dunbar Due Fri, October 9, 7. Calculate the m.g.f. of the random variable with uniform distribution on [, ] and then
More informationWeek 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals
Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg :
More informationSTA258H5. Al Nosedal and Alison Weir. Winter Al Nosedal and Alison Weir STA258H5 Winter / 41
STA258H5 Al Nosedal and Alison Weir Winter 2017 Al Nosedal and Alison Weir STA258H5 Winter 2017 1 / 41 NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION. Al Nosedal and Alison Weir STA258H5 Winter 2017
More informationExam 2 Spring 2015 Statistics for Applications 4/9/2015
18.443 Exam 2 Spring 2015 Statistics for Applications 4/9/2015 1. True or False (and state why). (a). The significance level of a statistical test is not equal to the probability that the null hypothesis
More informationSTAT Chapter 7: Central Limit Theorem
STAT 251 - Chapter 7: Central Limit Theorem In this chapter we will introduce the most important theorem in statistics; the central limit theorem. What have we seen so far? First, we saw that for an i.i.d
More informationStatistics & Flood Frequency Chapter 3. Dr. Philip B. Bedient
Statistics & Flood Frequency Chapter 3 Dr. Philip B. Bedient Predicting FLOODS Flood Frequency Analysis n Statistical Methods to evaluate probability exceeding a particular outcome - P (X >20,000 cfs)
More informationDefinition 9.1 A point estimate is any function T (X 1,..., X n ) of a random sample. We often write an estimator of the parameter θ as ˆθ.
9 Point estimation 9.1 Rationale behind point estimation When sampling from a population described by a pdf f(x θ) or probability function P [X = x θ] knowledge of θ gives knowledge of the entire population.
More informationINSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION
INSTITUTE AND FACULTY OF ACTUARIES Curriculum 2019 SPECIMEN EXAMINATION Subject CS1A Actuarial Statistics Time allowed: Three hours and fifteen minutes INSTRUCTIONS TO THE CANDIDATE 1. Enter all the candidate
More informationSYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data
SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu September 5, 2015
More informationProbability Theory. Probability and Statistics for Data Science CSE594 - Spring 2016
Probability Theory Probability and Statistics for Data Science CSE594 - Spring 2016 What is Probability? 2 What is Probability? Examples outcome of flipping a coin (seminal example) amount of snowfall
More informationStatistics 6 th Edition
Statistics 6 th Edition Chapter 5 Discrete Probability Distributions Chap 5-1 Definitions Random Variables Random Variables Discrete Random Variable Continuous Random Variable Ch. 5 Ch. 6 Chap 5-2 Discrete
More informationIEOR 165 Lecture 1 Probability Review
IEOR 165 Lecture 1 Probability Review 1 Definitions in Probability and Their Consequences 1.1 Defining Probability A probability space (Ω, F, P) consists of three elements: A sample space Ω is the set
More informationHomework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82
Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections
More informationBusiness Statistics 41000: Probability 3
Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404
More informationConverting to the Standard Normal rv: Exponential PDF and CDF for x 0 Chapter 7: expected value of x
Key Formula Sheet ASU ECN 22 ASWCC Chapter : no key formulas Chapter 2: Relative Frequency=freq of the class/n Approx Class Width: =(largest value-smallest value) /number of classes Chapter 3: sample and
More informationMVE051/MSG Lecture 7
MVE051/MSG810 2017 Lecture 7 Petter Mostad Chalmers November 20, 2017 The purpose of collecting and analyzing data Purpose: To build and select models for parts of the real world (which can be used for
More information5. In fact, any function of a random variable is also a random variable
Random Variables - Class 11 October 14, 2012 Debdeep Pati 1 Random variables 1.1 Expectation of a function of a random variable 1. Expectation of a function of a random variable 2. We know E(X) = x xp(x)
More informationLecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions
Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions ELE 525: Random Processes in Information Systems Hisashi Kobayashi Department of Electrical Engineering
More informationStatistical Analysis of Data from the Stock Markets. UiO-STK4510 Autumn 2015
Statistical Analysis of Data from the Stock Markets UiO-STK4510 Autumn 2015 Sampling Conventions We observe the price process S of some stock (or stock index) at times ft i g i=0,...,n, we denote it by
More informationBernoulli and Binomial Distributions
Bernoulli and Binomial Distributions Bernoulli Distribution a flipped coin turns up either heads or tails an item on an assembly line is either defective or not defective a piece of fruit is either damaged
More informationINF FALL NATURAL LANGUAGE PROCESSING. Jan Tore Lønning, Lecture 3, 1.9
INF5830 015 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lønning, Lecture 3, 1.9 Today: More statistics Binomial distribution Continuous random variables/distributions Normal distribution Sampling and sampling
More informationLecture Notes 6. Assume F belongs to a family of distributions, (e.g. F is Normal), indexed by some parameter θ.
Sufficient Statistics Lecture Notes 6 Sufficiency Data reduction in terms of a particular statistic can be thought of as a partition of the sample space X. Definition T is sufficient for θ if the conditional
More informationBinomial Approximation and Joint Distributions Chris Piech CS109, Stanford University
Binomial Approximation and Joint Distributions Chris Piech CS109, Stanford University Four Prototypical Trajectories Review The Normal Distribution X is a Normal Random Variable: X ~ N(µ, s 2 ) Probability
More informationChapter 7: Point Estimation and Sampling Distributions
Chapter 7: Point Estimation and Sampling Distributions Seungchul Baek Department of Statistics, University of South Carolina STAT 509: Statistics for Engineers 1 / 20 Motivation In chapter 3, we learned
More informationHomework Assignments
Homework Assignments Week 1 (p. 57) #4.1, 4., 4.3 Week (pp 58 6) #4.5, 4.6, 4.8(a), 4.13, 4.0, 4.6(b), 4.8, 4.31, 4.34 Week 3 (pp 15 19) #1.9, 1.1, 1.13, 1.15, 1.18 (pp 9 31) #.,.6,.9 Week 4 (pp 36 37)
More informationSection 1.3: More Probability and Decisions: Linear Combinations and Continuous Random Variables
Section 1.3: More Probability and Decisions: Linear Combinations and Continuous Random Variables Jared S. Murray The University of Texas at Austin McCombs School of Business OpenIntro Statistics, Chapters
More informationLecture Data Science
Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Statistics Foundations JProf. Dr. Claudia Wagner Learning Goals How to describe sample data? What is mode/median/mean?
More informationIntro to Likelihood. Gov 2001 Section. February 2, Gov 2001 Section () Intro to Likelihood February 2, / 44
Intro to Likelihood Gov 2001 Section February 2, 2012 Gov 2001 Section () Intro to Likelihood February 2, 2012 1 / 44 Outline 1 Replication Paper 2 An R Note on the Homework 3 Probability Distributions
More informationNormal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem
1.1.2 Normal distribution 1.1.3 Approimating binomial distribution by normal 2.1 Central Limit Theorem Prof. Tesler Math 283 Fall 216 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 1
More informationFinancial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR
Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR Nelson Mark University of Notre Dame Fall 2017 September 11, 2017 Introduction
More informationEngineering Statistics ECIV 2305
Engineering Statistics ECIV 2305 Section 5.3 Approximating Distributions with the Normal Distribution Introduction A very useful property of the normal distribution is that it provides good approximations
More informationProbability and Statistics
Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 3: PARAMETRIC FAMILIES OF UNIVARIATE DISTRIBUTIONS 1 Why do we need distributions?
More informationBasic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract
Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, 2013 Abstract Introduct the normal distribution. Introduce basic notions of uncertainty, probability, events,
More informationChapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables
Chapter 5 Continuous Random Variables and Probability Distributions 5.1 Continuous Random Variables 1 2CHAPTER 5. CONTINUOUS RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS Probability Distributions Probability
More informationSection 0: Introduction and Review of Basic Concepts
Section 0: Introduction and Review of Basic Concepts Carlos M. Carvalho The University of Texas McCombs School of Business mccombs.utexas.edu/faculty/carlos.carvalho/teaching 1 Getting Started Syllabus
More informationAMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4
AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Summer 2014 1 / 26 Sampling Distributions!!!!!!
More informationStatistics for Managers Using Microsoft Excel 7 th Edition
Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 5 Discrete Probability Distributions Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap 5-1 Learning
More informationLecture 3: Probability Distributions (cont d)
EAS31116/B9036: Statistics in Earth & Atmospheric Sciences Lecture 3: Probability Distributions (cont d) Instructor: Prof. Johnny Luo www.sci.ccny.cuny.edu/~luo Dates Topic Reading (Based on the 2 nd Edition
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial
More informationProbability and distributions
2 Probability and distributions The concepts of randomness and probability are central to statistics. It is an empirical fact that most experiments and investigations are not perfectly reproducible. The
More informationMATH 264 Problem Homework I
MATH Problem Homework I Due to December 9, 00@:0 PROBLEMS & SOLUTIONS. A student answers a multiple-choice examination question that offers four possible answers. Suppose that the probability that the
More informationMA : Introductory Probability
MA 320-001: Introductory Probability David Murrugarra Department of Mathematics, University of Kentucky http://www.math.uky.edu/~dmu228/ma320/ Spring 2017 David Murrugarra (University of Kentucky) MA 320:
More informationChapter 7: Estimation Sections
1 / 40 Chapter 7: Estimation Sections 7.1 Statistical Inference Bayesian Methods: Chapter 7 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods:
More informationSimulation Wrap-up, Statistics COS 323
Simulation Wrap-up, Statistics COS 323 Today Simulation Re-cap Statistics Variance and confidence intervals for simulations Simulation wrap-up FYI: No class or office hours Thursday Simulation wrap-up
More informationECON 214 Elements of Statistics for Economists 2016/2017
ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and
More informationBusiness Statistics Fall Quarter 2015 Midterm Answer Key
Business Statistics 4000 Fall Quarter 205 Midterm Answer Key Name (print please): Open book (but be courteous to your neighbors). No internet. A simple calculator may be helpful for computing numerical
More informationSTAT 241/251 - Chapter 7: Central Limit Theorem
STAT 241/251 - Chapter 7: Central Limit Theorem In this chapter we will introduce the most important theorem in statistics; the central limit theorem. What have we seen so far? First, we saw that for an
More informationProblems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:
Math 224 Fall 207 Homework 5 Drew Armstrong Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Section 3., Exercises 3, 0. Section 3.3, Exercises 2, 3, 0,.
More informationStochastic Components of Models
Stochastic Components of Models Gov 2001 Section February 5, 2014 Gov 2001 Section Stochastic Components of Models February 5, 2014 1 / 41 Outline 1 Replication Paper and other logistics 2 Data Generation
More informationBack to estimators...
Back to estimators... So far, we have: Identified estimators for common parameters Discussed the sampling distributions of estimators Introduced ways to judge the goodness of an estimator (bias, MSE, etc.)
More information4.3 Normal distribution
43 Normal distribution Prof Tesler Math 186 Winter 216 Prof Tesler 43 Normal distribution Math 186 / Winter 216 1 / 4 Normal distribution aka Bell curve and Gaussian distribution The normal distribution
More information