Chapter 6 Part 6. Confidence Intervals chi square distribution binomial distribution
|
|
- Clarence Watkins
- 5 years ago
- Views:
Transcription
1 Chapter 6 Part 6 Confidence Intervals chi square distribution binomial distribution October 8, 008
2 Brief review of what we covered last time. In order to get a confidence interval for the population mean ( μ ) we get a confidence interval for the sample mean. In order to obtain this confidence interval we used the sampling distribution of means whose distribution is represented by the random variable X σ. We discovered that if you know the population variance ( ), then X μ σ n follows the N(0,1) distribution. We started with z < X μ < z σ n α / 1 ( α/ ) 1 ( α/ ) cuts off an area equal to (where z 1 ( α / ) ) This allowed us to get the following is the cutoff on the N(0, 1) curve that σ n σ n Pr X z1 ( α/ ) < μ < X + z1 ( α/ ) = 1 confidence interval is α σ z n z σ, + 1 (a/ ) 1 (a/ ) n X α. Then the (1 - )%. We changed from probability to confidence when we switched from (a random variable) to (a sample estimate of the population mean). For samples of size n, these confidence intervals are all the same length and are all symmetric about, the sample mean. If we don t know the population variance, then we ll need to estimate the population s X μ s n variance using the sample variance ( ). We can then get that follows the t distribution with degrees of freedom = get the (1- )% confidence interval to be α n 1 s t n t s, + n 1,1 ( α/ ) n 1,1 ( α/ ) n. Using the same process as above, we. Page --
3 n 1 z n 1 The subscript for the cutoff t contains an that the didn t have. The indicates which distribution to use. These confidence intervals are all centered about t t 1,1 α, but they are not all of the same length (length = ) because as the n ( / ) sample standard deviation changes, the length changes. The length of the confidence intervals that use the normal distribution will be shorter than the intervals using the t distribution because. α α If = 0.05, then = 0.05 t t > z n 1,1 ( α/ ) 1 ( α/ ). di invttail(, 0.05) di invnormal(0.975) s n 4.30 = t df = 0975, = z So, for eample, the distribution with degrees of freedom (i.e. = 3) has a cutoff for the area of.5% that is over twice the size of the cutoff for the.5% area using the N(0, 1). As the degrees of freedom get larger the.5% cutoff for the t distribution will get closer to The need for a confidence interval about the variance came up in the contet of trying to find a way to measure variability, say in a quality control situation. We already know that we can estimate the variance of the population by the variance of a sample. But that is going to give us just a point estimate. We would like to be able to get a confidence interval. The confidence interval for the variance uses neither the t n nor the normal distribution. n 1 What is known is that the sample variance s multiplied by follows the chi σ square ( χ ) distribution. The graph on the right below (the chi square curve with 5 degrees of freedom) shows that we will have to calculate the two cutoffs separately rather than having something like and We will associate the cutoff on the left (0.83 in the graph on the right below) with and the cutoff on the right (1.83 in the graph below) with χ n 1,1 ( α / ). χ n 1, ( α / ) I am using S to represent the random variable and s as the sample estimate. Page -3-
4 Chi square distributions with various degrees of freedom.8 g().6.4 df = 1 df =. df = 5 df = 10 df = So we get that Pr χ n ( n 1) S < σ < χ 1, α/ n 11, ( α/ ) = 1 α You want σ in the middle, the steps to accomplish that are ( n 1) S 1) Divide all three terms by. This is a positive number because a n 1> 0 S > 0 sample of size 1 is not interesting so and (technically the sample variance could be zero but that would mean that all of the elements of the sample are the same, again not a very interesting case). 1 1 > > 0 b> a a b ) If, then. This allows you to flip each of the 3 terms so that we get σ in the middle with the result below. Pr ( ) n 1 S ( n 1 < < ) S σ χ χ n 11, ( α/ ) n 1, α/ = 1 α Page -4-
5 α ( n 1) s, ( n 1 ) s So that the (1 - )% confidence interval is. χn 11, ( α/ ) χn 1, α/ To get the confidence interval for the standard deviation, you just take the square root of each of the endpoints of the interval above (see Cheat Sheet). The confidence intervals for the variance are not symmetric about the variance. As you pick different samples of size n, you get different sample variances and hence the intervals are also not all the same length. Stata commands: chi(df,) provides the cumulative chi-squared distribution with df degrees of freedom invchi(df,p) provides the inverse of chi(). If chi(df,) = p, then invchi(df,p) =. chitail(df,) provides the upper-tail cumulative chi-squared with df degrees of freedom invchitail(df,p) provides the inverse of chitail(). If chitail(df,) = p, then invchitail(df,p) =. Page -5-
6 When we were mapping the square of the N(0, 1) curve onto a chi square distribution with 1 degree of freedom, we saw that the 1.96 on the N(0,1) curve mapped into 1.96 which is equal to Because the two ends of the normal distribution both map into the upper tail of the chi square, we have that 3.84 cuts off the upper 5% of the one degree of freedom chi square distribution. The first commands below go with the upper (right) part of the chi square distribution.. di chitail(1,3.84) (where 1 = df, 3.84 = cutoff and the area is to the right of 3.84). di invchitail(1,0.05) (where 1 = df, 0.05 = probability = area to the right of 3.84) The commands below go with the lower (left) part of the chi square distribution. If 3.84 cuts off 0.05 in the upper tail of a chi square with 1 degree of freedom, then it cuts off 0.95 in the lower portion of the curve.. di chi(1,3.84) (where 1 = df, 3.84 = cutoff and the area is to the left of 3.84). di invchi(1,0.95) (where 1 = df, 0.95 = probability = area to the left of 3.84) To find the 95% confidence interval for the variance of the blood lead concentration α (s =.41, with n = 98), we have α = 005. = and that 1 ( α / ) = Since n= 98, df= n - 1 = 97. So we need to calculate See the graph below. χ and χ 97, , We can consider the area to the left each of the two cutoffs.. di invchi(97,0.05) di invchi(97,0.975) Page -6-
7 Or we can use the area to the right of each of the two cutoffs.. di invchitail(97,0.05) di invchitail(97,0.975) ( n 1) s, ( n 1 ) s χ χ 97, , = 97(. 41) 97(. 41), di (97*.41)/ di (97*.41)/ So the 95% CI for.41 is (17.3, 30.34). The interval is not symmetric about.41.. di di Page -7-
8 If the gold standard that the labs were trying to achieve was 15, then.41 is statistically different from 15 (at the 0.05 level) since 15 is not in the interval. All of the variances in the interval are considered to be statistically similar and are considered to be statistically different from every variance outside the interval. If we wanted the confidence interval for the standard deviation (4.73) of the blood lead distribution, then all we have to do is take the square root of the endpoints for the confidence interval for the variance to get the endpoints for the confidence interval for the standard deviation.. di sqrt( ) di sqrt( ) So the confidence interval for the standard deviation 4.73 is (4.15, 5.51). Notice that this interval is not symmetric about di di The dofile below generates the chi square density with 0 degrees of freedom. clear *the chi square distribution is a special case of a Gamma Distribution. *this dofile generates a chi square density with df = 0. *to get graph of the density function, *use a connected twoway graph with = and y = y. set obs 100 gen = sum(1) *you can change the df to whatever you like. It has to be an integer. gen df = 0 gen y = gammaden(df/,,0,) twoway (connected y, msymbol(none)),/* */title(chi square distribution with 0 degrees of freedom) Below I give you a dofile that will calculate the confidence interval for the variance. Stata does not provide you with commands that will calculate the confidence for you. Page -8-
9 The dofile below generates a confidence interval for the variance. You have to change n, ssq and alpha to fit your situation. clear set obs 1 *n is the sample size on which the variance is calculated *The data here is on page 7 of the Chapter 6 Part 6 handout. gen n = 98 *ssq is the variance gen ssq =.41 *alpha is such that (1 - alpha)*100 is the % for the CI gen alpha = 0.05 gen df = n - 1 gen aover = alpha/ gen oneminus = 1 - aover gen chill = invchi(df,oneminus) gen chiul = invchi(df,aover) gen LB = (df*ssq)/chill gen UB = (df*ssq)/chiul * the command below says list the upper & lower bounds listed in the first record list LB UB in 1 Binomial distribution with n = 6 and p = f() It is going to be a little tricky to get a confidence interval for p the probability of success. You can see that we can t get cutoffs as we have done for the other distributions because you can t find a point that just cuts off.5% say. So let us first go back and see what we know about the binomial distribution. Page -9-
10 Binomial Distribution 1) The binomial distribution has two parameters, p = the probability of a success on a single trial and n = the number of trials. ) The n trials have to be independent and each of the trials has to have the same probability of success. 3) The binomial distribution is a discrete distribution which is defined for the integers 0, 1,,..., n. 4) Like the chi square distribution, the binomial distribution is not symmetric (ecept when p = 0.5). 5) The mean of the binomial distribution is np and the variance is npq or np(1 - p). 6) For the normal distribution the point estimates (as opposed to interval estimation) for the population mean and variance are (the sample mean) and s (the sample variance). For the binomial distribution what is the point estimate or sample estimate for the population parameter p (the probability of success)? First to distinguish between the population parameter and the sample statistic we will use a hat on the sample statistic. So = the number of successes/the total number of trials (i.e. the proportion of successes). [Aside: You will frequently see denoted as ]. $μ 7) For the normal distribution the average of the means of all of the random samples from a given population gives us the mean of that population. The average of the variances of all of the random samples gives us the variance of the population. It is also true that the average of the proportion of successes for each of the random samples is also the population proportion. So is an unbiased estimator of the population proportion p. $p $p Eample: Occupational Health (Rosner page 87) The proportion of deaths due to lung cancer in males aged in England and Wales during the period was 1%. Suppose that of 0 deaths that occur among male workers in this age group who have worked at least one year in a chemical plant, 5 are due to lung cancer. Is there a difference between the proportion of deaths due to lung cancer in this plant and the proportion in the general population? Page -10-
11 One way of approaching this problem is to get a confidence interval for the sample estimate of the proportion of deaths due to lung cancer and see if 1% is or is not in the interval. So first we need the sample estimate of the population value (i.e.1%). Success in this 5 case is death due to lung cancer and the number of trials n is 0. So p $ = = 05. or 0 this could also be epressed as 5%. How do we get a confidence interval for the sample estimate 0.5. Well the theoretical background for the confidence interval for a binomial distribution is not as simple as it was for the confidence intervals for the sample means and variances of normal distributions. This is in part because the binomial distribution is discrete. So we can t just find say 0.05 in each of the tails and use the cutoffs as the endpoints for the confidence interval. We can t do this because the probability mass is only at the integers and is not necessarily in neat little batches that add up to 0.05 (see graph page 9). Let us first use Stata to get the interval and then I ll eplain what Stata is doing. I will use the immediate command because we don t have a data set, we just have the population and sample values. Immediate command for variable distributed as binomial cii #obs #succ [, ciib_options] cii is epecting the number of observations (0 for this eample) and number of successes (5).. cii Binomial Eact -- Variable Obs Mean Std. Err. [95% Conf. Interval] When we use Stata to get the CI for 0.5 (5/0) we get p 1 = 0.09 (the left-hand endpoint of the CI) and p = 0.49 (the right-hand endpoint of the confidence interval). The mean listed in the Stata results above is not the mean of the binomial distribution with n = 0 and p = 0.5. In the Stata output the value listed under the mean is actually $p (0.5). Had we been Page -11-
12 estimating the mean of a population 0.5 would have been the sample mean ( this output is sort of one version of the output fits all. The value called the standard error (SE) in the Stata output is ). So pq n = = This is the SD of the sampling distribution for the probability of successes. But the SD of the sampling distribution is called the standard error (this works the same way for the normal distribution). How is a binomial confidence interval defined: An eact 100% (1 - α) confidence interval for the binomial parameter p that is always valid is given by (p 1, p ) where p 1, p satisfy the equations Pr(X $ X ~ binomial with parameter p 1 ) = α/ = n k= ( n) k p ( 1 p ) k 1 1 n k Pr(X # X ~ binomial with parameter p ) = α/ = n k p k p n k ( 1 ) k = 0 Where n is the number of trials (0 in this case) and is the number of successes (5 in this case). I ll show you in a graph what is essentially happening but first I ll show you that the values Stata came up with satisfy the definition given above.. cii Binomial Eact -- Variable Obs Mean Std. Err. [95% Conf. Interval] p = p = Page -1-
13 k (. )( ) k k = 5 0 k = Stata binomial(0,4, ) = binomial(0,4, ) = 4 0 j ( ) ( ) j j = 0 0 j Stata 9: Binomial(0,5, ) = = 0 j 0 j ( ) ( ) j = 5 0 j. di Binomial(0,5, ) k (. )( ) k k = 0 0 k = binomial(0,6, ) = di binomial(0,5, ) Notice that what we are finding are two binomial distributions that have the same number of trials (0) and successes (5) as our sample. However, the p s are different. Page -13-
14 Occupational Health (Rosner page 87) binomials that give endpoints for 95% CI for 0.5 f() p = 0.5 p1 = p = The binomial distribution with probability = p 1 (i.e. the circles) is such that its upper tail (i.e. from 5 up, including 5) is equal to 0.05 (or α/). The binomial distribution with probability = p (i.e. the s) is such that its lower tail (i.e. from 5 down, including 5) is equal to The red triangles above give the original binomial distribution (i.e. n = 0 and success = 5 so = 0.5). $p The green circles give the binomial with n = 0 and p 1 = Note that although p 1 is the left-hand endpoint of the CI, we obtain α/ = 0.05 from the upper tail of the green (circles) distribution (i.e. the sum is from 5 to 0). The weights at each of 5, 6,..., 0 added together equal 0.05 (i.e. the sum of the bars). The distribution with p 1 = is to the left of the original distribution (i.e. the red triangles). The blue s give the binomial with n = 0 and p = Note that although p is the right-hand endpoint of the CI, we obtain α/ = 0.05 from the lower tail of the blue ( s) Page -14-
15 distribution (i.e. the sum is from 0 to 5). The weights at each of 0, 1,, 3, 4 and 5 added together equal The distribution with is to the right of the original distribution. p = Notice that we know the binomials are in the correct order because p 1 < $p < p so 0p 1 < 0 $p < 0p and 0p 1, 0 $p and 0p are the respective means for the three distributions. Below is the dofile used to obtain the graph above. The */ and /* marks in the graph command are a way of continuing the command onto more than one line. clear *binomial3.do set obs 1 *note that fof is the binomial density f()with n = 0 and p = 0.5 *fofp1 is the binomial density with n = 0 and p = *fofp is the binomial density with n = 0 and p = gen = sum(1) - 1 gen fof = binomial(0,,0.5) - binomial(0,-1,0.5) label variable fof "p = 0.5" gen fofp1 = binomial(0,, ) - binomial(0,-1, ) label variable fofp1 "p1 = " gen fofp = binomial(0,, ) - binomial(0,-1, ) label variable fofp "p = " twoway (scatter fof, mcolor(red) msymbol(triangle)) (scatter fofp1, mcolor(mint) msymbol(circle)) /* */ (scatter fofp, mcolor(blue) msymbol(lg)), ytitle(f()) line(5) /* */ title(eample 4: Original binomial (p = 0.5)) /* */ subtitle(with binomials that give endpoints for 95% CI) /* */ legend(on rows(1)) scheme(s1color) plotregion(lwidth(none)) Stata 9 version gen fof = Binomial(0,,0.5) - Binomial(0,+1,0.5) In order to use the non-immediate form, I created a little file with 0 observations and two variables outcome (5 coded as 1 and 15 coded as zero) and newoutcome (5 coded as and 15 coded as 1). You can see below that when I coded the variable newoutcome as 1/ rather than 0/1, you just don t get an answer. In order to use the Stata command your variable must be coded 1 for success (here a cancer death) and 0 for failure (here a non-cancer death). The need for this coding is a Stata quirk. Page -15-
16 . tab outcome outcome Freq. Percent Cum not CA CA Total label list outlbl: 0 not CA 1 CA. ci outcome,binomial -- Binomial Eact -- Variable Obs Mean Std. Err. [95% Conf. Interval] outcome gen newoutcome = outcome + 1. tab newoutcome newoutcome Freq. Percent Cum Total ci newoutcome,binomial -- Binomial Eact -- Variable Obs Mean Std. Err. [95% Conf. Interval] See addition on the net page. Page -16-
17 Amendment to Chapter 6 Part 6 handout. Below is what I hope is a clearer eplanation of Problem 5.66 than was in the earlier version of the handout. So let us go back to Rosner s Problem 5.66: In Problem 5.65 we found that for 17 year old boys the probability of having elevated diastolic BP (i.e. DBP $ 90) was or 1.05% (the population p or population percent). This means if the population percent was true for the sample of 000, we would epect approimately 0 (000*0.0105) successes (i.e. 0 boys with elevated DBP). Twenty-five is the number of successes (boys with elevated DBP) in the sample of 000. So (the sample estimate of the population p) = 5/000 = Below $p we see that the 95% CI for (i.e 95% CI for $p ) is (0.008, 0.018). Notice that (the population percentage of successes) is in the 95% confidence interval about the sample estimate ( = 0.015) of p. This means that the sample estimate $p of p (0.015) and the population value of p (0.0105) are considered to be similar. Or we might say that it is reasonable to believe that the sample of 000 with 5 successes could have been drawn from the population with p (probability of success) = cii 000 5,level(95) -- Binomial Eact -- Variable Obs Mean Std. Err. [95% Conf. Interval] return list scalars: r(ub) = r(lb) = r(se) = r(mean) =.015 r(n) = 000. di r(ub) - r(lb) = length of the interval Page -17-
18 . cii 000 5,level(90) -- Binomial Eact -- Variable Obs Mean Std. Err. [90% Conf. Interval] di r(ub) - r(lb) Once you know what Stata is calling the upper bound and lower bound you do not have to use the return list command.. cii 000 5,level(85) -- Binomial Eact -- Variable Obs Mean Std. Err. [85% Conf. Interval] di r(ub) - r(lb) We can see that the 85% CI has the shortest length and the 95% CI has the longest length. So the more certain you are, the longer the interval. Page -18-
Chapter 6 Part 3 October 21, Bootstrapping
Chapter 6 Part 3 October 21, 2008 Bootstrapping From the internet: The bootstrap involves repeated re-estimation of a parameter using random samples with replacement from the original data. Because the
More informationStatistical Intervals (One sample) (Chs )
7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and
More informationNormal Probability Distributions
Normal Probability Distributions Properties of Normal Distributions The most important probability distribution in statistics is the normal distribution. Normal curve A normal distribution is a continuous
More informationStatistics for Business and Economics
Statistics for Business and Economics Chapter 7 Estimation: Single Population Copyright 010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-1 Confidence Intervals Contents of this chapter: Confidence
More information. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:
Statistics Sample Exam 3 Solution Chapters 6 & 7: Normal Probability Distributions & Estimates 1. What percent of normally distributed data value lie within 2 standard deviations to either side of the
More informationLESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY
LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY 1 THIS WEEK S PLAN Part I: Theory + Practice ( Interval Estimation ) Part II: Theory + Practice ( Interval Estimation ) z-based Confidence Intervals for a Population
More informationStatistics Class 15 3/21/2012
Statistics Class 15 3/21/2012 Quiz 1. Cans of regular Pepsi are labeled to indicate that they contain 12 oz. Data Set 17 in Appendix B lists measured amounts for a sample of Pepsi cans. The same statistics
More informationChapter 7. Sampling Distributions
Chapter 7 Sampling Distributions Section 7.1 Sampling Distributions and the Central Limit Theorem Sampling Distributions Sampling distribution The probability distribution of a sample statistic. Formed
More informationStatistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage
7 Statistical Intervals Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to
More informationChapter 5 Normal Probability Distributions
Chapter 5 Normal Probability Distributions Section 5-1 Introduction to Normal Distributions and the Standard Normal Distribution A The normal distribution is the most important of the continuous probability
More informationClass 13. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 13 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 017 by D.B. Rowe 1 Agenda: Recap Chapter 6.3 6.5 Lecture Chapter 7.1 7. Review Chapter 5 for Eam 3.
More informationChapter 8 Estimation
Chapter 8 Estimation There are two important forms of statistical inference: estimation (Confidence Intervals) Hypothesis Testing Statistical Inference drawing conclusions about populations based on samples
More informationChapter 8 Statistical Intervals for a Single Sample
Chapter 8 Statistical Intervals for a Single Sample Part 1: Confidence intervals (CI) for population mean µ Section 8-1: CI for µ when σ 2 known & drawing from normal distribution Section 8-1.2: Sample
More informationχ 2 distributions and confidence intervals for population variance
χ 2 distributions and confidence intervals for population variance Let Z be a standard Normal random variable, i.e., Z N(0, 1). Define Y = Z 2. Y is a non-negative random variable. Its distribution is
More informationParameter Estimation II
Parameter Estimation II ELEC 41 PROF. SIRIPONG POTISUK Estimating μ With Unnown σ This is often true in practice. When the sample is large and σ is unnown, the sampling distribution is approimately normal
More informationStatistical Tables Compiled by Alan J. Terry
Statistical Tables Compiled by Alan J. Terry School of Science and Sport University of the West of Scotland Paisley, Scotland Contents Table 1: Cumulative binomial probabilities Page 1 Table 2: Cumulative
More informationKey Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions
SGSB Workshop: Using Statistical Data to Make Decisions Module 2: The Logic of Statistical Inference Dr. Tom Ilvento January 2006 Dr. Mugdim Pašić Key Objectives Understand the logic of statistical inference
More informationSampling & populations
Sampling & populations Sample proportions Sampling distribution - small populations Sampling distribution - large populations Sampling distribution - normal distribution approximation Mean & variance of
More informationA.REPRESENTATION OF DATA
A.REPRESENTATION OF DATA (a) GRAPHS : PART I Q: Why do we need a graph paper? Ans: You need graph paper to draw: (i) Histogram (ii) Cumulative Frequency Curve (iii) Frequency Polygon (iv) Box-and-Whisker
More informationLecture 2. Probability Distributions Theophanis Tsandilas
Lecture 2 Probability Distributions Theophanis Tsandilas Comment on measures of dispersion Why do common measures of dispersion (variance and standard deviation) use sums of squares: nx (x i ˆµ) 2 i=1
More informationStatistics 431 Spring 2007 P. Shaman. Preliminaries
Statistics 4 Spring 007 P. Shaman The Binomial Distribution Preliminaries A binomial experiment is defined by the following conditions: A sequence of n trials is conducted, with each trial having two possible
More informationTests for One Variance
Chapter 65 Introduction Occasionally, researchers are interested in the estimation of the variance (or standard deviation) rather than the mean. This module calculates the sample size and performs power
More informationCH 5 Normal Probability Distributions Properties of the Normal Distribution
Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend
More informationThe Binomial Distribution
The Binomial Distribution January 31, 2019 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The
More informationBasic Procedure for Histograms
Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that
More informationtm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6}
PS 4 Monday August 16 01:00:42 2010 Page 1 tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6} log: C:\web\PS4log.smcl log type: smcl opened on:
More informationChapter 6 Confidence Intervals
Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) VOCABULARY: Point Estimate A value for a parameter. The most point estimate of the population parameter is the
More informationCIVL Learning Objectives. Definitions. Discrete Distributions
CIVL 3103 Discrete Distributions Learning Objectives Define discrete distributions, and identify common distributions applicable to engineering problems. Identify the appropriate distribution (i.e. binomial,
More informationRandom Variables and Probability Distributions
Chapter 3 Random Variables and Probability Distributions Chapter Three Random Variables and Probability Distributions 3. Introduction An event is defined as the possible outcome of an experiment. In engineering
More informationConfidence Intervals for an Exponential Lifetime Percentile
Chapter 407 Confidence Intervals for an Exponential Lifetime Percentile Introduction This routine calculates the number of events needed to obtain a specified width of a confidence interval for a percentile
More informationConfidence Intervals. σ unknown, small samples The t-statistic /22
Confidence Intervals σ unknown, small samples The t-statistic 1 /22 Homework Read Sec 7-3. Discussion Question pg 365 Do Ex 7-3 1-4, 6, 9, 12, 14, 15, 17 2/22 Objective find the confidence interval for
More informationChapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) Estimating Population Parameters
Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) Estimating Population Parameters VOCABULARY: Point Estimate a value for a parameter. The most point estimate
More informationVersion A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.
Math 224 Q Exam 3A Fall 217 Tues Dec 12 Version A Problem 1. Let X be the continuous random variable defined by the following pdf: { 1 x/2 when x 2, f(x) otherwise. (a) Compute the mean µ E[X]. E[X] x
More informationThe Binomial Distribution
The Binomial Distribution January 31, 2018 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The
More informationPROBABILITY DISTRIBUTIONS
CHAPTER 3 PROBABILITY DISTRIBUTIONS Page Contents 3.1 Introduction to Probability Distributions 51 3.2 The Normal Distribution 56 3.3 The Binomial Distribution 60 3.4 The Poisson Distribution 64 Exercise
More informationHomework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82
Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections
More informationChapter 7 1. Random Variables
Chapter 7 1 Random Variables random variable numerical variable whose value depends on the outcome of a chance experiment - discrete if its possible values are isolated points on a number line - continuous
More informationSTAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.
STAT 515 -- Chapter 5: Continuous Distributions Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. Continuous distributions typically are represented by
More informationClass 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 16 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 7. - 7.3 Lecture Chapter 8.1-8. Review Chapter 6. Problem Solving
More informationx is a random variable which is a numerical description of the outcome of an experiment.
Chapter 5 Discrete Probability Distributions Random Variables is a random variable which is a numerical description of the outcome of an eperiment. Discrete: If the possible values change by steps or jumps.
More informationProbability. An intro for calculus students P= Figure 1: A normal integral
Probability An intro for calculus students.8.6.4.2 P=.87 2 3 4 Figure : A normal integral Suppose we flip a coin 2 times; what is the probability that we get more than 2 heads? Suppose we roll a six-sided
More informationThe topics in this section are related and necessary topics for both course objectives.
2.5 Probability Distributions The topics in this section are related and necessary topics for both course objectives. A probability distribution indicates how the probabilities are distributed for outcomes
More informationBIO5312 Biostatistics Lecture 5: Estimations
BIO5312 Biostatistics Lecture 5: Estimations Yujin Chung September 27th, 2016 Fall 2016 Yujin Chung Lec5: Estimations Fall 2016 1/34 Recap Yujin Chung Lec5: Estimations Fall 2016 2/34 Today s lecture and
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial
More informationUseful Probability Distributions
Useful Probability Distributions Standard Normal Distribution Binomial Multinomial Hypergeometric Poisson Beta Binomial Student s t Beta Gamma Dirichlet Multivariate Normal and Correlation Standard Normal
More informationWeek 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4
Week 7 Oğuz Gezmiş Texas A& M University Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4 Oğuz Gezmiş (TAMU) Topics in Contemporary Mathematics II Week7 1 / 19
More informationStatistical Methods in Practice STAT/MATH 3379
Statistical Methods in Practice STAT/MATH 3379 Dr. A. B. W. Manage Associate Professor of Mathematics & Statistics Department of Mathematics & Statistics Sam Houston State University Overview 6.1 Discrete
More informationMAKING SENSE OF DATA Essentials series
MAKING SENSE OF DATA Essentials series THE NORMAL DISTRIBUTION Copyright by City of Bradford MDC Prerequisites Descriptive statistics Charts and graphs The normal distribution Surveys and sampling Correlation
More informationEstimating parameters 5.3 Confidence Intervals 5.4 Sample Variance
Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance Prof. Tesler Math 186 Winter 2017 Prof. Tesler Ch. 5: Confidence Intervals, Sample Variance Math 186 / Winter 2017 1 / 29 Estimating parameters
More informationContents. The Binomial Distribution. The Binomial Distribution The Normal Approximation to the Binomial Left hander example
Contents The Binomial Distribution The Normal Approximation to the Binomial Left hander example The Binomial Distribution When you flip a coin there are only two possible outcomes - heads or tails. This
More informationConfidence Intervals Introduction
Confidence Intervals Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample mean X is a point estimate of the population mean μ
More informationChapter 7 Sampling Distributions and Point Estimation of Parameters
Chapter 7 Sampling Distributions and Point Estimation of Parameters Part 1: Sampling Distributions, the Central Limit Theorem, Point Estimation & Estimators Sections 7-1 to 7-2 1 / 25 Statistical Inferences
More informationBusiness Statistics 41000: Probability 3
Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404
More informationWeek 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals
Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg :
More informationProbability Theory. Mohamed I. Riffi. Islamic University of Gaza
Probability Theory Mohamed I. Riffi Islamic University of Gaza Table of contents 1. Chapter 2 Discrete Distributions The binomial distribution 1 Chapter 2 Discrete Distributions Bernoulli trials and the
More informationEstimation. Focus Points 10/11/2011. Estimating p in the Binomial Distribution. Section 7.3
Estimation 7 Copyright Cengage Learning. All rights reserved. Section 7.3 Estimating p in the Binomial Distribution Copyright Cengage Learning. All rights reserved. Focus Points Compute the maximal length
More informationIn a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation
Name In a binomial experiment of n trials, where p = probability of success and q = probability of failure mean variance standard deviation µ = n p σ = n p q σ = n p q Notation X ~ B(n, p) The probability
More informationSection 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution
Section 7.6 Application of the Normal Distribution A random variable that may take on infinitely many values is called a continuous random variable. A continuous probability distribution is defined by
More informationDiscrete Random Variables
Discrete Random Variables In this chapter, we introduce a new concept that of a random variable or RV. A random variable is a model to help us describe the state of the world around us. Roughly, a RV can
More informationChapter 7. Inferences about Population Variances
Chapter 7. Inferences about Population Variances Introduction () The variability of a population s values is as important as the population mean. Hypothetical distribution of E. coli concentrations from
More information1 Inferential Statistic
1 Inferential Statistic Population versus Sample, parameter versus statistic A population is the set of all individuals the researcher intends to learn about. A sample is a subset of the population and
More informationChapter 6. The Normal Probability Distributions
Chapter 6 The Normal Probability Distributions 1 Chapter 6 Overview Introduction 6-1 Normal Probability Distributions 6-2 The Standard Normal Distribution 6-3 Applications of the Normal Distribution 6-5
More informationStatistics 13 Elementary Statistics
Statistics 13 Elementary Statistics Summer Session I 2012 Lecture Notes 5: Estimation with Confidence intervals 1 Our goal is to estimate the value of an unknown population parameter, such as a population
More informationMATH 264 Problem Homework I
MATH Problem Homework I Due to December 9, 00@:0 PROBLEMS & SOLUTIONS. A student answers a multiple-choice examination question that offers four possible answers. Suppose that the probability that the
More informationLecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1
Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Section 7.4-1 Chapter 7 Estimates and Sample Sizes 7-1 Review and Preview 7- Estimating a Population
More informationDiscrete Probability Distribution
1 Discrete Probability Distribution Key Definitions Discrete Random Variable: Has a countable number of values. This means that each data point is distinct and separate. Continuous Random Variable: Has
More informationContinuous Distributions
Quantitative Methods 2013 Continuous Distributions 1 The most important probability distribution in statistics is the normal distribution. Carl Friedrich Gauss (1777 1855) Normal curve A normal distribution
More informationDiploma in Business Administration Part 2. Quantitative Methods. Examiner s Suggested Answers
Cumulative frequency Diploma in Business Administration Part Quantitative Methods Examiner s Suggested Answers Question 1 Cumulative Frequency Curve 1 9 8 7 6 5 4 3 1 5 1 15 5 3 35 4 45 Weeks 1 (b) x f
More informationCentral Limit Theorem (cont d) 7/28/2006
Central Limit Theorem (cont d) 7/28/2006 Central Limit Theorem for Binomial Distributions Theorem. For the binomial distribution b(n, p, j) we have lim npq b(n, p, np + x npq ) = φ(x), n where φ(x) is
More informationSampling Distributions and the Central Limit Theorem
Sampling Distributions and the Central Limit Theorem February 18 Data distributions and sampling distributions So far, we have discussed the distribution of data (i.e. of random variables in our sample,
More informationStatistics for Business and Economics: Random Variables:Continuous
Statistics for Business and Economics: Random Variables:Continuous STT 315: Section 107 Acknowledgement: I d like to thank Dr. Ashoke Sinha for allowing me to use and edit the slides. Murray Bourne (interactive
More informationSTAT Chapter 7: Confidence Intervals
STAT 515 -- Chapter 7: Confidence Intervals With a point estimate, we used a single number to estimate a parameter. We can also use a set of numbers to serve as reasonable estimates for the parameter.
More informationCommonly Used Distributions
Chapter 4: Commonly Used Distributions 1 Introduction Statistical inference involves drawing a sample from a population and analyzing the sample data to learn about the population. We often have some knowledge
More information1. Statistical problems - a) Distribution is known. b) Distribution is unknown.
Probability February 5, 2013 Debdeep Pati Estimation 1. Statistical problems - a) Distribution is known. b) Distribution is unknown. 2. When Distribution is known, then we can have either i) Parameters
More informationClass 11. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 11 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 2017 by D.B. Rowe 1 Agenda: Recap Chapter 5.3 continued Lecture 6.1-6.2 Go over Eam 2. 2 5: Probability
More informationChapter 4. The Normal Distribution
Chapter 4 The Normal Distribution 1 Chapter 4 Overview Introduction 4-1 Normal Distributions 4-2 Applications of the Normal Distribution 4-3 The Central Limit Theorem 4-4 The Normal Approximation to the
More informationcontinuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence
continuous rv Let X be a continuous rv. Then a probability distribution or probability density function (pdf) of X is a function f(x) such that for any two numbers a and b with a b, P(a X b) = b a f (x)dx.
More informationConfidence Intervals for the Difference Between Two Means with Tolerance Probability
Chapter 47 Confidence Intervals for the Difference Between Two Means with Tolerance Probability Introduction This procedure calculates the sample size necessary to achieve a specified distance from the
More informationYou created this PDF from an application that is not licensed to print to novapdf printer (http://www.novapdf.com)
Monday October 3 10:11:57 2011 Page 1 (R) / / / / / / / / / / / / Statistics/Data Analysis Education Box and save these files in a local folder. name:
More informationChapter 11 Part 6. Correlation Continued. LOWESS Regression
Chapter 11 Part 6 Correlation Continued LOWESS Regression February 17, 2009 Goal: To review the properties of the correlation coefficient. To introduce you to the various tools that can be used to decide
More informationLecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean)
Statistics 16_est_parameters.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean) Some Common Sense Assumptions for Interval Estimates
More informationDiploma Part 2. Quantitative Methods. Examiner s Suggested Answers
Diploma Part 2 Quantitative Methods Examiner s Suggested Answers Question 1 (a) The binomial distribution may be used in an experiment in which there are only two defined outcomes in any particular trial
More informationTOPIC: PROBABILITY DISTRIBUTIONS
TOPIC: PROBABILITY DISTRIBUTIONS There are two types of random variables: A Discrete random variable can take on only specified, distinct values. A Continuous random variable can take on any value within
More informationUnit 5: Sampling Distributions of Statistics
Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate
More informationUnit 5: Sampling Distributions of Statistics
Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate
More information(# of die rolls that satisfy the criteria) (# of possible die rolls)
BMI 713: Computational Statistics for Biomedical Sciences Assignment 2 1 Random variables and distributions 1. Assume that a die is fair, i.e. if the die is rolled once, the probability of getting each
More informationChapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.
Chapter 14 : Statistical Inference 1 Chapter 14 : Introduction to Statistical Inference Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Data x
More informationNormal distribution Approximating binomial distribution by normal 2.10 Central Limit Theorem
1.1.2 Normal distribution 1.1.3 Approimating binomial distribution by normal 2.1 Central Limit Theorem Prof. Tesler Math 283 Fall 216 Prof. Tesler 1.1.2-3, 2.1 Normal distribution Math 283 / Fall 216 1
More informationMATH 3200 Exam 3 Dr. Syring
. Suppose n eligible voters are polled (randomly sampled) from a population of size N. The poll asks voters whether they support or do not support increasing local taxes to fund public parks. Let M be
More informationBernoulli and Binomial Distributions
Bernoulli and Binomial Distributions Bernoulli Distribution a flipped coin turns up either heads or tails an item on an assembly line is either defective or not defective a piece of fruit is either damaged
More informationStatistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)
Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x
More information4.1 Probability Distributions
Probability and Statistics Mrs. Leahy Chapter 4: Discrete Probability Distribution ALWAYS KEEP IN MIND: The Probability of an event is ALWAYS between: and!!!! 4.1 Probability Distributions Random Variables
More informationData Distributions and Normality
Data Distributions and Normality Definition (Non)Parametric Parametric statistics assume that data come from a normal distribution, and make inferences about parameters of that distribution. These statistical
More informationE509A: Principle of Biostatistics. GY Zou
E509A: Principle of Biostatistics (Week 2: Probability and Distributions) GY Zou gzou@robarts.ca Reporting of continuous data If approximately symmetric, use mean (SD), e.g., Antibody titers ranged from
More informationThe normal distribution is a theoretical model derived mathematically and not empirically.
Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.
More informationMATH 10 INTRODUCTORY STATISTICS
MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. It is Time for Homework Again! ( ω `) Please hand in your homework. Third homework will be posted on the website,
More informationLecture 2 INTERVAL ESTIMATION II
Lecture 2 INTERVAL ESTIMATION II Recap Population of interest - want to say something about the population mean µ perhaps Take a random sample... Recap When our random sample follows a normal distribution,
More informationThe Normal Probability Distribution
1 The Normal Probability Distribution Key Definitions Probability Density Function: An equation used to compute probabilities for continuous random variables where the output value is greater than zero
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao The binomial: mean and variance Recall that the number of successes out of n, denoted
More informationCS 237: Probability in Computing
CS 237: Probability in Computing Wayne Snyder Computer Science Department Boston University Lecture 12: Continuous Distributions Uniform Distribution Normal Distribution (motivation) Discrete vs Continuous
More information