BIOL The Normal Distribution and the Central Limit Theorem

Size: px
Start display at page:

Download "BIOL The Normal Distribution and the Central Limit Theorem"

Transcription

1 BIOL The Normal Distribution and the Central Limit Theorem In the first week of the course, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good summaries when the histogram (or distribution) is symmetric and unimodal. When it is not symmetric, we may want to use the median and IQR as summaries, although for most of the course, we will deal with things that are approximately symmetric and unimodal. Understanding the idea of what a Standard Deviation is, is very important as almost all statistical methods rely on this, and we will see it come up again and again throughout the course (in all of statistics, actually). Recall: The SD can be thought of as a measure of the average deviation from the mean. You ve spent your whole life being told that you can t compare apples and oranges; well were going to present a method for doing just that! Example: I m at a crucial point in my life where I m trying to decide what to do with it; teach or research? For my Masters I graduated with a grade of 87%, and the mean Master s grade is 83% with a standard deviation of 5%. For my course evaluations I have a mean rating of 4.65 (out of 5), and the mean evaluation score is typically 3.5 with a standard deviation of 0.4. Which one am I better at? (Note: These are made up numbers!) If we just look at the values it is hard to compare the two. 87 is 4 larger than 83, and 4.65 is only 1.15 larger than 3.5, but...87 is a lot bigger than 4.65 and you can only go 1.5 over the average of 3.5, and 17 over the average grade of 83 so...how do we compare the two? The answer is to use the Standard Deviation as a measuring stick, as it summarizes the average deviation from the mean. Essentially, we will want to find out how far each one is from its respective mean, in terms of its average deviation from the mean. 1

2 The Masters grade is = 4% above the mean grade, and... 4% in terms of standard deviation, it is = 0.8 standard deviations above the mean. SD=5% The evaluation score is = 1.15 above the mean score, and in terms of standard deviation, it is SD=0.4 mean. = standard deviations above the In terms of each of their own respective means and average deviation from the mean (or SD), the evaluation scores are much higher above their own mean than the Masters grade. In your every day life you are essentially using statistical tools to make decisions, without even knowing it... In my opinion (so don t generalize this to every statistician), statistics is simply a discipline that tries to take the way a person thinks about things and makes logical decisions based on what they observe in every day life, and formalize these into a set of objective rules. Adding or Multiplying each value by a constant: 1. Adding a constant (shifting) If we add a constant (c) to each observation in the data then: The measures of center (mean, median, midrange) will all have the constant (c) added to them, and so will the Quartiles. The measures of spread (variance, SD, range, IQR) will all remain the same. 2. Multiplying by a constant (scaling) If we multiply each observation by a constant (c), then: The measures of center (mean, median, midrange), and the measures of spread (SD, range, IQR) and the Quartiles will all be multiplied by the constant (c). The variance will be multiplied by c 2 In short, adding changes center, but not spread. Multiplying changes the spread and center. Multiplying by a constant is how we change measurement units (eg) Kg to lbs. 2

3 Standardizing (Z-scores): Question: How can we compare observations that were measured on different scales or from two different distributions? Answer: By summarizing how far away each of the observations is from the mean, in terms of its standard deviation (or average deviation from the mean)! The Z-score summarizes how far a given observation (x i ) is from its mean (µ), in terms of it s SD (σ). Z-score (Z)= Z = x i µ σ difference between observation and mean Standard deviation Exercise: A flight from Vancouver to Toronto usually takes 4.5 hours with a SD of 15 minutes. If my last flight took 4 hours and 10 minutes, how far is this from the mean in standard units? When we Standardize, we are adding (actually subtracting) a constant from every observation, and then multiplying (actually dividing) every observation by a constant...check rules on last page If we let M = x i x, then the mean of M is x x = 0, and the SD of M is unchanged. If we now let Z = M, then the mean of Z is the mean of M times the constant, which SD equals 0. The SD of Z is the SD of M times the constant, which is SD = 1. SD So, Z-scores have a mean of 0 and a SD of 1. A positive Z-score means that the observation is above the mean, and a negative z-score means its below it. The farther an observation is from the mean, the larger the Z-score will be in absolute value. 3

4 The Normal Distribution (Bell Curve): This is where we take a small step into the theoretical world of statistics. Many types of data one collects have a distribution that is bell shaped and roughly symmetric, and the Normal Distribution is appropriate for summarizing these (note that we are dealing with only quantitative variables here). (eg) weight, IQ scores,... Characteristics of Normal Model: 1. It is bell-shaped, unimodal, and perfectly symmetric about the mean (µ x ). 2. The spread of the distribution is determined by the standard deviation (σ). 3. This model is denoted by: N(µ, σ 2 ), where µ=mean, σ 2 =Variance, and σ is the SD. 4. The total area under the curve is 100% (just as the total area of the bars for a histogram is 100%) Theoretical Normal Models Porbability (%) N(2, 36) N( 4, 9) N(2, 4) Values f(x) = 1 e (x µx)2 2σ 2 2πσ 2 4

5 Notes: For the Normal Distribution, we use (µ x ) for the mean instead of ( x), and (σ) for the SD instead of (s), why??? The ( x) and (s) are Statistics or sample estimates; numerical summaries of the observed data. (sample) The (µ x ) and (σ) are Parameters or population parameters; that specify the theoretical model. (population) 5

6 Standardized Values (for the Normal Model: ) Z = x µ x σ When we standardize an observation from a Normal Distribution, the Z-score is N(0, 1). What we do is we use a theoretical Normal Distribution to describe the distribution of an observed variable. One must check the histogram to make sure that such a model is appropriate (symmetric and unimodal). We take the observed estimates of the mean and SD, and if a Normal Distribution seems appropriate, then we use the Normal Distribution (with the same mean and SD to approximate the observed data. We then standardize the value(s) of interest, so that we can use a Standard Normal variable (N(0, 1)). We can then answer questions such as: What proportion of males have weights above 190lbs? How many between 210 and 220? and so on... The Rule: Approximately 68% of the data will be within +/ 1 SD of the mean. Approximately 95% of the data will be within +/ 2 SDs of the mean. Approximately 99.7% of the data will be within +/ 3 SDs of the mean. (eg) if a class has a mean grade of 70% and a SD of 5%, then approximately 68% of students will receive grades between 65-75%, approx. 95% will receive grades between 60-80%, and 99.7% between 55-85%. Let s Draw a Picture: 6

7 Finding Percentages Under the Normal Model: 1. Draw a Normal Model and label where the mean is. Then shade the area of interest. 2. Standardize the x-value(s) that are at the boundaries of the area of interest. 3. Use the Normal Table (provided online) to find the area of the shaded region. I have posted a different Normal table than the textbook on the course website, which I will use in lectures. Example: What is the area (probability) below a Z-score of Z = 1.5? What is the area (probability) between Z-scores of and 1.21? Summary: 1. We estimate the mean and SD for our observed data. 2. Check if a Normal Model is appropriate (symmetric, unimodal) 3. If it is, then we standardize the values of interest. 4. Use the Normal Table to find the percentages we are interested in. (the Normal Model is HUGE in statistics, so make sure to practice many of these problems) 7

8 Exercises: 1. Suppose that math SAT scores follow the normal model. The past results of the math SAT exams show that males and females have mean scores of 500 and 455 and standard deviations of 100 and 120, respectively. Steve and Nikki took the math SAT exam, and they both scored 620. (a) Compare their scores using the z-score. (b) What percentage of males score over 600 on the math SAT test? (c) What percentage of females score between 255 and 555 on the math SAT test? 2. Find the area under the Normal Model for the following Z-scores. (a) smaller than (b) bigger than (c) bigger than 2.15 (d) between 0 and 1.18 (e) between and 1.62 (f) smaller than Find the z-scores corresponding to the following percentiles: (a) 50 th (b) 70 th (c) 15 th 4. The Mercury missions from NASA allowed no astronauts to be taller than 180.3cm. The mean height of males is 175.6cm with a standard deviation of 7.1cm. (a) What percentage of males would be ineligible to be astronauts, based on height alone? (b) Find the interquartile range for the height of males. 8

9 ** So far, we have talked about examining a single observation. In the last section we discussed how to work out probabilities when our variable follows a normal distribution. In previous sections, we discussed how to work out probabilities for things that follow a binomial or poission distribution. But, we only talked about examining single observations. Often, we take a sample of (n) observations, and we are interested in the mean or the sum of the n observations. (eg) We may be interested in taking a sample of 10 new floursecent lightbulbs and making some statements about the mean lifetime of the lightbulbs, or maybe even the sum of the lifetime of the 10 bulbs. Maybe we ve developed a new drug, and we want to make some statements about the mean decrease in bloodpressure due to the drug. So, how do we deal with these? Well, the Central Limit Theorem (CLT) makes things easy for us. This is probably the single most important theorem in statistics. Central Limit Theorem This is the fundamental theorem of statistics, and was first proved by Pierre-Simon LaPlace. The CLT says that if our random sample (x 1, x 2,..., x n ) comes from any particular distribution (may not be Normal, but all from the same distribution) with a mean of µ and a variance of σ 2, then when n is large enough, the sample mean x and the sample sum S approximately follow a normal distribution. We also require that the sample observations are independent and random. (ie) If X i Are Independent observations from any distribution with mean µ and variance σ 2, and a large n then... X N(µ, σ2 n ) S N(nµ, nσ 2 ) 9

10 It is not so intuitive at first, but even if the distribution we are sampling from is skewed or even bimodal, the sampling distribution of the mean will be normally distributed. This result is very important and is used extensively throughout statistics, as it tells us that no matter what distribution our random sample (the x i s) come from, that the sample mean X and the sample sum S follow a normal distribution as long as a few assumptions are met. The CLT is an assymptotic result, meaning that when n =, X and S are normally distributed, but when n <, X and S are only approximately normally distributed. This raises the question, when is n large enough? There is no quick answer to this. The more symmetric the distribution of the x i s the smaller n can be. A generally accepted rule is n 25 First let s draw a picture (and explain) what we mean by this, then we will take a look at a simulation example, and then we will discuss the CLT for proportions, the binomial and Poisson distributions. Picture: 10

11 Instead of using math, let s try and convince ourselves that this is true using some intuition and simulations. Simulating The Sampling Distribution of a Mean: Below is a picture of histograms of some simulated rolls of dice. I will talk about these in class. Note: I used dice as the example, as these cover many areas. (ie) A dice can lead to proportions, it can be binomial, and it is an ordinal variable, which is similar to a quantitative/continuous variable Rolls of 1 Die The Mean of Rolls of 2 Dice The Mean of Rolls of 3 Dice Density Density Density die die die3 The Mean of Rolls of 5 Dice The Mean of Rolls of 25 Dice Density Density die die25 We can see that as the sample size increases (1, 2, 3, 5, 25), the sampling distribution of the mean begins to look like a normal model. 11

12 Note: For now, we answer questions like: If the true mean and standard deviation are some known value, then what is the probability of getting certain estimated values? Later, we will use these ideas for testing hypotheses. ** The CLT for continuous variables Example: Suppose we will sample and test 25 lightbulbs to measure their lifetimes. Suppose we know that each of the lightbulbs has a mean lifetime of 1000 hours and a standard deviation of 1000 hours, and follows an exponential distribution (a continuous distribution we have not talked about). (a) What is the probability that the sample mean of the 25 lightbulbs will be more than 1100 hours? (b) What is the probability that the sample sum of the 25 lightbulbs will be more than 23,000? (c) What range of lifetimes would be approximately 95% sure that X will be in? Center this interval at the true mean. ** Now let s start talking about proportions. The CLT and Proportions If we are dealing with something that is categorical/discrete, we often end up examining a proportion. Here, we will see how we can use the normal distribution to approximate the sampling distribution of a proportion. By sampling distribution we mean the following. If we knew the true proportion in the population, then what type of sample proportions are likely to arise in our sample. We use these ideas to test hypotheses later. It turns out that if the true population proportion is p then... p(1 p) σˆp = n 12

13 This follows a normal distribution as long as: np and n(1 p) are both 10 Examples: 1. Suppose I will flip a fair coin 100 times. (a) What is the probability that I get 60% or more heads? (we can and will also answer this question treating it as a binomial) (b) What range of the precentage of heads should I expect to get 95% of the time? Center this interval around the true proportion. 2. There is what is known as the basic strategy for playing Blackjack. Playing this strategy gives the player a 45% chance of winning a given hand. If you play 50 hands, what is the probability that you win more than 50% of the hands (and win $)? 13

14 Normal Approximation to the Binomial Recall: That if X BIN(n, p), then µ x = np and σ 2 x = np(1 p) When n is large and p is not too close to 0 or 1, then... BIN(n, p) N(np, np(1-p)) The rule-of-thumb for this approximation to work is that min{np, n(1 p)} 10 Continuity Correction: Because we are approximating a discrete distribution with a continuous distribution, we must make a continuity correction. (ie) In the discrete case, P(X x) P(X > x) P (X = k) = P (k 0.5 X k + 0.5) P (a X b) = P (a 0.5 X b + 0.5) P (a < X < b) = P (a X b 0.5) P (X < a) = P (X a 0.5) P (X a) = P (X a + 0.5) P (X > a) = P (X a + 0.5) P (X a) = P (X a 0.5) Note: The continuity correction makes little difference when n is large. Examples: 1. Suppose I will flip a fair coin 100 times. (a) What is the probability that I get 60 or more heads? (b) What range of the number of heads should I expect to get 95% of the time? 2. Basic strategy for playing Blackjack. Playing this strategy gives the player a 45% chance of winning a given hand. If you play 50 hands, what is the probability that you win more than half of the hands (and win $)? 14

15 Normal Approximation to the Poisson Recall: That if X POISSON(λ ), then µ x = λ and σ 2 x = λ When λ is large, then... POISSON(λ ) N(λ, λ ) The rule-of-thumb for this approximation to work is that λ 20 Like in the Binomial case, here we are approximating a discrete distribution using a continuous distribution, so we must use the same continuity correction as in the Binomial case. Example: 1. Recall the example from the section on Poisson processes. We were monitoring the number of earthquakes in California over 6.7, and there were an average of 1.5 per year. (a) What is the probability of having more than 28 large earthquakes in the next 15 years? 15

16 Examples: 1. You have designed a new sattelite that is planned to orbit in space for the next 150 years. It is set up in the following way. It has 25 battery packs. One powers the sattelite, and when it burns out the next battery takes over, and when that one burns out the next takes over, and so on. The lifetime of each battery follows an exponential distribution with a mean of 8 years and a standard deviation of 8 years. What is the probability that the sattelite runs out of batteries before the 150 years is up? 2. A standard bottle of beer advertises that it contains 341mL of beer. In fact, the machine that pours the beer into the bottle pours a mean amount of 343mL with a standard deviation of 2mL. The amount of beer poured follows a normal distribution. (a) What is the probability that a randomly selected bottle of beer is underfilled? (b) If you buy a two-four, what is the probability that no more than 4 bottles are underfilled? (c) If you buy a 6-pack, what is the probability that the average amount of liquid is less than 341mL? 3. An elevator has a limit of 10 people or 2000lbs. If 10 people get on the elevator what is the probability that they surpass the limit? Suppose that the weights of people follow a normal distribution with a mean of 170lbs and a standard deviation of 30lbs. 4. It is believed that 4% of children have a gene that may be linked to juvenille diabetes. Researchers are hoping to track 20 or more of these children (with the defect) for several years. they will test 732 newborn babies for the presence of this gene, and if the gene is present, they will track the child for several years. What is the probability that they find 20 or more subjects to be in the study? (you can answer in terms of a proportion, or as a binomial) 16

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good

More information

STAT Chapter 7: Central Limit Theorem

STAT Chapter 7: Central Limit Theorem STAT 251 - Chapter 7: Central Limit Theorem In this chapter we will introduce the most important theorem in statistics; the central limit theorem. What have we seen so far? First, we saw that for an i.i.d

More information

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good

More information

STAT 241/251 - Chapter 7: Central Limit Theorem

STAT 241/251 - Chapter 7: Central Limit Theorem STAT 241/251 - Chapter 7: Central Limit Theorem In this chapter we will introduce the most important theorem in statistics; the central limit theorem. What have we seen so far? First, we saw that for an

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Section The Sampling Distribution of a Sample Mean

Section The Sampling Distribution of a Sample Mean Section 5.2 - The Sampling Distribution of a Sample Mean Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin The Sampling Distribution of a Sample Mean Example: Quality control check of light

More information

Midterm Exam III Review

Midterm Exam III Review Midterm Exam III Review Dr. Joseph Brennan Math 148, BU Dr. Joseph Brennan (Math 148, BU) Midterm Exam III Review 1 / 25 Permutations and Combinations ORDER In order to count the number of possible ways

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

Part V - Chance Variability

Part V - Chance Variability Part V - Chance Variability Dr. Joseph Brennan Math 148, BU Dr. Joseph Brennan (Math 148, BU) Part V - Chance Variability 1 / 78 Law of Averages In Chapter 13 we discussed the Kerrich coin-tossing experiment.

More information

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1

8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions 8-1 8.2 The Standard Deviation as a Ruler Chapter 8 The Normal and Other Continuous Distributions For Example: On August 8, 2011, the Dow dropped 634.8 points, sending shock waves through the financial community.

More information

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.) Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop

More information

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted Figure 1: Math 223 Lecture Notes 4/1/04 Section 4.10 The normal distribution Recall that a continuous random variable X with probability distribution function f(x) = 1 µ)2 (x e 2σ 2πσ is said to have a

More information

Lecture 9. Probability Distributions. Outline. Outline

Lecture 9. Probability Distributions. Outline. Outline Outline Lecture 9 Probability Distributions 6-1 Introduction 6- Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7- Properties of the Normal Distribution

More information

STAT 201 Chapter 6. Distribution

STAT 201 Chapter 6. Distribution STAT 201 Chapter 6 Distribution 1 Random Variable We know variable Random Variable: a numerical measurement of the outcome of a random phenomena Capital letter refer to the random variable Lower case letters

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education

More information

Lecture 9. Probability Distributions

Lecture 9. Probability Distributions Lecture 9 Probability Distributions Outline 6-1 Introduction 6-2 Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7-2 Properties of the Normal Distribution

More information

Commonly Used Distributions

Commonly Used Distributions Chapter 4: Commonly Used Distributions 1 Introduction Statistical inference involves drawing a sample from a population and analyzing the sample data to learn about the population. We often have some knowledge

More information

Math 227 Elementary Statistics. Bluman 5 th edition

Math 227 Elementary Statistics. Bluman 5 th edition Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find

More information

Normal Model (Part 1)

Normal Model (Part 1) Normal Model (Part 1) Formulas New Vocabulary The Standard Deviation as a Ruler The trick in comparing very different-looking values is to use standard deviations as our rulers. The standard deviation

More information

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations MLLunsford 1 Activity: Central Limit Theorem Theory and Computations Concepts: The Central Limit Theorem; computations using the Central Limit Theorem. Prerequisites: The student should be familiar with

More information

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Chapter 14: random variables p394 A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Consider the experiment of tossing a coin. Define a random variable

More information

The Normal Distribution

The Normal Distribution 5.1 Introduction to Normal Distributions and the Standard Normal Distribution Section Learning objectives: 1. How to interpret graphs of normal probability distributions 2. How to find areas under the

More information

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10 ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10 Fall 2011 Lecture 8 Part 2 (Fall 2011) Probability Distributions Lecture 8 Part 2 1 / 23 Normal Density Function f

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter7 Probability Distributions and Statistics Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number of boys in

More information

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Chapter 14: random variables p394 A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon. Consider the experiment of tossing a coin. Define a random variable

More information

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by Normal distribution The normal distribution is the most important distribution. It describes well the distribution of random variables that arise in practice, such as the heights or weights of people,

More information

Statistics and Probability

Statistics and Probability Statistics and Probability Continuous RVs (Normal); Confidence Intervals Outline Continuous random variables Normal distribution CLT Point estimation Confidence intervals http://www.isrec.isb-sib.ch/~darlene/geneve/

More information

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution Section 7.6 Application of the Normal Distribution A random variable that may take on infinitely many values is called a continuous random variable. A continuous probability distribution is defined by

More information

Chapter 7: Point Estimation and Sampling Distributions

Chapter 7: Point Estimation and Sampling Distributions Chapter 7: Point Estimation and Sampling Distributions Seungchul Baek Department of Statistics, University of South Carolina STAT 509: Statistics for Engineers 1 / 20 Motivation In chapter 3, we learned

More information

Business Statistics 41000: Probability 4

Business Statistics 41000: Probability 4 Business Statistics 41000: Probability 4 Drew D. Creal University of Chicago, Booth School of Business February 14 and 15, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office:

More information

1 Sampling Distributions

1 Sampling Distributions 1 Sampling Distributions 1.1 Statistics and Sampling Distributions When a random sample is selected the numerical descriptive measures calculated from such a sample are called statistics. These statistics

More information

Probability. An intro for calculus students P= Figure 1: A normal integral

Probability. An intro for calculus students P= Figure 1: A normal integral Probability An intro for calculus students.8.6.4.2 P=.87 2 3 4 Figure : A normal integral Suppose we flip a coin 2 times; what is the probability that we get more than 2 heads? Suppose we roll a six-sided

More information

Chapter 7 Study Guide: The Central Limit Theorem

Chapter 7 Study Guide: The Central Limit Theorem Chapter 7 Study Guide: The Central Limit Theorem Introduction Why are we so concerned with means? Two reasons are that they give us a middle ground for comparison and they are easy to calculate. In this

More information

LECTURE 6 DISTRIBUTIONS

LECTURE 6 DISTRIBUTIONS LECTURE 6 DISTRIBUTIONS OVERVIEW Uniform Distribution Normal Distribution Random Variables Continuous Distributions MOST OF THE SLIDES ADOPTED FROM OPENINTRO STATS BOOK. NORMAL DISTRIBUTION Unimodal and

More information

The Standard Deviation as a Ruler and the Normal Model. Copyright 2009 Pearson Education, Inc.

The Standard Deviation as a Ruler and the Normal Model. Copyright 2009 Pearson Education, Inc. The Standard Deviation as a Ruler and the Normal Mol Copyright 2009 Pearson Education, Inc. The trick in comparing very different-looking values is to use standard viations as our rulers. The standard

More information

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4

AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4 AMS 7 Sampling Distributions, Central limit theorem, Confidence Intervals Lecture 4 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Summer 2014 1 / 26 Sampling Distributions!!!!!!

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number

More information

Lecture 6: Chapter 6

Lecture 6: Chapter 6 Lecture 6: Chapter 6 C C Moxley UAB Mathematics 3 October 16 6.1 Continuous Probability Distributions Last week, we discussed the binomial probability distribution, which was discrete. 6.1 Continuous Probability

More information

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables

Chapter 5. Continuous Random Variables and Probability Distributions. 5.1 Continuous Random Variables Chapter 5 Continuous Random Variables and Probability Distributions 5.1 Continuous Random Variables 1 2CHAPTER 5. CONTINUOUS RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS Probability Distributions Probability

More information

The Normal Probability Distribution

The Normal Probability Distribution 1 The Normal Probability Distribution Key Definitions Probability Density Function: An equation used to compute probabilities for continuous random variables where the output value is greater than zero

More information

Statistics 511 Supplemental Materials

Statistics 511 Supplemental Materials Gaussian (or Normal) Random Variable In this section we introduce the Gaussian Random Variable, which is more commonly referred to as the Normal Random Variable. This is a random variable that has a bellshaped

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial

More information

Chapter 6. The Normal Probability Distributions

Chapter 6. The Normal Probability Distributions Chapter 6 The Normal Probability Distributions 1 Chapter 6 Overview Introduction 6-1 Normal Probability Distributions 6-2 The Standard Normal Distribution 6-3 Applications of the Normal Distribution 6-5

More information

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS A random variable is the description of the outcome of an experiment in words. The verbal description of a random variable tells you how to find or calculate

More information

Chapter 15: Sampling distributions

Chapter 15: Sampling distributions =true true Chapter 15: Sampling distributions Objective (1) Get "big picture" view on drawing inferences from statistical studies. (2) Understand the concept of sampling distributions & sampling variability.

More information

Sampling Distributions For Counts and Proportions

Sampling Distributions For Counts and Proportions Sampling Distributions For Counts and Proportions IPS Chapter 5.1 2009 W. H. Freeman and Company Objectives (IPS Chapter 5.1) Sampling distributions for counts and proportions Binomial distributions for

More information

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment

More information

What was in the last lecture?

What was in the last lecture? What was in the last lecture? Normal distribution A continuous rv with bell-shaped density curve The pdf is given by f(x) = 1 2πσ e (x µ)2 2σ 2, < x < If X N(µ, σ 2 ), E(X) = µ and V (X) = σ 2 Standard

More information

7 THE CENTRAL LIMIT THEOREM

7 THE CENTRAL LIMIT THEOREM CHAPTER 7 THE CENTRAL LIMIT THEOREM 373 7 THE CENTRAL LIMIT THEOREM Figure 7.1 If you want to figure out the distribution of the change people carry in their pockets, using the central limit theorem and

More information

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized

More information

University of California, Los Angeles Department of Statistics. Normal distribution

University of California, Los Angeles Department of Statistics. Normal distribution University of California, Los Angeles Department of Statistics Statistics 110A Instructor: Nicolas Christou Normal distribution The normal distribution is the most important distribution. It describes

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions

More information

Sampling Distributions and the Central Limit Theorem

Sampling Distributions and the Central Limit Theorem Sampling Distributions and the Central Limit Theorem February 18 Data distributions and sampling distributions So far, we have discussed the distribution of data (i.e. of random variables in our sample,

More information

A.REPRESENTATION OF DATA

A.REPRESENTATION OF DATA A.REPRESENTATION OF DATA (a) GRAPHS : PART I Q: Why do we need a graph paper? Ans: You need graph paper to draw: (i) Histogram (ii) Cumulative Frequency Curve (iii) Frequency Polygon (iv) Box-and-Whisker

More information

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative STAT:10 Statistical Methods and Computing Normal Distributions Lecture 4 Feb. 6, 17 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowa.edu 1 2 Using density curves to describe the distribution of values of

More information

AMS7: WEEK 4. CLASS 3

AMS7: WEEK 4. CLASS 3 AMS7: WEEK 4. CLASS 3 Sampling distributions and estimators. Central Limit Theorem Normal Approximation to the Binomial Distribution Friday April 24th, 2015 Sampling distributions and estimators REMEMBER:

More information

AP Stats ~ Lesson 6B: Transforming and Combining Random variables

AP Stats ~ Lesson 6B: Transforming and Combining Random variables AP Stats ~ Lesson 6B: Transforming and Combining Random variables OBJECTIVES: DESCRIBE the effects of transforming a random variable by adding or subtracting a constant and multiplying or dividing by a

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

5.1 Personal Probability

5.1 Personal Probability 5. Probability Value Page 1 5.1 Personal Probability Although we think probability is something that is confined to math class, in the form of personal probability it is something we use to make decisions

More information

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr.

Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 6 Normal Probability Distribution QMIS 120. Dr. Department of Quantitative Methods & Information Systems Business Statistics Chapter 6 Normal Probability Distribution QMIS 120 Dr. Mohammad Zainal Chapter Goals After completing this chapter, you should

More information

Chapter 5. Sampling Distributions

Chapter 5. Sampling Distributions Lecture notes, Lang Wu, UBC 1 Chapter 5. Sampling Distributions 5.1. Introduction In statistical inference, we attempt to estimate an unknown population characteristic, such as the population mean, µ,

More information

Module 4: Probability

Module 4: Probability Module 4: Probability 1 / 22 Probability concepts in statistical inference Probability is a way of quantifying uncertainty associated with random events and is the basis for statistical inference. Inference

More information

Lecture 8 - Sampling Distributions and the CLT

Lecture 8 - Sampling Distributions and the CLT Lecture 8 - Sampling Distributions and the CLT Statistics 102 Kenneth K. Lopiano September 18, 2013 1 Basics Improvements 2 Variability of Estimates Activity Sampling distributions - via simulation Sampling

More information

Binomial Random Variable - The count X of successes in a binomial setting

Binomial Random Variable - The count X of successes in a binomial setting 6.3.1 Binomial Settings and Binomial Random Variables What do the following scenarios have in common? Toss a coin 5 times. Count the number of heads. Spin a roulette wheel 8 times. Record how many times

More information

Density curves. (James Madison University) February 4, / 20

Density curves. (James Madison University) February 4, / 20 Density curves Figure 6.2 p 230. A density curve is always on or above the horizontal axis, and has area exactly 1 underneath it. A density curve describes the overall pattern of a distribution. Example

More information

The normal distribution is a theoretical model derived mathematically and not empirically.

The normal distribution is a theoretical model derived mathematically and not empirically. Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial. Lecture 21,22, 23 Text: A Course in Probability by Weiss 8.5 STAT 225 Introduction to Probability Models March 31, 2014 Standard Sums of Whitney Huang Purdue University 21,22, 23.1 Agenda 1 2 Standard

More information

Introduction to Business Statistics QM 120 Chapter 6

Introduction to Business Statistics QM 120 Chapter 6 DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics QM 120 Chapter 6 Spring 2008 Chapter 6: Continuous Probability Distribution 2 When a RV x is discrete, we can

More information

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial Lecture 23 STAT 225 Introduction to Probability Models April 4, 2014 approximation Whitney Huang Purdue University 23.1 Agenda 1 approximation 2 approximation 23.2 Characteristics of the random variable:

More information

PROBABILITY DISTRIBUTIONS

PROBABILITY DISTRIBUTIONS CHAPTER 3 PROBABILITY DISTRIBUTIONS Page Contents 3.1 Introduction to Probability Distributions 51 3.2 The Normal Distribution 56 3.3 The Binomial Distribution 60 3.4 The Poisson Distribution 64 Exercise

More information

Statistics, Their Distributions, and the Central Limit Theorem

Statistics, Their Distributions, and the Central Limit Theorem Statistics, Their Distributions, and the Central Limit Theorem MATH 3342 Sections 5.3 and 5.4 Sample Means Suppose you sample from a popula0on 10 0mes. You record the following sample means: 10.1 9.5 9.6

More information

Chapter 9: Sampling Distributions

Chapter 9: Sampling Distributions Chapter 9: Sampling Distributions 9. Introduction This chapter connects the material in Chapters 4 through 8 (numerical descriptive statistics, sampling, and probability distributions, in particular) with

More information

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION We have examined discrete random variables, those random variables for which we can list the possible values. We will now look at continuous random variables.

More information

Review of commonly missed questions on the online quiz. Lecture 7: Random variables] Expected value and standard deviation. Let s bet...

Review of commonly missed questions on the online quiz. Lecture 7: Random variables] Expected value and standard deviation. Let s bet... Recap Review of commonly missed questions on the online quiz Lecture 7: ] Statistics 101 Mine Çetinkaya-Rundel OpenIntro quiz 2: questions 4 and 5 September 20, 2011 Statistics 101 (Mine Çetinkaya-Rundel)

More information

I. Standard Error II. Standard Error III. Standard Error 2.54

I. Standard Error II. Standard Error III. Standard Error 2.54 1) Original Population: Match the standard error (I, II, or III) with the correct sampling distribution (A, B, or C) and the correct sample size (1, 5, or 10) I. Standard Error 1.03 II. Standard Error

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution January 31, 2019 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The

More information

Section Distributions of Random Variables

Section Distributions of Random Variables Section 8.1 - Distributions of Random Variables Definition: A random variable is a rule that assigns a number to each outcome of an experiment. Example 1: Suppose we toss a coin three times. Then we could

More information

Example. Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables

Example. Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables You are dealt a hand of 5 cards. Find the probability distribution table for the number of hearts. Graph

More information

Theoretical Foundations

Theoretical Foundations Theoretical Foundations Probabilities Monia Ranalli monia.ranalli@uniroma2.it Ranalli M. Theoretical Foundations - Probabilities 1 / 27 Objectives understand the probability basics quantify random phenomena

More information

Chapter 5: Statistical Inference (in General)

Chapter 5: Statistical Inference (in General) Chapter 5: Statistical Inference (in General) Shiwen Shen University of South Carolina 2016 Fall Section 003 1 / 17 Motivation In chapter 3, we learn the discrete probability distributions, including Bernoulli,

More information

FINAL REVIEW W/ANSWERS

FINAL REVIEW W/ANSWERS FINAL REVIEW W/ANSWERS ( 03/15/08 - Sharon Coates) Concepts to review before answering the questions: A population consists of the entire group of people or objects of interest to an investigator, while

More information

work to get full credit.

work to get full credit. Chapter 18 Review Name Date Period Write complete answers, using complete sentences where necessary.show your work to get full credit. MULTIPLE CHOICE. Choose the one alternative that best completes the

More information

Sampling Distribution Models. Copyright 2009 Pearson Education, Inc.

Sampling Distribution Models. Copyright 2009 Pearson Education, Inc. Sampling Distribution Mols Copyright 2009 Pearson Education, Inc. Rather than showing real repeated samples, imagine what would happen if we were to actually draw many samples. The histogram we d get if

More information

Math 243 Lecture Notes

Math 243 Lecture Notes Assume the average annual rainfall for in Portland is 36 inches per year with a standard deviation of 9 inches. Also assume that the average wind speed in Chicago is 10 mph with a standard deviation of

More information

The graph of a normal curve is symmetric with respect to the line x = µ, and has points of

The graph of a normal curve is symmetric with respect to the line x = µ, and has points of Stat 400, section 4.3 Normal Random Variables notes prepared by Tim Pilachowski Another often-useful probability density function is the normal density function, which graphs as the familiar bell-shaped

More information

Section Introduction to Normal Distributions

Section Introduction to Normal Distributions Section 6.1-6.2 Introduction to Normal Distributions 2012 Pearson Education, Inc. All rights reserved. 1 of 105 Section 6.1-6.2 Objectives Interpret graphs of normal probability distributions Find areas

More information

CHAPTER 6 Random Variables

CHAPTER 6 Random Variables CHAPTER 6 Random Variables 6.3 Binomial and Geometric Random Variables The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Binomial and Geometric Random

More information

Statistical Methods in Practice STAT/MATH 3379

Statistical Methods in Practice STAT/MATH 3379 Statistical Methods in Practice STAT/MATH 3379 Dr. A. B. W. Manage Associate Professor of Mathematics & Statistics Department of Mathematics & Statistics Sam Houston State University Overview 6.1 Discrete

More information

AP * Statistics Review

AP * Statistics Review AP * Statistics Review Normal Models and Sampling Distributions Teacher Packet AP* is a trademark of the College Entrance Examination Board. The College Entrance Examination Board was not involved in the

More information

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics Chapter 5 Student Lecture Notes 5-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 5 Discrete Probability Distributions QMIS 120 Dr. Mohammad Zainal Chapter Goals

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution January 31, 2018 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Exam Name The bar graph shows the number of tickets sold each week by the garden club for their annual flower show. ) During which week was the most number of tickets sold? ) A) Week B) Week C) Week 5

More information

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Chapter 7 Sampling Distributions and Point Estimation of Parameters Chapter 7 Sampling Distributions and Point Estimation of Parameters Part 1: Sampling Distributions, the Central Limit Theorem, Point Estimation & Estimators Sections 7-1 to 7-2 1 / 25 Statistical Inferences

More information

4.2 Bernoulli Trials and Binomial Distributions

4.2 Bernoulli Trials and Binomial Distributions Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan 4.2 Bernoulli Trials and Binomial Distributions A Bernoulli trial 1 is an experiment with exactly two outcomes: Success and

More information

8.1 Estimation of the Mean and Proportion

8.1 Estimation of the Mean and Proportion 8.1 Estimation of the Mean and Proportion Statistical inference enables us to make judgments about a population on the basis of sample information. The mean, standard deviation, and proportions of a population

More information

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Math 224 Fall 207 Homework 5 Drew Armstrong Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Section 3., Exercises 3, 0. Section 3.3, Exercises 2, 3, 0,.

More information

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 6 Point Estimation Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Point Estimation Statistical inference: directed toward conclusions about one or more parameters. We will use the generic

More information

Chapter 7: Random Variables

Chapter 7: Random Variables Chapter 7: Random Variables 7.1 Discrete and Continuous Random Variables 7.2 Means and Variances of Random Variables 1 Introduction A random variable is a function that associates a unique numerical value

More information