Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny.

Size: px
Start display at page:

Download "Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny."

Transcription

1 Distributions September 17

2 Random variables Anything that can be measured or categorized is called a variable If the value that a variable takes on is subject to variability, then it the variable is a random variable We have already seen several random variables in the data that we have looked at so far: sex, class, age, reduction in FVC

3 Distributions Random variables Data and probability meet in the notion of a distribution A probability distribution applies the theory of probability to describe the behavior of a random variable A distribution describes the probability that a random variable will be observed to take on a specific value or fall within a specific range of values

4 Discrete distributions Categorical variables are said to have discrete probability distributions In a discrete distribution, variables can only take on a finite number of values Because they only take on a finite number of values, the distribution can describe the probability that each value will occur Examples: Random variable Possible outcomes Survival Yes, no # of copies of a genetic mutation 0,1,2 # of children a woman will have in her lifetime 0,1,2,... # of people in a sample who smoke 0,1,2,...,n

5 Continuous distributions Continuous variables can take on an infinite number of possible values Because of this, any particular value will have probability 0 So what does a continuous distribution describe? It describes the probability that a continuous random variable will fall within a certain range Examples Random variable Height Weight Cholesterol levels Survival time

6 Listing the ways Random variables The binomial coefficients When trying to figure out the probability of something, it is sometimes very helpful to list all the different ways that the random process can turn out If all the ways are equally likely, then each one has probability, where n is the total number of ways 1 n Thus, the probability of the event is the number of ways it can happen divided by n

7 Genetics example Random variables The binomial coefficients For example, the possible outcomes of an individual inheriting cystic fibrosis genes are CC Cc cc cc If all these possibilities are equally likely (as they would be if the individual s parents had one copy of each version of the gene), then the probability of having one copy of each version is 2/4

8 Coin example Random variables The binomial coefficients Another example where the outcomes are equally likely is flips of a coin Suppose we flip a coin three times; what is the probability that exactly one of the flips was heads? Possible outcomes: HHH HHT HT H HT T T HH T HT T T H T T T The probability is therefore 3/8

9 The binomial coefficients The binomial coefficients Counting the number of ways something can happen quickly becomes a hassle (imagine flipping a coin 100 times and counting the number of times 20 heads show up) Luckily, mathematicians long ago discovered that when there are two possible outcomes that occur n times, the number of ways of k of them happening is n! k!(n k)! The notation n! means to multiply n by all the positive numbers that come before it (e.g. 3! = 3 2 1)

10 The binomial coefficients Calculating the binomial coefficients For the coin example, we could have used the binomial coefficients instead of listing all the ways the flips could happen: 3! 1!(3 1)! = (1) = 3 Many calculators and computer programs (including SAS) have specific functions for calculating binomial coefficients, which we will explore in lab

11 The binomial coefficients When sequences are not equally likely Suppose we draw 3 balls, with replacement, from an urn that contains 10 balls: 2 red balls and 8 green balls What is the probability that we will draw two red balls? As before, there are three possible sequences: RRG, RGR, and GRR, but the sequences no longer have probability 1 8

12 The binomial coefficients When sequences are not equally likely (cont d) The probability of each sequence is = = Thus, the probability of drawing two red balls is = 9.6% 2 10

13 The binomial formula Random variables The binomial coefficients This line of reasoning can be summarized in the following formula: the probability that an event will occur k times out of n is n! k!(n k)! pk (1 p) n k In this formula, n is the number of trials, k is the number of times the event is to occur, and p is the probability that the event will occur on any particular trial

14 The binomial formula (cont d) The binomial coefficients This formula works for any random variable that counts the number of times an event occurs out of n trials, provided that the following assumptions are met: The number of trials n must be fixed in advance The probability that the event occurs, p, must be the same from trial to trial The trials must be independent If these assumptions are met, the random variable is said to follow a binomial distribution, or to be binomially distributed

15 Example Random variables The binomial coefficients According to the CDC, 22% of the adults in the United States smoke Suppose we sample 10 people; what is the probability that 5 of them will smoke? We can use the binomial formula, with 10! 5!(10 5)!.225 (1.22) 10 5 = 3.7%

16 Example (cont d) Random variables The binomial coefficients What is the probability that our sample will contain two or fewer smokers? We can add up probabilities from the binomial distribution: P (x 2) = P (x = 0) + P (x = 1) + P (x = 2) = = 61.7%

17 NHANES Random variables The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Every few years, the CDC conducts a huge survey of randomly chosen Americans called the National Health and Nutrition Examination Survey (NHANES) Hundreds of variables are measured on these individuals: Demographic variables like age, education, and income Physiological variables like height, weight, blood pressure, and cholesterol levels Dietary habits Disease status Lots more: everything from cavities to sexual behavior

18 A common histogram shape The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Histograms of infant mortality rates, heights, and cholesterol levels: Africa NHANES (adult women) NHANES (adult women) Frequency Frequency Frequency Infant mortality rate Height (inches) HDL cholesterol What do these histograms have in common?

19 The normal curve Random variables The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Mathematicians discovered long ago that the equation y = 1 2π e x2 /2 described the histograms of many random variables y x

20 Features of the normal curve The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution The normal curve is symmetric around x = 0 The normal curve is always positive The normal curve drops rapidly down to zero as x moves away from 0

21 Standardizing a variable The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Before we can see the normal curve in action, we must discuss standardization of variables To standardize a variable, we subtract the average and divide by the standard deviation: x std i = x i x s x Once you have standardized a variable, its mean will be 0 and its standard deviation will be 1 Standardized values tell you how many standard deviations away from the mean an observation is

22 The normal curve in action The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Africa NHANES (adult women) NHANES (adult women) Density Density Density Infant mortality rate (standard units) Height (standard units) HDL cholesterol (standard units) Note that the data has been standardized and that the vertical axis is now called density Data whose histogram looks like the normal curve are said to be normally distributed or to follow a normal distribution

23 Probabilities from the normal curve The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Probabilities are given by the area under the normal curve: Density x

24 The 68%/95% rule Random variables The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution This is where the 68%/95% rule of thumb that we discussed earlier comes from: P=68% P=95% P=100% Density Density Density x x x

25 Calculating probabilities The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution By knowing that the total area under the normal curve is 1, we can get a rough idea of the area under a curve by looking at a plot However, to get exact numbers, we will need a computer How much area is under this normal curve? is an extremely common question in statistics, and programmers have developed algorithms to answer this question very quickly The output from these algorithms is commonly collected into tables, which is what you will have to use for exams

26 The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Calculating the area under a normal curve, example 1 Find the area under the normal curve between 0 and 1 Density Density Density x x x.84.5 =.34

27 The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Calculating the area under a normal curve, example 2 Find the area under the normal curve above 1 Density Density Density x x x 1.84 =.16

28 The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Calculating the area under a normal curve, example 3 Find the area under the normal curve that lies outside -1 and 1 Density Density Density x x x 1 - ( ) =.32 Alternatively, we could have used symmetry: 2(.16)=.32

29 Calculating percentiles The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution A related question of interest is, What is the xth percentile of the normal curve? This is the opposite of the earlier question: instead of being given a value and asked to find the area to the left of the value, now we are told the area to the left and asked to find the value With a table, we can perform this inverse search by finding the probability in the body of the table, then looking to the margins to find the percentile associated with it

30 Calculating percentiles (cont d) The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution What is the 60th percentile of the normal curve? There is no.600 in the table, but there is a.599, which corresponds to 0.25 The real 60th percentile must lie between 0.25 and 0.26 (it s actually ) For this class, 0.25, 0.26, or anything in between is an acceptable answer How about the 10th percentile? The 10th percentile is -1.28

31 The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Calculating values such that a certain area lies within/outside them Find the number x such that the area outside x and x is equal to 10% Density Density Density x x x Our answer is therefore ±1.645 (the 5th/95th percentile)

32 Reconstructing a histogram The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution In week 2, we said that the mean and standard deviation provide a two-number summary of a histogram We can now make this observation a little more concrete Anything we could have learned from a histogram, we will now determine by approximating the real distribution of the data by the normal distribution This approach is called the normal approximation

33 NHANES adult women The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution The data set we will work with on these examples is the NHANES sample of the heights of 2,649 adult women The mean height is 63.5 inches The standard deviation of height is 2.75 inches

34 Estimating probabilities The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Suppose we want to estimate the percent of women who are under 5 feet tall 5 feet, or 60 inches, is 3.5/2.75=1.27 standard deviations below the mean Using the normal distribution, the probability of more than 1.27 standard deviations below the mean is P (x < 1.27) = 10.2% In the actual sample, 10.6% of women were under 5 feet tall

35 Estimating probabilities (cont d) The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Another example: suppose we want to estimate the percent of women who are between 5 3 and 5 6 (63 and 66 inches) These heights are 0.18 standard deviations below the mean and 0.91 standard deviations above the mean, respectively Using the normal distribution, the probability of falling in this region is 39.0% In the actual data set, 38.8% of women are between 5 3 and 5 6

36 Approximating percentiles The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Suppose instead that we wished to find the 75th percentile of these women s heights For the normal distribution, 0.67 is the 75th percentile The mean plus 0.67 standard deviations in height is inches For the actual data, the 75th percentile is inches

37 The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution The broad applicability of the normal approximation These examples are by no means special: the distribution of many random variables are very closely approximated by the normal distribution Indeed, this is why statisticians call it the normal distribution Other names for the normal distribution include the Gaussian distribution (after its inventor) and the bell curve (after its shape) For variables with approximately normal distributions, the mean and standard deviation essentially tell us everything about the data other summary statistics and graphics are redundant

38 Caution Random variables The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution Other variables, however, are not approximated by the normal distribution well, and give misleading or nonsensical results when you apply the normal approximation to them For example, the value 0 lies 1.63 standard deviations below the mean infant mortality rate for Europe The normal approximation therefore predicts a probability that 5.1% of the countries in Europe will have negative infant mortality rates

39 Caution (cont d) Random variables The normal curve Calculating probabilities and percentiles from the normal curve Approximating data with the normal distribution As another example, the normal distribution will always predict the median to lie 0 standard deviations above the mean i.e., it will always predict that the median equals the mean As we have seen, however, the mean and median can differ greatly when distributions are skewed For example, according to the U.S. census bureau, the mean income in the United States is $66,570, while the median income is $48,201

40 Data distributions Random variables Overview Example So far, we have discussed the distribution of data Certain random variables, like the number of males or smokers in our study, followed a binomial distribution Other random variables, like height, seemed to follow a normal distribution

41 Overview Example For the rest of the course, we will be concerned with the distribution of statistics For example: we go out, collect a sample of 10 people, measure their heights, and then take the average If we were to repeat this procedure, we would get a different average Thus, we can think about the average of our sample as a random variable Like any random variable, it will have a distribution This distribution is called a sampling distribution, to reflect the fact that it is subject to variability depending on the random sample

42 What s the point? Random variables Overview Example In practice, no one obtains sampling distributions directly Investigators do not collect 10 samples of 10 individuals and report 10 different means If they can afford to sample 100 people, they collect a single sample of 100 people and report a single mean So why do we study sampling distributions?

43 What s the point? (cont d) Overview Example The reason we study sampling distributions is to understand how, in theory, our summary statistic would be distributed If we understand the sampling distribution of the average height, then we will understand the amount of variability likely to be present in the actual, single average that we calculate Understanding sampling distributions is central to inference, and to answering the question: How accurate is my generalization to the population likely to be?

44 Seeing sampling distributions Overview Example As we said before, investigators do not collect multiple samples in order to look at their sampling distributions In order to see sampling distributions, we will have to do one of the following: Argue or prove that the statistic will have a certain distribution, like the normal or binomial distribution Conduct a computer simulation in which we do collect multiple samples; we can then summarize, graph, and describe the sampling distribution of our summary statistic

45 Crossover trials Random variables Overview Example For example, consider the placebo-controlled experiment from assignment 2 which tested whether a new drug for cystic fibrosis was effective at preventing deterioration of lung function I didn t tell you this at the time, but in reality, the trial featured a crossover design: each participant in the trial received both the drug and the placebo (at different times), crossing over to receive the other treatment halfway through the trial Like all well-designed crossover trials, the therapy (treatment/placebo) that each participant received first was chosen at random Furthermore, there was a washout period during the crossover between the two drug periods

46 Cystic fibrosis crossover trial Overview Example So, for our cystic fibrosis study, let s consider the statistic x = # of patients who did better on drug than placebo In the actual experiment, 11 patients did better on the drug than placebo (x = 11) But how variable is x? Could it have easily been 7, or 14, or even 5? This is why we study sampling distributions

47 Overview Example A ball and urn recreation of the experiment One way to get a handle on the variability of x is to recreate our experiment using balls and urns Consider placing 11 balls saying drug and 3 balls saying placebo into an urn We can recreate our experiment by drawing the 14 balls out again with replacement and counting the number of drug balls (x) On a computer, this experiment can be replicated 10,000 times in a fraction of a second

48 Results of the recreation Overview Example Pct x

49 Using the binomial distribution Overview Example Of course, this is sort of the long way around we could have gotten these percentages without a computer by recognizing that x satisfies all the conditions of the binomial distribution Here, we would say that x follows a binomial distribution with n = 14 and p = 11/14 This would give us the same percentages as before, to within the experimental error of the simulation Both are perfectly valid ways of exploring sampling distributions to get a sense of the variability of sample statistics We will discuss how to explore sampling distributions more explicitly in the coming weeks

Random variables The binomial distribution The normal distribution Other distributions. Distributions. Patrick Breheny.

Random variables The binomial distribution The normal distribution Other distributions. Distributions. Patrick Breheny. Distributions February 11 Random variables Anything that can be measured or categorized is called a variable If the value that a variable takes on is subject to variability, then it the variable is a random

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution Patrick Breheny February 16 Patrick Breheny STA 580: Biostatistics I 1/38 Random variables The Binomial Distribution Random variables The binomial coefficients The binomial distribution

More information

The Binomial Distribution

The Binomial Distribution Patrick Breheny February 21 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 1 / 16 So far, we have discussed the probability of single events In research, however, the data

More information

The Binomial Distribution

The Binomial Distribution Patrick Breheny September 13 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 16 Outcomes and summary statistics Random variables Distributions So far, we have discussed the

More information

Sampling Distributions and the Central Limit Theorem

Sampling Distributions and the Central Limit Theorem Sampling Distributions and the Central Limit Theorem February 18 Data distributions and sampling distributions So far, we have discussed the distribution of data (i.e. of random variables in our sample,

More information

The normal distribution is a theoretical model derived mathematically and not empirically.

The normal distribution is a theoretical model derived mathematically and not empirically. Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.

More information

Math 227 Elementary Statistics. Bluman 5 th edition

Math 227 Elementary Statistics. Bluman 5 th edition Math 227 Elementary Statistics Bluman 5 th edition CHAPTER 6 The Normal Distribution 2 Objectives Identify distributions as symmetrical or skewed. Identify the properties of the normal distribution. Find

More information

The Central Limit Theorem

The Central Limit Theorem The Central Limit Theorem Patrick Breheny March 1 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 1 / 29 Kerrich s experiment Introduction The law of averages Mean and SD of

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

Lab #7. In previous lectures, we discussed factorials and binomial coefficients. Factorials can be calculated with:

Lab #7. In previous lectures, we discussed factorials and binomial coefficients. Factorials can be calculated with: Introduction to Biostatistics (171:161) Breheny Lab #7 In Lab #7, we are going to use R and SAS to calculate factorials, binomial coefficients, and probabilities from both the binomial and the normal distributions.

More information

11.5: Normal Distributions

11.5: Normal Distributions 11.5: Normal Distributions 11.5.1 Up to now, we ve dealt with discrete random variables, variables that take on only a finite (or countably infinite we didn t do these) number of values. A continuous random

More information

Sampling Distributions For Counts and Proportions

Sampling Distributions For Counts and Proportions Sampling Distributions For Counts and Proportions IPS Chapter 5.1 2009 W. H. Freeman and Company Objectives (IPS Chapter 5.1) Sampling distributions for counts and proportions Binomial distributions for

More information

Part V - Chance Variability

Part V - Chance Variability Part V - Chance Variability Dr. Joseph Brennan Math 148, BU Dr. Joseph Brennan (Math 148, BU) Part V - Chance Variability 1 / 78 Law of Averages In Chapter 13 we discussed the Kerrich coin-tossing experiment.

More information

The Normal Distribution

The Normal Distribution 5.1 Introduction to Normal Distributions and the Standard Normal Distribution Section Learning objectives: 1. How to interpret graphs of normal probability distributions 2. How to find areas under the

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Statistical Methods in Practice STAT/MATH 3379

Statistical Methods in Practice STAT/MATH 3379 Statistical Methods in Practice STAT/MATH 3379 Dr. A. B. W. Manage Associate Professor of Mathematics & Statistics Department of Mathematics & Statistics Sam Houston State University Overview 6.1 Discrete

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter7 Probability Distributions and Statistics Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number of boys in

More information

CHAPTER 6 Random Variables

CHAPTER 6 Random Variables CHAPTER 6 Random Variables 6.1 Discrete and Continuous Random Variables The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Discrete and Continuous Random

More information

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions Random Variables Examples: Random variable a variable (typically represented by x) that takes a numerical value by chance. Number of boys in a randomly selected family with three children. Possible values:

More information

Probability and distributions

Probability and distributions 2 Probability and distributions The concepts of randomness and probability are central to statistics. It is an empirical fact that most experiments and investigations are not perfectly reproducible. The

More information

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution?

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution? Distributions 1. What are distributions? When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution? In other words, if we have a large number of

More information

Chapter 6. The Normal Probability Distributions

Chapter 6. The Normal Probability Distributions Chapter 6 The Normal Probability Distributions 1 Chapter 6 Overview Introduction 6-1 Normal Probability Distributions 6-2 The Standard Normal Distribution 6-3 Applications of the Normal Distribution 6-5

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number

More information

x is a random variable which is a numerical description of the outcome of an experiment.

x is a random variable which is a numerical description of the outcome of an experiment. Chapter 5 Discrete Probability Distributions Random Variables is a random variable which is a numerical description of the outcome of an eperiment. Discrete: If the possible values change by steps or jumps.

More information

Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7

Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7 Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7 Lew Davidson (Dr.D.) Mallard Creek High School Lewis.Davidson@cms.k12.nc.us 704-786-0470 Probability & Sampling The Practice of Statistics

More information

Chapter 5 Probability Distributions. Section 5-2 Random Variables. Random Variable Probability Distribution. Discrete and Continuous Random Variables

Chapter 5 Probability Distributions. Section 5-2 Random Variables. Random Variable Probability Distribution. Discrete and Continuous Random Variables Chapter 5 Probability Distributions Section 5-2 Random Variables 5-2 Random Variables 5-3 Binomial Probability Distributions 5-4 Mean, Variance and Standard Deviation for the Binomial Distribution Random

More information

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution Section 7.6 Application of the Normal Distribution A random variable that may take on infinitely many values is called a continuous random variable. A continuous probability distribution is defined by

More information

Chapter 7. Random Variables

Chapter 7. Random Variables Chapter 7 Random Variables Making quantifiable meaning out of categorical data Toss three coins. What does the sample space consist of? HHH, HHT, HTH, HTT, TTT, TTH, THT, THH In statistics, we are most

More information

UNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES

UNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES f UNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES Normal Distribution: Definition, Characteristics and Properties Structure 4.1 Introduction 4.2 Objectives 4.3 Definitions of Probability

More information

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Chapter 7 Sampling Distributions and Point Estimation of Parameters Chapter 7 Sampling Distributions and Point Estimation of Parameters Part 1: Sampling Distributions, the Central Limit Theorem, Point Estimation & Estimators Sections 7-1 to 7-2 1 / 25 Statistical Inferences

More information

Chapter 15: Sampling distributions

Chapter 15: Sampling distributions =true true Chapter 15: Sampling distributions Objective (1) Get "big picture" view on drawing inferences from statistical studies. (2) Understand the concept of sampling distributions & sampling variability.

More information

Expected Value of a Random Variable

Expected Value of a Random Variable Knowledge Article: Probability and Statistics Expected Value of a Random Variable Expected Value of a Discrete Random Variable You're familiar with a simple mean, or average, of a set. The mean value of

More information

We use probability distributions to represent the distribution of a discrete random variable.

We use probability distributions to represent the distribution of a discrete random variable. Now we focus on discrete random variables. We will look at these in general, including calculating the mean and standard deviation. Then we will look more in depth at binomial random variables which are

More information

Binomial Random Variable - The count X of successes in a binomial setting

Binomial Random Variable - The count X of successes in a binomial setting 6.3.1 Binomial Settings and Binomial Random Variables What do the following scenarios have in common? Toss a coin 5 times. Count the number of heads. Spin a roulette wheel 8 times. Record how many times

More information

Lecture 9. Probability Distributions. Outline. Outline

Lecture 9. Probability Distributions. Outline. Outline Outline Lecture 9 Probability Distributions 6-1 Introduction 6- Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7- Properties of the Normal Distribution

More information

Random Variables CHAPTER 6.3 BINOMIAL AND GEOMETRIC RANDOM VARIABLES

Random Variables CHAPTER 6.3 BINOMIAL AND GEOMETRIC RANDOM VARIABLES Random Variables CHAPTER 6.3 BINOMIAL AND GEOMETRIC RANDOM VARIABLES Essential Question How can I determine whether the conditions for using binomial random variables are met? Binomial Settings When the

More information

Random Variables. 6.1 Discrete and Continuous Random Variables. Probability Distribution. Discrete Random Variables. Chapter 6, Section 1

Random Variables. 6.1 Discrete and Continuous Random Variables. Probability Distribution. Discrete Random Variables. Chapter 6, Section 1 6.1 Discrete and Continuous Random Variables Random Variables A random variable, usually written as X, is a variable whose possible values are numerical outcomes of a random phenomenon. There are two types

More information

Lecture 9. Probability Distributions

Lecture 9. Probability Distributions Lecture 9 Probability Distributions Outline 6-1 Introduction 6-2 Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7-2 Properties of the Normal Distribution

More information

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution?

When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution? Distributions 1. What are distributions? When we look at a random variable, such as Y, one of the first things we want to know, is what is it s distribution? In other words, if we have a large number of

More information

Probability. An intro for calculus students P= Figure 1: A normal integral

Probability. An intro for calculus students P= Figure 1: A normal integral Probability An intro for calculus students.8.6.4.2 P=.87 2 3 4 Figure : A normal integral Suppose we flip a coin 2 times; what is the probability that we get more than 2 heads? Suppose we roll a six-sided

More information

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions Announcements Unit2: Probabilityanddistributions 3. Normal and binomial distributions Sta 101 - Summer 2017 Duke University, Department of Statistical Science PS: Explain your reasoning + show your work

More information

Example. Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables

Example. Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables You are dealt a hand of 5 cards. Find the probability distribution table for the number of hearts. Graph

More information

4: Probability. Notes: Range of possible probabilities: Probabilities can be no less than 0% and no more than 100% (of course).

4: Probability. Notes: Range of possible probabilities: Probabilities can be no less than 0% and no more than 100% (of course). 4: Probability What is probability? The probability of an event is its relative frequency (proportion) in the population. An event that happens half the time (such as a head showing up on the flip of a

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution

More information

Chapter 4. The Normal Distribution

Chapter 4. The Normal Distribution Chapter 4 The Normal Distribution 1 Chapter 4 Overview Introduction 4-1 Normal Distributions 4-2 Applications of the Normal Distribution 4-3 The Central Limit Theorem 4-4 The Normal Approximation to the

More information

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS A random variable is the description of the outcome of an experiment in words. The verbal description of a random variable tells you how to find or calculate

More information

The probability of having a very tall person in our sample. We look to see how this random variable is distributed.

The probability of having a very tall person in our sample. We look to see how this random variable is distributed. Distributions We're doing things a bit differently than in the text (it's very similar to BIOL 214/312 if you've had either of those courses). 1. What are distributions? When we look at a random variable,

More information

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions

Unit2: Probabilityanddistributions. 3. Normal and binomial distributions Announcements Unit2: Probabilityanddistributions 3. Normal and binomial distributions Sta 101 - Fall 2017 Duke University, Department of Statistical Science Formatting of problem set submissions: Bad:

More information

The topics in this section are related and necessary topics for both course objectives.

The topics in this section are related and necessary topics for both course objectives. 2.5 Probability Distributions The topics in this section are related and necessary topics for both course objectives. A probability distribution indicates how the probabilities are distributed for outcomes

More information

Prof. Thistleton MAT 505 Introduction to Probability Lecture 3

Prof. Thistleton MAT 505 Introduction to Probability Lecture 3 Sections from Text and MIT Video Lecture: Sections 2.1 through 2.5 http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-041-probabilistic-systemsanalysis-and-applied-probability-fall-2010/video-lectures/lecture-1-probability-models-and-axioms/

More information

Module 4: Probability

Module 4: Probability Module 4: Probability 1 / 22 Probability concepts in statistical inference Probability is a way of quantifying uncertainty associated with random events and is the basis for statistical inference. Inference

More information

Chapter 8: Binomial and Geometric Distributions

Chapter 8: Binomial and Geometric Distributions Chapter 8: Binomial and Geometric Distributions Section 8.1 Binomial Distributions The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Section 8.1 Binomial Distribution Learning Objectives

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education

More information

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean.

Lecture 3. Sampling distributions. Counts, Proportions, and sample mean. Lecture 3 Sampling distributions. Counts, Proportions, and sample mean. Statistical Inference: Uses data and summary statistics (mean, variances, proportions, slopes) to draw conclusions about a population

More information

BIOL The Normal Distribution and the Central Limit Theorem

BIOL The Normal Distribution and the Central Limit Theorem BIOL 300 - The Normal Distribution and the Central Limit Theorem In the first week of the course, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao The binomial: mean and variance Recall that the number of successes out of n, denoted

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial

More information

The Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012

The Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012 The Normal Distribution & Descriptive Statistics Kin 304W Week 2: Jan 15, 2012 1 Questionnaire Results I received 71 completed questionnaires. Thank you! Are you nervous about scientific writing? You re

More information

Statistics for Business and Economics: Random Variables:Continuous

Statistics for Business and Economics: Random Variables:Continuous Statistics for Business and Economics: Random Variables:Continuous STT 315: Section 107 Acknowledgement: I d like to thank Dr. Ashoke Sinha for allowing me to use and edit the slides. Murray Bourne (interactive

More information

Lecture 6: Chapter 6

Lecture 6: Chapter 6 Lecture 6: Chapter 6 C C Moxley UAB Mathematics 3 October 16 6.1 Continuous Probability Distributions Last week, we discussed the binomial probability distribution, which was discrete. 6.1 Continuous Probability

More information

Chapter 6: Random Variables

Chapter 6: Random Variables Chapter 6: Random Variables Section 6.1 Discrete and Continuous Random Variables The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 6 Random Variables 6.1 Discrete and Continuous

More information

Math 160 Professor Busken Chapter 5 Worksheets

Math 160 Professor Busken Chapter 5 Worksheets Math 160 Professor Busken Chapter 5 Worksheets Name: 1. Find the expected value. Suppose you play a Pick 4 Lotto where you pay 50 to select a sequence of four digits, such as 2118. If you select the same

More information

NOTES: Chapter 4 Describing Data

NOTES: Chapter 4 Describing Data NOTES: Chapter 4 Describing Data Intro to Statistics COLYER Spring 2017 Student Name: Page 2 Section 4.1 ~ What is Average? Objective: In this section you will understand the difference between the three

More information

MATH 446/546 Homework 1:

MATH 446/546 Homework 1: MATH 446/546 Homework 1: Due September 28th, 216 Please answer the following questions. Students should type there work. 1. At time t, a company has I units of inventory in stock. Customers demand the

More information

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,

More information

Chapter 8. Variables. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc.

Chapter 8. Variables. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Random Variables Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 8.1 What is a Random Variable? Random Variable: assigns a number to each outcome of a random circumstance, or,

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

The following content is provided under a Creative Commons license. Your support

The following content is provided under a Creative Commons license. Your support MITOCW Recitation 6 The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free. To make

More information

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Week 2: Probability and Distributions) GY Zou gzou@robarts.ca Reporting of continuous data If approximately symmetric, use mean (SD), e.g., Antibody titers ranged from

More information

The Normal Probability Distribution

The Normal Probability Distribution 1 The Normal Probability Distribution Key Definitions Probability Density Function: An equation used to compute probabilities for continuous random variables where the output value is greater than zero

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

The Normal Distribution

The Normal Distribution Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we

More information

3. Probability Distributions and Sampling

3. Probability Distributions and Sampling 3. Probability Distributions and Sampling 3.1 Introduction: the US Presidential Race Appendix 2 shows a page from the Gallup WWW site. As you probably know, Gallup is an opinion poll company. The page

More information

Discrete Probability Distributions

Discrete Probability Distributions 90 Discrete Probability Distributions Discrete Probability Distributions C H A P T E R 6 Section 6.2 4Example 2 (pg. 00) Constructing a Binomial Probability Distribution In this example, 6% of the human

More information

Binomial distribution

Binomial distribution Binomial distribution Jon Michael Gran Department of Biostatistics, UiO MF9130 Introductory course in statistics Tuesday 24.05.2010 1 / 28 Overview Binomial distribution (Aalen chapter 4, Kirkwood and

More information

Topic 6 - Continuous Distributions I. Discrete RVs. Probability Density. Continuous RVs. Background Reading. Recall the discrete distributions

Topic 6 - Continuous Distributions I. Discrete RVs. Probability Density. Continuous RVs. Background Reading. Recall the discrete distributions Topic 6 - Continuous Distributions I Discrete RVs Recall the discrete distributions STAT 511 Professor Bruce Craig Binomial - X= number of successes (x =, 1,...,n) Geometric - X= number of trials (x =,...)

More information

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either

More information

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Statistics 431 Spring 2007 P. Shaman. Preliminaries Statistics 4 Spring 007 P. Shaman The Binomial Distribution Preliminaries A binomial experiment is defined by the following conditions: A sequence of n trials is conducted, with each trial having two possible

More information

STAT 201 Chapter 6. Distribution

STAT 201 Chapter 6. Distribution STAT 201 Chapter 6 Distribution 1 Random Variable We know variable Random Variable: a numerical measurement of the outcome of a random phenomena Capital letter refer to the random variable Lower case letters

More information

The Binomial and Geometric Distributions. Chapter 8

The Binomial and Geometric Distributions. Chapter 8 The Binomial and Geometric Distributions Chapter 8 8.1 The Binomial Distribution A binomial experiment is statistical experiment that has the following properties: The experiment consists of n repeated

More information

Descriptive Statistics (Devore Chapter One)

Descriptive Statistics (Devore Chapter One) Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information

2 General Notions 2.1 DATA Types of Data. Source: Frerichs, R.R. Rapid Surveys (unpublished), NOT FOR COMMERCIAL DISTRIBUTION

2 General Notions 2.1 DATA Types of Data. Source: Frerichs, R.R. Rapid Surveys (unpublished), NOT FOR COMMERCIAL DISTRIBUTION Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 2 General Notions 2.1 DATA What do you want to know? The answer when doing surveys begins first with the question,

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION In Inferential Statistic, ESTIMATION (i) (ii) is called the True Population Mean and is called the True Population Proportion. You must also remember that are not the only population parameters. There

More information

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative STAT:10 Statistical Methods and Computing Normal Distributions Lecture 4 Feb. 6, 17 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowa.edu 1 2 Using density curves to describe the distribution of values of

More information

Section The Sampling Distribution of a Sample Mean

Section The Sampling Distribution of a Sample Mean Section 5.2 - The Sampling Distribution of a Sample Mean Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin The Sampling Distribution of a Sample Mean Example: Quality control check of light

More information

Statistics and Probability

Statistics and Probability Statistics and Probability Continuous RVs (Normal); Confidence Intervals Outline Continuous random variables Normal distribution CLT Point estimation Confidence intervals http://www.isrec.isb-sib.ch/~darlene/geneve/

More information

A random variable is a (typically represented by ) that has a. value, determined by, A probability distribution is a that gives the

A random variable is a (typically represented by ) that has a. value, determined by, A probability distribution is a that gives the 5.2 RANDOM VARIABLES A random variable is a (typically represented by ) that has a value, determined by, for each of a. A probability distribution is a that gives the for each value of the. It is often

More information

CHAPTER 6 Random Variables

CHAPTER 6 Random Variables CHAPTER 6 Random Variables 6.3 Binomial and Geometric Random Variables The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Binomial and Geometric Random

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at

Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at Solutions for practice questions: Chapter 15, Probability Distributions If you find any errors, please let me know at mailto:msfrisbie@pfrisbie.com. 1. Let X represent the savings of a resident; X ~ N(3000,

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution January 31, 2019 Contents The Binomial Distribution The Normal Approximation to the Binomial The Binomial Hypothesis Test Computing Binomial Probabilities in R 30 Problems The

More information

CHAPTER 5 Sampling Distributions

CHAPTER 5 Sampling Distributions CHAPTER 5 Sampling Distributions 5.1 The possible values of p^ are 0, 1/3, 2/3, and 1. These correspond to getting 0 persons with lung cancer, 1 with lung cancer, 2 with lung cancer, and all 3 with lung

More information

Chapter Seven. The Normal Distribution

Chapter Seven. The Normal Distribution Chapter Seven The Normal Distribution 7-1 Introduction Many continuous variables have distributions that are bellshaped and are called approximately normally distributed variables, such as the heights

More information

Lecture Data Science

Lecture Data Science Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Statistics Foundations JProf. Dr. Claudia Wagner Learning Goals How to describe sample data? What is mode/median/mean?

More information

A.REPRESENTATION OF DATA

A.REPRESENTATION OF DATA A.REPRESENTATION OF DATA (a) GRAPHS : PART I Q: Why do we need a graph paper? Ans: You need graph paper to draw: (i) Histogram (ii) Cumulative Frequency Curve (iii) Frequency Polygon (iv) Box-and-Whisker

More information

Chapter 8 Estimation

Chapter 8 Estimation Chapter 8 Estimation There are two important forms of statistical inference: estimation (Confidence Intervals) Hypothesis Testing Statistical Inference drawing conclusions about populations based on samples

More information

Section Introduction to Normal Distributions

Section Introduction to Normal Distributions Section 6.1-6.2 Introduction to Normal Distributions 2012 Pearson Education, Inc. All rights reserved. 1 of 105 Section 6.1-6.2 Objectives Interpret graphs of normal probability distributions Find areas

More information

Sampling Distributions

Sampling Distributions Section 8.1 119 Sampling Distributions Section 8.1 C H A P T E R 8 4Example 2 (pg. 378) Sampling Distribution of the Sample Mean The heights of 3-year-old girls are normally distributed with μ=38.72 and

More information