Statistics 511 Additional Materials

Similar documents
Discrete Random Variables

Statistical Methods in Practice STAT/MATH 3379

Discrete Random Variables

Discrete Random Variables

Stat511 Additional Materials

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Discrete Probability Distribution

Expectations. Definition Let X be a discrete rv with set of possible values D and pmf p(x). The expected value or mean value of X, denoted by E(X ) or

STAT 111 Recitation 3

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

Probability Distributions

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

Chapter 3 - Lecture 3 Expected Values of Discrete Random Va

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

Section M Discrete Probability Distribution

Discrete Random Variables

Shifting our focus. We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why?

A useful modeling tricks.

11.5: Normal Distributions

The Normal Probability Distribution

Chapter 3 - Lecture 5 The Binomial Probability Distribution

Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

Confidence Intervals and Sample Size

Statistics 6 th Edition

Statistics, Their Distributions, and the Central Limit Theorem

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

GETTING STARTED. To OPEN MINITAB: Click Start>Programs>Minitab14>Minitab14 or Click Minitab 14 on your Desktop

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

MATH 264 Problem Homework I

Learning Goals: * Determining the expected value from a probability distribution. * Applying the expected value formula to solve problems.

Uniform Probability Distribution. Continuous Random Variables &

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Section Random Variables and Histograms

Business Statistics 41000: Probability 4

2. Modeling Uncertainty

Lecture 9. Probability Distributions. Outline. Outline

The topics in this section are related and necessary topics for both course objectives.

Lecture 9. Probability Distributions

STA Module 3B Discrete Random Variables

Prob and Stats, Nov 7

Honors Statistics. Daily Agenda

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CS 237: Probability in Computing

Section Distributions of Random Variables

5.2 Random Variables, Probability Histograms and Probability Distributions

Expected Value of a Random Variable

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics

Section Introduction to Normal Distributions

Overview. Definitions. Definitions. Graphs. Chapter 5 Probability Distributions. probability distributions

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

4: Probability. What is probability? Random variables (RVs)

1 Describing Distributions with numbers

Introduction to Business Statistics QM 120 Chapter 6

Probability Distributions

Chapter 7. Sampling Distributions and the Central Limit Theorem

30 Wyner Statistics Fall 2013

continuous rv Note for a legitimate pdf, we have f (x) 0 and f (x)dx = 1. For a continuous rv, P(X = c) = c f (x)dx = 0, hence

7 THE CENTRAL LIMIT THEOREM

Chapter 9: Sampling Distributions

The normal distribution is a theoretical model derived mathematically and not empirically.

Chapter 7. Sampling Distributions and the Central Limit Theorem

Module 4: Probability

The Uniform Distribution

4 Random Variables and Distributions

variance risk Alice & Bob are gambling (again). X = Alice s gain per flip: E[X] = Time passes... Alice (yawning) says let s raise the stakes

Statistics 511 Supplemental Materials

Lecture 18 Section Mon, Feb 16, 2009

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Statistics vs. statistics

Chapter 4 Discrete Random variables

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x)

Section 8.1 Distributions of Random Variables

2 DESCRIPTIVE STATISTICS

Lecture 18 Section Mon, Sep 29, 2008

Statistics, Measures of Central Tendency I

When the observations of a quantitative random variable can take on only a finite number of values, or a countable number of values.

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

We use probability distributions to represent the distribution of a discrete random variable.

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

Chapter 4 Probability Distributions

Probability: Week 4. Kwonsang Lee. University of Pennsylvania February 13, 2015

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

The Central Limit Theorem for Sample Means (Averages)

AMS7: WEEK 4. CLASS 3

Chapter 16. Random Variables. Copyright 2010, 2007, 2004 Pearson Education, Inc.

8.1 Estimation of the Mean and Proportion

Theoretical Foundations

Chapter Seven: Confidence Intervals and Sample Size

STAT 111 Recitation 2

Chapter 9. Sampling Distributions. A sampling distribution is created by, as the name suggests, sampling.

Counting Basics. Venn diagrams

Part V - Chance Variability

Transcription:

Discrete Random Variables In this section, we introduce the concept of a random variable or RV. A random variable is a model to help us describe the state of the world around us. Roughly, a RV can be thought of as the value that is assigned to the outcome of an experiment. There are different RVs depending upon the type of quantity that we are trying to model. In this section we will introduce some random variables for experiments whose outcome is a whole number. Sample and Population notation Before we begin with random variables, it is important that we specify some new notation that differentiates the data analysis that we were doing earlier and what we will do throughout the rest of the course. We are going to differentiate between the population mean and the sample mean. The population mean is the mean or average of all the elements of the population. The sample mean is the average for a subset of that population. Throughout the rest of the course we will be explicit. Thus we need notation to demarcate between these two ideas. For a variable X, the population mean will be denoted by the Greek symbol mu with a subscript of x, µ x. For the sample mean, we have already seen that we will use the symbol x with a bar over it or x. We will do the same for the standard deviation. For a variable X, the population standard deviation will be the Greek letter sigma with a subscript of x, σ x. The sample standard deviation will be denoted by the letter s with a subscript of x, s x. Quantity Sample Statistic Population Parameter Mean x estimates µ x Standard Deviation s x estimates σ x Example Suppose that the population is made up of 25 students in a Stat 215 lab and the variable that we are interested in is the number of siblings for each student, call it Y. y:4,3,1,1,5,2,2,4,0,2,1,2,3,0,2,4,4,6,0,0,1,3,3,1,3 The population mean µ y is 2.28 and the population standard deviation is σ y = 1.646. If we take a sample from this population of 5 individuals, we might get the observed data 1,2,2,3,0. The mean of this sample is y =1.6 and the sample standard deviation is s y =1.14. If another samples is drawn then we might get the observed data 4, 1, 0, 2, 4. The mean of this sample is y =2.1 and the sample standard deviation is s y = 1.79 Page 1 of 6

Random Variables Definition: A random variable or RV is a variable whose value is determined by the outcome of an experiment. Examples: W = the number of tomatoes that a plant will produce Y = the inflation rate for the U.S. in the fourth quarter of this year Z = the number of people who vote in the next presidential election T = number of tablets in a bottle of aspirin K = number of potatoes in a 5 pound bag of potatoes B = time between phone calls for a computer helpline There are two types of random variables discrete and continuous. Definition: A random variable is called discrete if it can only take a countable number of values. In the list of RVs above, W, Z, T, and K are all discrete random variables. Definition: A random variable is called continuous if it can take values inside an interval. In the list of RVs above, Y, and B are continuous random variables. Definition: A probability distribution (sometimes simply called a distribution) is the mechanism that assigns probabilities to each value that the random variable can assume. Discrete Random Variables In this section we will focus on discrete random variables. Notation: For a discrete RV X, we will use as shorthand P(X=x) to represent the probability that the random variable X takes the value x. Often we will make this even shorter and simply write P(x). For a six-sided die with the values {1, 2, 3, 4, 5, 6} on its faces, let X be the value that appears on the upward face when the die is rolled. Then P(2) or P(X=2) represents the probability that X will be 2. Rules There are two rules for all discrete probability distributions: Page 2 of 6

1. 0 P(X=x) 1 for each value of x. 2. Σ P(X=x) = 1 These rules imply that each outcome has a probability between 0 and 1 and that if we add up the probabilities of all the possible outcomes their sum must be 1. We can use several methods for describing the distribution of a random variable X 1. Table For the table each possible value that the RV may take is listed and beside it is listed the probability associated with that value. y 2 3 4 P(Y=y) 0.3 0.2 0.5 From this table we can see that the probability that the RV Y takes the value 3 is 0.2 or P(Y=3) =0.2. We can also note that P(Y=2) = 0.3. 2. Line graph The second possible way to represent a distribution is through a line graph. For a line graph each possible value that the RV can take is listed on the x-axis. Vertical lines are drawn up from each possible value so that the height of those lines corresponds to the probability of that value. For the RV W w 2 3 4 5 P(W=w) 0.4 0.1 0.2 0.3 The line graph for this distribution of W looks like the following: P(W=w) 0.30 0.10 1 2 3 4 5 w Page 3 of 6

3. Formula Lastly we can use a formula to describe the distribution of a random variable. Under this method, if we know certain quantities, usually called parameters, then we can calculate the probability of a particular value for the RV. P(X=x) = v(1-v) x-1 for x = 1, 2, 3,... If we know what v is we can then calculate the probabilities for each value of x. In this example the parameter we would need to know would be v. Calculating combined probabilities We are often interested in calculating probabilities other than those of the simple events of the two preceding sections. There we focused on calculating the probability of a single value. There are two possibilities. First for something like P(X<3) we could add up probabilities of the simple events that make up this probability. P(X<3) = P(X=0)+P(X=1)+P(X=2). Each one of these would have to be calculated using the formula that was appropriate. It gets worse for something like P(X 15) = P(X=0) + P(X=1) + P(X=2) + + P(X=15). Theoretical Mean and Standard Deviation of a Discrete Probability Distribution One of the important properties of a probability distribution is: If we know that a discrete RV possess a particular probability distribution, we no longer need to estimate the theoretical mean µ x by calculating the sample mean x. We can calculate the exact value of µ x using the information in the probability distribution of X. Similarly, we no longer need to estimate the theoretical standard deviation σ x by calculating the sample standard deviation s x. We can calculate the exact value of σ x using the information in the probability distribution of X. Page 4 of 6

Formulas Theoretical Mean: µ x = Σx * p(x) Theoretical Standard Deviation: σ x = Σ(x µ) 2 p(x) or σ x = [Σx 2 p(x)] u 2 Suppose that RV X has the probability distribution X 1 2 3 4 P(X) 0.1 0.4 0.2 0.3 The theoretical mean is µ x = Σx * p(x) = (1*0.1) + (2*0.4) + (3*0.2) +*(4 *0.3) = 0.1+ 0.8 + 0.6 +1.2 = 2.7 The theoretical variance is σ x 2 = Σ(x µ) 2 * p(x) = (1 2.7) 2 *0.1+ (2 2.7) 2 *0.4 + (3 2.7) 2 *0.2 + (4 2.7) 2 *0.3 = 0.289 + 0.196 + 0.018 + 0.507 =1.01 The theoretical standard deviation is σ x = σ x 2 Page 5 of 6

= 1.01 =1.00498756 Note that we could have computed the theoretical variance as σ 2 x = [Σx 2 p(x)] u 2 = [1 2 *0.1+ 2 2 *0.4 + 3 2 * 0.2 + 4 2 * 0.3] 2.7 2 = [0.1+1.6 +1.8 + 4.8] 7.29 = 8.3 7.29 =1.01 Page 6 of 6