Shifting our focus. We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why?

Similar documents
Chapter 3 - Lecture 5 The Binomial Probability Distribution

Part 1 In which we meet the law of averages. The Law of Averages. The Expected Value & The Standard Error. Where Are We Going?

Statistical Methods in Practice STAT/MATH 3379

Discrete Random Variables

Math 14 Lecture Notes Ch. 4.3

Chapter 3 - Lecture 3 Expected Values of Discrete Random Va

TOPIC: PROBABILITY DISTRIBUTIONS

The normal distribution is a theoretical model derived mathematically and not empirically.

Binomal and Geometric Distributions

MATH 264 Problem Homework I

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Probability: Week 4. Kwonsang Lee. University of Pennsylvania February 13, 2015

Discrete Probability Distribution

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

ECON 214 Elements of Statistics for Economists 2016/2017

Simple Random Sample

Lecture 9: Plinko Probabilities, Part III Random Variables, Expected Values and Variances

Chapter 5. Statistical inference for Parametric Models

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

Chapter 3 Discrete Random Variables and Probability Distributions

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Let X be the number that comes up on the next roll of the die.

Have you ever wondered whether it would be worth it to buy a lottery ticket every week, or pondered on questions such as If I were offered a choice

STA 6166 Fall 2007 Web-based Course. Notes 10: Probability Models

Expected Value of a Random Variable

Introduction to Statistics I

Mathematics of Randomness

MANAGEMENT PRINCIPLES AND STATISTICS (252 BE)

5.2 Random Variables, Probability Histograms and Probability Distributions

Statistics, Their Distributions, and the Central Limit Theorem

Random variables. Discrete random variables. Continuous random variables.

Section Distributions of Random Variables

MA 1125 Lecture 14 - Expected Values. Wednesday, October 4, Objectives: Introduce expected values.

Lecture 9. Probability Distributions. Outline. Outline

CHAPTER 6 Random Variables

VIDEO 1. A random variable is a quantity whose value depends on chance, for example, the outcome when a die is rolled.

Lecture 9. Probability Distributions

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

Review of the Topics for Midterm I

Binomial and Geometric Distributions

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

Discrete Random Variables

Section Distributions of Random Variables

Discrete Random Variables

Theoretical Foundations

Experimental Probability - probability measured by performing an experiment for a number of n trials and recording the number of outcomes

Section Random Variables and Histograms

MA : Introductory Probability

Example 1: Identify the following random variables as discrete or continuous: a) Weight of a package. b) Number of students in a first-grade classroom

Lecture 7 Random Variables

Chapter 7. Random Variables

Class 13. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

STA Module 3B Discrete Random Variables

Midterm Exam III Review

Statistics 511 Additional Materials

***SECTION 8.1*** The Binomial Distributions

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 4: May 2, Abstract

2011 Pearson Education, Inc

Chapter 9. Sampling Distributions. A sampling distribution is created by, as the name suggests, sampling.

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Probability Distributions

Chapter 5: Statistical Inference (in General)

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

Business Statistics 41000: Probability 4

Confidence Intervals Introduction

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Simulation Lecture Notes and the Gentle Lentil Case

Learning Goals: * Determining the expected value from a probability distribution. * Applying the expected value formula to solve problems.

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

Chapter 7: Point Estimation and Sampling Distributions

Chapter 5 Discrete Probability Distributions. Random Variables Discrete Probability Distributions Expected Value and Variance

Discrete Probability Distributions

STT315 Chapter 4 Random Variables & Probability Distributions AM KM

HHH HHT HTH THH HTT THT TTH TTT

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

30 Wyner Statistics Fall 2013

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Discrete probability distributions

Binomial Random Variables. Binomial Random Variables

MAS187/AEF258. University of Newcastle upon Tyne

4.2 Bernoulli Trials and Binomial Distributions

ECON 214 Elements of Statistics for Economists 2016/2017

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

STOR Lecture 7. Random Variables - I

The Binomial Probability Distribution

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Elementary Statistics Lecture 5

The topics in this section are related and necessary topics for both course objectives.

CD Appendix F Hypergeometric Distribution

Chapter 17. The. Value Example. The Standard Error. Example The Short Cut. Classifying and Counting. Chapter 17. The.

Lecture 18 Section Mon, Feb 16, 2009

Sec$on 6.1: Discrete and Con.nuous Random Variables. Tuesday, November 14 th, 2017

Lecture 18 Section Mon, Sep 29, 2008

Linear Regression with One Regressor

STOR 155 Introductory Statistics (Chap 5) Lecture 14: Sampling Distributions for Counts and Proportions

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics

Distribution of the Sample Mean

Transcription:

Probability

Introduction

Shifting our focus We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why?

What is Probability? Probability is used to quantify uncertainty about the future. Probability is a way to model randomness

Why Study Probability in a Statistics Course? Probability is the language with which we express and interpret assessment of uncertainty in a formal statistical analysis. We take a random sample from a population. Board: Statistical Cartoon (random sample).

3.2: Random Sampling

Random Sampling Most of the formal methods of statistical inference we will use in this class are based on the assumption that the individual units in the sample are sampled at random from the population of interest. (Ignore for the present that in practice, individuals are almost never sampled at random, in a very formal sense, from the population of interest.)

Simple Random Sampling Taking a simple random sample of size n is equivalent to the process of: 1 representing every individual from a population with a single ticket; 2 putting the tickets into large box; 3 mixing the tickets thoroughly; 4 drawing out n tickets without replacement.

Simple Random Sampling The defining characteristic of the process of simple random sampling is that every possible sample of size n has the same chance of being selected. In particular, this means that (a) every individual has the same chance of being included in the sample; and that (b) members of the sample are chosen independently of each other.

Other Random Sampling Strategies Stratified random sampling and cluster sampling are examples of random sampling processes that are not simple. Data analysis for these types of sampling strategies go beyond the scope of this course.

Inference from Samples to Populations Statistical inference involves making statements about populations on the basis of analysis of sampled data. The Simple random sampling model is useful because it allows precise mathematical description of the random distribution of the discrepancy between statistical estimates and population parameters. This is known as chance error due to random sampling. When using the random sampling model, it is important to ask what is the population to which the results will be generalized? The use statistical methods that assume random sampling on data that is not collected as a random sample is prone to sampling bias, in which individuals do not have the same chance of being sampled. Sampling bias can lead to incorrect statistical inferences because the sample is unrepresentative of the population in important ways.

Examples For you You should read the following Examples (p.75-76): Example 3.2 - Example 3.7 Further examples of sampling procedures and sampling bias.

3.3: Introduction to Probability

Outcome Space Definition The outcome space is the set of possible simple outcomes from a random experiment (often denoted by Ω). Example: In a single die roll, the set of possible outcomes is: Ω = {1, 2, 3, 4, 5, 6}

Events Definition An event is a set of possible outcomes. Example: In a single die roll, possible events include: A = the die roll is even B = the die roll is a 6 C = the die roll is 4 or less

Probability Definition The probability of an event E, denoted P {E}, is a numerical measure between 0 and 1 that represents the likelihood of the event E in some probability model. Probabilities assigned to events must follow a number of rules.

Probability Definition The probability of an event E, denoted P {E}, is a numerical measure between 0 and 1 that represents the likelihood of the event E in some probability model. Probabilities assigned to events must follow a number of rules. Note: 1 Probability is a numerical measure of the likelihood of an event. 2 Probabilities are always between 0 and 1, inclusive. 3 Notation: The probability of an event E is written P {E}

Probability Definition The probability of an event E, denoted P {E}, is a numerical measure between 0 and 1 that represents the likelihood of the event E in some probability model. Probabilities assigned to events must follow a number of rules. Note: 1 Probability is a numerical measure of the likelihood of an event. 2 Probabilities are always between 0 and 1, inclusive. 3 Notation: The probability of an event E is written P {E} Example: The probability P {the die roll is a 6} equals 1/6 under a probability model that gives equal probability to each possible result, but could be different under a different model.

Examples 1 If a fair coin is tossed, the probability of a head is P {Heads} = 0.5 2 If bucket contains 34 white balls and 66 red balls and a ball is drawn at random, the probability that the drawn ball is white is P {white} = 34/100 = 0.34

Probability Rules Non-negativity: For any event E, 0 P {E} 1. Outcome space: The probability of the event of all possible outcomes is 1. Complements: P {E c } = 1 P {E}.

3.7: Random Variables

Introduction to Random Variables Definition A random variable is a variable that depends on an outcome of a chance (random) experiment.

Examples Variable versus a Random Variable Examples: 1 Bret s height

Examples Variable versus a Random Variable Examples: 1 Bret s height not random

Examples Variable versus a Random Variable Examples: 1 Bret s height not random 2 Select a student in class at random and record the student s height

Examples Variable versus a Random Variable Examples: 1 Bret s height not random 2 Select a student in class at random and record the student s height random

Examples Variable versus a Random Variable Examples: 1 Bret s height not random 2 Select a student in class at random and record the student s height random 3 The total height of everyone in 331 SMI (right now)

Examples Variable versus a Random Variable Examples: 1 Bret s height not random 2 Select a student in class at random and record the student s height random 3 The total height of everyone in 331 SMI (right now) not random

Examples Variable versus a Random Variable Examples: 1 Bret s height not random 2 Select a student in class at random and record the student s height random 3 The total height of everyone in 331 SMI (right now) not random 4 Role a die and then record the number on its face

Examples Variable versus a Random Variable Examples: 1 Bret s height not random 2 Select a student in class at random and record the student s height random 3 The total height of everyone in 331 SMI (right now) not random 4 Role a die and then record the number on its face random

Random Variables A random variable is a rule that attaches a numerical value to a chance outcome. Associated with each possible value of the random variable is a probability, a number between 0 and 1 that represents the long-run relative frequency of observing the given value. The sum of the probabilities for all possible values is one.

Probability Distributions A probability distribution gives a menu for the random variable. It lists the possible values, and the probability of each value. To describe the distribution, it is sufficient to provide a list of all possible values and the probability associated with each value. The sum of these probabilities is one. Frequently (as with the binomial distribution), there is a formula that specifies the probability for each possible value.

Example The probability distribution for a Random variable X. Here is the menu: lists the possible values, and the probability of each value k 0 1 5 10 P {X = k} 0.1 0.5 0.1 0.3

Example Write down the probability distribution for the random variable which describes the following chance outcome. I roll a die and then record the number on its face.

The Mean (Expected Value) The mean or expected value of a random variable X is written as E(X). For discrete random variables, E(X) = x i IP{X = x i }, where the x i s are the values that the variable takes on and the sum is taken over all possible values. Notation: µ X = E(X) Note that the expected value of a random variable is a weighted average of the possible values of the random variable, weighted by the probabilities.

The Variance The variance of a random variable X is written as Var(X). For discrete random variables, Var(X) = E ( (X µ X ) 2) = (x i µ X ) 2 IP{X = x i }, where the x i s are the values that the variable takes on and the sum is taken over all possible values. Notation: σ 2 X = Var(X) The variance is a weighted average of the squared deviations between the possible values of the random variable and its mean. If a random variable has units, the units of the variance are those units squared, which is hard to interpret.

Alternate Variance Formula Var(X) = E(X 2 ) (E(X)) 2 This formula is often easier to compute by hand. It suggests three steps 1 Compute E(X 2 ) 2 Compute E(X) (then square this answer) 3 Substract the two quantities.

Standard Deviation We also define the standard deviation to be the square root of the variance, so it has the same units as the random variable. A notation is SD(X) = Var(X). Also, σ X = SD(X).

Standard Problem Consider a random variable X defined by the following distribution k 0 1 5 10 P {X = k} 0.1 0.5 0.1 0.3 1 Compute P(X 5) 2 Compute P(.5 X 6) 3 Compute E(X), Var(X), and SD(X)