Counting Basics. Venn diagrams

Similar documents
Week 7. Texas A& M University. Department of Mathematics Texas A& M University, College Station Section 3.2, 3.3 and 3.4

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

The normal distribution is a theoretical model derived mathematically and not empirically.

Statistics, Measures of Central Tendency I

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Theoretical Foundations

MAKING SENSE OF DATA Essentials series

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

CH 5 Normal Probability Distributions Properties of the Normal Distribution

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

2011 Pearson Education, Inc

The Normal Probability Distribution

Lecture 8. The Binomial Distribution. Binomial Distribution. Binomial Distribution. Probability Distributions: Normal and Binomial

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Probability Distribution Unit Review

Binomial Distribution. Normal Approximation to the Binomial

Statistics for Business and Economics

Basic Procedure for Histograms

ECON 214 Elements of Statistics for Economists 2016/2017

Math 14 Lecture Notes Ch The Normal Approximation to the Binomial Distribution. P (X ) = nc X p X q n X =

Lecture 9. Probability Distributions. Outline. Outline

Consider the following examples: ex: let X = tossing a coin three times and counting the number of heads

Lecture 9. Probability Distributions

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions

Example. Chapter 8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables

Chapter 7 1. Random Variables

Chapter 8 Homework Solutions Compiled by Joe Kahlig. speed(x) freq 25 x < x < x < x < x < x < 55 5

Chapter 4 Probability Distributions

Probability & Sampling The Practice of Statistics 4e Mostly Chpts 5 7

The graph of a normal curve is symmetric with respect to the line x = µ, and has points of

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

Chapter 6. The Normal Probability Distributions

2017 Fall QMS102 Tip Sheet 2

Section Distributions of Random Variables

Binomial Random Variables. Binomial Random Variables

Review. What is the probability of throwing two 6s in a row with a fair die? a) b) c) d) 0.333

Random Variables and Probability Functions

11.5: Normal Distributions

The Binomial Probability Distribution

Section Distributions of Random Variables

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

Section Introduction to Normal Distributions

Midterm Exam III Review

Discrete Random Variables

Discrete Random Variables

Chapter 5 Discrete Probability Distributions. Random Variables Discrete Probability Distributions Expected Value and Variance

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

Statistical Methods in Practice STAT/MATH 3379

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

The topics in this section are related and necessary topics for both course objectives.

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

MidTerm 1) Find the following (round off to one decimal place):

ECON 214 Elements of Statistics for Economists

MANAGEMENT PRINCIPLES AND STATISTICS (252 BE)

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

5.2 Random Variables, Probability Histograms and Probability Distributions

Module 4: Probability

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Chapter 4. Section 4.1 Objectives. Random Variables. Random Variables. Chapter 4: Probability Distributions

UNIT 4 MATHEMATICAL METHODS

Chapter 4. The Normal Distribution

Review of the Topics for Midterm I

Chapter 4 Random Variables & Probability. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Probability is the tool used for anticipating what the distribution of data should look like under a given model.

Chapter 14 - Random Variables

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

AP Statistics Chapter 6 - Random Variables

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

Part V - Chance Variability

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Section Random Variables and Histograms

IOP 201-Q (Industrial Psychological Research) Tutorial 5

Discrete Probability Distribution

8.1 Binomial Distributions

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

Lecture 6: Chapter 6

The Normal Distribution

MATH 118 Class Notes For Chapter 5 By: Maan Omran

Statistics 6 th Edition

Math 227 Elementary Statistics. Bluman 5 th edition

Part 10: The Binomial Distribution

9 Expectation and Variance

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

4.2 Bernoulli Trials and Binomial Distributions

Probability Theory. Mohamed I. Riffi. Islamic University of Gaza

Data Analysis and Statistical Methods Statistics 651

Chapter 5: Probability

Chapter. Section 4.2. Chapter 4. Larson/Farber 5 th ed 1. Chapter Outline. Discrete Probability Distributions. Section 4.

Chapter ! Bell Shaped

Unit 04 Review. Probability Rules

STAT Chapter 5: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

MATH 104 CHAPTER 5 page 1 NORMAL DISTRIBUTION

Statistics vs. statistics

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

Distributions in Excel

CIVL Discrete Distributions

Transcription:

Counting Basics Sets Ways of specifying sets Union and intersection Universal set and complements Empty set and disjoint sets Venn diagrams Counting Inclusion-exclusion Multiplication principle Addition principle Tree diagrams

Some special cases Inclusion-Exclusion Principle can be used to count the number of elements in the complement of a set, and the number of elements in a disjoint union. A special case of the Multiplication principle leads to Permutations which count certain ordered n-tuples of elements. This then leads to formulas for counting ordered tuples where some of the elements are the same. Another application of the Multiplication principle leads to Combinations which count certain unordered n-tuples of elements.

Counting Slogans P(n, m) counts the number of ordered subsets of size m selected from a set of n elements. C(n, m) counts the number of unordered subsets of size m selected from a set of n elements.

Ordered and unordered partitions A set with n elements can be partitioned into k subsets of r 1, r 2,..., r k elements (where r 1 + r 2 + + r k = n ) and where the subsets are distinguished from one another in the following number of ways: ( ) n = r 1, r 2,..., r k n! r 1!r 2!... r k! A set of n elements can be partitioned into k unordered subsets of r elements each (kr = n) in the following number of ways: ( 1 n ) k! r, r,..., r = n! k! r! r! r! = n! k!(r!) k

Basic Probability Suppose we have a set U of all possible ways an experiment could work out, and that S is a subset of U consisting of the outcomes whose occurrence. we are interested in. Then the probability that any particular experiment yields an outcome in S is P(S) = n(s) n(u). Sometimes probabilities are empirical and sometimes they can be calculated. Sometimes some probabilities are given to you and you want to calculate others. A probability distribution assigns to each element in a finite set a number between 0 and 1 such that the sum of all the numbers is 1. Frequency: number of times something occurs; relative frequency: proportion of time it occurs

Probability formulae Most of counting theory applies to enable you to compute probabilities. E.G. P(E) + P(E ) = 1 or P(E ) = 1 P(E) Let E and F be events in a sample space S, then P(E F ) = P(E) + P(F ) P(E F ) E and F mutually exclusive implies that P(E F ) = P(E) + P(F ) Most of these rules can be visualized with Venn diagrams.

Conditional Probability and Independence P ( F E ) = n(h E) n(e) = P(F E) P(E) P(E)P ( F E ) = P(E F ) Two events F and E are said to be independent if P ( F E ) = P(F ) For independent events, E and F P(E F ) = P(E)P(F ) P(E F ) = P(E) + P(F ) P(E) P(F )

Conditional Probability and Bayes Tree Diagrams: Multiply probabilities along a path in a tree and add probabilities for different paths. The probabilities at every node must add up to be 1. Let E 1 and E 2 be mutually exclusive events (E 1 E 2 = ) whose union is the sample space, i.e. E 1 E 2 = S. Let F be an event in S for which P(F ) 0. Then Bayes Theorem: P ( E 1 F ) = P(E 1 F ) P(F ) = P(E 1 F ) P(E 1 F ) + P(E 2 F ) = P(E 1 )P ( F E1 ) P(E 1 )P ( F E1 ) + P(E2 )P ( F E2 ).

Conditional Probability There is a version of Bayes Theorem for more than two mutually exclusive events. Problems along these lines are best dealt with not with the general formula, but with a tree diagram, which always gives the same answer.

Charts, histograms... This is all about: Organizing data into meaningful groups and computing frequencies. Drawing histograms and graphs adhering to the equal area principle. Extracting frequencies from histograms.

Mean, median and mode The population mean of m numbers x 1, x 2,..., x m (the data for every member of a population of size m) is denoted by µ and is computed as follows: µ = x 1 + x 2 + + x m m The sample mean of the numbers x 1, x 2,..., x n (data for a sample of size n from the population) is denoted by x and is computed similarly: x = x 1 + x 2 + + x n n The sample mean is just the population mean of the sample space.

Mean, median and mode The population median is the middle number if you order the data by value. If the number of elements in the data set is even then you need to average two numbers. The sample median is exactly the same, just for a sample instead of for the whole population. The population mode is number which appears most often. There may be many numbers which are the mode. The sample mode is exactly the same, just for a sample instead of for the whole population.

In either case the standard deviation is the square root of the variance. Variance and standard deviation For a set of data {x 1, x 2,... x n } for a population of size n, we define the population variance, denoted by σ 2, to be the average squared distance from the mean, µ: σ 2 = (x 1 µ) 2 + (x 2 µ) 2 + + (x n µ) 2 n For a sample {x 1, x 2,... x n } from a larger population, with sample mean x, we define the sample variance, denoted by s 2, by s 2 = (x 1 x) 2 + (x 2 x) 2 + + (x n x) 2 n 1

Variance and standard deviation If a population data set (with n data points) has the values c 1, c 2,..., c m occurring with frequencies f 1, f 2,..., f m (so c 1 occurs f 1 times, etc.), then σ 2 = ( c1 µ ) 2 f1 + ( c 2 µ ) 2 f2 + + ( c m µ ) 2 fm n with a similar formula for s 2 (if the data is a sample).

is a Random Variable Random Variables We perform an experiment, and to each outcome we associate a numerical value (a subject s weight; the time it took until something happened, the number of times we were successful in some number of attempts). For each possible numerical value that could come up, we can ask what is the probability that value comes up? The resulting table of values and probabilities: Outcomes Probability X P(X) x 1 p 1 x 2 p 2. x n. p n

Expected Value of a Random Variable The expected value of a random variable is a measure of the average value the random variable takes, averaged over many repetitions. If X is a random variable with possible values x 1, x 2,..., x n and corresponding probabilities p 1, p 2,..., p n, then the expected value of X, denoted by E(X), is E(X) = x 1 p 1 + x 2 p 2 + + x n p n. Outcomes Probability Out. Prob. X P(X) X P(X) x 1 p 1 x 1 p 1 x 2 p 2 x 2 p 2... x n p n x n p n Sum = E(X)

Variance of a random variable If X is a random variable with values x 1, x 2,..., x n, corresponding probabilities p 1, p 2,..., p n, and expected value µ = E(X), then Variance = σ 2 (X) = p 1 (x 1 µ) 2 + p 2 (x 2 µ) 2 + + p n (x n µ) 2 and Standard Deviation = σ(x) = Variance.

Calculating via a table Variance = σ 2 (X) = p 1 (x 1 µ) 2 + p 2 (x 2 µ) 2 + + p n (x n µ) 2 Standard Deviation = σ(x) = Variance. x i p i x i p i (x i µ) (x i µ) 2 p i (x i µ) 2 x 1 p 1 x 1 p 1 (x 1 µ) (x 1 µ) 2 p 1 (x 1 µ) 2 x 2 p 2 x 2 p 2 (x 2 µ) (x 2 µ) 2 p 2 (x 2 µ) 2...... x n p n x n p n (x n µ) (x n µ) 2 p n (x n µ) 2 Sum = µ Sum = σ 2 (X)

Bernoulli experiments and binomial distribution A Bernoulli experiment is some fixed number n of repetitions of independent, identical trials, where in each trial what is measured is either Success (which happens with probability p) or failure (which happens with probability q = 1 p). If X is the number of successes then P(X = k) = C(n, k)p k q n k = ( ) n p k q n k k for k = 0, 1, 2,, n. The expected value of X is E(X) = np and the standard deviation of X is σ(x) = npq. X is a Binomial distribution with parameters n and p.

Normal distributions 1. All Normal Curves have the same general bell shape. 2. The curve is symmetric with respect to a vertical line that passes through the peak of the curve. 3. The curve is centered at the mean µ which coincides with the median and the mode and is located at the point beneath the peak of the curve. 4. The area under the curve is always 1. 5. The curve is completely determined by the mean µ and the standard deviation σ. For the same mean, µ, a smaller value of σ gives a taller and narrower curve, whereas a larger value of σ gives a flatter curve.

Standard Normal The standard normal curve has µ = 0 and σ = 1. For the standard normal curve, you can compute either by P(a Z b) P(a Z b) = P(Z b) P(Z a) and consulting a table for these probabilities (table will be provided), or by using the Normalcdf function on your calculator. For the Normal, there s no difference between P(Z b) and P(Z < b).

Standard Normal For a general normal with mean µ and standard deviation σ, the z-score of an observation a is z a = a µ σ For a general normal, the probability of seeing a value between a and b is the same as the probability that a standard normal takes a value between z a and z b z-scores allow you to convert non-standard normal distributions to the standard one; to compare two values from different normally distributed data sets; to find percentiles for a normal distribution.

Linear programing Main points: Given a collection of inequalities, identify the region of the plane where all the inequalities are satisfied simultaneously. In other words, given a finite set of constraints, find the feasible set (which may be empty). Find the coordinates of the corners of the feasible set Given a linear objective function, figure out where it is maximized/minimized on the feasible set the corners of the feasible set are key.

Game theory Main points: Construct payoff matrices for zero-sum and constant sum games. Find saddle points (if there are any), and figure out if games are strictly determined. Find the values of games, and figure out if they are fair. Find expected payoffs for mixed strategies. Use strategy lines the find optimal mixed strategies for players with 2 play options.