MATH 10 INTRODUCTORY STATISTICS

Similar documents
MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS

Value (x) probability Example A-2: Construct a histogram for population Ψ.

Midterm Exam III Review

MLLunsford 1. Activity: Central Limit Theorem Theory and Computations

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Sampling Distributions Chapter 18

The Binomial Distribution

The Binomial Distribution

Lecture 9. Probability Distributions

Name PID Section # (enrolled)

Chapter 7 - Lecture 1 General concepts and criteria

(Practice Version) Midterm Exam 1

Probability. An intro for calculus students P= Figure 1: A normal integral

Statistics Class 15 3/21/2012

Lecture 9. Probability Distributions. Outline. Outline

Confidence Intervals: Review

Lecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean)

AMS7: WEEK 4. CLASS 3

Shifting our focus. We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why?

. (i) What is the probability that X is at most 8.75? =.875

Statistical Methods in Practice STAT/MATH 3379

The Central Limit Theorem

Statistics and Probability

Discrete Probability Distributions

Class 13. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

ECON 214 Elements of Statistics for Economists 2016/2017

4.2 Probability Distributions

MATH 264 Problem Homework I

Discrete Probability Distribution

Central Limit Theorem (cont d) 7/28/2006

Homework Assigment 1. Nick Polson 41000: Business Statistics Booth School of Business. Due in Week 3

Statistics vs. statistics

CH 5 Normal Probability Distributions Properties of the Normal Distribution

Discrete Mathematics for CS Spring 2008 David Wagner Final Exam

Business Statistics 41000: Probability 4

STA Module 3B Discrete Random Variables

ECE 295: Lecture 03 Estimation and Confidence Interval

UNIVERSITY OF VICTORIA Midterm June 2014 Solutions

Uniform Probability Distribution. Continuous Random Variables &

Chapter 5. Sampling Distributions

1) 3 points Which of the following is NOT a measure of central tendency? a) Median b) Mode c) Mean d) Range

Review of the Topics for Midterm I

Sampling Distribution

Probability Models.S2 Discrete Random Variables

Chapter 9. Sampling Distributions. A sampling distribution is created by, as the name suggests, sampling.

Lecture 22. Survey Sampling: an Overview

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Chapter 7. Sampling Distributions and the Central Limit Theorem

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

Statistics 6 th Edition

What was in the last lecture?

Confidence Intervals Introduction

Class 16. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Central Limit Theorem 11/08/2005

But suppose we want to find a particular value for y, at which the probability is, say, 0.90? In other words, we want to figure out the following:

Statistics 13 Elementary Statistics

Binomal and Geometric Distributions

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Chapter 7. Sampling Distributions

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

Math 140 Introductory Statistics. Next test on Oct 19th

4.1 Probability Distributions

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Statistics for Business and Economics

Discrete Random Variables

Discrete Random Variables

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Lecture 12: The Bootstrap

Midterm Test 1 (Sample) Student Name (PRINT):... Student Signature:... Use pencil, so that you can erase and rewrite if necessary.

Introduction to Probability and Inference HSSP Summer 2017, Instructor: Alexandra Ding July 19, 2017

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Binomial Random Variables. Binomial Random Variables

Making Sense of Cents

Math 140 Introductory Statistics

BIOL The Normal Distribution and the Central Limit Theorem

Chapter 9: Sampling Distributions

Statistical Intervals (One sample) (Chs )

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

BIO5312 Biostatistics Lecture 5: Estimations

Chapter 8 Homework Solutions Compiled by Joe Kahlig. speed(x) freq 25 x < x < x < x < x < x < 55 5

HOMEWORK: Due Mon 11/8, Chapter 9: #15, 25, 37, 44

A random variable (r. v.) is a variable whose value is a numerical outcome of a random phenomenon.

1. Confidence Intervals (cont.)

Lecture 9: Plinko Probabilities, Part III Random Variables, Expected Values and Variances

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

Applications of the Central Limit Theorem

Lecture 2. Probability Distributions Theophanis Tsandilas

MATH 112 Section 7.3: Understanding Chance

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

Chapter 7. Sampling Distributions and the Central Limit Theorem

Lecture 37 Sections 11.1, 11.2, Mon, Mar 31, Hampden-Sydney College. Independent Samples: Comparing Means. Robb T. Koether.

Chapter 5 Discrete Probability Distributions. Random Variables Discrete Probability Distributions Expected Value and Variance

ECON 214 Elements of Statistics for Economists

Transcription:

MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student.

It is Time for Homework Again! ( ω `) Please hand in your homework. Third homework will be posted on the website, under the homework tab. And also sent out via email. Short but requires a bit of reading. 30% weekly homework. Each homework might have different points assigned but carry the same weight.

Your other homework: read and understand the relevant chapters in the textbook. I cover key points or difficult parts during lecture. Textbook has much more exposition and worked examples with tricks and techniques. Lectures are concise/terse. You might think you understand something but actually don t. Everything in the relevant textbook chapters are fair game for the exam. :3 E.g. particular technique or usage.

Week 4 Midterm Week 5 woohoo Chapter 9 Sampling Distributions today s lecture Sampling distributions of the mean and p. Difference between means. Central Limit Theorem. Chapter 10 Estimation today s lecture Point, interval estimates. Bias, variability. Confidence interval!! for the mean, difference between means, proportions. t-distribution!! Chapter 11 Logic of Hypothesis Testing Chapter 8 Advanced Graphs when will we get to do this lol

The Monty Hall Problem 3 doors. 1 item behind each door, assigned randomly: Goats behind 2 doors. Expensive car behind remaining door. Game show host knows what is behind each door. After you picked a door, he opens one of the other two door with a goat. Then asks if you want to switch or stay with your initial choice.

Chapter 9, Section 6 Sampling Distribution of the Mean Sampling Distribution of the Mean has, Standard error, σ M = σ n. μ M = μ σ M 2 = σ2 n Central Limit Theorem!!! ᕕ( ᐛ )ᕗ If the population has finite mean μ, and finite non-zero variance σ 2, then the sampling distribution of the mean becomes better approximated by a normal distribution N(μ, σ2 ), as sample size n increases. n

Chapter 9, Section 6 Sampling Distribution of the Mean Central limit theorem works for any distribution with finite mean and finite nonzero variance.

Quick Sample Exam Question The incomes of people in country X has a distribution that looks like the one below, with population mean μ and variance σ 2. 1. If you take a large simple random sample of n incomes from country X, what a good approximation of the sampling distribution of the sample mean M? What are the mean and variance of this approximation? (1 pt)

Quick Sample Exam Question The incomes of people in country X has a distribution that looks like the one below, with population mean μ and variance σ 2. 1. If you take a large simple random sample of n incomes from country X, what a good approximation of the sampling distribution of the sample mean M? What are the mean and variance of this approximation? (1 pt) 2. Which theorem made the approximation in the previous question possible? (1 pt)

Quick Sample Exam Question The incomes of people in country X has a distribution that looks like the one below, with population mean μ and variance σ 2. 1. If you take a large simple random sample of n incomes from country X, what a good approximation of the sampling distribution of the sample mean M? What are the mean and variance of this approximation? (1 pt) 2. Which theorem made the approximation in the previous question possible? (1 pt) 3. If another researcher independently took another large simple random sample of n incomes from country X, what is the probability that his sample mean would be in the interval μ σ, μ + σ? (2 pts) n n

Chapter 9, Section 7 Difference Between Means Finally, we can use statistics to compare two populations. Suppose you have two simple random samples with size n 1 and n 2. Samples from population 1 and 2 respectively. Calculate their sample means M 1 and M 2. The difference has a sampling distribution with mean μ M1 M 2 = μ 1 μ 2.

Chapter 9, Section 7 Difference Between Means The difference has a sampling dist. with mean μ M1 M 2 = μ 1 μ 2. 2 And variance σ M1 M 2 = σ 2 M1 + σ 2 M1. σ 2 Mi = σ2, which is variance of the sampling dist. of M n i. i Since the sample means are independent (as random variables), the variance sum law was used to derive the variance. 2 σ M1 M 2 = σ 1 2 2 n 1 + σ 2 n 2

Chapter 9, Section 7 Difference Between Means The difference has a sampling dist. with mean μ M1 M 2 = μ 1 μ 2. 2 And variance σ M1 M 2 = σ 2 M1 + σ 2 M1 = σ 1 2 2 + σ 2. n 1 n 2 Standard error σ M1 M 2 = σ 1 2 + σ 2 2. n 1 n 2 This becomes much easier if the sample sizes and population variances are equal. σ M1 M 2 = σ2 n + σ2 n = 2σ2 n. exam trick: not factoring out the root 2

Chapter 9, Section 9 Sampling Distribution of p Population with N individuals. A proportion of them are of type A, and the rest are of type B. E.g. Type A = those who voted for candidate A, and type B = those who voted for candidate B. Take a simple random sample of size n. You can see this sample as an experiment with n trials and probability of success.

Chapter 9, Section 9 Sampling Distribution of p Take a simple random sample of size n. You can see this sample as an experiment with n trials and probability of success. The Binomial distribution modelling the distribution of the number of successes in this sample would have mean n. The standard deviation of the Binomial distribution modelling our sample is n (1 ).

Chapter 9, Section 9 Sampling Distribution of p Let p be the proportion of type A ( successes ) in your sample. This p has sampling distribution with mean. The standard deviation of the Binomial distribution modelling our sample is n (1 ). Standard error of p is σ p = (1 ) n. The sampling distribution is approximately normally distributed for large n.

Break time!! \o/ Break starts after I hand out the exercise. Yeah things are getting more complicated. You are getting a wall of text for your exercise. 12 minutes Circle is a timer that becomes blue. O_o (please ignore if it glitches)

Chapter 10, Section 4 Characteristics of Estimators Point estimate vs Interval estimate, for population parameters. Quantities calculated from a sample are known as statistics.

Chapter 10, Section 4 Characteristics of Estimators Point estimate vs Interval estimate, for population parameters. Quantities calculated from a sample are known as statistics. For point estimates Bias: is mean of sampling dist. equal to parameter? (expected value) Variability: standard error. (mean square error) Interesting aside: bias-variance tradeoff and shrinkage. (not in exam)

Chapter 10, Section 8 Confidence Interval for Mean FINALLY Confidence intervals are interval estimators. What are, for example, 95% confidence intervals? We want to estimate the population mean. We take a simple random sample. Use it to calculate interval [a, b]. If you repeat this procedure many times, 95% of the intervals we calculated contains the population mean.

Chapter 10, Section 8 Confidence Interval for Mean FINALLY Confidence intervals are interval estimators. What are, for example, 95% confidence intervals? We want to estimate the population mean. We take a simple random sample. Use it to calculate interval [a, b]. If you repeat this procedure many times, 95% of the intervals we calculated contains the population mean. Alternatively, this procedure has a 95% chance of a producing a interval that contains the mean.

Chapter 10, Section 8 Confidence Interval for Mean FINALLY We want to estimate the population mean. We take a simple random sample. Use it to calculate interval [a, b]. If you repeat this procedure many times, 95% of the Alternatively, this procedure has a 95% chance of a producing a interval that contains the mean. Important for exam: the interval is a random object. The pop. mean is NOT random. Get new, random interval every time you take a new sample. We cannot even say any given interval has a 95% chance of containing the mean (that probability is either zero or one).

Chapter 10, Section 8 Confidence Interval for Mean FINALLY How to construct a 95% confidence intervals? Reverse engineering. Take a simple random sample of size n, calculate sample mean M. Assuming we know the population variance σ 2. The sampling distribution of the sample mean can be approximated by a normal distribution with mean μ M = μ and variance σ M 2 = σ2 n. This means that 95% of the time, the sample mean will be within 2 standard errors of the population mean.

Chapter 10, Section 8 Confidence Interval for Mean FINALLY The sampling distribution of the sample mean can be approximated by a normal distribution with mean μ M = μ and variance σ M 2 = σ2 n. This means that 95% of the time, the sample mean will be within 1.96 standard errors of the population mean. Reverse engineering: turn this around and say 95% of the time, the population mean (fixed, non-random quantity) will end up within 1.96 standard errors of the mean of a simple random sample. The 95% confidence interval is [ M 1.96 σ M, M + 1.96 σ M ].

Chapter 10, Section 8 Confidence Interval for Mean FINALLY What if we want a general α% confidence interval? Repeat the same process. Then, the α% confidence interval is [ M Z α σ M, M + Z α σ M ]