Elementary Statistics and Inference. Elementary Statistics and Inference. Chapter 20 Chance Errors in Sampling (cont.) 22S:025 or 7P:025.

Similar documents
Estimating Proportions with Confidence

Basic formula for confidence intervals. Formulas for estimating population variance Normal Uniform Proportion

CHAPTER 8 Estimating with Confidence

Topic-7. Large Sample Estimation

Standard Deviations for Normal Sampling Distributions are: For proportions For means _

Lecture 5: Sampling Distribution

Math 124: Lecture for Week 10 of 17

Sampling Distributions and Estimation

Statistics for Economics & Business

Lecture 4: Probability (continued)

Chapter 8: Estimation of Mean & Proportion. Introduction

Chapter 8. Confidence Interval Estimation. Copyright 2015, 2012, 2009 Pearson Education, Inc. Chapter 8, Slide 1

Today: Finish Chapter 9 (Sections 9.6 to 9.8 and 9.9 Lesson 3)

Inferential Statistics and Probability a Holistic Approach. Inference Process. Inference Process. Chapter 8 Slides. Maurice Geraghty,

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

1. Find the area under the standard normal curve between z = 0 and z = 3. (a) (b) (c) (d)

point estimator a random variable (like P or X) whose values are used to estimate a population parameter

Sampling Distributions and Estimation

ii. Interval estimation:

A point estimate is the value of a statistic that estimates the value of a parameter.

BASIC STATISTICS ECOE 1323

Lecture 4: Parameter Estimation and Confidence Intervals. GENOME 560 Doug Fowler, GS

Chapter 8 Interval Estimation. Estimation Concepts. General Form of a Confidence Interval

Chapter 10 - Lecture 2 The independent two sample t-test and. confidence interval

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

1 Random Variables and Key Statistics

Sampling Distributions & Estimators

B = A x z

CHAPTER 8: CONFIDENCE INTERVAL ESTIMATES for Means and Proportions

Outline. Populations. Defs: A (finite) population is a (finite) set P of elements e. A variable is a function v : P IR. Population and Characteristics

2. Find the annual percentage yield (APY), to the nearest hundredth of a %, for an account with an APR of 12% with daily compounding.

1. Suppose X is a variable that follows the normal distribution with known standard deviation σ = 0.3 but unknown mean µ.

Fixed Income Securities

Chance Error in Sampling

The Idea of a Confidence Interval

A random variable is a variable whose value is a numerical outcome of a random phenomenon.

SCHOOL OF ACCOUNTING AND BUSINESS BSc. (APPLIED ACCOUNTING) GENERAL / SPECIAL DEGREE PROGRAMME

CHAPTER 2 PRICING OF BONDS

. (The calculated sample mean is symbolized by x.)

NOTES ON ESTIMATION AND CONFIDENCE INTERVALS. 1. Estimation

Confidence Intervals Introduction

Lecture 5 Point Es/mator and Sampling Distribu/on

Parametric Density Estimation: Maximum Likelihood Estimation

ISBN Copyright 2015 The Continental Press, Inc.

Chapter 17 Sampling Distribution Models

APPLIED STATISTICS Complementary Course of BSc Mathematics - IV Semester CUCBCSS Admn onwards Question Bank

Introduction to Probability and Statistics Chapter 7

Department of Mathematics, S.R.K.R. Engineering College, Bhimavaram, A.P., India 2

CHAPTER 8 CONFIDENCE INTERVALS

Variance and Standard Deviation (Tables) Lecture 10

Unbiased estimators Estimators

STRAND: FINANCE. Unit 3 Loans and Mortgages TEXT. Contents. Section. 3.1 Annual Percentage Rate (APR) 3.2 APR for Repayment of Loans

REINSURANCE ALLOCATING RISK

Fixed Income Securities

Revolving Credit Facility. Flexible Funds for Flexible Needs

CONSUMER PRICE INDEX

Overlapping Generations

5 Statistical Inference

MATH : EXAM 2 REVIEW. A = P 1 + AP R ) ny

An Empirical Study of the Behaviour of the Sample Kurtosis in Samples from Symmetric Stable Distributions

5. Best Unbiased Estimators

VETERINARY PATHOLOGIST EMPLOYER DEMOGRAPHIC SURVEY: ADDENDUM


These characteristics are expressed in terms of statistical properties which are estimated from the sample data.

SUPPLEMENTAL MATERIAL

ST 305: Exam 2 Fall 2014

Statistics for Business and Economics

A Note About Maximum Likelihood Estimator in Hypergeometric Distribution

Monetary Economics: Problem Set #5 Solutions

BIOSTATS 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval

Problem Set 1a - Oligopoly

Labour Force Survey in Belarus: determination of sample size, sample design, statistical weighting

REITInsight. In this month s REIT Insight:

This article is part of a series providing

1 Estimating the uncertainty attached to a sample mean: s 2 vs.

Forecasting bad debt losses using clustering algorithms and Markov chains

EVIDENCE ON THE DISTRIBUTIONAL EFFECTS OF A LAND VALUE TAX ON RESIDENTIAL HOUSEHOLDS. Elizabeth Plummer

Binomial Model. Stock Price Dynamics. The Key Idea Riskless Hedge

When you click on Unit V in your course, you will see a TO DO LIST to assist you in starting your course.

Structuring the Selling Employee/ Shareholder Transition Period Payments after a Closely Held Company Acquisition

DOWLING COLLEGE: School of Education Department of Educational Administration, Leadership, and Technology

MODIFICATION OF HOLT S MODEL EXEMPLIFIED BY THE TRANSPORT OF GOODS BY INLAND WATERWAYS TRANSPORT

The material in this chapter is motivated by Experiment 9.

Monopoly vs. Competition in Light of Extraction Norms. Abstract

Systematic and Complex Sampling!

Optimal Risk Classification and Underwriting Risk for Substandard Annuities

Working conditions in Korea: Survey highlights

4.5 Generalized likelihood ratio test

Chapter Four 1/15/2018. Learning Objectives. The Meaning of Interest Rates Future Value, Present Value, and Interest Rates Chapter 4, Part 1.

Risk Assessment for Project Plan Collapse

Section 3.3 Exercises Part A Simplify the following. 1. (3m 2 ) 5 2. x 7 x 11

International Journal of Management (IJM), ISSN (Print), ISSN (Online) Volume 1, Number 2, July - Aug (2010), IAEME

SIMPLE INTEREST, COMPOUND INTEREST INCLUDING ANNUITY

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory

Quantitative Analysis

Notes on Expected Revenue from Auctions

Introduction to Statistical Inference

Summary. Recap. Last Lecture. .1 If you know MLE of θ, can you also know MLE of τ(θ) for any function τ?

Journal of Statistical Software

Correlation possibly the most important and least understood topic in finance

Transcription:

Elemetary Statistics ad Iferece 22S:025 or 7P:025 Lecture 27 1 Elemetary Statistics ad Iferece 22S:025 or 7P:025 Chapter 20 2 D. The Correctio Factor - (page 367) 1992 Presidetial Campaig Texas 12.5 x 106 voters New Mexico 1.2 x 106 voters Radom Sample of 2,500 voters selected i New Mexica & 2,500 voters i Texas New Mexico samples 1 out of 500 Texas samples 1 out 5,000 3 1

It would seem the accuracy i New Mexico would be better because a greater percetage of the populatio is sampled! NOT SO The SE(%) is based o size of sample ot the size of the populatio. p(1 p) SE(%) = 100 where =sample size. 4 Results for SE(%) whe samplig with replacemet whe populatios are large. SE ( without replacemet) = correctio factor SE(with replacemet) umber of tickets i box - umber of draws SE(without replacemet) = SE umber of tickets i box -1 N SE(without replacemet) = p(1 p) N 1 Where N = size of populatio = sample size p = sample proportio 5 Example: Suppose N = 100,000 = 1,000 p =.50 SE(without replacemet) = = 100,000000-1,000 1000(.5)(.5) 99,999.99001 1000(.5)(.5) = (.994922) SE of the box The larger the populatio, the closer the correctio factor is to oe. 6 2

Note the correctio factor approaches 1 as size of the tickets i box (populatio) icreases. 7 I New Mexico & Texas sample NM TX 46% 1 54% 0 37% 1 63% 0 Democrat Democrat SD =.47.54.50 SD =.37.63.48 SE = (.50) SE = (.48) Because SE is early the same i both states, the estimated percet of democrats i a radom sample of 2,500 voters will be about the same, give or take about the same %. 8 Exercise Set C (page 370) #2, 3, 4, 5 #3. A survey orgaizatio wats to take a simple radom sample i order to estimate the percetage of people p who have see a certai televisio program. To keep the costs dow, they wat to take as small a sample as possible. But their cliet will oly tolerate chace errors of 1 percetage poit or so i the estimate. Should they use a sample of size 100, 2,500, or 10,000? You may assume the populatio to be very large; past experiece suggests the populatio percetage will be i the rage 20%-40%. 9 3

40% 1 60% 0 avg = p =.40 SD =.40.60 =.489 ~.50 10 Wat SE(%) ~1.00 p(1 p 1.00 = 100.40.60 1.00 = 100.40.60 1.00 = (10,000) = (.24)(10,000) ~ 2,400 Use sample of about 2,500 11 #5. 2 R 8 B p(r)=.20 Draw 4 marbles with replacemet, ad without replacemet. Fid SE(%) of red marbles draw. (a) With replacemet.2.8 SE(%) = 100 = 20% 4 12 4

(b) Without replacemet use correctio factor SE(%) = N SE(%) N 1 SE(%) = 10 4 (20) = (.816)(20) = 16.33% 10 1 Note: A big differece i SE(%) whe size of populatio is small. 13 E. The Gallup Poll They sample several thousad voters out of over 200 millio reaso? The size of the chace error (SE) i percet depeds maily o the absolute size of the sample, ad hardly at all o the size of the populatio from which the sample was selected. 14 For a sample of size 2,500 whe sample percet is betwee.40 ad.60, the p(1 p) SE(%) = 100.5.5 SE (%) = 100 = 1.00% 2,500 The chace error for a sample of size 2,500 from a populatio of over 200,000 millio would have a SE of percet aroud 1%. If sample percet is 50%, we would coclude results are accurate, give or take 1%. 15 5

F. Review Exercises (pp. 371-373) #1, 2, 3, 5, 11, 12 16 E(H) SE(H) E(% Hds) SE(% Hds) =2,500 1,250 25 50 1.0% =10,000 5,000 35 50.5% =1,000,000 500,000 500 50.05% 17 18 6

3. A group of 50,000 tax forms has a average gross icome of $37,000 with a SD of $20,000. Furthermore, 20% of the forms have a gross icome over $50,000. A group of 900 forms is chose at radom for audit. To estimate the chace that betwee 19% ad 21% of the forms chose for audit have gross icomes over $50,000, a box model is eeded. 19 a) Should the umber of tickets i the box be 900 or 50,000? b) Each ticket i the box shows a zero or a oe a gross icome c) True or false: The SE of the box is $20,000. d) True or false: The umber of draws is 900. e) Fid the chace (approximately) that betwee 19% ad 21% of the forms chose for audit have gross icomes over $50,000. 20 20% + 50k 80% - 50k = 900 E(% over 50k) = 20%.2.8 SE(% over 50k) = 100 = 1.33% 900 21 7

SE=1.33% 19 21 M=20% 19 20 21 20 Z = =.75 Z = = +. 75 1.33 1.33 % X M Z = SE Use Normal Curve table to fid percetage of scores betwee ±Z=.75. P (.75 Z.75) =.5467 or 54.67 ~ 55% 22 #11. (page 373) A uiversity has 25,000 studets, of whom 17,000 are udergraduates. The housig office takes a simple radom sample of 500 studets, 357 out of 500 are udergraduates. 17,000 1 8,000 0 17,000 p = =.68 E(umber) = 340 25,000 E(% of udergraduates) = 68% SE(% of udergraduates) = SE(%) = 2.09%.68.32 100 500 23 Determie the likelihood of obtaiig 357 or more udergraduates (71%) i a sample of 500 studets. SE=2.09 % M=68 71 Z 71 68 Z = = 1.44 ~1.45 2.09 24 8

85.29% -1.45 1.45 0 About 85.29% of scores i Normal Distributio are betwee ±Z=1.45. About 14.71% of scores are either less tha Z=-1.45 or greater tha Z=1.45. About 7.35% of scores are greater tha Z=1.45. Result: The probability (chace) of obtaiig 357 udergrads or more i a sample of 500 studets is about 7%. Z 25 9