INFERENTIAL STATISTICS REVISION

Similar documents
Making Sense of Cents

The Normal Probability Distribution

Section Introduction to Normal Distributions

Mr. Orchard s Math 141 WIR 8.5, 8.6, 5.1 Week 13

Section 6.5. The Central Limit Theorem

Mean GMM. Standard error

Copyright 2005 Pearson Education, Inc. Slide 6-1

The Normal Distribution

MA131 Lecture 8.2. The normal distribution curve can be considered as a probability distribution curve for normally distributed variables.

Estimation of the Mean and Proportion

Chapter Seven: Confidence Intervals and Sample Size

Central Limit Theorem

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Normal Probability Distributions

Chapter 7. Sampling Distributions

In a binomial experiment of n trials, where p = probability of success and q = probability of failure. mean variance standard deviation

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

7.1 Comparing Two Population Means: Independent Sampling

19. CONFIDENCE INTERVALS FOR THE MEAN; KNOWN VARIANCE

A continuous random variable is one that can theoretically take on any value on some line interval. We use f ( x)

Statistics for Managers Using Microsoft Excel 7 th Edition

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

6.1 Graphs of Normal Probability Distributions:

6 Central Limit Theorem. (Chs 6.4, 6.5)

Chapter Seven. The Normal Distribution

Section 3.4 The Normal Distribution

Chapter 5 Basic Probability

Chapter 6 Confidence Intervals

Chapter 6: The Normal Distribution

8.3 CI for μ, σ NOT known (old 8.4)

C.10 Exercises. Y* =!1 + Yz

NORMAL RANDOM VARIABLES (Normal or gaussian distribution)

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Lecture 8: Single Sample t test

Confidence Interval and Hypothesis Testing: Exercises and Solutions

AMS7: WEEK 4. CLASS 3

Midterm Test 1 (Sample) Student Name (PRINT):... Student Signature:... Use pencil, so that you can erase and rewrite if necessary.

ECON 214 Elements of Statistics for Economists

A LEVEL MATHEMATICS QUESTIONBANKS NORMAL DISTRIBUTION - BASIC

(# of die rolls that satisfy the criteria) (# of possible die rolls)

Chapter 7 Study Guide: The Central Limit Theorem

STATISTICS - CLUTCH CH.9: SAMPLING DISTRIBUTIONS: MEAN.

If the distribution of a random variable x is approximately normal, then

Since his score is positive, he s above average. Since his score is not close to zero, his score is unusual.

Uniform Probability Distribution. Continuous Random Variables &

Confidence Intervals and Sample Size

Math 14, Homework 7.1 p. 379 # 7, 9, 18, 20, 21, 23, 25, 26 Name

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

Elementary Statistics

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) Estimating Population Parameters

The Central Limit Theorem (Solutions) COR1-GB.1305 Statistics and Data Analysis

PREMIUM VERSION PREVIEW

5.1 Personal Probability

A) The first quartile B) The Median C) The third quartile D) None of the previous. 2. [3] If P (A) =.8, P (B) =.7, and P (A B) =.

MA131 Lecture 9.1. = µ = 25 and σ X P ( 90 < X < 100 ) = = /// σ X

Section 7.5 The Normal Distribution. Section 7.6 Application of the Normal Distribution

Chapter 7. Inferences about Population Variances

The Central Limit Theorem

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Abraham Baldwin Agricultural College Math 2000 Practice Test 3

6. THE BINOMIAL DISTRIBUTION

Fall 2011 Exam Score: /75. Exam 3

As you draw random samples of size n, as n increases, the sample means tend to be normally distributed.

Confidence Intervals: Review

Study Ch. 7.3, # 63 71

Using the Central Limit Theorem It is important for you to understand when to use the CLT. If you are being asked to find the probability of the

AP 9.2 Notes WEB.notebook February 04, Bellwork

WebAssign Math 3680 Homework 5 Devore Fall 2013 (Homework)

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

Chapter 6: The Normal Distribution

Chapter Four: Introduction To Inference 1/50

ECON 214 Elements of Statistics for Economists 2016/2017

Midterm Exam III Review

* Point estimate for P is: x n

d) Find the standard deviation of the random variable X.

Lecture 39 Section 11.5

ECO220Y Continuous Probability Distributions: Normal Readings: Chapter 9, section 9.10

Confidence Intervals for the Mean. When σ is known

Module 4: Probability

AP * Statistics Review

Expected Value of a Random Variable

Lecture 37 Sections 11.1, 11.2, Mon, Mar 31, Hampden-Sydney College. Independent Samples: Comparing Means. Robb T. Koether.

Distribution of the Sample Mean

The Central Limit Theorem for Sample Means (Averages)

Normal distribution. We say that a random variable X follows the normal distribution if the probability density function of X is given by

Estimation and Confidence Intervals

No, because np = 100(0.02) = 2. The value of np must be greater than or equal to 5 to use the normal approximation.

1. Variability in estimates and CLT

CHAPTER 6 Random Variables

2.) What is the set of outcomes that describes the event that at least one of the items selected is defective? {AD, DA, DD}

VARIABILITY: Range Variance Standard Deviation

CH 5 Normal Probability Distributions Properties of the Normal Distribution

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

Lecture 6: Chapter 6

Normal Sampling and Modelling

Determine whether the given procedure results in a binomial distribution. If not, state the reason why.

Transcription:

INFERENTIAL STATISTICS REVISION

PREMIUM VERSION PREVIEW WWW.MATHSPOINTS.IE/SIGN-UP/

2016 LCHL Paper 2 Question 9 (a) (i) Data on earnings were published for a particular country. The data showed that the annual income of people in full-time employment was normally distributed with a mean of 39 400 and a standard deviation of 12 920. The government intends to impose a new tax on incomes over 60 000. Find the percentage of full-time workers who will be liable for this tax, correct to one decimal place. Calculate the z score. x μ σ x is a data point μ is the population mean σ is the population standard deviation Quick Sketch 39,400 60000 39400 12920 1.5944 μ = 39,400 σ = 12,920 x = 60,000 60,000 The tables only give values to the left of z but: P Z 1 P Z z P z > 1.59 = 1 P z < 1.59 = 1 0.9441 = 0.0559 5.6% 0.0559

2016 LCHL Paper 2 Question 9 (a) (ii) The government will also provide a subsidy to the lowest 10 % of income earners. Find the level of income at which the government will stop paying the subsidy, correct to the nearest euro. 39,400 Instead of calculating the lowest 10% we calculate those above 90% as the curve is symmetrical and the tables have no percentages less than 50%. 1 0.1 = 0.90 1.28 x μ σ = 1.28 Find the z score that corresponds to 0.90 x μ σ 0.10 Quick Sketch x 39400 = 1.28 12920 x 39400 = 1.28 12920 x = 22862.4 Less than 90% is the z score 1.28 BUT greater than 90% is 1.28 (same as less than 10%). x = 22,862

2016 LCHL Paper 2 Question 9 (a) (iii) Some time later a research institute surveyed a sample of 1000 full-time workers, randomly selected, and found that the mean annual income of the sample was 38 280. Test the hypothesis, at the 5% level of significance, that the mean annual income of full-time workers has changed since the national data were published. State the null hypothesis and the alternative hypothesis. Give your conclusion in the context of the question. 15 marks 11 for the z score. Was acceptable to use Confidence Interval Method State the null and alternate hypothesis H 0 : μ = 39400, the mean annual income of full time workers is still 39,400 H 0 : μ 39400, the mean annual income of full time workers is NOT still 39,400 Calculate the test statistic x μ σ n x = 38,280 μ = 39,400 σ = 12,920 n = 1,000 38280 39400 12920 1000 2.74 2.74 < 1.96 Reject the null hypothesis. Sketch of Critical Regions Give conclusion in the context of the question. There is enough evidence to reject the claim that the mean annual income has remained the same. Marking scheme: The mean income has changed.

2016 LCHL Paper 2 Question 9 (b) The research institute surveyed 400 full-time farmers, randomly selected from all the full-time farmers in the country, and found that the mean income for the sample was 26 974 and the standard deviation was 5120. Assuming that annual farm income is normally distributed in this country, create a 95% confidence interval for the mean income of full-time farmers. 10 marks 95% Confidence Interval for a Means x 1.96 σ n < μ < x + 1.96 σ n 26974 1.96 5120 5120 < μ < 26974 + 1.96 400 400 26974 501.76 < μ < 26974 + 501.76 26472.24 < μ < 27475.76 Margin of Error (Mean) ±1.96 σ n Note: σ is in the tables but you must n remember the ±1.96 which is specific to a 95% level of confidence.

2016 LCHL Paper 2 Question 9 (c) It is known that data on farm size are not normally distributed. The research institute could take many large random samples of farm size and create a sampling distribution of the means of all these samples. Give one reason why they might do this. Central Limit Theorem Mean μ x = μ Standard Deviation σ x = σ n The distribution of sample means will be normally distributed. The mean of the samples will be equal to the mean of the population (for n > 30) and the standard deviation will be σ n. Must know the above!

2016 LCHL Paper 2 Question 9 (d) The research institute also carried out a survey into the use of agricultural land. n farmers were surveyed. If the margin of error of the survey was 4 5 %, find the value of n. Without more information this is the only formula we could use. Syllabus probably needs a little clarity here. Margin or Error 1 n 1 n = 0.045 2 1 = n 0.045 n = 493.827 The institute interviewed 494 farmers.

2015 LCHL Paper 2 Question 2 (a) A survey of 100 shoppers, randomly selected from a large number of Saturday supermarket shoppers, showed that the mean shopping spend was 90.45. The standard deviation of this sample was 20.73. Find a 95% confidence interval for the mean amount spent in a supermarket on that Saturday. 10 marks Confidence Interval for a Mean x 1.96 σ n < μ < x + 1.96 σ n 90.45 1.96 20.73 20.73 < μ < 90.45 + 1.96 100 100 90.45 4.06308 < μ < 90.45 + 4.06308 86.38692 < μ < 94.51308 Margin of Error (Mean) ±1.96 σ n Note: σ is in the tables but you must n remember the ±1.96 which is specific to a 95% level of confidence. We can be 95% confident that the mean amount spent was in the range 86.39 < μ < 94.51

2015 LCHL Paper 2 Question 2 (b) A supermarket has claimed that the mean amount spent by shoppers on a Saturday is 94. Based on the survey, test the supermarket s claim using a 5% level of significance. Clearly state your null hypothesis, your alternative hypothesis, and your conclusion. Full marks for Confidence Interval Method EVEN if you already used it for part (a). That would be 20 marks for doing the same thing twice!! State the null and alternate hypothesis H 0 : μ = 94, the mean amount spent by shoppers on a Saturday is 94 H 1 : μ 94, the mean amount spent by shoppers on a Saturday is NOT 94 Calculate the test statistic x μ σ n x = 90.45 μ = 90 σ = 20.73 n = 100 90.45 94 20.73 100 1.71 1.71 > 1.96 Fail to reject H 0 Sketch of Critical Regions Give conclusion in the context of the question. We do not reject the null hypothesis as 1.71 is in the fail to reject region. This means that there is not sufficient evidence to reject the claim that the mean amount spent by shoppers is as the supermarket claims. Marking scheme: Outlined no conclusion.

2015 LCHL Paper 2 Question 2 (c) Find the p value of the test you performed in part (b) above and explain what this value represents in the context of the question. p value = 2 P z > z 1 where z 1 is the test statistic The probability of getting a value z < 1.71 is got from the tables. P z < 1.71 = 1 P z < 1.71 = 1 0.9564 = 0.0436 The probability of getting a value > 1.71 is also 0.0436 p value is the sum of these probabilities. 2 0.0436 = 0.0872 Compare this to the level of significance 5% or 0.05 0.0872 > 0.05 This is greater evidence to fail to reject the null hypothesis. What does the p value represent: If the supermarkets claim is true then our sample result would be this far (a measure of 94 90.45 = 3.55) from the true claim 8.72% of the time. We are unwilling to reject it as this % is too high (i.e. greater than 5%). p-value is easy to calculate but interpreting it has caused difficulty.

PREMIUM VERSION PREVIEW WWW.MATHSPOINTS.IE/SIGN-UP/