Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Similar documents
Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Data Distributions and Normality

Two Populations Hypothesis Testing

Data Analysis. BCF106 Fundamentals of Cost Analysis

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.

σ 2 : ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Review: Population, sample, and sampling distributions

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

The normal distribution is a theoretical model derived mathematically and not empirically.

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Normal Probability Distributions

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1

1 Inferential Statistic

10/1/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

Terms & Characteristics

Statistical Intervals (One sample) (Chs )

7.1 Comparing Two Population Means: Independent Sampling

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Confidence Intervals. σ unknown, small samples The t-statistic /22

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Copyright 2005 Pearson Education, Inc. Slide 6-1

Chapter 7. Inferences about Population Variances

IOP 201-Q (Industrial Psychological Research) Tutorial 5

The "bell-shaped" curve, or normal curve, is a probability distribution that describes many real-life situations.

1) 3 points Which of the following is NOT a measure of central tendency? a) Median b) Mode c) Mean d) Range

Chapter 7. Confidence Intervals and Sample Sizes. Definition. Definition. Definition. Definition. Confidence Interval : CI. Point Estimate.

One sample z-test and t-test

1. Variability in estimates and CLT

Descriptive Analysis

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

Chapter 4 Variability

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Diploma in Business Administration Part 2. Quantitative Methods. Examiner s Suggested Answers

SLIDES. BY. John Loucks. St. Edward s University

Chapter 8 Estimation

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

Statistics for Business and Economics

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

Chapter Seven: Confidence Intervals and Sample Size

χ 2 distributions and confidence intervals for population variance

Chapter 8 Statistical Intervals for a Single Sample

8.3 CI for μ, σ NOT known (old 8.4)

Lecture 8: Single Sample t test

ECON 214 Elements of Statistics for Economists 2016/2017

Probability & Statistics

David Tenenbaum GEOG 090 UNC-CH Spring 2005

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

BIO5312 Biostatistics Lecture 5: Estimations

Lecture 35 Section Wed, Mar 26, 2008

ECON 214 Elements of Statistics for Economists

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

Tutorial 1. Review of Basic Statistics

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes. Standardizing normal distributions The Standard Normal Curve

2011 Pearson Education, Inc

Lecture 2. Probability Distributions Theophanis Tsandilas

Chapter ! Bell Shaped

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

If the distribution of a random variable x is approximately normal, then

1 Small Sample CI for a Population Mean µ

Chapter 7. Sampling Distributions

Statistics & Statistical Tests: Assumptions & Conclusions

Problem Set 4 Answer Key

Chapter 9. Sampling Distributions. A sampling distribution is created by, as the name suggests, sampling.

MATH 10 INTRODUCTORY STATISTICS

The t Test. Lecture 35 Section Robb T. Koether. Hampden-Sydney College. Mon, Oct 31, 2011

Introduction to Statistics I

KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI

Rationale. Learning about return and risk from the historical record and beta estimation. T Bills and Inflation

8.1 Estimation of the Mean and Proportion

Lecture 6: Confidence Intervals

Value (x) probability Example A-2: Construct a histogram for population Ψ.

Quantitative Analysis

Simple Descriptive Statistics

Two-Sample T-Test for Superiority by a Margin

STAT Chapter 7: Confidence Intervals

Moments and Measures of Skewness and Kurtosis

Statistics 431 Spring 2007 P. Shaman. Preliminaries

MgtOp S 215 Chapter 8 Dr. Ahn

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =

Figure 1: 2πσ is said to have a normal distribution with mean µ and standard deviation σ. This is also denoted

Basic Procedure for Histograms

ASTM D02, Dec./ Orlando Statistics Seminar

Two-Sample T-Test for Non-Inferiority

Tests for One Variance

Learning Objectives for Ch. 7

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

On Performance of Confidence Interval Estimate of Mean for Skewed Populations: Evidence from Examples and Simulations

Chapter 7 Study Guide: The Central Limit Theorem

Statistics for Managers Using Microsoft Excel 7 th Edition

6.2 Normal Distribution. Normal Distributions

Statistics Class 15 3/21/2012

Theoretical Foundations

DESCRIBING DATA: MESURES OF LOCATION

1 Describing Distributions with numbers

Data Analysis and Statistical Methods Statistics 651

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority

Transcription:

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD MAJOR POINTS Sampling distribution of the mean revisited Testing hypotheses: sigma known An example Testing hypotheses: sigma unknown An example Factors affecting the test Measuring the size of the effect Confidence intervals 2 REVIEW: HYPOTHESIS TESTING STEPS 1. State Null Hypothesis 2. Alternative Hypothesis 3. Decide on α (usually.05) 4. Decide on type of test (distribution; z, t, etc.) 5. Find critical value & state decision rule 6. Calculate test 7. Apply decision rule 3 1

SAMPLING DISTRIBUTIONS In reality, we take only one sample of a specific size (N) from a population and calculate a statistic of interest. Based upon this single statistic from a single sample, we want to know: How likely is it that I could get a sample statistic of this value from a population if the corresponding population parameter was 4 SAMPLING DISTRIBUTIONS BUT, in order to answer that question, we need to know what the entire range of values this statistic could be. How can we find this out? Draw all possible samples of size N from the population and calculate a sample statistic on each of these samples (Chapter 8) 5 Or we can calculate it SAMPLING DISTRIBUTIONS A distribution of all possible statistics calculated from all possible samples of size N drawn from a population is called a Sampling Distribution. Three things we want to know about any distribution? Central Tendency, Dispersion, Shape 6 2

AN EXAMPLE BACK TO IQPLUS Returning to our study of IQPLUS and its affect on IQ A group of 25 participants are given 30mg of IQPLUS everyday for ten days At the end of 10 days the 25 participants are given the Stanford-Binet intelligence test. 7 IQPLUS The mean IQ score of the 25 participants is 106 µ = 100, σ = 15 Is this increase large enough to conclude that IQPLUS was affective in increasing the participants IQ? 8 SAMPLING DISTRIBUTION OF THE MEAN Formal solution to example given in Chapter 8. We need to know what kinds of sample means to expect if IQPLUS has no effect. i. e. What kinds of means if µ = 100 and σ = 15? This is the sampling distribution of the 9 mean (Why?) 3

POPULATION DISTRIBUTION 10 SAMPLING DISTRIBUTION 11 What is the relationship between σ and the SD above? SAMPLING DISTRIBUTION OF THE MEAN The sampling distribution of the mean depends on Mean of sampled population Why? St. dev. of sampled population Why? Size of sample Why? 12 4

SAMPLING DISTRIBUTION OF THE MEAN Shape of the sampling distribution Approaches normal Why? Rate of approach depends on sample size Why? Basic theorem Central limit theorem 13 CENTRAL LIMIT THEOREM Central Tendency The mean of the Sampling Distribution of the mean is denoted as µ X Dispersion The Standard Deviation of the Sampling Distribution of the mean is called the Standard Error of the 14 Mean and is denoted as σ X CENTRAL LIMIT THEOREM Standard Error of the Mean We defined this manually in Chapter 8 And it can be calculated as: Shape σ = The shape of the sampling distribution of the mean will be normal if the original population is normally distributed OR if the sample size is reasonably large. 15 X σ n 5

DEMONSTRATION Let a population be very skewed Draw samples of size 3 and calculate means Draw samples of size 10 and calculate means Plot means Note changes in means, standard deviations, and shapes 16 PARENT POPULATION Frequency 3000 2000 1000 Skewed Population 0 Std. Dev = 2.43 Mean = 3.0 N = 10000.00 4.0 2.0 0.0 6.0 8.0 10.0 12.0 14.0 16.0 18.0 20.0 X 17 2000 Sampling Distribution Sample size = n = 3 Frequency 1000 0 13.00 12.00 11.00 10.00 9.00 8.00 7.00 6.00 5.00 4.00 3.00 2.00 1.00 0.00 Sample Mean Std. Dev = 1.40 Mean = 2.99 N = 10000.00 1600 Sampling Distribution Sample size = n = 10 1400 1200 Frequency 1000 800 600 400 200 0 6.50 6.00 5.50 5.00 4.50 4.00 3.50 3.00 2.50 2.00 1.50 1.00 Std. Dev =.77 Mean = 2.99 N = 10000.00 Sample Mean 6

DEMONSTRATION Means have stayed at 3.00 throughout Except for minor sampling error Standard deviations have decreased appropriately Shape has become more normal as we move from n = 3 to n = 10 See superimposed normal distribution for reference 19 TESTING HYPOTHESES: µ AND σ KNOWN Called a 1-sample Z-test H 0 : µ = 100 H 1 : µ 100 (Two-tailed) Calculate p (sample mean) = 106 if µ = 100 Use z from normal distribution Sampling distribution would be normal 20 USING Z TO TEST H 0 2-TAILED α =.05 Calculate z X µ z = = = = X σ X If z > + 1.96, reject H 0 (Why 1.96?) > 1.96 The difference is significant. 21 7

USING Z TO TEST H 0 1-TAILED α =.05 Calculate z (from last slide) If z > + 1.64, reject H 0 (Why 1.64?) > 1.64 The difference is significant. 22 Z-TEST Compare computed z to histogram of sampling distribution The results should look consistent. Logic of test Calculate probability of getting this mean if null true. Reject if that probability is too small. 23 TESTING HYPOTHESES: µ KNOWN σ NOT KNOWN Assume same example, but σ not known We can make a guess at σ with s But, unless we have a large sample, s is likely to underestimate σ (see next slide) So, a test based on the normal distribution will lead to biased results (e.g. more Type 1 errors) 24 8

SAMPLING DISTRIBUTION OF THE VARIANCE Frequency 1400 1200 1000 800 600 400 200 0 138.89 800.0 750.0 700.0 650.0 600.0 550.0 500.0 450.0 400.0 350.0 300.0 250.0 200.0 150.0 100.0 50.0 0.0 Let s say you have a population variance = 138.89 If n = 5 and you take 10,000 samples 58.94% < 138.89 Sample variance 25 TESTING HYPOTHESES: µ KNOWN σ NOT KNOWN Since s is the best estimate of σ; is the best estimate of σ X Since Z does not work in this case we need a different distribution One that is based on s Adjusts for the underestimation s X And takes sample size (i.e. degrees of freedom) into account 26 THE T DISTRIBUTION Symmetric, mean = median = mode = 0. Asymptotic tails Infinite family of t distributions, one for every possible df. For low df, the t distribution is more leptokurtic (e.g. spiked, thin, w/ fat tails) For high df, the t distribution is more normal With df =, the t distribution and the z distribution are equivalent. 27 9

THE T DISTRIBUTION 28 DEGREES OF FREEDOM Skewness of sampling distribution of variance decreases as n increases t will differ from z less as sample size increases Therefore need to adjust t accordingly Degrees of Freedom: df = n - 1 t based on df 29 TESTING HYPOTHESES: µ KNOWN σ NOT KNOWN Called a 1-sample t-test H 0 : µ = 100 H 1 : µ 100 (Two-tailed) Calculate p (sample mean) = 106 if µ = 100 Use t-table to look up critical value using degrees of freedom Compare t observed to t critical and make decision 30 10

USING T TO TEST H 0 2-TAILED α =.05 Same as z except for s in place of σ. Let s say for the 25, s = 7.78 t observed X µ = = = = s X With α =.05, df=24, 2-tailed t critical = (Table D.6; see next slide) Since > reject H 0 31 df Critical Values of Student's t 1-tailed 0.1 0.05 0.025 0.01 0.005 0.0005 2-tailed 0.2 0.1 0.05 0.02 0.01 0.001 1 3.078 6.314 12.706 31.821 63.657 636.619 2 1.886 2.92 4.303 6.965 9.925 31.598 3 1.683 2.353 3.182 4.5415 5.841 12.941 4 1.533 2.132 2.776 3.747 4.604 8.61 5 1.476 2.015 2.571 3.365 4.032 6.859 6 1.44 1.943 2.447 3.143 3.707 5.959 7 1.415 1.895 2.365 2.998 3.499 5.405 8 1.397 1.86 2.306 2.896 3.355 5.041 9 1.383 1.833 2.262 2.821 3.25 4.781 10 1.372 1.812 2.228 2.764 3.169 4.587 11 1.363 1.796 2.201 2.718 3.106 4.437 12 1.356 1.782 2.179 2.681 3.055 4.318 13 1.35 1.771 2.16 2.65 3.012 4.221 14 1.345 1.761 2.145 2.624 2.977 4.14 15 1.341 1.753 2.131 2.602 2.947 4.073 16 1.337 1.746 2.12 2.583 2.921 4.015 17 1.333 1.74 2.11 2.567 2.898 3.965 18 1.33 1.734 2.101 2.552 2.878 3.922 19 1.328 1.729 2.093 2.539 2.861 4.883 20 1.325 1.725 2.086 2.528 2.845 3.85 21 1.323 1.721 2.08 2.518 2.831 3.819 22 1.321 1.717 2.074 2.508 2.819 3.792 23 1.319 1.714 2.069 2.5 2.807 3.767 24 1.318 1.711 2.064 2.492 2.797 3.745 25 1.316 1.708 2.06 2.485 2.787 3.725 T DISTRIBUTION 32 USING T TO TEST H 0 1-TAILED α =.05 H 0 : µ 100 H 1 : µ > 100 (One-tailed) The t observed value is the same With α =.05, df=24, 1-tailed t critical = (Table D.6; see next slide) Since > reject H 0 33 11

df Critical Values of Student's t 1-tailed 0.1 0.05 0.025 0.01 0.005 0.0005 2-tailed 0.2 0.1 0.05 0.02 0.01 0.001 1 3.078 6.314 12.706 31.821 63.657 636.619 2 1.886 2.92 4.303 6.965 9.925 31.598 3 1.683 2.353 3.182 4.5415 5.841 12.941 4 1.533 2.132 2.776 3.747 4.604 8.61 5 1.476 2.015 2.571 3.365 4.032 6.859 6 1.44 1.943 2.447 3.143 3.707 5.959 7 1.415 1.895 2.365 2.998 3.499 5.405 8 1.397 1.86 2.306 2.896 3.355 5.041 9 1.383 1.833 2.262 2.821 3.25 4.781 10 1.372 1.812 2.228 2.764 3.169 4.587 11 1.363 1.796 2.201 2.718 3.106 4.437 12 1.356 1.782 2.179 2.681 3.055 4.318 13 1.35 1.771 2.16 2.65 3.012 4.221 14 1.345 1.761 2.145 2.624 2.977 4.14 15 1.341 1.753 2.131 2.602 2.947 4.073 16 1.337 1.746 2.12 2.583 2.921 4.015 17 1.333 1.74 2.11 2.567 2.898 3.965 18 1.33 1.734 2.101 2.552 2.878 3.922 19 1.328 1.729 2.093 2.539 2.861 4.883 20 1.325 1.725 2.086 2.528 2.845 3.85 21 1.323 1.721 2.08 2.518 2.831 3.819 22 1.321 1.717 2.074 2.508 2.819 3.792 23 1.319 1.714 2.069 2.5 2.807 3.767 24 1.318 1.711 2.064 2.492 2.797 3.745 25 1.316 1.708 2.06 2.485 2.787 3.725 T DISTRIBUTION 34 CONCLUSIONS With n = 25, t observed (24) = Because is larger than both (1-tailed) and (2- tailed) we reject H 0 under both 1- and 2-tailed hypotheses Conclude that taking IQPLUS leads to a higher IQ than normal. 35 FACTORS AFFECTING t test Difference between sample and population means Magnitude of sample variance Sample size Decision Significance level α One-tailed versus two-tailed test 36 12

SIZE OF THE EFFECT We know that the difference is significant. That doesn t mean that it is important. Population mean = 100, Sample mean = 106 Difference is 6 words or roughly a 6% increase. Is this large? 37 EFFECT SIZE Later we develop this more in terms of standard deviations. For Example: In our sample s = 7.78 X µ 106 100 6 Effect size = = = =.77 s 7.78 7.78 over 3/4 of a standard deviation 38 CONFIDENCE INTERVALS ON MEAN Sample mean is a point estimate We want interval estimate Given the sample mean we can calculate an interval that has a probability of containing the population mean This can be done if σ is known or not 39 13

CONFIDENCE INTERVALS ON MEAN If σ is known than the 95% CI is CI.95 = X ± 1.96( σ X ) If σ is not known than the 95% CI is ( tailed α = ) CI X t s.95 = ± * (2,.05) X 40 FOR OUR DATA ASSUMING σ KNOWN ( 1.96 * ) CI = X ± σ.95 X = 106 ± (1.96* ) = 106 ± = µ 41 FOR OUR DATA ASSUMING σ NOT KNOWN CI = X t * s.95 ± (2 tailed, α.05) X = = 106 ± ( * ) = 106 ± = µ 42 14

CONFIDENCE INTERVAL Neither interval includes 100 - the population mean of IQ Consistent with result of t test. Confidence interval and effect size tell us about the magnitude of the effect. What else can we conclude from confidence interval? 43 15