M249 Diagnostic Quiz

Similar documents
2011 Pearson Education, Inc

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.

MAS187/AEF258. University of Newcastle upon Tyne

Chapter 4 Probability Distributions

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Random Variables and Probability Distributions

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

IOP 201-Q (Industrial Psychological Research) Tutorial 5

Discrete Random Variables and Probability Distributions

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

SEX DISCRIMINATION PROBLEM

A probability distribution shows the possible outcomes of an experiment and the probability of each of these outcomes.

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

The Normal Probability Distribution

Basic Procedure for Histograms

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

8.1 Estimation of the Mean and Proportion

Overview. Definitions. Definitions. Graphs. Chapter 4 Probability Distributions. probability distributions

Diploma Part 2. Quantitative Methods. Examiner s Suggested Answers

8.1 Binomial Distributions

MAS187/AEF258. University of Newcastle upon Tyne

Paper Reference. Paper Reference(s) 6683/01 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary

PROBABILITY DISTRIBUTIONS

Statistics 6 th Edition

Continuous Probability Distributions

physicsandmathstutor.com Paper Reference Statistics S1 Advanced/Advanced Subsidiary Wednesday 20 May 2009 Afternoon Time: 1 hour 30 minutes

MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment

Contents. An Overview of Statistical Applications CHAPTER 1. Contents (ix) Preface... (vii)

Data Analysis and Statistical Methods Statistics 651

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

Elementary Statistics

Continuous random variables

DATA HANDLING Five-Number Summary

The Bernoulli distribution

Fundamentals of Statistics

2 Exploring Univariate Data

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Establishing a framework for statistical analysis via the Generalized Linear Model

Math 227 Elementary Statistics. Bluman 5 th edition

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Module 3: Sampling Distributions and the CLT Statistics (OA3102)

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

CHAPTER 2 Describing Data: Numerical

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

UNIT 4 NORMAL DISTRIBUTION: DEFINITION, CHARACTERISTICS AND PROPERTIES

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Lecture 2. Probability Distributions Theophanis Tsandilas

Notes on bioburden distribution metrics: The log-normal distribution

Case Study: Heavy-Tailed Distribution and Reinsurance Rate-making

Probability Distribution Unit Review

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

Some Discrete Distribution Families

Chapter 3 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc.

Lecture 2 Describing Data

Continuous Probability Distributions & Normal Distribution

6683/01 Edexcel GCE Statistics S1 Gold Level G2

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Point Estimation. Copyright Cengage Learning. All rights reserved.

Random variables. Contents

Discrete Random Variables and Probability Distributions. Stat 4570/5570 Based on Devore s book (Ed 8)

2 of PU_2015_375 Which of the following measures is more flexible when compared to other measures?

Lecture 9. Probability Distributions. Outline. Outline

Web Extension: Continuous Distributions and Estimating Beta with a Calculator

Chapter 5: Statistical Inference (in General)

ECON 214 Elements of Statistics for Economists

Unit2: Probabilityanddistributions. 3. Normal distribution

Normal Model (Part 1)

Frequency Distribution and Summary Statistics

Time allowed : 3 hours Maximum marks : 100. Total number of questions : 8 Total number of printed pages : 7 PART A

Lecture 9. Probability Distributions

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

Statistics S1 Advanced/Advanced Subsidiary

Statistical Methods in Practice STAT/MATH 3379

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

Uncertainty Analysis with UNICORN

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

3.1 Measures of Central Tendency

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Overview. Definitions. Definitions. Graphs. Chapter 5 Probability Distributions. probability distributions

LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL

Chapter 7: Point Estimation and Sampling Distributions

Statistics 13 Elementary Statistics

STA1510 (BASIC STATISTICS) AND STA1610 (INTRODUCTION TO STATISTICS) NOTES PART 1

ECON 214 Elements of Statistics for Economists 2016/2017

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

It is common in the field of mathematics, for example, geometry, to have theorems or postulates

EX-POST VERIFICATION OF PREDICTION MODELS OF WAGE DISTRIBUTIONS

Converting to the Standard Normal rv: Exponential PDF and CDF for x 0 Chapter 7: expected value of x

Statistics Class 15 3/21/2012

A.REPRESENTATION OF DATA

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

Exploring Data and Graphics

Commonly Used Distributions

Statistics for Business and Economics: Random Variables:Continuous

DATA SUMMARIZATION AND VISUALIZATION

INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS. 20 th May Subject CT3 Probability & Mathematical Statistics

5.3 Statistics and Their Distributions

Distributions in Excel

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley

Transcription:

THE OPEN UNIVERSITY Faculty of Mathematics and Computing M249 Diagnostic Quiz Prepared by the Course Team [Press to begin] c 2005, 2006 The Open University Last Revision Date: May 19, 2006 Version 4.2

Section 1: Introduction 2 1. Introduction In order to study M249, it is assumed that you have some knowledge of statistics as well as some basic mathematical skills. This quiz covers a number of key mathematical and statistical areas with which you should be familiar. Try each question for yourself, using your calculator if you wish, then click on the green section letter (e.g. (a) ) to see the solution. Click on the symbol at the end of the solution to return to the question. Use the and keys to move from Section to Section. There is some advice on evaluating your performance at the end of the quiz.

Section 2: Decimals and fractions 3 2. Decimals and fractions Exercise 1. (a) Evaluate 2.8372 1.8205 to 3 decimal places. (b) Evaluate 7.1946 2.011 to 3 significant figures. Exercise 2. (a) Write down 7 8 2 in its simplest fractional form. 21 (b) Write down 4 10 16 in its simplest fractional form. 15

Section 3: Formulas and equations 4 3. Formulas and equations Exercise 3. (a) Consider the following sample data. In a random sample of size n = 15, the sample mean was found to be x = 10.25 and the sample standard deviation s = 2.326. The 95% confidence limits (that is, the end-points of the 95% confidence interval) for the mean µ of the population from which the sample was drawn are defined to be x ± 1.96 s n Evaluate this confidence interval to 2 decimal places, using the summary statistics given. (b) If 5 2y + 14 = y + 8, find y.

Section 4: Powers, logarithms and the exponential function 5 4. Powers, logarithms and the exponential function Exercise 4. (a) Write 3 3 3 4 in the form 3 n and hence evaluate it. (b) Write ( 3 3) 4 in the form 3 n and hence evaluate it. (c) Write 3 4 /3 4 in the form 3 n and hence evaluate it. Exercise 5. This exercise is about logarithms to base e, which are sometimes called natural logarithms and may be denoted by ln, log e or just log. (a) To 4 decimal places, log 3 = 1.0986 and log 4 = 1.3863. Without evaluating any other logarithms, use these values to find log 12 to 3 decimal places. (b) If log x = 1.2, what is x, to 3 decimal places? Exercise 6. (a) Suppose that exp (x) = 2, for some number x. Without evaluating x, use this result to find exp ( x) and exp (2x). (b) If exp(x) = 3.2, what is x, to 3 decimal places?

Section 5: Graphs 1: Bar charts and histograms 6 5. Graphs 1: Bar charts and histograms Exercise 7. The bar chart below displays the numbers of defects found in a sample of 80 cars by the quality control division of a major vehicle manufacturer. [Press for questions.]

Section 5: Graphs 1: Bar charts and histograms 7 (a) What proportion of cars in the sample have at most 4 faults? (b) What proportion of cars in the sample have between 4 and 6 faults inclusive? (c) Without performing any calculations, would you expect the sample mean number of faults to be greater than the sample median? Give a reason for your answer. [Press to return to graph.] [Press for next exercise.]

Section 6: Graphs 2: Scatterplots 8 6. Graphs 2: Scatterplots Exercise 8. Consider the scatterplot below: [Press for questions.]

Section 6: Graphs 2: Scatterplots 9 (a) How would you characterise the trend in the data displayed by this graph? (For example, would you say that it is linear or non-linear, increasing, decreasing or constant?) (b) How would you describe the degree of scatter? (For example, would you say that it is increasing, decreasing or constant as x increases?) [Press to return to graph.] [Press for next exercise.]

Section 7: Measures of location and dispersion 10 7. Measures of location and dispersion Exercise 9. Numbers of defective components in a random sample of 4 production batches were found to be 8, 2, 2 and 20. (a) Find the mean, median and mode of this sample. (b) Calculate the sample standard deviation, to 3 decimal places.

Section 8: Probabilities from tables 11 8. Probabilities from tables Exercise 10. Consider the data tabulated below, which summarises the beer preferences of a sample of 150 beer drinkers, categorised by gender: Bitter Lager Stout Male 40 20 20 Female 30 35 5 (a) What percentage of the sample drink lager? (b) What percentage of the sample comprises female lager drinkers? (c) What percentage of females in the sample drink lager? (d) What percentage of lager drinkers in the sample are female?

Section 9: Probability distributions 1: Discrete distributions 12 9. Probability distributions 1: Discrete distributions Exercise 11. Consider the probability distribution of the discrete random variable x tabulated below. x -1 0 1 2 1 1 3 p (x) 4 2 16 1 16 (a) Find the mean of the distribution. (b) Find p (x 1).

Section 10: Probability distributions 2: Identify the distribution 13 10. Probability distributions 2: Identify the distribution Exercise 12. Consider the probability distribution represented by the following probability mass function. [Press for questions.]

Section 10: Probability distributions 2: Identify the distribution 14 Do you think that this probability mass function (p.m.f.) represents: (a) a normal distribution? (b) a binomial distribution? (c) a Poisson distribution? [Press to return to graph.] [Press for next exercise.]

Section 11: Hypothesis testing 15 11. Hypothesis testing Exercise 13. A hypothesis test has been undertaken to test the null hypothesis that there is no difference between the proportions of girls and boys who get a grade A in GCSE Mathematics. The significance probability (or p value) was found to be be 0.0034. (a) Which of the following two statements is true? (A) OR There is strong evidence that the underlying proportions are the same. (B) There is strong evidence that the underlying proportions are not the same.

Section 12: Confidence intervals 16 12. Confidence intervals Exercise 14. Suppose that ( 2.6, 3.4) is a 95% confidence interval for some parameter θ. Classify the following statements as true or false: (a) 3.3 lies outside the corresponding 99% confidence interval for θ. (b) On the basis of these data, it is very implausible that θ should equal zero.

Section 13: Correlation 17 13. Correlation Exercise 15. (a) For the data in the scatterplot below, do you think the correlation coefficient is likely to be 1, 0 or 0.8?

Section 13: Correlation 18 Exercise 16. (a) For the data in the scatterplot below, do you think the correlation coefficient is likely to be -1, 0 or 0.8?

Section 13: Correlation 19 Exercise 17. (a) For the data in the scatterplot below, do you think the correlation coefficient is likely to be -1, 0 or 0.8?

Section 14: Post-mortem 20 14. Post-mortem If you had difficulty in answering some of these questions, you might find it useful to look at the pre-registration version of the Introduction to Statistical Modelling, http://statistics.open.ac.uk/m249/introduction.pdf, which comprises a revision of the pre-requisites for M249. You should only attempt to read Sections 1, 3, 4 and 5; Sections 2 and 6 introduce the statistical package SPSS which you will not receive until you have registered for the course. (If you decide to study M249, you will need to work through this unit again, including the computing sections.) If you found the mathematics exercises, 1-6, very difficult, you might consider studying one of the mathematics entry level courses, MU120 or MST121, or the 10-point course S151 Mathematics for Science before embarking on M249. On the other hand, if you found the statistics exercises, 7-17, very difficult, you would be advised to study M248 before M249. If you have any queries about your suitability for the course, you should contact your Regional Office.

Solutions to Exercises 21 Solutions to Exercises Exercise 1(a) 2.8372 1.8205 = 5.1651226, which is 5.165 to 3 decimal places. Remember that, when rounding to 3 decimal places, if the digit in the 4 th decimal place is in the range 0 4, the number is rounded down, that is, the digit in the 3 rd decimal place is left unchanged. If it is in the range 5 9, the number is rounded up, that is, the digit in the 3 rd decimal place is increased by 1. This rule can be applied however many decimal places you are asked to round to.

Solutions to Exercises 22 Exercise 1(b) 7.1946 2.011 = 3.577623073 to the limits of calculator accuracy, which is 3.58 to 3 significant figures.

Solutions to Exercises 23 Exercise 2(a) Applying the usual cancellation rules, we have 7 8 2 21 = 1 8 2 3 = 1 4 1 3 = 1 12 (after dividing above and below by 7, then by 2).

Solutions to Exercises 24 Exercise 2(b) Remembering that a b c d = a b d, where a, b, c, d c represent any non-zero real numbers, and applying the cancellation rules, we get 4 10 16 15 = 4 10 15 16 = 4 2 3 16 = 1 2 3 4 = 3 8

Solutions to Exercises 25 Exercise 3(a) Substituting in the given formula, we get x ± 1.96 s ( ) 2.326 = 10.25 ± 1.96 n 15 = 10.25 ± (1.96 0.600570617) = 10.25 ± 1.17711841 To 2 decimal places, the required interval is therefore (9.07, 11.43).

Solutions to Exercises 26 Exercise 3(b) Applying the manipulation rules yields 5 2 y + 14 = y + 8 5 2 y y = 8 14 3 2 y = 6 y = 4 (The logical symbol means if and only if or is equivalent to.)

Solutions to Exercises 27 Exercise 4(a) Using the rule x a x b = x a+b gives 3 3 3 4 = 3 3+4 = 3 7 = 2187

Solutions to Exercises 28 Exercise 4(b) Using the rule (x a ) b = x ab gives ( 3 3 ) 4 = 3 3 4 = 3 12 = 531441

Solutions to Exercises 29 Exercise 4(c) Using the rule x a /x b = x a b gives 3 4 /3 4 = 3 4 4 = 3 0 = 1 remembering that, for any number x, x 0 = 1.

Solutions to Exercises 30 Exercise 5(a) Remembering that, for any positive real numbers x and y, log (xy) = log x + log y, we get to 3 decimal places. log 12 = log (3 4) = log 3 + log 4 = 1.0986 + 1.3863 = 2.4849 = 2.485

Solutions to Exercises 31 Exercise 5(b) Remembering that log is the inverse function of exp, the exponential function, we have to 3 decimal places. x = exp (1.2) = e 1.2 = 3.320116923 = 3.320

Solutions to Exercises 32 Exercise 6(a) Since exp ( x) = e x = 1 for all x, we have ex exp ( x) = 1 exp (x) = 1 2 Since exp (2x) = e 2x = (e x ) 2 for all x, we have exp (2x) = (exp (x)) 2 = 2 2 = 4

Solutions to Exercises 33 Exercise 6(b) of exp, we have Again remembering that log is the inverse function to 3 decimal places. x = log (3.2) = 1.16315081 = 1.163

Solutions to Exercises 34 Exercise 7(a) The number of cars with at most 4 faults is 10 + 14 + 20 + 12 + 7 = 63. The proportion is therefore 63 80 = 0.79.

Solutions to Exercises 35 Exercise 7(b) The number of cars with between 4 and 6 faults inclusive is 7 + 6 + 4 = 17. The proportion is therefore 17 80 = 0.21.

Solutions to Exercises 36 Exercise 7(c) Yes. For right-skew data, the sample mean is greater than the sample median.

Solutions to Exercises 37 Exercise 8(a) A visual inspection suggests that x and y could well be linearly related. y appears to be decreasing as x increases.

Solutions to Exercises 38 Exercise 8(b) The degree of scatter appears to be increasing as x increases.

Solutions to Exercises 39 Exercise 9(a) The mean is 8+2+2+20 4 = 8. The ordered sample is 2, 2, 8, 20, so the median is 2+8 2 = 5. The mode is the most frequently-occurring value, in this case 2.

Solutions to Exercises 40 Exercise 9(b) s = 1 n (x i x) 2 n 1 i=1 1 = {(8 8) 2 + (2 8) 2 + (2 8) 2 + (20 8) 2} 3 216 = 3 = 72 = 8.485281374 = 8.485, to 3 decimal places.

Solutions to Exercises 41 Exercise 10(a) The expanded table is: Bitter Lager Stout Totals Male 40 20 20 80 Female 30 35 5 70 Totals 70 55 25 150 The percentage of the sample who drink lager is therefore 55 150 = 36.67%, to 2 decimal places.

Solutions to Exercises 42 Exercise 10(b) The expanded table is: Bitter Lager Stout Totals Male 40 20 20 80 Female 30 35 5 70 Totals 70 55 25 150 The percentage of the sample who are female lager drinkers is therefore 35 150 = 23.33%, to 2 decimal places.

Solutions to Exercises 43 Exercise 10(c) The expanded table is: Bitter Lager Stout Totals Male 40 20 20 80 Female 30 35 5 70 Totals 70 55 25 150 The percentage of females in the sample who drink lager is therefore 35 70 = 50%.

Solutions to Exercises 44 Exercise 10(d) The expanded table is: Bitter Lager Stout Totals Male 40 20 20 80 Female 30 35 5 70 Totals 70 55 25 150 The percentage of lager drinkers in the sample who are female is therefore 35 55 = 63.64%, to 2 decimal places.

Solutions to Exercises 45 Exercise 11(a) The mean is ( E (x) = ( 1) 1 ) + 4 ( 0 1 2 ) ( + 1 3 ) ( + 2 1 ) 16 16 = 1 4 + 0 + 3 16 + 1 8 = 1 16

Solutions to Exercises 46 Exercise 11(b) p (x 1) = p (x = 1 or x = 2) = p (x = 1) + p (x = 2) = 3 16 + 1 16 = 1 4

Solutions to Exercises 47 Exercise 12(a) The p.m.f. does not represent a normal distribution since it is discrete, whereas the normal distribution is continuous. Also, the p.m.f. is not symmetrical, whereas the normal probability density function is symmetrical about the mean.

Solutions to Exercises 48 Exercise 12(b) The p.m.f. has the characteristics associated with a binomial distribution, in that it is discrete and unimodal and has a finite range. In fact, it is the distribution B (10, 0.8).

Solutions to Exercises 49 Exercise 12(c) The p.m.f. does not represent a Poisson distribution since the Poisson distribution is right-skew.

Solutions to Exercises 50 Exercise 13(a) The significance probability is the probability of obtaining data which are at least as extreme as those observed if the null hypothesis were true. This probability is very small and so we have strong evidence against the null hypothesis. It is highly unlikely that the underlying proportions are the same. Thus statement (A) is incorrect and statement (B) is correct.

Solutions to Exercises 51 Exercise 14(a) The statement is false. The 99% confidence interval for θ is centred on the same value (in this case 0.4) as the 95% confidence interval and is wider. So, since 3.3 lies inside the 95% confidence interval, it must also lie inside the 99% confidence interval.

Solutions to Exercises 52 Exercise 14(b) The statement is false. The 95% confidence interval gives a range of values of θ which are plausible at the 95% confidence level. Since this interval contains zero, it is plausible that θ = 0.

Solutions to Exercises 53 Exercise 15(a) The data points lie on a straight line with negative gradient. The correlation coefficient is 1.

Solutions to Exercises 54 Exercise 16(a) The data points appear to lie fairly close to a straight line with positive gradient. A correlation coefficient of 0.8 would be appropriate here.

Solutions to Exercises 55 Exercise 17(a) There appears to be no evidence of any linear trend in these data. A correlation coefficient of 0 would be appropriate here.