Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Size: px
Start display at page:

Download "Categorical. A general name for non-numerical data; the data is separated into categories of some kind."

Transcription

1 Chapter 5

2 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show, favourite music genre. Ordinal data Categorical data with some form of implied order. Eg. A question followed by the respondent selecting a level of satisfaction ie Very good good moderate - bad

3 Numerical data A general name for data that is numerical in its nature. Discrete data Numerical data where there is a countable number of values possible. Eg. What year were you born?, How many pets do you own? Continuous data Numerical data where there is an infinite number of values possible; often associated with measurement of some kind. The values can be any (real) numbers within a particular range. Eg. How tall are the players on Melbourne Victory squad? (the values recoded varies depending on the accuracy requested or the degree of accuracy possible with out measuring instrument)

4 We make this distinction between discrete and continuous data because there are different things we can do with each type. However, we often treat continuous data as if it were discrete. This is because we must record the data in some way. With the height example previously, if the question said to record the data to the nearest cm, then we would have discrete possibilities.

5 A structural method of recording statistical data. If the range of values is too big, group the data into class intervals. No. of siblings Tally marks Frequency 0 III 3 1 IIII 4 2 II 2 Total 9

6 We can also add in relative frequency and percentage frequency columns to compare values in different frequency tables where the total for each table is different. Relative frequency Divide the frequency by the total number of data values in the table. The relative frequency table should equate to 1. Percentage frequency Multiply the relative percentage by 100. The percentage frequency column should equate to 100.

7 Although a frequency table is a useful way of recording data, when we group data values we lose information. A stem plot preserves this lost information. Stem plots are usually divided into intervals of 10. Example: 12, 15, 23, 45, 56, 18, 44, 33, 23, 19, 34, 52, 59, 41, 23, 13, 9, 11, 15, 18 Note: We can easily identify the highest and lowest score. If we need to break the stem because of large numbers of leaf values, use a code such as 5 L to represent the values in the lower half of the 50 s and 5 U for the upper half STEM LEAF

8 A bar graph is used to display categorical (nominal) information. Bar graphs can easily be constructed from data represented in a frequency table. Gaps between bars to represent we are not dealing with continuous data. Example: Type of vehicle Sedan 16 Ute 7 Truck 8 Stationwagon 6 Motorbike 1 Total 38 Frequency Type of Vehicle

9 A single rectangle is divided into pieces according to the contribution of the category to the whole. An appropriate scale needs to be chosen to make the graph easier to draw. Find the percentage contribution for each category and base the scale on those values. Component Protein Fat Sat Fat Carbs Sugar No. of grams Note: Number of grams involved = 78.6 No. of grams Protein Fat Sat Fat Carbs Sugar

10 When dealing with numerical data we should not really use a bar chart. However, the temptation is strong when we have discrete data Uses strips instead of bars (to make it look different) Graph does not have a title but both axis need labels. Gaps between strips to represent discrete data

11 When dealing with continuous numerical data we use histograms. Like a bar chart, but the bars are joint together. There are no gaps in the data values so we leave no gaps in the diagram. Leave a half-column gap between the first column and the vertical axis. Note: indicates that that part of the horizontal axis has been left out. Make the intervals the same size.

12 When we have a frequency table that have already been set up and the intervals are not the same size, information can be misleading. Looking at the histogram, it seems to indicate that the most dangerous age range is 30-<40 years, but this is not correct, because the interval is larger we would expect more deaths. We need a relative measure to make such a decision.

13 To draw a histogram correctly when we have inconsistent intervals we need to a percentage frequency and a frequency density column. Frequency density Frequency density = percentage frequency class width Remember: Precentage frequency = frequency total number of data values 100

14 Example: Percentage frequency = = 1.0% 299 Frequency density = 1.0 = Note: add the vertical axis title frequency density

15 5.3.4: The frequency table gives information about the number of traffic fatalities for females in Victoria during draw a historgram of the data, taking note of the inconsistent class interval used.

16 Add the percentage frequency column and a frequency density column. Precentage frequency = frequency 100 total number of data values Frequency density = percentage frequency class width Age range No. of deaths Percentage frequency Frequency density 0-1 (1 / 98) x 100 = 1.0 (1.0 / 5) = (3 / 98) x 100 = 3.1 (3.1 / 8) = (5 / 98) x 100 = 5.1 (5.1 / 3) = (6 / 98) x 100 = 6.1 (6.1 / 2) = (11 / 98) x 100 = 11.2 (11.2 / 4) = (6 / 98) x 100 = 6.1 (6.1 / 4) = (5 / 98) x 100 = 5.1 (5.1 / 4) = (7 / 98) x 100 = 7.1 (7.1 / 10) = (12 / 98) x 100 = 12.2 (12.2 / 10) = (11 / 98) x 100 = 11.2 (11.2 / 10) = (16 / 98) x 100 = 16.3 (16.3 / 15) = (15 / 98) x 100 = 15.3 (15.3 / 15) = 1.02 Σf=98

17 Age range No. of deaths Percentage frequency Frequency density Σf=98

18 When we have a continuous data set recorded in class intervals we can draw what is called a cumulative frequency diagram. This shows the number of data values less than a particular value. Add a column to the frequency table labelled cumulative frequency Add a new first row to emphasis we are finding the number of data values less than the given value. The cumulative frequency column tells us how many values there are less than the right-hand end-point of the interval. Eg.

19 Example: Class interval (mass, kg) 10 - < < < <50 5 Frequency (number of students) Σf = 34 Class interval (mass, kg) Frequency (number of students) Cumulative frequency < < (0+5) 20 - < (5+8) 30 - < (13+16) 40 - < (29+5) Σf = 34

20 Once we have added the cumulative frequency column we can now draw the diagram. Cumulative frequency Mass (kg)

21 How to read the diagram What percentage of people are less than 35kg? Find the cumulative frequency: 20 people How many people in data set: = 58.82% 34 Cumulative frequency % of people surveyed are less than 35 kg. Mass (kg)

22 We can use our CAS to find only approximate values for percentiles. These arise from questions such as Under what mass are 70% of the students? of course you have also been given the distribution of the student weight. We have the following information Class interval (mass, kg) Frequency (Number of students) 10 - < < < <50 5 We need to enter the cumulative frequency values as the second column in our list. As a check, the total frequency should be the value in the last row.

23 Class interval (mass, kg) Frequency (Number of students) Cumulative frequency < < < < < Enter the data in the Lists & spreadsheet application. Call the first column x Enter the right-hand boundary values of the class interval Call the second column y Include the point (10,0)

24 2. Insert a new page,, and choose Data and Statistics. Label the axis 3. Join the dots by pressing Type > XY Line Plot > Plot

25 4. To estimate the 70 th percentile we need to know 70% of the cumulative frequency. In this case this is 0.7 x 34 = we now want to draw in the line y = to do this press > Analyse > Plot Function and fill in the dialog box. We can now approximate the x-value of the point of intersection by simply reading from the graph. Therefore, the 70 th percentile is approximately 37

26 Mean The measure of central tendency found by adding together all of the data values and dividing by the data set sum of the data values number of data values = Σx n = Σxf Σf Median The measure of central tendency that is the (physical) middle value of the data set for odd n, the median is the n th value. for even n, the median is the mean of the n 2 th and n th values. Mode The measure of central tendency that is the most frequent occurring data value

27 1. Enter the data in Lists & Spreadsheets using the midpoint values to represent each group. 2. Press > Statistics > Stat Calculations > One- Variable Statistics to bring up this dialogue box.

28 3. In this case we have only one list of data values as the second list is just the frequencies. So, to OK and press. This brings up another dialogue box which needs to be filled in as shown. (If you had just entered a list of ungrouped data values then you would leave Frequency List as 1 )

29 4. to OK and press and we get the summary statistics included in our table. Arrow down to highlight the value beside x and read of the value of the mean from the entry line at the bottom. In this case it is Continue to arrow down to find the value beside MedianX as this is the median. This tells us the median is 7, but as this is grouped data, we should really say the median occurs in the 5-9 group.

30 Find the mode, median and mean of the following: Score Frequency Mode: look in the frequency column for the highest occurring number Mode = 5 and 7

31 Median Find Σf Score Frequency Score Frequency Σf= 131 for odd n, the median is the n th value. Median = Median = th value = 66th value = 7

32 Mean: Add a new column to the frequency table and label it xf (x stands for the data value, f for the frequency), and fill it by multiplying the data values by the frequency for that value. Score Frequency Σf= 131 Score Frequency xf Σf= 131 Σxf = 927 Mean = Σxf Σf = = 7.08 (2 dp)

33 Find the modal class, median class and mean (2 dp) for the following grouped discrete data sets. Score Frequency Modal class: Look in the frequency column for the highest number; the modal class is the number associated with this. Modal class = 20-29

34 Median class 108 Find Σf 2 for even n, the median is the mean of the n th and n + 1 th values. 2 2 th value and th and 55 th values + 1 th value 54 th and 55 th values are in the interval Median class = Score Frequency Σf= 108

35 When we deal with grouped data we cannot find a single mode, instead we find the modal class. We can find a specific value for the mean but we need to find the these values that represent each of these class intervals. We use the median of the interval and call this the midpoint of the interval (x m ) We add two new columns to the frequency table x m and x m f

36 Add the new columns x m and x m f Score Frequency Score Frequency x m x m f Σf= 108 Σx m f = 4406 Find the mean: Mean = Σx mf Σf = = 40.80

37 For the following data sets draw a cumulative frequency curve to find an estimate for the median. Add a cumulative frequency column to the table. Note that <141 is included as a start to the class. Score Frequency 141-< < < < < < Score Frequency Cumulative f < < < < < < < Σf= 146

38 Draw the cumulative frequency curve Score Frequency Cumulative f < < < < < < < Σf= 146 for even n, the median is the mean of the n th and n + 1 th values. 2 2 Median = and + 1 = median between rd and 74 th value Median is approximately 156 Draw a line from the median position across to the curve and then draw a line from this point on the curve down to the horizontal axis.

39 The median divides the data set into two equal pieces. We can extend this idea by dividing the data set into any number of equal-sized pieces. We call these pieces quantiles. Some quantiles have special names. 4 equal pieces = quartiles 10 equal pieces = deciles 100 equal pieces = percentiles

40 Lower quartile (Q1 or QL) Upper quartile (Q3 or QU) We consider there to be approximately 25% of the data values in each of the quartiles.

41 Sometimes we want to find how the data spreads out from the central values (mean, median, mode) The range (maxx-minx) is the simplest measure of spread Sensitive to extreme values (outliers) Example: 1, 1, 1, 4, 6, 8, 8, 9, 10, 13, 13, 14 Range = 14 1 = 13 The range represents the difference between the highest score to the lowest score Interquartile range: IQR = Q 3 Q 1 Interquartile range (IQR) = upper quartile (Q 3 ) lower quartile (Q 1 ) The IQR represents the middle 50% of the data set.

42 5.5.1 Find the interquartile range for the following data set. Q 1 Q 3 2, 3, 5, 8, 9, 11, 15, 16, 18, 25, 36 IQR = Q 3 Q 1 IQR = 18 5 IQR = 13 median

43 We use quartiles to draw up boxplots. The key features of box plots are show in the diagram. The five values shown are sometimes referred to as the five-figure summary for a data set. The box plot needs a scale line associated with it. The box is the central box representing the middle 50% of the data. The whiskers are the lines that go out to the extreme values.

44 5.5.3a Find the interquartile range and draw a boxplot for the following data. Find Σf Σf = 24 Identify the median Median = mean of the n th and n + 1 th values. 2 2 Median between 12 th and 13 th value. = 2 Identify Q 1 and Q 3 12 values in the lower and upper half Q 1 is between the 6 th and 7 th value = 1 Q 3 is between the 18 th and 19 th value = 3 Score Frequency Calculate the IQR IQR = 3 1 = 2

45 Draw a box plot to represent the data Q 1 = 1 Q 3 = 3 Median = 2

46 5.5.3c Find the interquartile range and draw a boxplot for the following data. Find Σf Σf = 26 Identify the median Median = mean of the n th and n 2 2 Median between 13 th and 14 th value. = 30.5 Identify Q 1 and Q 3 13 values in the lower and upper half Q 1 is the 7 th value = 15 Q 3 is the20 th value = th values. Stem Leaf Calculate the IQR IQR = = 27

47 Q 1 = 15 Q 3 = 42 Median = 30.5

48 5.5.5 Draw a box plot for the following data which is displayed in a percentage cumulative frequency graph Draw a line from the 50% mark to estimate for the median Approx median = 18 Draw a line at the 25% mark and 75% mark to estimate Q1 and Q3 Approx Q 1 = 12 Approx Q 3 = 23.5 Calculate the IQR IQR = = 11.5

49 Q 1 = 12 Q 3 = 23.5 Median = 18

50 1. Enter the data the normal way. As we will be drawing a graph you need to name the column. We will use x here. Then get the summary statistics screen by pressing > Statistics > Stat Calculations > One-Variable Statistics. When you get to the dialog box you need to make X1 List x as this is the name you gave the column with the data in it. Leave Frequency List as 1 since we entered each value individually. You will be asked about the x say it is a Variable Reference.

51 2. Arrow down to get the five-figure summary on the screen and write the values into your exercise book. 3. Insert a new page ( ) and choose Add Data & Statistics. Name the horizontal axis with the name given to column A, in our case x. Now press > Plot Type > Box Plot and our boxplot appears. Move your cursor around to find out info The dot represents an outlier or extreme value. An outlier is a value that lies more than 1.5 x IQR away from the nearer of the upper or lower quartiles.

52 The standard deviation ( σ ) sigma is a measure of spread relating to how far the data deviates from the mean. I.e. How far from the centre we might expect to still find data. The larger the number, the more spread out the data is. Note: Population: when gather data from the whole group. Ie. The height of year 11 s Every single year 11 is measured. Mean of population: μ (mu) Sample: When we gather data from a select group to represent the population. Ie. The height of year 11 s one class may be measured to represent the whole group. Mean of sample: x (x bar) μ and x are measures of central tendency that tells us where we might expect to find the centre of the data set

53 We define variance as the mean of the squared deviations from the mean and use the symbol σ 2 (sigma squared) to represent it. Variance = σ 2 = Σ x μ 2 n For grouped intervals σ 2 = Σf x μ 2 Σf sum of the squared deviations from the mean number of values

54 We squared the deviations to make them positive before adding. To take account of this we now find the positive square root of the variance and call this the standard deviation ( σ ) of the population. So σ = Σf(x μ)2 Σf

55 There are two standard deviations calculated. sx is the standard deviation if the data represents a sample whereas σx is the standard deviation if the data represents a population. In this case we are using all the members of a team so we would use the σx value. 1. Find the midpoint for each class interval and add them to the table Distance covered (nearest km) Number of runners x

56 2. Enter the data and go to the One-Variable Statistics screen and make the entries shown. (it is slightly different to previously since we are not drawing a graph). 3. down to OK and press 4. the mean is x = 43 and the standard deviation we are interested in is σx = write these in your book. Note: the calculator has given us the values of the mean ( x), the sum of the x-values (Σx), and the sum of the values 2 (Σx 2 ). Note also that x is used for the mean whereas, strikly speaking, it should be μ, as this was a population. However, there is no difference between the values x and μ.

57 5.5.8a Find the standard deviation, correct to one decimal place, of the following sample data sets. 3, 3, 4, 6, 2, 1, 3, 4, 6, 7, 5, 3, 2, 1, 7, 9 Enter the data in your CAS using Lists & Spreadsheets Press Menu > Statistics > Stat Calculations > One-Variable Statistics. When prompted choose what you labelled column A sx = 2.3 (1 dp)

58 Comparative analysis Used when dealing with the analysis of more than one set of data Absolute analysis Used when dealing with the analysis of just one set of data Outliers (Extreme values) A value more than 1.5 x IQR away from the nearer of the upper and lower quartiles

59 Skewness A term used to describe data sets that are not symmetrical. Boxplots: Negative skew: indicates that the median is closer to the upper quartile Symmetrical: indicates that the median lies in the middle Positive skew: The median is closer to the lower quartile.

60 Histograms Positive skew: Negative skew: mode < median < mean mean < median < mode Degree of skewness A numerical value to indicate the degree of skewness. The degree of skewness = 3(mean median) standard deviation This value indicates the direction of the skew as well as its size. The larger the value the greater the degree of skewness

61 Skewed data sets are more likely to contain outliers Use median and IQR as summary statistics Mean and standard deviation are affected by extreme values

62 5.6.1 Describe the following data set, including a comment about its degree of skewness. The data set represents the score obtained out of 66 on a test. 60, 51, 47, 42, 53, 34, 47, 39, 56, 63, 35, 34, 50, 35, 41, 19, 48, 42, 37, 45, 29 Enter data into CAS, write down the five-figure summary: x (mean) = σx (st dev) = minx = 19 Q 1 = 35 Median = 42 Q 3 = 50.5 MaxX = 63

63 Calculate to determine if there are any outliers in the data set. IQR = = X IQR = No outliers because there is no value that lies further than Q 1 or Q 3 : = or (maxx = 63) = 11.5 (MinX = 19) Calculate the degree of skewness 3(mean median) = 3( ) = 0.34 standard deviation Data set contains no outliers, but is slightly positively skewed. This is confirmed by the mean lying closer to the lower quartile and the calculated value for the degree of skewness.

64 Composite bar charts Some can handle more than two categories

65 Back-to-back stemplots Can ONLY handle two categories

66 Comparative boxplots Several related boxplots using the SAME scale Can be used for a number of categories.

67 Male Female In the Lists & Spreadsheets application, name the first column A gender and column B life Enter the male data set in column B. In column A enter male press > Data > Fill down and press. A dashed box will appear. Scroll down column A until the last piece of data and press. 2. Return to column B and under the males life expectancy fill in the female life expectancy. Give all these pieces of data the label female in column A.

68 To create the box plot, press (to insert a new page) and select Add Data & Statistics. Label the horizontal axis Life and the vertical axis Gender, then press > Plot type > Box plot.

69 The table below gives the life expectancy for males and females in the 40 most populated countries of the world. Draw comparative boxplots for males and females and discuss your findings. Enter the data into your CAS to produce the boxplots. And five figure summary Both data sets are negatively skewed. Females outscore males on all of the five values. 50% of females values are greater than the value for which less than 25% of the males are greater. There are no outliers. The range of values and IQR are greater for females.

70 The bar chart below shows the number of drivers killed in accidents that involved the vehicle going off path on straight of going off path on curve for the years 2002 and compare the two years. In both years many more males were killed than females. In both years the youngest drivers killed were females, these drivers must have been driving illegally as they were under 16. In 2002, three underage male drivers were killed. More males died in 2002 in comparison to 2003 except for the age class 18-21where more males died In 2003 compared to Female figures are so low that useful comparisons cannot really be made.

71 The data below represents the height (cm) of the players for two clubs. Draw a back-to-back stemplot and use it to help make comparisons between the teams.

72 Find the median and IQR of both teams. WB median = 38 and = between the th and 20 th value = 186 cm WB IQR = = 12 Geelong median = Geelong IQR = = 9 Geelong appears to have the taller team; median height cm compared with 186 cm. Spread of height is slightly greater for WB with a larger IQR. WB modal class is 18 L and Geelong s is 19 L reinforcing that Geelong has the taller team.

2 Exploring Univariate Data

2 Exploring Univariate Data 2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting

More information

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]

A LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2] 1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

DATA HANDLING Five-Number Summary

DATA HANDLING Five-Number Summary DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

Ti 83/84. Descriptive Statistics for a List of Numbers

Ti 83/84. Descriptive Statistics for a List of Numbers Ti 83/84 Descriptive Statistics for a List of Numbers Quiz scores in a (fictitious) class were 10.5, 13.5, 8, 12, 11.3, 9, 9.5, 5, 15, 2.5, 10.5, 7, 11.5, 10, and 10.5. It s hard to get much of a sense

More information

Measures of Dispersion (Range, standard deviation, standard error) Introduction

Measures of Dispersion (Range, standard deviation, standard error) Introduction Measures of Dispersion (Range, standard deviation, standard error) Introduction We have already learnt that frequency distribution table gives a rough idea of the distribution of the variables in a sample

More information

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations

More information

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

2CORE. Summarising numerical data: the median, range, IQR and box plots

2CORE. Summarising numerical data: the median, range, IQR and box plots C H A P T E R 2CORE Summarising numerical data: the median, range, IQR and box plots How can we describe a distribution with just one or two statistics? What is the median, how is it calculated and what

More information

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x

More information

2 2 In general, to find the median value of distribution, if there are n terms in the distribution the

2 2 In general, to find the median value of distribution, if there are n terms in the distribution the THE MEDIAN TEMPERATURES MEDIAN AND CUMULATIVE FREQUENCY The median is the third type of statistical average you will use in his course. You met the other two, the mean and the mode in pack MS4. THE MEDIAN

More information

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows

More information

Statistics (This summary is for chapters 18, 29 and section H of chapter 19)

Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x n =

More information

The Normal Distribution

The Normal Distribution Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we

More information

Frequency Distribution and Summary Statistics

Frequency Distribution and Summary Statistics Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary

More information

Lecture 2 Describing Data

Lecture 2 Describing Data Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

STAT 113 Variability

STAT 113 Variability STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2

More information

3.1 Measures of Central Tendency

3.1 Measures of Central Tendency 3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent

More information

Lecture Week 4 Inspecting Data: Distributions

Lecture Week 4 Inspecting Data: Distributions Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your

More information

Descriptive Statistics (Devore Chapter One)

Descriptive Statistics (Devore Chapter One) Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

22.2 Shape, Center, and Spread

22.2 Shape, Center, and Spread Name Class Date 22.2 Shape, Center, and Spread Essential Question: Which measures of center and spread are appropriate for a normal distribution, and which are appropriate for a skewed distribution? Eplore

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line. Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,

More information

MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment

MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment MBEJ 1023 Planning Analytical Methods Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment Contents What is statistics? Population and Sample Descriptive Statistics Inferential

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25 Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example

More information

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar Measures of Central Tendency 11.220 Lecture 5 22 February 2006 R. Ryznar Today s Content Wrap-up from yesterday Frequency Distributions The Mean, Median and Mode Levels of Measurement and Measures of Central

More information

IOP 201-Q (Industrial Psychological Research) Tutorial 5

IOP 201-Q (Industrial Psychological Research) Tutorial 5 IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,

More information

appstats5.notebook September 07, 2016 Chapter 5

appstats5.notebook September 07, 2016 Chapter 5 Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.

More information

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,

More information

1. In a statistics class with 136 students, the professor records how much money each

1. In a statistics class with 136 students, the professor records how much money each so shows the data collected. student has in his or her possession during the first class of the semester. The histogram 1. In a statistics class with 136 students, the professor records how much money

More information

Some estimates of the height of the podium

Some estimates of the height of the podium Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40

More information

MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION

MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION 1 Day 3 Summer 2017.07.31 DISTRIBUTION Symmetry Modality 单峰, 双峰 Skewness 正偏或负偏 Kurtosis 2 3 CHAPTER 4 Measures of Central Tendency 集中趋势

More information

Edexcel past paper questions

Edexcel past paper questions Edexcel past paper questions Statistics 1 Chapters 2-4 (Discrete) Statistics 1 Chapters 2-4 (Discrete) Page 1 Stem and leaf diagram Stem-and-leaf diagrams are used to represent data in its original form.

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Stemplots (or Stem-and-leaf plots) Stemplot and Boxplot T -- leading digits are called stems T -- final digits are called leaves STAT 74 Descriptive Statistics 2 Example: (number

More information

DATA ANALYSIS EXAM QUESTIONS

DATA ANALYSIS EXAM QUESTIONS DATA ANALYSIS EXAM QUESTIONS Question 1 (**) The number of phone text messages send by 11 different students is given below. 14, 25, 31, 36, 37, 41, 51, 52, 55, 79, 112. a) Find the lower quartile, the

More information

6683/01 Edexcel GCE Statistics S1 Gold Level G2

6683/01 Edexcel GCE Statistics S1 Gold Level G2 Paper Reference(s) 6683/01 Edexcel GCE Statistics S1 Gold Level G Time: 1 hour 30 minutes Materials required for examination papers Mathematical Formulae (Green) Items included with question Nil Candidates

More information

CSC Advanced Scientific Programming, Spring Descriptive Statistics

CSC Advanced Scientific Programming, Spring Descriptive Statistics CSC 223 - Advanced Scientific Programming, Spring 2018 Descriptive Statistics Overview Statistics is the science of collecting, organizing, analyzing, and interpreting data in order to make decisions.

More information

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either

More information

Description of Data I

Description of Data I Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret

More information

Monte Carlo Simulation (Random Number Generation)

Monte Carlo Simulation (Random Number Generation) Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...

More information

Unit 2 Measures of Variation

Unit 2 Measures of Variation 1. (a) Weight in grams (w) 6 < w 8 4 8 < w 32 < w 1 6 1 < w 1 92 1 < w 16 8 6 Median 111, Inter-quartile range 3 Distance in km (d) < d 1 1 < d 2 17 2 < d 3 22 3 < d 4 28 4 < d 33 < d 6 36 Median 2.2,

More information

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values

More information

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda, MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, CONTENTS What is measures of dispersion? Why measures of dispersion? How measures of dispersions are calculated? Range Quartile

More information

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data. -3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data

More information

Edexcel past paper questions

Edexcel past paper questions Edexcel past paper questions Statistics 1 Chapters 2-4 (Continuous) S1 Chapters 2-4 Page 1 S1 Chapters 2-4 Page 2 S1 Chapters 2-4 Page 3 S1 Chapters 2-4 Page 4 Histograms When you are asked to draw a histogram

More information

Section3-2: Measures of Center

Section3-2: Measures of Center Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number

More information

CHAPTER 2 Describing Data: Numerical

CHAPTER 2 Describing Data: Numerical CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of

More information

SUMMARY STATISTICS EXAMPLES AND ACTIVITIES

SUMMARY STATISTICS EXAMPLES AND ACTIVITIES Session 6 SUMMARY STATISTICS EXAMPLES AD ACTIVITIES Example 1.1 Expand the following: 1. X 2. 2 6 5 X 3. X 2 4 3 4 4. X 4 2 Solution 1. 2 3 2 X X X... X 2. 6 4 X X X X 4 5 6 5 3. X 2 X 3 2 X 4 2 X 5 2

More information

Exploring Data and Graphics

Exploring Data and Graphics Exploring Data and Graphics Rick White Department of Statistics, UBC Graduate Pathways to Success Graduate & Postdoctoral Studies November 13, 2013 Outline Summarizing Data Types of Data Visualizing Data

More information

Manual for the TI-83, TI-84, and TI-89 Calculators

Manual for the TI-83, TI-84, and TI-89 Calculators Manual for the TI-83, TI-84, and TI-89 Calculators to accompany Mendenhall/Beaver/Beaver s Introduction to Probability and Statistics, 13 th edition James B. Davis Contents Chapter 1 Introduction...4 Chapter

More information

AP Statistics Unit 1 (Chapters 1-6) Extra Practice: Part 1

AP Statistics Unit 1 (Chapters 1-6) Extra Practice: Part 1 AP Statistics Unit 1 (Chapters 1-6) Extra Practice: Part 1 1. As part of survey of college students a researcher is interested in the variable class standing. She records a 1 if the student is a freshman,

More information

Descriptive Statistics

Descriptive Statistics Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

Example: Histogram for US household incomes from 2015 Table:

Example: Histogram for US household incomes from 2015 Table: 1 Example: Histogram for US household incomes from 2015 Table: Income level Relative frequency $0 - $14,999 11.6% $15,000 - $24,999 10.5% $25,000 - $34,999 10% $35,000 - $49,999 12.7% $50,000 - $74,999

More information

Applications of Data Dispersions

Applications of Data Dispersions 1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given

More information

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017

More information

Key: 18 5 = 1.85 cm. 5 a Stem Leaf. Key: 2 0 = 20 points. b Stem Leaf. Key: 2 0 = 20 cm. 6 a Stem Leaf. Key: 4 3 = 43 cm.

Key: 18 5 = 1.85 cm. 5 a Stem Leaf. Key: 2 0 = 20 points. b Stem Leaf. Key: 2 0 = 20 cm. 6 a Stem Leaf. Key: 4 3 = 43 cm. Answers EXERCISE. D D C B Numerical: a, b, c Categorical: c, d, e, f, g Discrete: c Continuous: a, b C C Categorical B A Categorical and ordinal Discrete Ordinal D EXERCISE. Stem Key: = Stem Key: = $ The

More information

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA. Name: ID# Section KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 11: BUSINESS STATISTICS I Semester 04 Major Exam #1 Sunday March 7, 005 Please circle your instructor

More information

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet.

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. 1 Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. Warning to the Reader! If you are a student for whom this document is a historical artifact, be aware that the

More information

Chapter 3. Lecture 3 Sections

Chapter 3. Lecture 3 Sections Chapter 3 Lecture 3 Sections 3.4 3.5 Measure of Position We would like to compare values from different data sets. We will introduce a z score or standard score. This measures how many standard deviation

More information

A.REPRESENTATION OF DATA

A.REPRESENTATION OF DATA A.REPRESENTATION OF DATA (a) GRAPHS : PART I Q: Why do we need a graph paper? Ans: You need graph paper to draw: (i) Histogram (ii) Cumulative Frequency Curve (iii) Frequency Polygon (iv) Box-and-Whisker

More information

Section 6-1 : Numerical Summaries

Section 6-1 : Numerical Summaries MAT 2377 (Winter 2012) Section 6-1 : Numerical Summaries With a random experiment comes data. In these notes, we learn techniques to describe the data. Data : We will denote the n observations of the random

More information

How Wealthy Are Europeans?

How Wealthy Are Europeans? How Wealthy Are Europeans? Grades: 7, 8, 11, 12 (course specific) Description: Organization of data of to examine measures of spread and measures of central tendency in examination of Gross Domestic Product

More information

Math Take Home Quiz on Chapter 2

Math Take Home Quiz on Chapter 2 Math 116 - Take Home Quiz on Chapter 2 Show the calculations that lead to the answer. Due date: Tuesday June 6th Name Time your class meets Provide an appropriate response. 1) A newspaper surveyed its

More information

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.

More information

Statistics S1 Advanced/Advanced Subsidiary

Statistics S1 Advanced/Advanced Subsidiary Paper Reference(s) 6683/01 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Tuesday 10 June 2014 Morning Time: 1 hour 30 minutes Materials required for examination Mathematical Formulae (Pink) Items

More information

4. DESCRIPTIVE STATISTICS

4. DESCRIPTIVE STATISTICS 4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in

More information

Summary of Statistical Analysis Tools EDAD 5630

Summary of Statistical Analysis Tools EDAD 5630 Summary of Statistical Analysis Tools EDAD 5630 Test Name Program Used Purpose Steps Main Uses/Applications in Schools Principal Component Analysis SPSS Measure Underlying Constructs Reliability SPSS Measure

More information

Numerical Descriptions of Data

Numerical Descriptions of Data Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =

More information

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). We will look the three common and useful measures of spread. The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). 1 Ameasure of the center

More information

STOR 155 Practice Midterm 1 Fall 2009

STOR 155 Practice Midterm 1 Fall 2009 STOR 155 Practice Midterm 1 Fall 2009 INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE ON THE BUBBLE SHEET. YOU MUST BUBBLE-IN YOUR

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =

CHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) = Solutions to End-of-Section and Chapter Review Problems 225 CHAPTER 6 6.1 (a) P(Z < 1.20) = 0.88493 P(Z > 1.25) = 1 0.89435 = 0.10565 P(1.25 < Z < 1.70) = 0.95543 0.89435 = 0.06108 (d) P(Z < 1.25) or Z

More information

Lesson 12: Describing Distributions: Shape, Center, and Spread

Lesson 12: Describing Distributions: Shape, Center, and Spread : Shape, Center, and Spread Opening Exercise Distributions - Data are often summarized by graphs. We often refer to the group of data presented in the graph as a distribution. Below are examples of the

More information

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers. Chapter 3 Section3-: Measures of Center Section 3-3: Measurers of Variation Section 3-4: Measures of Relative Standing Section 3-5: Exploratory Data Analysis Describing Distributions with Numbers The overall

More information

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12)

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Descriptive statistics: - Measures of centrality (Mean, median, mode, trimmed mean) - Measures of spread (MAD, Standard deviation, variance) -

More information

SAMPLE. HSC formula sheet. Sphere V = 4 πr. Volume. A area of base

SAMPLE. HSC formula sheet. Sphere V = 4 πr. Volume. A area of base Area of an annulus A = π(r 2 r 2 ) R radius of the outer circle r radius of the inner circle HSC formula sheet Area of an ellipse A = πab a length of the semi-major axis b length of the semi-minor axis

More information

Solutions for practice questions: Chapter 9, Statistics

Solutions for practice questions: Chapter 9, Statistics Solutions for practice questions: Chapter 9, Statistics If you find any errors, please let me know at mailto:msfrisbie@pfrisbie.com. 1. We know that µ is the mean of 30 values of y, 30 30 i= 1 2 ( y i

More information

Diploma in Financial Management with Public Finance

Diploma in Financial Management with Public Finance Diploma in Financial Management with Public Finance Cohort: DFM/09/FT Jan Intake Examinations for 2009 Semester II MODULE: STATISTICS FOR FINANCE MODULE CODE: QUAN 1103 Duration: 2 Hours Reading time:

More information

Simple Descriptive Statistics

Simple Descriptive Statistics Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency

More information

Empirical Rule (P148)

Empirical Rule (P148) Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall

More information

the display, exploration and transformation of the data are demonstrated and biases typically encountered are highlighted.

the display, exploration and transformation of the data are demonstrated and biases typically encountered are highlighted. 1 Insurance data Generalized linear modeling is a methodology for modeling relationships between variables. It generalizes the classical normal linear model, by relaxing some of its restrictive assumptions,

More information

LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL

LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL There is a wide range of probability distributions (both discrete and continuous) available in Excel. They can be accessed through the Insert Function

More information

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table:

Example - Let X be the number of boys in a 4 child family. Find the probability distribution table: Chapter8 Probability Distributions and Statistics Section 8.1 Distributions of Random Variables tthe value of the result of the probability experiment is a RANDOM VARIABLE. Example - Let X be the number

More information

Fundamentals of Statistics

Fundamentals of Statistics CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct

More information

Descriptive Statistics Bios 662

Descriptive Statistics Bios 662 Descriptive Statistics Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-08-19 08:51 BIOS 662 1 Descriptive Statistics Descriptive Statistics Types of variables

More information

2 DESCRIPTIVE STATISTICS

2 DESCRIPTIVE STATISTICS Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled

More information

DESCRIPTIVE STATISTICS

DESCRIPTIVE STATISTICS DESCRIPTIVE STATISTICS INTRODUCTION Numbers and quantification offer us a very special language which enables us to express ourselves in exact terms. This language is called Mathematics. We will now learn

More information

MAS187/AEF258. University of Newcastle upon Tyne

MAS187/AEF258. University of Newcastle upon Tyne MAS187/AEF258 University of Newcastle upon Tyne 2005-6 Contents 1 Collecting and Presenting Data 5 1.1 Introduction...................................... 5 1.1.1 Examples...................................

More information

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical

More information

Continuous Probability Distributions

Continuous Probability Distributions 8.1 Continuous Probability Distributions Distributions like the binomial probability distribution and the hypergeometric distribution deal with discrete data. The possible values of the random variable

More information