3.1 Measures of Central Tendency
|
|
- Bonnie Cornelia Franklin
- 5 years ago
- Views:
Transcription
1
2 3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent the number of bathroom in a given residence. Five residential properties are examined, and the value of x i is recorded for each. The observations are x 1 = 2, x 2 = 1, x 3 = 3, x 4 = 2, and x 5 = 3. (a) Find 5 x i and 5 x 2 i. (b) Find 5 (x i 3) and 5 (x i 3) 2. Solution: (a) The symbol 5 x i tells us to sum the x i values in the data set. Therefore, 5 x i = x 1 + x 2 + x 3 + x 4 + x 5 = = 11 (b) The symbol 5 x 2 i = x x x x x 2 5 = = (x i 3) tells us to subtract 3 from each x i value and then sum Therefore, 5 (x i 3) = (x 1 3) + (x 2 3) + (x 3 3) + (x 4 3) + (x 5 3) = (2 3) + (1 3) + (3 3) + (2 3) + (3 3) = ( 1) + ( 2) ( 1) + 0 = 4 5 (x i 3) 2 = (x 1 3) 2 + (x 2 3) 2 + (x 3 3) 2 + (x 4 3) 2 + (x 5 3) 2 = (2 3) 2 + (1 3) 2 + (3 3) 2 + (2 3) 2 + (3 3) 2 = ( 1) 2 + ( 2) 2 + (0) 2 + ( 1) 2 + (0) 2 = 6. 2
3 Example 2 Find xy if the variables x i,y i are given by i x i y i Solution: xy = xi y i = x 1 y 1 + x 2 y 2 + x 3 y 3 + x 4 y 4 + x 5 y 5 = = Mean Definition 1 The mean of a set of measurements is defined to be the sum of the measurements divided by the total number of measurements. Mean for ungrouped data The mean for ungrouped data is obtained by dividing the sum of all values by the number of values in the data set. Thus, (a) Mean for population data µ = x 1 + x 2 + x x N N = N x i N (b) Mean for sample data x = x 1 + x 2 + x x n n = n x i n where x i is the sum of all values, N is the population size, n is the sample size, µ is the population mean, x is the sample mean. Example 3 The monthly starting salaries for a sample of 12 Business School Graduates are as follows: Monthly Monthly Monthly Monthly Graduate Salary Graduate Salary Graduate Salary Graduate Salary
4 Solution: The sample mean is x = x 1 + x 2 + x x n n = = 25, = Mean for grouped data (a) Mean for population data µ = fi m i N (b) Mean for sample data x = fi m i n where f i denote the frequency of class i and m i denote the midpoint of class i. Example 4 Recall the frequency distribution in Example 2 (Chapter 2). Find the mean. Audit Time Frequency Class Midpoint (Days) f i m i f i m i fi = 20 fi m i = 380 Hence, the sample mean is Weighted Mean x = fi m i fi = = 19. Let X 1,X 2,...,X N be a set of N values, and let w 1,w 2,...,w N be the weight assigned to them. The weighted mean is found by dividing the sum of the products of the values and their weights by the sum of the weights. µ Weighted = w 1X 1 + w 2 X w N X N w 1 + w w N = wi X i wi 4
5 Example 5 It is decided that six observations, 12, 20, 17, 5, 9 and 22, should be given the weights 10, 4, 6, 18, 16 and 3 respectively. What is the weighted mean? Solution: The weighted mean is calculated as wi X i (10 12) + (4 20) + (6 17) + (18 5) + (16 9) + (3 22) = wi = = 10.6 Example 6 A mathematics class is divided into two sections, both of which are given the same test. Section 1 (41 students) has a mean score of 62; and section 2 (52 students) has a mean score of 68. Find the mean of the whole class correct to two decimal places. Solution: Since w 1 = 41,X 1 = 62,w 2 = 52,X 2 = 68, we find x = w 1X 1 + w 2 X 2 w 1 + w 2 = (41) (62) + (52) (68) = = Example 7 The examination results of an AD student are listed as follows: Subject Credit Grade Grade Point I 3 B+ 3.5 II 3 B 3 III 1 A+ 4.5 IV 4 D+ 1.5 V 5 D 1 Hence, the GPA of the student is calculated as follows GPA = = Geometric Mean The Geometric mean of a set of values is defined as the nth-root of the product of the n values. The geometric mean of X 1,X 2,...,X n is given by X G = n X 1 X 2 X n = (X 1 X 2 X n ) 1/n 5
6 Example 8 Find the geometric mean for the following set of data 8, 18, 24, 36, 64. Solution: By the formula, the geometric mean is = = 24 Application Geometric Mean Rate of Return R G = [(1 + R 1 ) (1 + R 2 ) (1 + R n )] 1/n 1 where R i is the rate of return in time period i. Example 9 The price of a stock has risen by 6%, 13%, 11% and 15% in each of 4 successive years, find the average percentage risen in the price of the stock. Solution: The geometric mean is given by [( ) ( ) ( ) ( )] 1/4 1 = (1.5290) 1/4 1 = The average rise = = 11.2%. This value of 11.2% can be translated as the constant increase necessary each year to produce the final year price, given the starting price Median Definition 2 The median of a set measurements is defined to be the middle value when the measurements are arranged from lowest to highest. Median for ungrouped data If there is an odd number of items, the median is the value of the middle item when all items are arranged in ascending order. If there is an even number of items, the median is the average value of the two middle items when all items are arranged in ascending order. 6
7 Example 10 Find the median of the given data: Solution: Arrange the data in ascending order }{{} 46 Median Hence, the median is the 3rd term 46. Example 11 Compute the median for Example 3. Solution: Arranging the 12 items in ascending order } 2090{{ 2120} Median= = Median for grouped data For grouped data, the median can be found by first identify the class containing the median, then apply the following formula: ( n/2 C median = l 1 + f m ) (l 2 l 1 ) where: l 1 is the lower class boundary of the median class; n C f m l 2 is the total frequency; is the cumulative frequency just before the median class; is the frequency of the median; is the upper class boundary containing the median. It is obvious that the median is affected by the total number of data but is independent of extreme values. However if the data is ungrouped and numerous, finding the median is tedious. Note that median may be applied in qualitative data if they can be ranked Mode Definition 3 The mode of a set of measurements is defined to be the measurements that occurs most often(with highest frequency). The mode may not exist, and even if it does exist it may not be unique. A distribution having only one mode is called unimodal. 7
8 Example 12 Refer to Example 5 (Chapter 2). Find the mode. Automobile Purchase frequency Chevrolet Cavalier 9 Ford Escort 14 Toyota Echo 8 Honda Accord 11 Hyundai Excel 8 Total 50 The mode is the Ford Escort. For grouped data, the mode can be found by first identify the largest frequency of that class, called modal class, then apply the following formula on the modal class: ( ) fa mode = l 1 + (l 2 l 1 ) f a + f b where: l 1 f a is the lower class boundary of the modal class; is the difference of the frequencies of the modal class with the previous class and is always positive; f b is the difference of the frequencies of the modal class with the following class and is always positive; l 2 is the upper class boundary of the modal class Characteristics of each measure of Central Tendency Mode 1. It is the most frequent or probable measurement in the data set. 2. There can be more than one mode for a data set. 3. It is not influenced by the extreme measurements. 4. Modes of subsets cannot be combined to determine the mode of the complete data set. 5. For group data, its value can change depending on the class used. 6. It is applicable for both qualitative and quantitative data. 8
9 Median 1. It is the central value 50% of the measurements lie above it and 50% fall below it. 2. There is only one median for a data set. 3. Medians of subsets cannot be combined to determine the median of the complete data set. 4. For grouped data, its value is rather stable even when the data are organized into different class. 5. It is applicable to quantitative data only. Mean 1. It is the average of the measurements in a data set. 2. There is only one mean for a data set. 3. Its value is influenced by extreme measurements, trimming can help to reduce the degree of influence. 4. Means of subsets can be combined to determine the mean of the complete data set. 5. It is applicable to quantitative data only Percentiles Definition 4 The pth percentile is a value such that at least p percent of the items take on this value or less and at least (100 p) percent of the items take on this value or more. Calculating the pth Percentile Step 1. Arrange the data in ascending order (rank order from smallest value to largest value). Step 2. Compute an index i as follows: i = ( p ) n 100 where p is the percentile of interest and n is the number of items. 9
10 Step 3. (a) If i is not an integer, round up. The next integer value greater than i denotes the position of the pth percentile. (b) If i is an integer, the pth percentile is the average of the data values in positions i and i + 1. Example 13 Determine the 85th percentile for Example 3. Solution: Step 1. Arrange the 12 data values in ascending order. Step 2. i = ( p ) ( ) 85 n = 12 = Step 3. Since i is not an integer, round up. The position of the 85th percentile is the next integer greater than 10.2, the 11th position Quartiles Definition 5 The 25th, 50th, and 75th percentiles of the data referred to as the first quartile, the second quartile, and third quartile, respectively. The quartiles can be used to divide the data into four parts, with each part containing approximately 25% of the data. Q 1 Q 2 Q 3 = first quartile = second quartile (Median) = third quartile Example 14 Find the quartiles for the given set of data. 6.1, 2.8, 1.2, 0.7, 4.3, 5.5, 5.9, 6.5, 7.6, 8.3, 9.6, 9.8, 12.9, 13.1, 18.5 Solution: For Q 1, i = n = 1 Round up 15 = 3.75 = 4 4 Hence, Q 1 = Observation at 4th position = 0.7. For Q 3, Hence, Q 3 = Observation at 12th position = 9.8. i = n = 3 Round up 15 = = 12 4 Example 15 Find the Q 1 and Q 3 for the Example 3. The ordered data are given as follows:
11 Solution: For Q 1, i = n = 1 12 = 3 integer 4 Hence, Q 1 = average of observation at 3rd position and 4th position = = For Q 3, i = n = 3 12 = 9 integer 4 Hence, Q 3 = average of observation at 9th position and 10th position = = Measures of Variation Range Definition 6 The range is the difference between the largest and smallest data values Interquartile Range Definition 7 The interquartile Range (IQR) is the difference between the third and first quartiles in a set of data. IQR = Q 3 Q 1 Example 16 Refer to Example 15. The range = = 615 and the IQR = = Variance and Standard Deviation Definition 8 The variance is the average of the squared differences between each of the observations in a set of data and the mean. The standard deviation is the positive square root of the variance. For ungrouped data 11
12 Variance N (a) Population σ 2 = (x i µ) 2 N Standard Deviation N (x i µ) 2 σ = σ 2 = N (b) Sample s 2 = n (x i x) 2 n 1 s = s 2 = n (x i x) 2 n 1 For grouped data (a) Population σ 2 = Variance Standard Deviation fi (m i µ) 2 σ = fi (m σ 2 i µ) = 2 fi fi (b) Sample s 2 = fi (m i x) 2 fi 1 s = s 2 = fi (m i x) 2 fi 1 Alternative Formula: (a) Ungrouped data: σ 2 = N x2 i N µ 2 N and s 2 = n x2 i n x 2 n 1 (b) Grouped data σ 2 = N f im 2 i N µ 2 N and s 2 = n f im 2 i n x 2 n 1 Example 17 Find the variance and standard deviation for the data given in Example 3. Solution: 12
13 Sample Deviation Squared Deviation Monthly Mean About the Mean About the Mean Salary (x i ) ( x) (x i x) (x i x) (xi x) = 0 (xi x) 2 = Thus, the variance is s 2 = 12 (x i x) 2 n 1 = and the standard deviation is s = s 2 = = = Example 18 Redo the last example by using the alternative formula. Solution: x i = = x 2 i = (2050) 2 + (2150) (2080) 2 = x = s 2 = 12 x i n = x2 i n ( x) 2 n 1 = 2140 = s = = (2140) =
14 Monthly Salary (x i ) x 2 i = = = = xi = x 2 i = Example 19 Find the variance and standard deviation for the data given in Example 4. Solution: Recall that the mean is x = 19. See Example 4. Audit Class Squared Time Frequency Midpoint Deviation Deviation (Days) f i m i (m i x) (m i x) 2 f i (m i x) f i (m i x) 2 The variance is and the standard deviation is s 2 = fi (m i x) 2 fi 1 = = 30 s = s 2 = 30 = Example 20 Redo the last example by using the alternative formula. Solution: Recall that the mean is x = 19. See Example 4. 14
15 Audit Class Time Frequency Midpoint (Days) f i m i f i m 2 i f i m 2 i The variance is s 2 = and the standard deviation is fi m 2 i n( x) 2 fi 1 = 7790 (20) (192 ) 20 1 s = s 2 = 30 = = = Coefficient of Variation The Coefficient of Variation is defined as follows CV = Standard Deviation Mean 100% When to use CV. 1. The data are in different units. 2. The data are in the same units, but the means are apart. Example 21 A study of the test scores for an in-plant course in management principles and the years of service of the employees enrolled in the course resulted in these statistics: The mean score was 200; the standard deviation was 40. The mean number of years of service was 20 years; the standard deviation was 2 years. Compare the relative dispersion in the two distributions using the coefficient of variation. 15
16 Solution: The distributions are in different units (test scores and years of service). Therefore, they are converted to coefficients of variation. For the test scores: For years of service: CV = (100) CV s x = (100) = (100) s x 20 (100) = 20 percent = 10 percent Interpreting, there is more dispersion relative to the mean in the distribution of test scores compared with the distribution of years of service (because 20 > 10 percent). The same procedure is used when the data are in the same units but the means are far apart. (See the following example.) Example 22 The variation in the annual incomes of executives is to be compared with the variation in incomes of unskilled employees. For a sample of executives, x = $500, 000 and s = $50, 000. For a sample of unskilled employees, x = $22, 000, and s = $2, 200. We are tempted to say that there is more dispersion in the annual incomes of the executives because $50, 000 > $2, 200. The means are so far apart, however, that we need to convert the statistics to coefficients of variation to make a meaningful comparison of the variation in annual incomes. Solution: For the executives: For the unskilled employees: CV = (100) CV = (100) s x s x $50, 000 $2, 200 = (100) = $500, 000 $22, 000 (100) = 10 percent = 10 percent There is no difference in the relative dispersion of the two groups. 3.4 Shape A important property of a set of data is its shape the manner in which the data are distribution. Either the distribution of the data is symmetrical or it not. If the distribution of data is not symmetrical, it is called asymmetrical or skewed. Mean > Median: Mean = Median: Mean < Median: positive or right-skewness symmetry or zero-skewness negative or left-skewness 16
17 The skewness is an abstract quantity which shows how data piled-up. A number of measures have been suggested to determine the skewness of a given distribution. One of the simplest one is known as Pearson s measure of skewness: Skewness = Mean - Mode Standard Deviation 3(Mean - Median) Standard Deviation Positive skewness arises when the mean is increased by some unusually high values; negative skewness occurs when the mean is reduced by some extremely low values. Data are symmetrical when there is no really extreme values in a particular direction so that low and high values balance each other out. For data sets that are extremely skewed, be wary of using the mean as a measure of the center of distribution. In this situation, a more meaningful measure of central tendency may be the median, which is more resistant to the influence of extreme measurements. 3.5 Box-and-Whisker Plot (Box Plot) A plot that shows the center, spread, and skewness of a data set. It is constructed by drawing a box and two whiskers that use the median, the first quartile, the third quartile, and the smallest and the largest values in the data set. 17
18 Example 23 The following data give the incomes (in thousands of dollars) for a sample of 13 households Construct a box-and-whisker plot for these data. Solution: The following five steps are performed to construct a box-and-whisker plot. Step 1 First, rank the data in increasing order and calculate the values of the median, the first quartile, the third quartile, and the interquartile range. The ranked data are Step 2 Determine the median, the quartiles, the smallest and the largest values in the given data set. These five values for our example are as follows. Median i = 13 2 First quartile Q 1 i = 13 4 = 6.5 = 7 th ordered data = 32 = 3.25 = 4th ordered data = 23 Third quartile Q 3 i = 3(13) 4 = 9.75 = 10th ordered data = 46 Smallest value = 17 Largest value = 92 Step 3 Draw a horizontal line and mark the income levels on it such that all the values in the given data set are covered. Above the horizontal line, draw a box with its left side at the position of the first quartile and the right side at the position of the third quartile. Inside the box, draw a vertical line at the position of the median. Step 4 By drawing two lines, join the points of the smallest and the largest values to the box. These values are 17 and 60 in this example as listed in Step 2. The two lines that join the box to these two values are called whiskers. 3.6 Uses of Standard Deviation The Empirical Rule For data having a bell-shaped distribution, 18
19 Approximately 68% of the items will be within one standard deviation of the mean. Approximately 95% of the items will be within two standard deviation of the mean. Almost all (99.7%) of the items will be within three standard deviation of the mean. Example 24 The age distribution of a sample of 5000 persons is bell-shaped with a mean of 40 years and a standard of 12 years. Determine the approximate percentage of people who are 16 to 64 years old. Solution: We will use the empirical rule to find the required percentage because the distribution of ages follows a bell-shaped curve from the given information, for the this distribution, x = 40 years and s = 12 years 19
20 Each of the two points, 16 and 64, is 24 units away from the mean. Dividing 24 by 12, we convert the distance between each of the two points and the mean in terms of standard deviation. Thus, the distance between 16 and 40 and between 40 and 64 is each equal to 2s. Because the are within two standard deviations of the mean is approximately 95% for a bell-shaped, approximately 95% of the people in the sample are 16 to 64 years old Chebyshev s Theorem At least (1 1/k 2 ) of the items in any data set must be within k standard deviation of the mean, where k is any value greater than 1. Example 25 For a statistics class, the mean for the midterm scores is 75 and the standard deviation is 8. Using Chebyshev s theorem, find the percentage of students who scored between 59 and 91. Solution: Let µ and σ be the mean and the standard deviation, respectively, of the midterm scores. The from the given information, µ = 75 and σ = 8 To find the percentage of students who scored between 59 and 91, the first step is to determine k. Each of the two points, 59 and 91, is 16 units away from the mean. 20
21 The value of k is obtained by dividing the distance between the mean and each point by the standard deviation. Thus, k = 16/8 = k 2 = 1 1 (2) 2 = 3 4 = 0.75 Hence, according to Chebyshev s theorem, at least 75% of the students scored between 59 and z-score A z-score measures how many standard deviations an observation is above or below the mean. z i = x i x s where z i = the z-score for item i, x = the sample mean s = the sample standard deviation. Example 26 Different typing skills are required for secretaries depending on whether one is working in a law office, an accounting firm, or for a research mathematical group at a major university. In order to evaluate candidates for these positions, an employment agency administers three distinct standardized typing samples. A time penalty has been incorporated into the scoring of each sample based on the number of typing errors. The mean and standard deviation for each test, together with the score achieved by a recent applicant, are given as follows. Sample Applicants s score Mean Standard deviation Law 141 sec 180 sec 30 sec Accounting 7 min 10 min 2 min Scientific 33 min 26 min 5 min For what type of position does this applicant seem to be best suited? Solution: First we compute z-score for each sample Law : z = = Accounting : z = 7 10 = Scientific : z = =
22 Since speed is of primary importance, we looking for the z-score that represents the greatest number of standards to the left of the mean and in our case that would be 1.5. Therefore, this applicant ranks higher among typists in accounting firms than when compared to typists in the other two areas, and consequently should be placed with an accounting firm. 22
23 3.7 Use Scientific Calculator to find the mean and standard deviation Example 27 Use the calculator to find the mean and standard deviation of the data set: 1, 2, 5, 6, 8, 9, 10, 12, 14, 18. Steps Function Keys Descriptions 1 MODE 3 Change to the statistical mode 2 SHIFT AC Clear all old data 3 1 M+ / RUN Input the first data 4 2 M+ / RUN Input the second data 5. Continue to input the data M+ / RUN Input the final data 7 SHIFT 1 µ or x, population mean or sample mean 8 SHIFT 2 σ population standard deviation 9 SHIFT 3 s sample standard deviation 10 Kout 1 x 2, sum of squares of all data 11 Kout 2 x, sum of all data 12 Kout 3 n population size or sample size 23
24 Kout 1 = x 2 = 975 Kout 2 = x = 85 Kout 3 = n = 10 SHIFT 1 = x = 8.5 SHIFT 2 = xσ n = SHIFT 3 = xσ n 1 = Example 28 Suppose that we want to find the mean and standard deviation of the following data: Audit Time Frequency Class Midpoint (Days) f i m i MODE 3 SHIFT AC M M M M M+ / RUN / RUN / RUN / RUN / RUN Kout 1 = x 2 = 7790 Kout 2 = x = 380 Kout 3 = n = 20 SHIFT 1 = x = 19 24
25 SHIFT 2 = xσ n = SHIFT 3 = xσ n 1 = Remarks: If your calculator has no MODE 3 function, then you may use the MODE 2 function. But all the x values will be moved to the y values. Eg. x 2 = y 2 = Kout 4. 25
Numerical Descriptions of Data
Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =
More informationChapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1
Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and
More information1 Describing Distributions with numbers
1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write
More informationChapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.
-3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data
More informationChapter 3 Descriptive Statistics: Numerical Measures Part A
Slides Prepared by JOHN S. LOUCKS St. Edward s University Slide 1 Chapter 3 Descriptive Statistics: Numerical Measures Part A Measures of Location Measures of Variability Slide Measures of Location Mean
More informationSimple Descriptive Statistics
Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency
More informationCHAPTER 2 Describing Data: Numerical
CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of
More informationECON 214 Elements of Statistics for Economists
ECON 214 Elements of Statistics for Economists Session 3 Presentation of Data: Numerical Summary Measures Part 2 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh
More informationOverview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution
PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations
More informationDescriptive Statistics
Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs
More informationDescriptive Statistics
Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations
More informationStatistics 114 September 29, 2012
Statistics 114 September 29, 2012 Third Long Examination TGCapistrano I. TRUE OR FALSE. Write True if the statement is always true; otherwise, write False. 1. The fifth decile is equal to the 50 th percentile.
More informationRefer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.
Refer to Ex 3-18 on page 123-124 Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B. Test on Chapter 3 Friday Sept 27 th. You are expected to provide
More informationMeasures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean
Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values
More informationMath 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment
Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class
More informationApplications of Data Dispersions
1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given
More information2 Exploring Univariate Data
2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting
More informationDATA SUMMARIZATION AND VISUALIZATION
APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296
More informationWeek 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.
Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.
More informationMEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,
MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, CONTENTS What is measures of dispersion? Why measures of dispersion? How measures of dispersions are calculated? Range Quartile
More informationStandardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis
Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem
More informationMeasures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms
Measures of Central Tendency: Ungrouped Data Measures of central tendency yield information about particular places or locations in a group of numbers. Common Measures of Location Mode Median Percentiles
More informationMeasures of Dispersion (Range, standard deviation, standard error) Introduction
Measures of Dispersion (Range, standard deviation, standard error) Introduction We have already learnt that frequency distribution table gives a rough idea of the distribution of the variables in a sample
More informationNumerical Measurements
El-Shorouk Academy Acad. Year : 2013 / 2014 Higher Institute for Computer & Information Technology Term : Second Year : Second Department of Computer Science Statistics & Probabilities Section # 3 umerical
More informationMeasures of Central tendency
Elementary Statistics Measures of Central tendency By Prof. Mirza Manzoor Ahmad In statistics, a central tendency (or, more commonly, a measure of central tendency) is a central or typical value for a
More informationSection3-2: Measures of Center
Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number
More informationBasic Procedure for Histograms
Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that
More informationHandout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25
Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example
More informationChapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1
Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.
More informationappstats5.notebook September 07, 2016 Chapter 5
Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.
More informationSome Characteristics of Data
Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key
More informationMeasures of Variation. Section 2-5. Dotplots of Waiting Times. Waiting Times of Bank Customers at Different Banks in minutes. Bank of Providence
Measures of Variation Section -5 1 Waiting Times of Bank Customers at Different Banks in minutes Jefferson Valley Bank 6.5 6.6 6.7 6.8 7.1 7.3 7.4 Bank of Providence 4. 5.4 5.8 6. 6.7 8.5 9.3 10.0 Mean
More informationStatistics vs. statistics
Statistics vs. statistics Question: What is Statistics (with a capital S)? Definition: Statistics is the science of collecting, organizing, summarizing and interpreting data. Note: There are 2 main ways
More information2 DESCRIPTIVE STATISTICS
Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled
More informationPopulation Mean GOALS. Characteristics of the Mean. EXAMPLE Population Mean. Parameter Versus Statistics. Describing Data: Numerical Measures
GOALS Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright 010 by The McGraw-Hill Companies, Inc. All rights reserved. 3-1. Calculate the arithmetic mean, weighted mean, median, mode,
More informationDot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.
Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,
More informationDATA HANDLING Five-Number Summary
DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest
More informationDavid Tenenbaum GEOG 090 UNC-CH Spring 2005
Simple Descriptive Statistics Review and Examples You will likely make use of all three measures of central tendency (mode, median, and mean), as well as some key measures of dispersion (standard deviation,
More informationCSC Advanced Scientific Programming, Spring Descriptive Statistics
CSC 223 - Advanced Scientific Programming, Spring 2018 Descriptive Statistics Overview Statistics is the science of collecting, organizing, analyzing, and interpreting data in order to make decisions.
More informationMeasure of Variation
Measure of Variation Variation is the spread of a data set. The simplest measure is the range. Range the difference between the maximum and minimum data entries in the set. To find the range, the data
More informationFundamentals of Statistics
CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct
More informationDESCRIPTIVE STATISTICS
DESCRIPTIVE STATISTICS INTRODUCTION Numbers and quantification offer us a very special language which enables us to express ourselves in exact terms. This language is called Mathematics. We will now learn
More informationLecture 2 Describing Data
Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms
More informationSTAT 113 Variability
STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2
More informationMBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment
MBEJ 1023 Planning Analytical Methods Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment Contents What is statistics? Population and Sample Descriptive Statistics Inferential
More information9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives
Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical
More informationSUMMARY STATISTICS EXAMPLES AND ACTIVITIES
Session 6 SUMMARY STATISTICS EXAMPLES AD ACTIVITIES Example 1.1 Expand the following: 1. X 2. 2 6 5 X 3. X 2 4 3 4 4. X 4 2 Solution 1. 2 3 2 X X X... X 2. 6 4 X X X X 4 5 6 5 3. X 2 X 3 2 X 4 2 X 5 2
More informationEmpirical Rule (P148)
Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall
More informationNumerical Descriptive Measures. Measures of Center: Mean and Median
Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where
More informationNormal Model (Part 1)
Normal Model (Part 1) Formulas New Vocabulary The Standard Deviation as a Ruler The trick in comparing very different-looking values is to use standard deviations as our rulers. The standard deviation
More informationSome estimates of the height of the podium
Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40
More informationData that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.
Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer
More informationDescriptive Analysis
Descriptive Analysis HERTANTO WAHYU SUBAGIO Univariate Analysis Univariate analysis involves the examination across cases of one variable at a time. There are three major characteristics of a single variable
More informationBasic Sta)s)cs. Describing Data Measures of Spread
Basic Sta)s)cs Describing Data Measures of Spread Describing Data Learning Inten7ons Today we will understand: } Measures of Spread * Calculate the range of a sample * Determine quar7les and interquar7le
More informationPSYCHOLOGICAL STATISTICS
UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc COUNSELLING PSYCHOLOGY (2011 Admission Onwards) II Semester Complementary Course PSYCHOLOGICAL STATISTICS QUESTION BANK 1. The process of grouping
More informationFrequency Distribution and Summary Statistics
Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary
More informationNOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS
NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows
More informationBoth the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.
Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of
More informationUnit 2 Statistics of One Variable
Unit 2 Statistics of One Variable Day 6 Summarizing Quantitative Data Summarizing Quantitative Data We have discussed how to display quantitative data in a histogram It is useful to be able to describe
More informationSTAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model
STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good
More informationSTAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model
STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good
More information( ) P = = =
1. On a lunch counter, there are 5 oranges and 6 apples. If 3 pieces of fruit are selected, find the probability that 1 orange and apples are selected. Order does not matter Combinations: 5C1 (1 ) 6C P
More informationChapter 3-Describing Data: Numerical Measures
Chapter 3-Describing Data: Numerical Measures Jie Zhang Account and Information Systems Department College of Business Administration The University of Texas at El Paso jzhang6@utep.edu Jie Zhang, QMB
More informationChapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)
Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop
More informationCategorical. A general name for non-numerical data; the data is separated into categories of some kind.
Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,
More informationChapter 4. The Normal Distribution
Chapter 4 The Normal Distribution 1 Chapter 4 Overview Introduction 4-1 Normal Distributions 4-2 Applications of the Normal Distribution 4-3 The Central Limit Theorem 4-4 The Normal Approximation to the
More informationMath146 - Chapter 3 Handouts. The Greek Alphabet. Source: Page 1 of 39
Source: www.mathwords.com The Greek Alphabet Page 1 of 39 Some Miscellaneous Tips on Calculations Examples: Round to the nearest thousandth 0.92431 0.75693 CAUTION! Do not truncate numbers! Example: 1
More informationSTATS DOESN T SUCK! ~ CHAPTER 4
CHAPTER 4 QUESTION 1 The Geometric Mean Suppose you make a 2-year investment of $5,000 and it grows by 100% to $10,000 during the first year. During the second year, however, the investment suffers a 50%
More informationDescription of Data I
Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret
More informationDescribing Data: One Quantitative Variable
STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive
More informationBiostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras
Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions
More information3.3-Measures of Variation
3.3-Measures of Variation Variation: Variation is a measure of the spread or dispersion of a set of data from its center. Common methods of measuring variation include: 1. Range. Standard Deviation 3.
More informationIOP 201-Q (Industrial Psychological Research) Tutorial 5
IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,
More informationCopyright 2005 Pearson Education, Inc. Slide 6-1
Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is
More informationStatistics I Chapter 2: Analysis of univariate data
Statistics I Chapter 2: Analysis of univariate data Numerical summary Central tendency Location Spread Form mean quartiles range coeff. asymmetry median percentiles interquartile range coeff. kurtosis
More informationMeasures of Central Tendency Lecture 5 22 February 2006 R. Ryznar
Measures of Central Tendency 11.220 Lecture 5 22 February 2006 R. Ryznar Today s Content Wrap-up from yesterday Frequency Distributions The Mean, Median and Mode Levels of Measurement and Measures of Central
More informationMath 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet.
1 Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. Warning to the Reader! If you are a student for whom this document is a historical artifact, be aware that the
More informationLecture 9. Probability Distributions. Outline. Outline
Outline Lecture 9 Probability Distributions 6-1 Introduction 6- Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7- Properties of the Normal Distribution
More informationRandom Variables and Probability Distributions
Chapter 3 Random Variables and Probability Distributions Chapter Three Random Variables and Probability Distributions 3. Introduction An event is defined as the possible outcome of an experiment. In engineering
More informationDESCRIPTIVE STATISTICS II. Sorana D. Bolboacă
DESCRIPTIVE STATISTICS II Sorana D. Bolboacă OUTLINE Measures of centrality Measures of spread Measures of symmetry Measures of localization Mainly applied on quantitative variables 2 DESCRIPTIVE STATISTICS
More informationSection Introduction to Normal Distributions
Section 6.1-6.2 Introduction to Normal Distributions 2012 Pearson Education, Inc. All rights reserved. 1 of 105 Section 6.1-6.2 Objectives Interpret graphs of normal probability distributions Find areas
More informationNOTES: Chapter 4 Describing Data
NOTES: Chapter 4 Describing Data Intro to Statistics COLYER Spring 2017 Student Name: Page 2 Section 4.1 ~ What is Average? Objective: In this section you will understand the difference between the three
More informationLecture 9. Probability Distributions
Lecture 9 Probability Distributions Outline 6-1 Introduction 6-2 Probability Distributions 6-3 Mean, Variance, and Expectation 6-4 The Binomial Distribution Outline 7-2 Properties of the Normal Distribution
More informationTi 83/84. Descriptive Statistics for a List of Numbers
Ti 83/84 Descriptive Statistics for a List of Numbers Quiz scores in a (fictitious) class were 10.5, 13.5, 8, 12, 11.3, 9, 9.5, 5, 15, 2.5, 10.5, 7, 11.5, 10, and 10.5. It s hard to get much of a sense
More informationMini-Lecture 3.1 Measures of Central Tendency
Mini-Lecture 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data 3. Explain what it means for a
More informationSection 6-1 : Numerical Summaries
MAT 2377 (Winter 2012) Section 6-1 : Numerical Summaries With a random experiment comes data. In these notes, we learn techniques to describe the data. Data : We will denote the n observations of the random
More informationMEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION
MEASURES OF CENTRAL TENDENCY & VARIABILITY + NORMAL DISTRIBUTION 1 Day 3 Summer 2017.07.31 DISTRIBUTION Symmetry Modality 单峰, 双峰 Skewness 正偏或负偏 Kurtosis 2 3 CHAPTER 4 Measures of Central Tendency 集中趋势
More informationAP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE
AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,
More informationStat 101 Exam 1 - Embers Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.
More informationStatistics 511 Supplemental Materials
Gaussian (or Normal) Random Variable In this section we introduce the Gaussian Random Variable, which is more commonly referred to as the Normal Random Variable. This is a random variable that has a bellshaped
More informationEngineering Mathematics III. Moments
Moments Mean and median Mean value (centre of gravity) f(x) x f (x) x dx Median value (50th percentile) F(x med ) 1 2 P(x x med ) P(x x med ) 1 0 F(x) x med 1/2 x x Variance and standard deviation
More information4. DESCRIPTIVE STATISTICS
4. DESCRIPTIVE STATISTICS Descriptive Statistics is a body of techniques for summarizing and presenting the essential information in a data set. Eg: Here are daily high temperatures for Jan 16, 2009 in
More informationA LEVEL MATHEMATICS ANSWERS AND MARKSCHEMES SUMMARY STATISTICS AND DIAGRAMS. 1. a) 45 B1 [1] b) 7 th value 37 M1 A1 [2]
1. a) 45 [1] b) 7 th value 37 [] n c) LQ : 4 = 3.5 4 th value so LQ = 5 3 n UQ : 4 = 9.75 10 th value so UQ = 45 IQR = 0 f.t. d) Median is closer to upper quartile Hence negative skew [] Page 1 . a) Orders
More informationCHAPTER 6. ' From the table the z value corresponding to this value Z = 1.96 or Z = 1.96 (d) P(Z >?) =
Solutions to End-of-Section and Chapter Review Problems 225 CHAPTER 6 6.1 (a) P(Z < 1.20) = 0.88493 P(Z > 1.25) = 1 0.89435 = 0.10565 P(1.25 < Z < 1.70) = 0.95543 0.89435 = 0.06108 (d) P(Z < 1.25) or Z
More informationMoments and Measures of Skewness and Kurtosis
Moments and Measures of Skewness and Kurtosis Moments The term moment has been taken from physics. The term moment in statistical use is analogous to moments of forces in physics. In statistics the values
More informationVARIABILITY: Range Variance Standard Deviation
VARIABILITY: Range Variance Standard Deviation Measures of Variability Describe the extent to which scores in a distribution differ from each other. Distance Between the Locations of Scores in Three Distributions
More informationBasic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract
Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, 2013 Abstract Review summary statistics and measures of location. Discuss the placement exam as an exercise
More informationMATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1)
LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1) Descriptive statistics are ways of summarizing large sets of quantitative (numerical) information. The best way to reduce a set of
More informationMath 140 Introductory Statistics. First midterm September
Math 140 Introductory Statistics First midterm September 23 2010 Box Plots Graphical display of 5 number summary Q1, Q2 (median), Q3, max, min Outliers If a value is more than 1.5 times the IQR from the
More informationContinuous Probability Distributions
8.1 Continuous Probability Distributions Distributions like the binomial probability distribution and the hypergeometric distribution deal with discrete data. The possible values of the random variable
More information