MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment

Size: px
Start display at page:

Download "MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment"

Transcription

1 MBEJ 1023 Planning Analytical Methods Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment Contents What is statistics? Population and Sample Descriptive Statistics Inferential Statistics Statistical data analysis Contents Scales of Measurement Skewness Measure of Dispersion Using Excel 1

2 What is statistics Statistics consists of a body of methods for collecting and analyzing data Agresti & Finlay, 1997 Statistics Raw data What kind and how much data need to be collected? Quantitative techniques How should we organize and summarize the data? How can we analyse the data and draw conclusions from it? Meaningful information How can we assess the strength of the conclusions and evaluate their uncertainty? Population and Sample What kind and how much data need to be collected? Population is the collection of all individuals or items under consideration in a statistical study Weiss, 1999 Sample is that part of the population from which information is collected Weiss,

3 Population and Sample Ideal survey: The sampled population= the target population For obvious reasons it is impossible A perfect sample: A scaled down version of the target population, mirroring every characteristic of the target populationp It is impossible A good sample: Reproduce the characteristics of the target population as closely as possible Descriptive Statistics How should we organize and summarize the data? Descriptive statistics consist of methods for organizing and summarizing information Weiss, 1999 Descriptive Statistics Central Tendency Mean, Median, Mode, Sum, Dispersion Std. deviation, Variance, Range, Minimum, Maximum, Distribution Normal, Chi-square, Binomial, Poisson, Geometric, Percentile Quartiles, Percentiles,. 3

4 Inferential Statistics How can we assess the strength of the conclusions and evaluate their uncertainty? Inferential statistics consist of methods for drawing and measuring the reliability of conclusions about population based on information obtained from a sample of the population Weiss, 1999 Inferential Statistics Point Estimation Interval Estimation Hypothesis Testing Confidence level Margin of error Statistical data analysis How can we analyse the data and draw conclusions from it? Scale of measurement Number of groups Nature of the relationship between groups Number of variables Assumptions of statistical tests 4

5 Statistical data analysis Begin Formulate the research problem Define population and sample Collect the data Do descriptive data analysis Use appropriate statistical methods to solve the research problem Report the results End Basic mathematical notations Variable Number of Observations, n Counter Variable A variable can be defined as a known characteristic or phenomenon of a population or sample. Variable Quantitative (e.g. height, income, etc.) Qualitative (e.g. gender, religion, etc.) Student s Weight (in kg) = {56, 45, 65, 47, 50} w = {56, 45, 65, 47, 50} Uppercase variable Population s characteristic Lowercase variable Sample s characteristic 5

6 Number of Observations, n x = {44, 71, 55, 32, 27} y = {3.5, 2.7, 3.0, 4.5, 5.2} z = {-8, 6, -4} For x and y, n = 5 For z, n = 3 Counter b = {90, 85, 76, 92, 85, 53, 74, 85, 90, 66} n = 10 To avoid misunderstanding same values for different observations, mathematicians use counter to refer to the individual value in a set of observations A counter normally is represented by the letters i, j and k b i i = 2 b 2 = 85 i = 3 b 3 = 76 Example Counter c = {91, 86, 77, 93, 86, 54, 75, 86, 95, 67, 80} a) i = 5 86 b) i = n 80 c) i = 1, 2,, n 91, 86, 77, 93, 86, 54, 75, 86, 95, 67, 80 d) c 9 95 e) a = {c 3, c 8 } a = {77, 86} Descriptive Statistics Central Tendency Mean, Median, Mode, Sum, Dispersion Std. deviation, Variance, Range, Minimum, Maximum, Distribution Normal, Chi-square, Binomial, Poisson, Geometric, Percentile Quartiles, Percentiles,. When your data are described correctly and adequately, everybody will have an insight on the features of your data Descriptive statistics help us to simplify large amounts of data in a sensible way. For instance, the Grade Point Average (GPA) describes the general performance of a student across a potentially wide range of course experiences. 6

7 Scales of Measurement The ways that numbers are being assigned to observations Measurement is basically the process of assigning numbers to observations according to certain rules Sprinthall, 2000 Scales of Measurement Nominal Establish identity (Apartment has pool = 1, Apartment does not have pool =0) Ordinal Place into an order and ranking (Apartments are ranked according to their prices) Interval Position along a continuous scale The scale does not have absolute zero (we cannot talk about no temperature) Ratio Measure on a ratio scale (Floor area) Zero has meaning Zero denotes the absence of something 1. Magnitude the ability to be counted 2. Order the ability to be ranked 3. Interval having equal distance 4. Rational zero a number zero on the scale that is meaningful The location of the distribution is assessed by its central tendency Central tendency, by definition, is a typical or representative score Mode Median Mean 7

8 Mode The mode is the most frequently occurring score value Distance, d, between home and workplace (in km) d M o = 14 The mode may be seen on a frequency distribution as the score value which corresponds to the highest point Mode A distribution may have more than one mode Age, a, when started working (in years) a Such distributions are called bimodal M o = 20, 23 Median The median is the score value which cuts the distribution in half, such that half the scores fall above the median and half fall below it (1) Order the scores from lowest to highest (2) If there are odd numbers of scores M d = X i i = (N + 1) / 2 (2) If there is an even number of scores M d = (X i + X i ) / i 1 = N / 2 i 2 = (N + 2) / 2 d a There are odd numbers of scores (11) i = (11 + 1) / 2 = 6 M d = d 6 M d = 14 There is an even number of scores (8) i 1 = 8 / 2 = 4 i 2 = (8 + 2) / 2 = 5 M d = (a 4 + a 5 ) / 2 M d = ( ) / 2 M d =

9 Calculating the median for the class interval data Monthly household income (in RM) Household Income No. of Households (X) (f) < RM1, RM1,001 RM2, RM2,001 RM3, RM3,001 RM4, > RM4,000 5 M d = the median ll = lower exact limit containing the n(0.50) score n = total number of scores cf = cumulative frequency of scores above the interval containing the n(0.50) score f i = frequency of scores in the interval containing the n(0.50) score w = width of the class interval Calculating the median for the class interval data Monthly household income (in RM) Household Income Exact Limits No. of Cumulative (X) Households Frequency (f) (cf) < 1, ,001 2, ,001 3, ,001 4, > 4, Class Boundaries are the midpoints between the upper class limit of a class and the lower class limit of the next class in the sequence n = 100 n (0.50) = 50 II = cf = 28 f i = 45 w = 1000 M d = ((50 28 ) / 45) 1000 M d = M d = the median ll = lower exact limit containing the n(0.50) score n = total number of scores cf = cumulative frequency of scores above the interval containing the n(0.50) score f i = frequency of scores in the interval containing the n(0.50) score w = width of the class interval Mean The population mean is the sum of the observations divided by the population size = The population mean, N = Population size a = Sample mean, n = Sample size = ( ) / 8 =

10 Mean Sometimes, we are given data arranged in frequency table Age No. of Respondents (x) (f) = the sample mean x i = individual observation f i = class frequency n = the number of classes = (19 (5) + 20 (4) + 21 (5) + 22 (3) + 23 (1) + 24 (2)) / = 417 / 20 = Mean Sometimes, data are further summarised using class intervals Age, x Frequency, f Class Intervals Midpoint, x i Frequency, f i x i f i Sum, = the sample mean x i = the midpoint of the class interval f i = class frequency n = the number of class intervals = 419 / 20 = Choosing Appropriate Measure of Central Tendency Measurement Scale Nominal Ordinal Interval/Ratio Measure of Central Tendency Mode: The value that appears most often in a distribution. Median: The value that t divides id the distribution ib ti of responses into two equal size groups (the value of the 50th percentile). Mode and Median Mean: The arithmetic average of a distribution. 10

11 Skewness Mode Median M o M d Mean The three measures of location can be used together to describe the central tendency of a distribution Skewness Symmetrical distribution It described a distribution that is normally distributed. The concept of Normal distribution is used in many statistical analysis and tests. Symmetrical distribution has a zero skewness. A positively skewed distribution occurs when both the mode and the median are located to the left of the mean If the mode and the median are located to the right of the mean, then we have a distribution that is negatively skewed. Skewness Pearson s Index of Skewness (I) = the sample mean = the median = the sample standard deviation 11

12 Skewness Example The sample mean of a set of data is 3.45, the median is 4.00 and the sample standard deviation is Compute the Pearson s Index of Skewness, and determine if the data is symmetrically distributed. = 3.45 = 4 = 1.22 I = 3 (3.45 4) / 1.22 I = The distribution is not symmetric around the mean. The distribution is negatively skewed. Measure of Dispersion The range The variance The standard deviation Measures of dispersion express quantitatively the degree of variation or dispersion of values in a population or in a sample Measure of Dispersion The range Range = Largest - smallest Distribution 1: Range = = 13 Distribution 2: Range = = 13 The range is greatly affected by extreme scores The range is not the most important measure of variability 12

13 Measure of Dispersion The variance The standard deviation The population variance ( within the population. ) is a measure of variability between observations X = the individual observation in the population = the population mean N = the size of the population The population standard deviation ( ) is the positive square root of the variance A small variance indicates that the data tends to be very close to the mean and hence to each other, while a high variance indicates that the data is very spread out around the mean and from each other. Measure of Dispersion The variance The standard deviation The sample variance Steps to compute the Variance Step 1 - Find the mean of the scores. Step 2 - Subtract the mean from every score. Step 3 - Square the results of Step 2. = Sample mean x i = individual observation n = Sample size The sample standard deviation Step 4 - Sum the results of Step 3. Step 5 - Divide the results of Step 4 by n-1. Step 6 - Take the square root of Step 5. The result at Step 5 is the sample variance. The sample standard deviation is obtained in Step 6. Measure of Dispersion The variance The standard deviation Example i x i x i -x (x i x) Total, Step 5 - Divide the results of Step 4 by n-1. S 2 = 28 / (5 1) = 7 Step 6 - Take the square root of Step 5. Step 1 - Find the mean of the scores. _ x = 35 / 5 = 7 Step 2 - Subtract the mean from every score. _ x i -x Step 3 - Square the results of Step 2. _ (x i x) 2 Step 4 - Sum the results of Step 3. _ (x i x) 2 = 28 13

14 Measure of Dispersion The variance The standard deviation The standard deviation measures variability in units of measurement, while the variance does so in units of measurement squared. For example, if one measured height in inches, then the standard deviation would be in inches, while the variance would be in inches squared. For this reason, the standard deviation is usually the preferred measure when describing the variability of distributions. The variance, however, has some unique properties which make it very useful later on in the course. Exercise: Use the following data and calculate the variance and the standard deviation. Age No. of Respondents (x) (f) If the number of data that we need to process exceeds a certain limit, we will find that even the simplest data analysis will be troublesome. There are various applications range from the very general spreadsheet applications like MS Excel and Lotus to a more advanced statistical applications like SPSS, Minitab, S-Plus and SAS to solve this problem. The spreadsheet applications are easier to learn but they lack advanced statistical functions. Statistical applications like SPSS have more data analysis capabilities but require advanced mathematical knowledge. Various applications share some common steps in performing statistical analysis 14

15 Perform Data Entry To enter a fresh new set of data, you can select the Type in data radio button and click OK. An easier approach, however, is just to click CANCEL. The title bar The menu bar The tool bar Column heading Once you are in the data editor, you can enter your data A variable can be in many forms such as numerical, strings, date, The number of decimal places that SPSS will display Which numbers represent which categories (for discrete data of both nominal and ordinal levels of measurement). For example, you could assign the labels 'Male and 'Female to the numeric values 1 and 2 The width of a variable is the number of characters SPSS will allow to be entered for the variable A string of text to indentify in more detail what a variable represents To name a column, just go to the Variable View 15

16 Tell SPSS what to do when encounter missing values in our data file. The columns property tells SPSS how wide the column should be for each variable. Don't confuse this one with width, which indicates how many digits of the number will be displayed. The column size indicates how much space is allocated rather than the degree to which it is filled. Indicates whether the information in the Data View should be left-justified, rightjustified, or cantered Our data is a scale, ordinal, or nominal data To name a column, just go to the Variable View It is always good computing practice to frequently save your data. In SPSS, almost all statistical analysis that you want to perform are located in the Analyze menu. The same goes to the descriptive statistical analysis that we want to conduct. To proceed with our descriptive statistical analysis, select the menu Analyze > Descriptive Statistics > Descriptives 16

17 To customize the analysis to be performed, click on the Options Now, a dialog box will appear prompting to you to select the variable(s) that you want to describe. To do this, click on the variable. With the variable x selected, click on the arrow to move the variable x into the selected variable(s) box on the right For this exercise, we will select several statistics that we have covered thus far. These options are Mean, Standard (Std.) Deviation, Variance, Range, Minimum and Maximum. Among these options only Mean is used to measure the central tendency, whereas the other statistics are used to measure the degree of dispersion. Using Excel Excel and SPSS use the cell paradigm Excel uses the workbook paradigm where a single workbook can contain many data sheets. Excel is a spreadsheet application and its greatest use is when you have a lot of data to manipulate. The spreadsheet applications are easier to learn but they lack advanced statistical functions. 17

18 Using Excel Data Entry Simply enter your data The data sheet in this example does not have any more space at the top of the sheet to insert our column header. We can solve this by inserting a new row at the top. Using Excel Column border To insert a new row in Excel, first right-click on the row number, above which you want to insert a new row. Among the options available in this pop-up menu is one called Insert. Click Insert and a new row is automatically inserted on-top of row. Double-clicking the column border will increase the width of the column to fit the widest of the text in that column. You can save your data by selecting the menu File > Save or by clicking the button. Using Excel In Excel, data manipulation is achieved through entering a set of formulas. To enter a formula in a cell, you must start by typing the equal ( = ) sign. If the equal ( = ) is not entered, Excel will treat the formula as text which means that no computation will be performed. Functions in Excel for computing descriptive statistics Functions Formula Examples Mean =Average(Cells) =Average(A2:A6) Mode =MODE(Cells) =MODE(A2:A6) Median =MEDIAN(Cells) =MEDIAN(A2:A6) Minimum =MIN(Cells) =MIN(A2:A6) Maximum =MAX(Cells) =MAX(A2:A6) Variance =VAR(Cells) =VAR(A2:A6) Standard Deviation =STDEV(Cells) =STDEV(A2:A6) To perform computation in Excel, first select the cell where you want the result to appear. Then, write the formula by first typing the equal ( = ) sign in the cell (or, in the formula box). Press Enter and the result of the computation will be shown in the cell that you have selected earlier. 18

19 Using Excel Formula box To refer to a cell, you need not type the cell number. Instead, you can click the cell which you want to use and the cell number will be inserted in the formula. This way, you can avoid referring to the wrong cell. We can use formulas that we ourselves defined for computing statistics that are not defined by Excel. Thank you Dr. Mehdi Moeinaddini mehdi@utm.my 19

CSC Advanced Scientific Programming, Spring Descriptive Statistics

CSC Advanced Scientific Programming, Spring Descriptive Statistics CSC 223 - Advanced Scientific Programming, Spring 2018 Descriptive Statistics Overview Statistics is the science of collecting, organizing, analyzing, and interpreting data in order to make decisions.

More information

3.1 Measures of Central Tendency

3.1 Measures of Central Tendency 3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent

More information

Numerical Descriptive Measures. Measures of Center: Mean and Median

Numerical Descriptive Measures. Measures of Center: Mean and Median Steve Sawin Statistics Numerical Descriptive Measures Having seen the shape of a distribution by looking at the histogram, the two most obvious questions to ask about the specific distribution is where

More information

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations

More information

MATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1)

MATHEMATICS APPLIED TO BIOLOGICAL SCIENCES MVE PA 07. LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1) LP07 DESCRIPTIVE STATISTICS - Calculating of statistical indicators (1) Descriptive statistics are ways of summarizing large sets of quantitative (numerical) information. The best way to reduce a set of

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

1.2 Describing Distributions with Numbers, Continued

1.2 Describing Distributions with Numbers, Continued 1.2 Describing Distributions with Numbers, Continued Ulrich Hoensch Thursday, September 6, 2012 Interquartile Range and 1.5 IQR Rule for Outliers The interquartile range IQR is the distance between the

More information

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Categorical. A general name for non-numerical data; the data is separated into categories of some kind. Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,

More information

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 3 Presentation of Data: Numerical Summary Measures Part 2 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh

More information

DESCRIPTIVE STATISTICS

DESCRIPTIVE STATISTICS DESCRIPTIVE STATISTICS INTRODUCTION Numbers and quantification offer us a very special language which enables us to express ourselves in exact terms. This language is called Mathematics. We will now learn

More information

Descriptive Statistics

Descriptive Statistics Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs

More information

Lecture 2 Describing Data

Lecture 2 Describing Data Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

SUMMARY STATISTICS EXAMPLES AND ACTIVITIES

SUMMARY STATISTICS EXAMPLES AND ACTIVITIES Session 6 SUMMARY STATISTICS EXAMPLES AD ACTIVITIES Example 1.1 Expand the following: 1. X 2. 2 6 5 X 3. X 2 4 3 4 4. X 4 2 Solution 1. 2 3 2 X X X... X 2. 6 4 X X X X 4 5 6 5 3. X 2 X 3 2 X 4 2 X 5 2

More information

CHAPTER 2 Describing Data: Numerical

CHAPTER 2 Describing Data: Numerical CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of

More information

Measures of Dispersion (Range, standard deviation, standard error) Introduction

Measures of Dispersion (Range, standard deviation, standard error) Introduction Measures of Dispersion (Range, standard deviation, standard error) Introduction We have already learnt that frequency distribution table gives a rough idea of the distribution of the variables in a sample

More information

Fundamentals of Statistics

Fundamentals of Statistics CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct

More information

Summary of Statistical Analysis Tools EDAD 5630

Summary of Statistical Analysis Tools EDAD 5630 Summary of Statistical Analysis Tools EDAD 5630 Test Name Program Used Purpose Steps Main Uses/Applications in Schools Principal Component Analysis SPSS Measure Underlying Constructs Reliability SPSS Measure

More information

Descriptive Analysis

Descriptive Analysis Descriptive Analysis HERTANTO WAHYU SUBAGIO Univariate Analysis Univariate analysis involves the examination across cases of one variable at a time. There are three major characteristics of a single variable

More information

IOP 201-Q (Industrial Psychological Research) Tutorial 5

IOP 201-Q (Industrial Psychological Research) Tutorial 5 IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc COUNSELLING PSYCHOLOGY (2011 Admission Onwards) II Semester Complementary Course PSYCHOLOGICAL STATISTICS QUESTION BANK 1. The process of grouping

More information

David Tenenbaum GEOG 090 UNC-CH Spring 2005

David Tenenbaum GEOG 090 UNC-CH Spring 2005 Simple Descriptive Statistics Review and Examples You will likely make use of all three measures of central tendency (mode, median, and mean), as well as some key measures of dispersion (standard deviation,

More information

Continuous Probability Distributions

Continuous Probability Distributions 8.1 Continuous Probability Distributions Distributions like the binomial probability distribution and the hypergeometric distribution deal with discrete data. The possible values of the random variable

More information

2 Exploring Univariate Data

2 Exploring Univariate Data 2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting

More information

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line. Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

Description of Data I

Description of Data I Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret

More information

LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL

LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL LAB 2 INSTRUCTIONS PROBABILITY DISTRIBUTIONS IN EXCEL There is a wide range of probability distributions (both discrete and continuous) available in Excel. They can be accessed through the Insert Function

More information

Numerical Descriptions of Data

Numerical Descriptions of Data Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =

More information

Exploring Data and Graphics

Exploring Data and Graphics Exploring Data and Graphics Rick White Department of Statistics, UBC Graduate Pathways to Success Graduate & Postdoctoral Studies November 13, 2013 Outline Summarizing Data Types of Data Visualizing Data

More information

Simple Descriptive Statistics

Simple Descriptive Statistics Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency

More information

Chapter 3 Descriptive Statistics: Numerical Measures Part A

Chapter 3 Descriptive Statistics: Numerical Measures Part A Slides Prepared by JOHN S. LOUCKS St. Edward s University Slide 1 Chapter 3 Descriptive Statistics: Numerical Measures Part A Measures of Location Measures of Variability Slide Measures of Location Mean

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

GEOMORPHIC PROCESSES Laboratory #5: Flood Frequency Analysis

GEOMORPHIC PROCESSES Laboratory #5: Flood Frequency Analysis GEOMORPHIC PROCESSES 15-040-504 Laboratory #5: Flood Frequency Analysis Purpose: 1. Introduction to flood frequency analysis based on a log-normal and Log-Pearson Type III discharge frequency distribution

More information

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017

More information

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical

More information

Measures of Central tendency

Measures of Central tendency Elementary Statistics Measures of Central tendency By Prof. Mirza Manzoor Ahmad In statistics, a central tendency (or, more commonly, a measure of central tendency) is a central or typical value for a

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

2 DESCRIPTIVE STATISTICS

2 DESCRIPTIVE STATISTICS Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled

More information

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, Abstract Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 3: April 25, 2013 Abstract Review summary statistics and measures of location. Discuss the placement exam as an exercise

More information

VARIABILITY: Range Variance Standard Deviation

VARIABILITY: Range Variance Standard Deviation VARIABILITY: Range Variance Standard Deviation Measures of Variability Describe the extent to which scores in a distribution differ from each other. Distance Between the Locations of Scores in Three Distributions

More information

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda, MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, CONTENTS What is measures of dispersion? Why measures of dispersion? How measures of dispersions are calculated? Range Quartile

More information

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

One Proportion Superiority by a Margin Tests

One Proportion Superiority by a Margin Tests Chapter 512 One Proportion Superiority by a Margin Tests Introduction This procedure computes confidence limits and superiority by a margin hypothesis tests for a single proportion. For example, you might

More information

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 05 Normal Distribution So far we have looked at discrete distributions

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information

DATA HANDLING Five-Number Summary

DATA HANDLING Five-Number Summary DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest

More information

M249 Diagnostic Quiz

M249 Diagnostic Quiz THE OPEN UNIVERSITY Faculty of Mathematics and Computing M249 Diagnostic Quiz Prepared by the Course Team [Press to begin] c 2005, 2006 The Open University Last Revision Date: May 19, 2006 Version 4.2

More information

Statistics vs. statistics

Statistics vs. statistics Statistics vs. statistics Question: What is Statistics (with a capital S)? Definition: Statistics is the science of collecting, organizing, summarizing and interpreting data. Note: There are 2 main ways

More information

DazStat. Introduction. Installation. DazStat is an Excel add-in for Excel 2003 and Excel 2007.

DazStat. Introduction. Installation. DazStat is an Excel add-in for Excel 2003 and Excel 2007. DazStat Introduction DazStat is an Excel add-in for Excel 2003 and Excel 2007. DazStat is one of a series of Daz add-ins that are planned to provide increasingly sophisticated analytical functions particularly

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

Lecture Week 4 Inspecting Data: Distributions

Lecture Week 4 Inspecting Data: Distributions Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your

More information

Master of Science in Strategic Management Degree Master of Science in Strategic Supply Chain Management Degree

Master of Science in Strategic Management Degree Master of Science in Strategic Supply Chain Management Degree CHINHOYI UNIVERSITY OF TECHNOLOGY SCHOOL OF BUSINESS SCIENCES AND MANAGEMENT POST GRADUATE PROGRAMME Master of Science in Strategic Management Degree Master of Science in Strategic Supply Chain Management

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow

More information

Monte Carlo Simulation (Random Number Generation)

Monte Carlo Simulation (Random Number Generation) Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...

More information

Edexcel past paper questions

Edexcel past paper questions Edexcel past paper questions Statistics 1 Chapters 2-4 (Discrete) Statistics 1 Chapters 2-4 (Discrete) Page 1 Stem and leaf diagram Stem-and-leaf diagrams are used to represent data in its original form.

More information

Population Mean GOALS. Characteristics of the Mean. EXAMPLE Population Mean. Parameter Versus Statistics. Describing Data: Numerical Measures

Population Mean GOALS. Characteristics of the Mean. EXAMPLE Population Mean. Parameter Versus Statistics. Describing Data: Numerical Measures GOALS Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright 010 by The McGraw-Hill Companies, Inc. All rights reserved. 3-1. Calculate the arithmetic mean, weighted mean, median, mode,

More information

Statistics (This summary is for chapters 18, 29 and section H of chapter 19)

Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Statistics (This summary is for chapters 18, 29 and section H of chapter 19) Mean, Median, Mode Mode: most common value Median: middle value (when the values are in order) Mean = total how many = x n =

More information

DESCRIPTIVE STATISTICS II. Sorana D. Bolboacă

DESCRIPTIVE STATISTICS II. Sorana D. Bolboacă DESCRIPTIVE STATISTICS II Sorana D. Bolboacă OUTLINE Measures of centrality Measures of spread Measures of symmetry Measures of localization Mainly applied on quantitative variables 2 DESCRIPTIVE STATISTICS

More information

The Mode: An Example. The Mode: An Example. Measure of Central Tendency: The Mode. Measure of Central Tendency: The Median

The Mode: An Example. The Mode: An Example. Measure of Central Tendency: The Mode. Measure of Central Tendency: The Median Chapter 4: What is a measure of Central Tendency? Numbers that describe what is typical of the distribution You can think of this value as where the middle of a distribution lies (the median). or The value

More information

Descriptive Statistics (Devore Chapter One)

Descriptive Statistics (Devore Chapter One) Descriptive Statistics (Devore Chapter One) 1016-345-01 Probability and Statistics for Engineers Winter 2010-2011 Contents 0 Perspective 1 1 Pictorial and Tabular Descriptions of Data 2 1.1 Stem-and-Leaf

More information

Copyright 2005 Pearson Education, Inc. Slide 6-1

Copyright 2005 Pearson Education, Inc. Slide 6-1 Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is

More information

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar

Measures of Central Tendency Lecture 5 22 February 2006 R. Ryznar Measures of Central Tendency 11.220 Lecture 5 22 February 2006 R. Ryznar Today s Content Wrap-up from yesterday Frequency Distributions The Mean, Median and Mode Levels of Measurement and Measures of Central

More information

CABARRUS COUNTY 2008 APPRAISAL MANUAL

CABARRUS COUNTY 2008 APPRAISAL MANUAL STATISTICS AND THE APPRAISAL PROCESS PREFACE Like many of the technical aspects of appraising, such as income valuation, you have to work with and use statistics before you can really begin to understand

More information

NCSS Statistical Software. Reference Intervals

NCSS Statistical Software. Reference Intervals Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and

More information

Introduction to Basic Excel Functions and Formulae Note: Basic Functions Note: Function Key(s)/Input Description 1. Sum 2. Product

Introduction to Basic Excel Functions and Formulae Note: Basic Functions Note: Function Key(s)/Input Description 1. Sum 2. Product Introduction to Basic Excel Functions and Formulae Excel has some very useful functions that you can use when working with formulae. This worksheet has been designed using Excel 2010 however the basic

More information

Descriptive Statistics: Measures of Central Tendency and Crosstabulation. 789mct_dispersion_asmp.pdf

Descriptive Statistics: Measures of Central Tendency and Crosstabulation. 789mct_dispersion_asmp.pdf 789mct_dispersion_asmp.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lectures 7-9: Measures of Central Tendency, Dispersion, and Assumptions Lecture 7: Descriptive Statistics: Measures of Central Tendency

More information

Chapter 6 Confidence Intervals

Chapter 6 Confidence Intervals Chapter 6 Confidence Intervals Section 6-1 Confidence Intervals for the Mean (Large Samples) VOCABULARY: Point Estimate A value for a parameter. The most point estimate of the population parameter is the

More information

KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI

KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI 88 P a g e B S ( B B A ) S y l l a b u s KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI Course Title : STATISTICS Course Number : BA(BS) 532 Credit Hours : 03 Course 1. Statistical

More information

GOALS. Describing Data: Displaying and Exploring Data. Dot Plots - Examples. Dot Plots. Dot Plot Minitab Example. Stem-and-Leaf.

GOALS. Describing Data: Displaying and Exploring Data. Dot Plots - Examples. Dot Plots. Dot Plot Minitab Example. Stem-and-Leaf. Describing Data: Displaying and Exploring Data Chapter 4 GOALS 1. Develop and interpret a dot plot.. Develop and interpret a stem-and-leaf display. 3. Compute and understand quartiles, deciles, and percentiles.

More information

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet.

Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. 1 Math 2200 Fall 2014, Exam 1 You may use any calculator. You may not use any cheat sheet. Warning to the Reader! If you are a student for whom this document is a historical artifact, be aware that the

More information

appstats5.notebook September 07, 2016 Chapter 5

appstats5.notebook September 07, 2016 Chapter 5 Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.

More information

Contents. An Overview of Statistical Applications CHAPTER 1. Contents (ix) Preface... (vii)

Contents. An Overview of Statistical Applications CHAPTER 1. Contents (ix) Preface... (vii) Contents (ix) Contents Preface... (vii) CHAPTER 1 An Overview of Statistical Applications 1.1 Introduction... 1 1. Probability Functions and Statistics... 1..1 Discrete versus Continuous Functions... 1..

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

The normal distribution is a theoretical model derived mathematically and not empirically.

The normal distribution is a theoretical model derived mathematically and not empirically. Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.

More information

Descriptive Statistics in Analysis of Survey Data

Descriptive Statistics in Analysis of Survey Data Descriptive Statistics in Analysis of Survey Data March 2013 Kenneth M Coleman Mohammad Nizamuddiin Khan Survey: Definition A survey is a systematic method for gathering information from (a sample of)

More information

A CLEAR UNDERSTANDING OF THE INDUSTRY

A CLEAR UNDERSTANDING OF THE INDUSTRY A CLEAR UNDERSTANDING OF THE INDUSTRY IS CFA INSTITUTE INVESTMENT FOUNDATIONS RIGHT FOR YOU? Investment Foundations is a certificate program designed to give you a clear understanding of the investment

More information

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12)

Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Wk 2 Hrs 1 (Tue, Jan 10) Wk 2 - Hr 2 and 3 (Thur, Jan 12) Descriptive statistics: - Measures of centrality (Mean, median, mode, trimmed mean) - Measures of spread (MAD, Standard deviation, variance) -

More information

Statistics 511 Supplemental Materials

Statistics 511 Supplemental Materials Gaussian (or Normal) Random Variable In this section we introduce the Gaussian Random Variable, which is more commonly referred to as the Normal Random Variable. This is a random variable that has a bellshaped

More information

Measures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms

Measures of Central Tendency: Ungrouped Data. Mode. Median. Mode -- Example. Median: Example with an Odd Number of Terms Measures of Central Tendency: Ungrouped Data Measures of central tendency yield information about particular places or locations in a group of numbers. Common Measures of Location Mode Median Percentiles

More information

1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form:

1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form: 1 Exercise One Note that the data is not grouped! 1.1 Calculate the mean ROI Below you find the raw data in tabular form: Obs Data 1 18.5 2 18.6 3 17.4 4 12.2 5 19.7 6 5.6 7 7.7 8 9.8 9 19.9 10 9.9 11

More information

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data. -3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data

More information

Software Tutorial ormal Statistics

Software Tutorial ormal Statistics Software Tutorial ormal Statistics The example session with the teaching software, PG2000, which is described below is intended as an example run to familiarise the user with the package. This documented

More information

MgtOp 215 TEST 1 (Golden) Spring 2016 Dr. Ahn. Read the following instructions very carefully before you start the test.

MgtOp 215 TEST 1 (Golden) Spring 2016 Dr. Ahn. Read the following instructions very carefully before you start the test. MgtOp 15 TEST 1 (Golden) Spring 016 Dr. Ahn Name: ID: Section (Circle one): 4, 5, 6 Read the following instructions very carefully before you start the test. This test is closed book and notes; one summary

More information

Frequency Distributions

Frequency Distributions Frequency Distributions January 8, 2018 Contents Frequency histograms Relative Frequency Histograms Cumulative Frequency Graph Frequency Histograms in R Using the Cumulative Frequency Graph to Estimate

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

MA 1125 Lecture 05 - Measures of Spread. Wednesday, September 6, Objectives: Introduce variance, standard deviation, range.

MA 1125 Lecture 05 - Measures of Spread. Wednesday, September 6, Objectives: Introduce variance, standard deviation, range. MA 115 Lecture 05 - Measures of Spread Wednesday, September 6, 017 Objectives: Introduce variance, standard deviation, range. 1. Measures of Spread In Lecture 04, we looked at several measures of central

More information

MgtOp S 215 Chapter 8 Dr. Ahn

MgtOp S 215 Chapter 8 Dr. Ahn MgtOp S 215 Chapter 8 Dr. Ahn An estimator of a population parameter is a rule that tells us how to use the sample values,,, to estimate the parameter, and is a statistic. An estimate is the value obtained

More information

XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING

XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING INTRODUCTION XLSTAT makes accessible to anyone a powerful, complete and user-friendly data analysis and statistical solution. Accessibility to

More information

Probability distributions

Probability distributions Probability distributions Introduction What is a probability? If I perform n eperiments and a particular event occurs on r occasions, the relative frequency of this event is simply r n. his is an eperimental

More information

Two-Sample T-Test for Superiority by a Margin

Two-Sample T-Test for Superiority by a Margin Chapter 219 Two-Sample T-Test for Superiority by a Margin Introduction This procedure provides reports for making inference about the superiority of a treatment mean compared to a control mean from data

More information

Data Analysis. BCF106 Fundamentals of Cost Analysis

Data Analysis. BCF106 Fundamentals of Cost Analysis Data Analysis BCF106 Fundamentals of Cost Analysis June 009 Chapter 5 Data Analysis 5.0 Introduction... 3 5.1 Terminology... 3 5. Measures of Central Tendency... 5 5.3 Measures of Dispersion... 7 5.4 Frequency

More information

Frequency Distribution and Summary Statistics

Frequency Distribution and Summary Statistics Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary

More information

Applications of Data Dispersions

Applications of Data Dispersions 1 Applications of Data Dispersions Key Definitions Standard Deviation: The standard deviation shows how far away each value is from the mean on average. Z-Scores: The distance between the mean and a given

More information

Data screening, transformations: MRC05

Data screening, transformations: MRC05 Dale Berger Data screening, transformations: MRC05 This is a demonstration of data screening and transformations for a regression analysis. Our interest is in predicting current salary from education level

More information