Introduction to Computational Finance and Financial Econometrics Descriptive Statistics

Size: px
Start display at page:

Download "Introduction to Computational Finance and Financial Econometrics Descriptive Statistics"

Transcription

1 You can t see this text! Introduction to Computational Finance and Financial Econometrics Descriptive Statistics Eric Zivot Summer 2015 Eric Zivot (Copyright 2015) Descriptive Statistics 1 / 28

2 Outline 1 Univariate Descriptive Statistics 2 Bivariate Descriptive Statistics 3 Time Series Descriptive Statistics Eric Zivot (Copyright 2015) Descriptive Statistics 2 / 28

3 Covariance Stationarity {..., X 1,..., X T,...} = {X t } is a covariance stationary stochastic process, and each X t is identically distributed with unknown pdf f (x). Recall, E[X t ] = µ indep of t var(x t ) = σ 2 indep of t cov(x t, X t j ) = γ j indep of t cor(x t, X t j ) = ρ j indep of t Eric Zivot (Copyright 2015) Descriptive Statistics 3 / 28

4 Descriptive Statistics Observed Sample: {X 1 = x 1,..., X T = x T } = {x t } T t=1 are observations generated by the stochastic process. Descriptive Statistics: Data summaries (statistics) to describe certain features of the data, to learn about the unknown pdf, f (x), and to capture the observed dependencies in the data. Eric Zivot (Copyright 2015) Descriptive Statistics 4 / 28

5 Time Plots Line plot of time series data with time/dates on horizontal axis. Visualization of data - uncover trends, assess stationarity and time dependence Spot unusual behavior Plotting multiple time series can reveal commonality across series Eric Zivot (Copyright 2015) Descriptive Statistics 5 / 28

6 Histograms Goal: Describe the shape of the distribution of the data {x t } T t=1 Hisogram Construction: 1 Order data from smallest to largest values 2 Divide range into N equally spaced bins [ ] 3 Count number of observations in each bin 4 Create bar chart (optionally normalize area to equal 1) Eric Zivot (Copyright 2015) Descriptive Statistics 6 / 28

7 R Functions Function hist() density() Description compute histogram compute smoothed histogram Note: The density() function computes a smoothed (kernel density) estimate of the unknown pdf at the point x using the formula: ˆf (x) = 1 Tb T ( ) x xt k b t=1 k( ) = kernel function b = bandwidth (smoothing) parameter where k( ) is a pdf symmetric about zero (typically the standard normal distribution). See Ruppert Chapter 4 for details. Eric Zivot (Copyright 2015) Descriptive Statistics 7 / 28

8 Empirical Quantiles/Percentiles Percentiles: For α [0, 1], the 100 α th percentile (empirical quantile) of a sample of data is the data value ˆq α such that α 100% of the data are less than ˆq α. Quartiles: ˆq.25 = first quartile ˆq.50 = second quartile (median) ˆq.75 = third quartile ˆq.75 ˆq.25 = interquartile range (IQR) Eric Zivot (Copyright 2015) Descriptive Statistics 8 / 28

9 R Functions Function sort() min() max() range() quantile() median() IQR() summary() Description sort elements of data vector compute minimum value of data vector compute maximum value of data vector compute min and max of a data vector compute empirical quantiles compute median compute inter-quartile range compute summary statistics Eric Zivot (Copyright 2015) Descriptive Statistics 9 / 28

10 Historical Value-at-Risk Let {R t } T t=1 denote a sample of T simple monthly returns on an investment, and let $W 0 be the initial value of an investment. For α (0, 1), the historical VaR α is: $W 0 ˆq R α ˆq R α = empirical α 100% quantile of {R t } T t=1 Note: For continuously compounded returns {r t } T t=1 use: $W 0 (exp(ˆq r α) 1) ˆq r α = empirical α 100% quantile of {r t } T t=1 Eric Zivot (Copyright 2015) Descriptive Statistics 10 / 28

11 Sample Statistics Plug-In Principle: Estimate population quantities using sample statistics. Sample Average (Mean): 1 T T x t = x = ˆµ x t=1 Sample Variance: 1 T 1 T (x t x) 2 = sx 2 = ˆσ x 2 t=1 Sample Standard Deviation: s 2 x = s x = ˆσ x Eric Zivot (Copyright 2015) Descriptive Statistics 11 / 28

12 Sample Statistics cont. Sample Skewness: 1 T 1 Sample Kurtosis: 1 T 1 T (x t x) 3 /sx 3 = ŝkew t=1 T (x t x) 4 /sx 4 = kurt t=1 Sample Excess Kurtosis: kurt 3 Eric Zivot (Copyright 2015) Descriptive Statistics 12 / 28

13 R Functions Function Package Description mean() base compute sample mean colmeans() base compute column means of matrix var() stats compute sample variance sd() stats compute sample standard deviation skewness() PerformanceAnalytics compute sample skewness kurtosis() PerformanceAnalytics compute sample excess kurtosis Note: Use the R function apply(), to apply functions over rows or columns of a matrix or data.frame. Eric Zivot (Copyright 2015) Descriptive Statistics 13 / 28

14 Empirical Cumulative Distribution Function Recall, the CDF of a random variable X is: F X (x) = Pr(X x) The empirical CDF of a random sample is: ˆF X (x) = 1 n (#x i x) = number of x i values x sample size Eric Zivot (Copyright 2015) Descriptive Statistics 14 / 28

15 Empirical Cumulative Distribution Function cont. How to compute and plot ˆF X (x) for a sample {x 1,..., x n }. Sort data from smallest to largest values: {x (1),..., x (n) } and compute ˆF X (x) at these points Plot ˆF X (x) against sorted data {x (1),..., x (n) } Use the R function ecdf() Note: x (1),..., x (n) are called the order statistics. In particular, x (1) = min(x 1,..., x n ) and x (n) = max(x 1,..., x n ). Eric Zivot (Copyright 2015) Descriptive Statistics 15 / 28

16 Quantile-Quantile (QQ) Plots A QQ plot is useful for comparing your data with the quantiles of a distribution (usually the normal distribution) that you think is appropriate for your data. You interpret the QQ plot in the following way: If the points fall close to a straight line, your conjectured distribution is appropriate If the points do not fall close to a straight line, your conjectured distribution is not appropriate and you should consider a different distribution Eric Zivot (Copyright 2015) Descriptive Statistics 16 / 28

17 R Functions Function Package Description qqnorm() stats QQ-plot against normal distribution qqline() stats draw straight line on QQ-plot qqplot() car QQ-plot against specified distribution Eric Zivot (Copyright 2015) Descriptive Statistics 17 / 28

18 Outliers Extremely large or small values are called outliers Outliers can greatly influence the values of common descriptive statistics. In particular, the sample mean, variance, standard deviation, skewness and kurtosis Percentile measures are more robust to outliers: outliers do not greatly influence these measures (e.g. median instead of mean; IQR instead of SD) Eric Zivot (Copyright 2015) Descriptive Statistics 18 / 28

19 Outliers cont. IQR (interquartile range) - outlier robust measure of spread: IQR = q.75 q.25 Moderate Outlier: ˆq IQR < x < ˆq IQR ˆq.25 3 IQR < x < ˆq IQR Extreme Outlier: x > ˆq IQR x < ˆq.25 3 IQR Eric Zivot (Copyright 2015) Descriptive Statistics 19 / 28

20 Boxplots A box plot displays the locations of the basic features of the distribution of one-dimensional data the median, the upper and lower quartiles, outer fences that indicate the extent of your data beyond the quartiles, and outliers, if any. R functions Function Package Description boxplot() graphics box plots for multiple series chart.boxplot() PerformanceAnalytics box plots for asset returns Eric Zivot (Copyright 2015) Descriptive Statistics 20 / 28

21 Outline 1 Univariate Descriptive Statistics 2 Bivariate Descriptive Statistics 3 Time Series Descriptive Statistics Eric Zivot (Copyright 2015) Descriptive Statistics 21 / 28

22 Bivariate Descriptive Statistics {..., (X 1, Y 1 ), (X 2, Y 2 ),... (X T, Y T ),...} = {(X t, Y t )} Covariance stationary bivariate stochastic process with realized values: {(x 1, y 1 ), (x 2, y 2 ),... (x T, y T )} = {(x t, y t )} T t=1 Scatterplot: XY plot of bivariate data R functions: plot(), pairs() Eric Zivot (Copyright 2015) Descriptive Statistics 22 / 28

23 Bivariate Descriptive Statistics cont. Sample Covariance: 1 T 1 Sample Correlation: T (x t x)(y t ȳ) = s xy = ˆσ xy t=1 s xy s x s y = r xy = ˆρ xy Eric Zivot (Copyright 2015) Descriptive Statistics 23 / 28

24 R Functions Function Package Description var() stats compute sample variance-covariance matrix cov() stats compute sample variance-covariance matrix cor() stats compute sample correlation matrix Eric Zivot (Copyright 2015) Descriptive Statistics 24 / 28

25 Outline 1 Univariate Descriptive Statistics 2 Bivariate Descriptive Statistics 3 Time Series Descriptive Statistics Eric Zivot (Copyright 2015) Descriptive Statistics 25 / 28

26 Time Series Descriptive Statistics Sample Autocovariance: ˆγ j = 1 T 1 T t=j+1 Sample Autocorrelation: ˆρ j = ˆγ j, j = 1, 2,... ˆσ 2 (x t x)(x t j x) j = 1, 2,... Sample Autocorrelation Function (SACF): Plot ˆρ j against j Eric Zivot (Copyright 2015) Descriptive Statistics 26 / 28

27 R Functions Function Package Description acf() stats compute and plot sample autocorrelations chart.acf() PerformanceAnalytics plot sample autocorrelations chart.acfplus() PerformanceAnalytics plot sample autocorrelations Eric Zivot (Copyright 2015) Descriptive Statistics 27 / 28

28 You can t see this text! faculty.washington.edu/ezivot/ Eric Zivot (Copyright 2015) Descriptive Statistics 28 / 28

Chapter 1. Descriptive Statistics for Financial Data. 1.1 UnivariateDescriptiveStatistics

Chapter 1. Descriptive Statistics for Financial Data. 1.1 UnivariateDescriptiveStatistics Chapter 1 Descriptive Statistics for Financial Data Updated: July 7, 2014 In this chapter we use graphical and numerical descriptive statistics to study the distribution and dependence properties of daily

More information

I. Return Calculations (20 pts, 4 points each)

I. Return Calculations (20 pts, 4 points each) University of Washington Winter 015 Department of Economics Eric Zivot Econ 44 Midterm Exam Solutions This is a closed book and closed note exam. However, you are allowed one page of notes (8.5 by 11 or

More information

Lecture 2 Describing Data

Lecture 2 Describing Data Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms

More information

Midterm Exam. b. What are the continuously compounded returns for the two stocks?

Midterm Exam. b. What are the continuously compounded returns for the two stocks? University of Washington Fall 004 Department of Economics Eric Zivot Economics 483 Midterm Exam This is a closed book and closed note exam. However, you are allowed one page of notes (double-sided). Answer

More information

appstats5.notebook September 07, 2016 Chapter 5

appstats5.notebook September 07, 2016 Chapter 5 Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.

More information

1 Describing Distributions with numbers

1 Describing Distributions with numbers 1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write

More information

Random Variables and Probability Distributions

Random Variables and Probability Distributions Chapter 3 Random Variables and Probability Distributions Chapter Three Random Variables and Probability Distributions 3. Introduction An event is defined as the possible outcome of an experiment. In engineering

More information

Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR

Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR Nelson Mark University of Notre Dame Fall 2017 September 11, 2017 Introduction

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Stemplots (or Stem-and-leaf plots) Stemplot and Boxplot T -- leading digits are called stems T -- final digits are called leaves STAT 74 Descriptive Statistics 2 Example: (number

More information

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

Section3-2: Measures of Center

Section3-2: Measures of Center Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number

More information

Economics 483. Midterm Exam. 1. Consider the following monthly data for Microsoft stock over the period December 1995 through December 1996:

Economics 483. Midterm Exam. 1. Consider the following monthly data for Microsoft stock over the period December 1995 through December 1996: University of Washington Summer Department of Economics Eric Zivot Economics 3 Midterm Exam This is a closed book and closed note exam. However, you are allowed one page of handwritten notes. Answer all

More information

Chen-wei Chiu ECON 424 Eric Zivot July 17, Lab 4. Part I Descriptive Statistics. I. Univariate Graphical Analysis 1. Separate & Same Graph

Chen-wei Chiu ECON 424 Eric Zivot July 17, Lab 4. Part I Descriptive Statistics. I. Univariate Graphical Analysis 1. Separate & Same Graph Chen-wei Chiu ECON 424 Eric Zivot July 17, 2014 Part I Descriptive Statistics I. Univariate Graphical Analysis 1. Separate & Same Graph Lab 4 Time Series Plot Bar Graph The plots show that the returns

More information

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.

Normal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial. Lecture 21,22, 23 Text: A Course in Probability by Weiss 8.5 STAT 225 Introduction to Probability Models March 31, 2014 Standard Sums of Whitney Huang Purdue University 21,22, 23.1 Agenda 1 2 Standard

More information

Descriptive Statistics Bios 662

Descriptive Statistics Bios 662 Descriptive Statistics Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-08-19 08:51 BIOS 662 1 Descriptive Statistics Descriptive Statistics Types of variables

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

2 Exploring Univariate Data

2 Exploring Univariate Data 2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting

More information

Section 6-1 : Numerical Summaries

Section 6-1 : Numerical Summaries MAT 2377 (Winter 2012) Section 6-1 : Numerical Summaries With a random experiment comes data. In these notes, we learn techniques to describe the data. Data : We will denote the n observations of the random

More information

Introduction to R (2)

Introduction to R (2) Introduction to R (2) Boxplots Boxplots are highly efficient tools for the representation of the data distributions. The five number summary can be located in boxplots. Additionally, we can distinguish

More information

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values

More information

Lecture 1: Empirical Properties of Returns

Lecture 1: Empirical Properties of Returns Lecture 1: Empirical Properties of Returns Econ 589 Eric Zivot Spring 2011 Updated: March 29, 2011 Daily CC Returns on MSFT -0.3 r(t) -0.2-0.1 0.1 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996

More information

MVE051/MSG Lecture 7

MVE051/MSG Lecture 7 MVE051/MSG810 2017 Lecture 7 Petter Mostad Chalmers November 20, 2017 The purpose of collecting and analyzing data Purpose: To build and select models for parts of the real world (which can be used for

More information

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1

Chapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.

More information

Monte Carlo Simulation (Random Number Generation)

Monte Carlo Simulation (Random Number Generation) Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...

More information

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.

Dot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line. Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,

More information

The Normal Distribution

The Normal Distribution Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we

More information

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,

More information

Some estimates of the height of the podium

Some estimates of the height of the podium Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40

More information

Describing Data: One Quantitative Variable

Describing Data: One Quantitative Variable STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive

More information

Statistical Analysis of Data from the Stock Markets. UiO-STK4510 Autumn 2015

Statistical Analysis of Data from the Stock Markets. UiO-STK4510 Autumn 2015 Statistical Analysis of Data from the Stock Markets UiO-STK4510 Autumn 2015 Sampling Conventions We observe the price process S of some stock (or stock index) at times ft i g i=0,...,n, we denote it by

More information

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25

Handout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25 Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example

More information

STAT 113 Variability

STAT 113 Variability STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2

More information

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution

Overview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations

More information

Topic 8: Model Diagnostics

Topic 8: Model Diagnostics Topic 8: Model Diagnostics Outline Diagnostics to check model assumptions Diagnostics concerning X Diagnostics using the residuals Diagnostics and remedial measures Diagnostics: look at the data to diagnose

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either

More information

Simple Descriptive Statistics

Simple Descriptive Statistics Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency

More information

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Lecture 3 Reading: Sections 5.7 54 Remember, when you finish a chapter make sure not to miss the last couple of boxes: What Can Go

More information

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial

Lecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial Lecture 23 STAT 225 Introduction to Probability Models April 4, 2014 approximation Whitney Huang Purdue University 23.1 Agenda 1 approximation 2 approximation 23.2 Characteristics of the random variable:

More information

Lecture 6: Non Normal Distributions

Lecture 6: Non Normal Distributions Lecture 6: Non Normal Distributions and their Uses in GARCH Modelling Prof. Massimo Guidolin 20192 Financial Econometrics Spring 2015 Overview Non-normalities in (standardized) residuals from asset return

More information

Exploring Data and Graphics

Exploring Data and Graphics Exploring Data and Graphics Rick White Department of Statistics, UBC Graduate Pathways to Success Graduate & Postdoctoral Studies November 13, 2013 Outline Summarizing Data Types of Data Visualizing Data

More information

Frequency Distribution and Summary Statistics

Frequency Distribution and Summary Statistics Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

Financial Time Series and Their Characteristics

Financial Time Series and Their Characteristics Financial Time Series and Their Characteristics Egon Zakrajšek Division of Monetary Affairs Federal Reserve Board Summer School in Financial Mathematics Faculty of Mathematics & Physics University of Ljubljana

More information

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of

More information

Statistics I Chapter 2: Analysis of univariate data

Statistics I Chapter 2: Analysis of univariate data Statistics I Chapter 2: Analysis of univariate data Numerical summary Central tendency Location Spread Form mean quartiles range coeff. asymmetry median percentiles interquartile range coeff. kurtosis

More information

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS

NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows

More information

STA 248 H1S Winter 2008 Assignment 1 Solutions

STA 248 H1S Winter 2008 Assignment 1 Solutions 1. (a) Measures of location: STA 248 H1S Winter 2008 Assignment 1 Solutions i. The mean, 100 1=1 x i/100, can be made arbitrarily large if one of the x i are made arbitrarily large since the sample size

More information

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.) Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop

More information

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Categorical. A general name for non-numerical data; the data is separated into categories of some kind. Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,

More information

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). We will look the three common and useful measures of spread. The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). 1 Ameasure of the center

More information

Empirical Rule (P148)

Empirical Rule (P148) Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall

More information

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis

Standardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem

More information

Case Study: Heavy-Tailed Distribution and Reinsurance Rate-making

Case Study: Heavy-Tailed Distribution and Reinsurance Rate-making Case Study: Heavy-Tailed Distribution and Reinsurance Rate-making May 30, 2016 The purpose of this case study is to give a brief introduction to a heavy-tailed distribution and its distinct behaviors in

More information

Math 140 Introductory Statistics. First midterm September

Math 140 Introductory Statistics. First midterm September Math 140 Introductory Statistics First midterm September 23 2010 Box Plots Graphical display of 5 number summary Q1, Q2 (median), Q3, max, min Outliers If a value is more than 1.5 times the IQR from the

More information

CHAPTER 2 Describing Data: Numerical

CHAPTER 2 Describing Data: Numerical CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali Part I Descriptive Statistics 1 Introduction and Framework... 3 1.1 Population, Sample, and Observations... 3 1.2 Variables.... 4 1.2.1 Qualitative and Quantitative Variables.... 5 1.2.2 Discrete and Continuous

More information

Numerical Descriptions of Data

Numerical Descriptions of Data Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =

More information

Unit 2 Statistics of One Variable

Unit 2 Statistics of One Variable Unit 2 Statistics of One Variable Day 6 Summarizing Quantitative Data Summarizing Quantitative Data We have discussed how to display quantitative data in a histogram It is useful to be able to describe

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow

More information

SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data

SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu September 5, 2015

More information

The Constant Expected Return Model

The Constant Expected Return Model Chapter 1 The Constant Expected Return Model Date: February 5, 2015 The first model of asset returns we consider is the very simple constant expected return (CER) model. This model is motivated by the

More information

Copyright 2005 Pearson Education, Inc. Slide 6-1

Copyright 2005 Pearson Education, Inc. Slide 6-1 Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is

More information

Introduction to Computational Finance and Financial Econometrics Introduction to Portfolio Theory

Introduction to Computational Finance and Financial Econometrics Introduction to Portfolio Theory You can t see this text! Introduction to Computational Finance and Financial Econometrics Introduction to Portfolio Theory Eric Zivot Spring 2015 Eric Zivot (Copyright 2015) Introduction to Portfolio Theory

More information

Description of Data I

Description of Data I Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret

More information

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay. Solutions to Final Exam.

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay. Solutions to Final Exam. The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay Solutions to Final Exam Problem A: (32 pts) Answer briefly the following questions. 1. Suppose

More information

Econ 424/CFRM 462 Portfolio Risk Budgeting

Econ 424/CFRM 462 Portfolio Risk Budgeting Econ 424/CFRM 462 Portfolio Risk Budgeting Eric Zivot August 14, 2014 Portfolio Risk Budgeting Idea: Additively decompose a measure of portfolio risk into contributions from the individual assets in the

More information

Business Statistics 41000: Probability 3

Business Statistics 41000: Probability 3 Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

Lecture Week 4 Inspecting Data: Distributions

Lecture Week 4 Inspecting Data: Distributions Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your

More information

Descriptive Statistics

Descriptive Statistics Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs

More information

SOLUTIONS TO THE LAB 1 ASSIGNMENT

SOLUTIONS TO THE LAB 1 ASSIGNMENT SOLUTIONS TO THE LAB 1 ASSIGNMENT Question 1 Excel produces the following histogram of pull strengths for the 100 resistors: 2 20 Histogram of Pull Strengths (lb) Frequency 1 10 0 9 61 63 6 67 69 71 73

More information

Modern Methods of Data Analysis - SS 2009

Modern Methods of Data Analysis - SS 2009 Modern Methods of Data Analysis Lecture II (7.04.09) Contents: Characterize data samples Characterize distributions Correlations, covariance Reminder: Average of a Sample arithmetic mean of data set: weighted

More information

Much of what appears here comes from ideas presented in the book:

Much of what appears here comes from ideas presented in the book: Chapter 11 Robust statistical methods Much of what appears here comes from ideas presented in the book: Huber, Peter J. (1981), Robust statistics, John Wiley & Sons (New York; Chichester). There are many

More information

Standard Normal, Inverse Normal and Sampling Distributions

Standard Normal, Inverse Normal and Sampling Distributions Standard Normal, Inverse Normal and Sampling Distributions Section 5.5 & 6.6 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 9-3339 Cathy

More information

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017

More information

Financial Econometrics Notes. Kevin Sheppard University of Oxford

Financial Econometrics Notes. Kevin Sheppard University of Oxford Financial Econometrics Notes Kevin Sheppard University of Oxford Monday 15 th January, 2018 2 This version: 22:52, Monday 15 th January, 2018 2018 Kevin Sheppard ii Contents 1 Probability, Random Variables

More information

How Wealthy Are Europeans?

How Wealthy Are Europeans? How Wealthy Are Europeans? Grades: 7, 8, 11, 12 (course specific) Description: Organization of data of to examine measures of spread and measures of central tendency in examination of Gross Domestic Product

More information

Session 5: Associations

Session 5: Associations Session 5: Associations Li (Sherlly) Xie http://www.nemoursresearch.org/open/statclass/february2013/ Session 5 Flow 1. Bivariate data visualization Cross-Tab Stacked bar plots Box plot Scatterplot 2. Correlation

More information

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.

Chapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data. -3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data

More information

Standard Deviation. Lecture 18 Section Robb T. Koether. Hampden-Sydney College. Mon, Sep 26, 2011

Standard Deviation. Lecture 18 Section Robb T. Koether. Hampden-Sydney College. Mon, Sep 26, 2011 Standard Deviation Lecture 18 Section 5.3.4 Robb T. Koether Hampden-Sydney College Mon, Sep 26, 2011 Robb T. Koether (Hampden-Sydney College) Standard Deviation Mon, Sep 26, 2011 1 / 42 Outline 1 Variability

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution

More information

Descriptive Statistics for Financial Time Series

Descriptive Statistics for Financial Time Series Descriptive Statistics for Financial Time Series Econ 424/Amath 462 Summer 2014 Eric Zivot Updated: July 10, 2014 # Load libraries > library(tseries) > library(performanceanalytics) Data for Examples #

More information

Introduction to Descriptive Statistics

Introduction to Descriptive Statistics Introduction to Descriptive Statistics 17.871 Types of Variables ~Nominal (Quantitative) Nominal (Qualitative) categorical Ordinal Interval or ratio Describing data Moment Non-mean based measure Center

More information

Mini-Lecture 3.1 Measures of Central Tendency

Mini-Lecture 3.1 Measures of Central Tendency Mini-Lecture 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data 3. Explain what it means for a

More information

NCSS Statistical Software. Reference Intervals

NCSS Statistical Software. Reference Intervals Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and

More information

3.1 Measures of Central Tendency

3.1 Measures of Central Tendency 3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent

More information

Simulation Lecture Notes and the Gentle Lentil Case

Simulation Lecture Notes and the Gentle Lentil Case Simulation Lecture Notes and the Gentle Lentil Case General Overview of the Case What is the decision problem presented in the case? What are the issues Sanjay must consider in deciding among the alternative

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial

More information

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good

More information

Lecture 18 Section Mon, Feb 16, 2009

Lecture 18 Section Mon, Feb 16, 2009 The s the Lecture 18 Section 5.3.4 Hampden-Sydney College Mon, Feb 16, 2009 Outline The s the 1 2 3 The 4 s 5 the 6 The s the Exercise 5.12, page 333. The five-number summary for the distribution of income

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 10 (MWF) Checking for normality of the data using the QQplot Suhasini Subba Rao Checking for

More information

Fitting financial time series returns distributions: a mixture normality approach

Fitting financial time series returns distributions: a mixture normality approach Fitting financial time series returns distributions: a mixture normality approach Riccardo Bramante and Diego Zappa * Abstract Value at Risk has emerged as a useful tool to risk management. A relevant

More information

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.

Center and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers. Chapter 3 Section3-: Measures of Center Section 3-3: Measurers of Variation Section 3-4: Measures of Relative Standing Section 3-5: Exploratory Data Analysis Describing Distributions with Numbers The overall

More information

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model

STAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good

More information

Amath 546/Econ 589 Univariate GARCH Models

Amath 546/Econ 589 Univariate GARCH Models Amath 546/Econ 589 Univariate GARCH Models Eric Zivot April 24, 2013 Lecture Outline Conditional vs. Unconditional Risk Measures Empirical regularities of asset returns Engle s ARCH model Testing for ARCH

More information