Introduction to Computational Finance and Financial Econometrics Descriptive Statistics
|
|
- David Blair
- 5 years ago
- Views:
Transcription
1 You can t see this text! Introduction to Computational Finance and Financial Econometrics Descriptive Statistics Eric Zivot Summer 2015 Eric Zivot (Copyright 2015) Descriptive Statistics 1 / 28
2 Outline 1 Univariate Descriptive Statistics 2 Bivariate Descriptive Statistics 3 Time Series Descriptive Statistics Eric Zivot (Copyright 2015) Descriptive Statistics 2 / 28
3 Covariance Stationarity {..., X 1,..., X T,...} = {X t } is a covariance stationary stochastic process, and each X t is identically distributed with unknown pdf f (x). Recall, E[X t ] = µ indep of t var(x t ) = σ 2 indep of t cov(x t, X t j ) = γ j indep of t cor(x t, X t j ) = ρ j indep of t Eric Zivot (Copyright 2015) Descriptive Statistics 3 / 28
4 Descriptive Statistics Observed Sample: {X 1 = x 1,..., X T = x T } = {x t } T t=1 are observations generated by the stochastic process. Descriptive Statistics: Data summaries (statistics) to describe certain features of the data, to learn about the unknown pdf, f (x), and to capture the observed dependencies in the data. Eric Zivot (Copyright 2015) Descriptive Statistics 4 / 28
5 Time Plots Line plot of time series data with time/dates on horizontal axis. Visualization of data - uncover trends, assess stationarity and time dependence Spot unusual behavior Plotting multiple time series can reveal commonality across series Eric Zivot (Copyright 2015) Descriptive Statistics 5 / 28
6 Histograms Goal: Describe the shape of the distribution of the data {x t } T t=1 Hisogram Construction: 1 Order data from smallest to largest values 2 Divide range into N equally spaced bins [ ] 3 Count number of observations in each bin 4 Create bar chart (optionally normalize area to equal 1) Eric Zivot (Copyright 2015) Descriptive Statistics 6 / 28
7 R Functions Function hist() density() Description compute histogram compute smoothed histogram Note: The density() function computes a smoothed (kernel density) estimate of the unknown pdf at the point x using the formula: ˆf (x) = 1 Tb T ( ) x xt k b t=1 k( ) = kernel function b = bandwidth (smoothing) parameter where k( ) is a pdf symmetric about zero (typically the standard normal distribution). See Ruppert Chapter 4 for details. Eric Zivot (Copyright 2015) Descriptive Statistics 7 / 28
8 Empirical Quantiles/Percentiles Percentiles: For α [0, 1], the 100 α th percentile (empirical quantile) of a sample of data is the data value ˆq α such that α 100% of the data are less than ˆq α. Quartiles: ˆq.25 = first quartile ˆq.50 = second quartile (median) ˆq.75 = third quartile ˆq.75 ˆq.25 = interquartile range (IQR) Eric Zivot (Copyright 2015) Descriptive Statistics 8 / 28
9 R Functions Function sort() min() max() range() quantile() median() IQR() summary() Description sort elements of data vector compute minimum value of data vector compute maximum value of data vector compute min and max of a data vector compute empirical quantiles compute median compute inter-quartile range compute summary statistics Eric Zivot (Copyright 2015) Descriptive Statistics 9 / 28
10 Historical Value-at-Risk Let {R t } T t=1 denote a sample of T simple monthly returns on an investment, and let $W 0 be the initial value of an investment. For α (0, 1), the historical VaR α is: $W 0 ˆq R α ˆq R α = empirical α 100% quantile of {R t } T t=1 Note: For continuously compounded returns {r t } T t=1 use: $W 0 (exp(ˆq r α) 1) ˆq r α = empirical α 100% quantile of {r t } T t=1 Eric Zivot (Copyright 2015) Descriptive Statistics 10 / 28
11 Sample Statistics Plug-In Principle: Estimate population quantities using sample statistics. Sample Average (Mean): 1 T T x t = x = ˆµ x t=1 Sample Variance: 1 T 1 T (x t x) 2 = sx 2 = ˆσ x 2 t=1 Sample Standard Deviation: s 2 x = s x = ˆσ x Eric Zivot (Copyright 2015) Descriptive Statistics 11 / 28
12 Sample Statistics cont. Sample Skewness: 1 T 1 Sample Kurtosis: 1 T 1 T (x t x) 3 /sx 3 = ŝkew t=1 T (x t x) 4 /sx 4 = kurt t=1 Sample Excess Kurtosis: kurt 3 Eric Zivot (Copyright 2015) Descriptive Statistics 12 / 28
13 R Functions Function Package Description mean() base compute sample mean colmeans() base compute column means of matrix var() stats compute sample variance sd() stats compute sample standard deviation skewness() PerformanceAnalytics compute sample skewness kurtosis() PerformanceAnalytics compute sample excess kurtosis Note: Use the R function apply(), to apply functions over rows or columns of a matrix or data.frame. Eric Zivot (Copyright 2015) Descriptive Statistics 13 / 28
14 Empirical Cumulative Distribution Function Recall, the CDF of a random variable X is: F X (x) = Pr(X x) The empirical CDF of a random sample is: ˆF X (x) = 1 n (#x i x) = number of x i values x sample size Eric Zivot (Copyright 2015) Descriptive Statistics 14 / 28
15 Empirical Cumulative Distribution Function cont. How to compute and plot ˆF X (x) for a sample {x 1,..., x n }. Sort data from smallest to largest values: {x (1),..., x (n) } and compute ˆF X (x) at these points Plot ˆF X (x) against sorted data {x (1),..., x (n) } Use the R function ecdf() Note: x (1),..., x (n) are called the order statistics. In particular, x (1) = min(x 1,..., x n ) and x (n) = max(x 1,..., x n ). Eric Zivot (Copyright 2015) Descriptive Statistics 15 / 28
16 Quantile-Quantile (QQ) Plots A QQ plot is useful for comparing your data with the quantiles of a distribution (usually the normal distribution) that you think is appropriate for your data. You interpret the QQ plot in the following way: If the points fall close to a straight line, your conjectured distribution is appropriate If the points do not fall close to a straight line, your conjectured distribution is not appropriate and you should consider a different distribution Eric Zivot (Copyright 2015) Descriptive Statistics 16 / 28
17 R Functions Function Package Description qqnorm() stats QQ-plot against normal distribution qqline() stats draw straight line on QQ-plot qqplot() car QQ-plot against specified distribution Eric Zivot (Copyright 2015) Descriptive Statistics 17 / 28
18 Outliers Extremely large or small values are called outliers Outliers can greatly influence the values of common descriptive statistics. In particular, the sample mean, variance, standard deviation, skewness and kurtosis Percentile measures are more robust to outliers: outliers do not greatly influence these measures (e.g. median instead of mean; IQR instead of SD) Eric Zivot (Copyright 2015) Descriptive Statistics 18 / 28
19 Outliers cont. IQR (interquartile range) - outlier robust measure of spread: IQR = q.75 q.25 Moderate Outlier: ˆq IQR < x < ˆq IQR ˆq.25 3 IQR < x < ˆq IQR Extreme Outlier: x > ˆq IQR x < ˆq.25 3 IQR Eric Zivot (Copyright 2015) Descriptive Statistics 19 / 28
20 Boxplots A box plot displays the locations of the basic features of the distribution of one-dimensional data the median, the upper and lower quartiles, outer fences that indicate the extent of your data beyond the quartiles, and outliers, if any. R functions Function Package Description boxplot() graphics box plots for multiple series chart.boxplot() PerformanceAnalytics box plots for asset returns Eric Zivot (Copyright 2015) Descriptive Statistics 20 / 28
21 Outline 1 Univariate Descriptive Statistics 2 Bivariate Descriptive Statistics 3 Time Series Descriptive Statistics Eric Zivot (Copyright 2015) Descriptive Statistics 21 / 28
22 Bivariate Descriptive Statistics {..., (X 1, Y 1 ), (X 2, Y 2 ),... (X T, Y T ),...} = {(X t, Y t )} Covariance stationary bivariate stochastic process with realized values: {(x 1, y 1 ), (x 2, y 2 ),... (x T, y T )} = {(x t, y t )} T t=1 Scatterplot: XY plot of bivariate data R functions: plot(), pairs() Eric Zivot (Copyright 2015) Descriptive Statistics 22 / 28
23 Bivariate Descriptive Statistics cont. Sample Covariance: 1 T 1 Sample Correlation: T (x t x)(y t ȳ) = s xy = ˆσ xy t=1 s xy s x s y = r xy = ˆρ xy Eric Zivot (Copyright 2015) Descriptive Statistics 23 / 28
24 R Functions Function Package Description var() stats compute sample variance-covariance matrix cov() stats compute sample variance-covariance matrix cor() stats compute sample correlation matrix Eric Zivot (Copyright 2015) Descriptive Statistics 24 / 28
25 Outline 1 Univariate Descriptive Statistics 2 Bivariate Descriptive Statistics 3 Time Series Descriptive Statistics Eric Zivot (Copyright 2015) Descriptive Statistics 25 / 28
26 Time Series Descriptive Statistics Sample Autocovariance: ˆγ j = 1 T 1 T t=j+1 Sample Autocorrelation: ˆρ j = ˆγ j, j = 1, 2,... ˆσ 2 (x t x)(x t j x) j = 1, 2,... Sample Autocorrelation Function (SACF): Plot ˆρ j against j Eric Zivot (Copyright 2015) Descriptive Statistics 26 / 28
27 R Functions Function Package Description acf() stats compute and plot sample autocorrelations chart.acf() PerformanceAnalytics plot sample autocorrelations chart.acfplus() PerformanceAnalytics plot sample autocorrelations Eric Zivot (Copyright 2015) Descriptive Statistics 27 / 28
28 You can t see this text! faculty.washington.edu/ezivot/ Eric Zivot (Copyright 2015) Descriptive Statistics 28 / 28
Chapter 1. Descriptive Statistics for Financial Data. 1.1 UnivariateDescriptiveStatistics
Chapter 1 Descriptive Statistics for Financial Data Updated: July 7, 2014 In this chapter we use graphical and numerical descriptive statistics to study the distribution and dependence properties of daily
More informationI. Return Calculations (20 pts, 4 points each)
University of Washington Winter 015 Department of Economics Eric Zivot Econ 44 Midterm Exam Solutions This is a closed book and closed note exam. However, you are allowed one page of notes (8.5 by 11 or
More informationLecture 2 Describing Data
Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms
More informationMidterm Exam. b. What are the continuously compounded returns for the two stocks?
University of Washington Fall 004 Department of Economics Eric Zivot Economics 483 Midterm Exam This is a closed book and closed note exam. However, you are allowed one page of notes (double-sided). Answer
More informationappstats5.notebook September 07, 2016 Chapter 5
Chapter 5 Describing Distributions Numerically Chapter 5 Objective: Students will be able to use statistics appropriate to the shape of the data distribution to compare of two or more different data sets.
More information1 Describing Distributions with numbers
1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write
More informationRandom Variables and Probability Distributions
Chapter 3 Random Variables and Probability Distributions Chapter Three Random Variables and Probability Distributions 3. Introduction An event is defined as the possible outcome of an experiment. In engineering
More informationFinancial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR
Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR Nelson Mark University of Notre Dame Fall 2017 September 11, 2017 Introduction
More informationExploratory Data Analysis
Exploratory Data Analysis Stemplots (or Stem-and-leaf plots) Stemplot and Boxplot T -- leading digits are called stems T -- final digits are called leaves STAT 74 Descriptive Statistics 2 Example: (number
More informationChapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1
Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and
More informationDATA SUMMARIZATION AND VISUALIZATION
APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296
More informationSection3-2: Measures of Center
Chapter 3 Section3-: Measures of Center Notation Suppose we are making a series of observations, n of them, to be exact. Then we write x 1, x, x 3,K, x n as the values we observe. Thus n is the total number
More informationEconomics 483. Midterm Exam. 1. Consider the following monthly data for Microsoft stock over the period December 1995 through December 1996:
University of Washington Summer Department of Economics Eric Zivot Economics 3 Midterm Exam This is a closed book and closed note exam. However, you are allowed one page of handwritten notes. Answer all
More informationChen-wei Chiu ECON 424 Eric Zivot July 17, Lab 4. Part I Descriptive Statistics. I. Univariate Graphical Analysis 1. Separate & Same Graph
Chen-wei Chiu ECON 424 Eric Zivot July 17, 2014 Part I Descriptive Statistics I. Univariate Graphical Analysis 1. Separate & Same Graph Lab 4 Time Series Plot Bar Graph The plots show that the returns
More informationNormal Distribution. Notes. Normal Distribution. Standard Normal. Sums of Normal Random Variables. Normal. approximation of Binomial.
Lecture 21,22, 23 Text: A Course in Probability by Weiss 8.5 STAT 225 Introduction to Probability Models March 31, 2014 Standard Sums of Whitney Huang Purdue University 21,22, 23.1 Agenda 1 2 Standard
More informationDescriptive Statistics Bios 662
Descriptive Statistics Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-08-19 08:51 BIOS 662 1 Descriptive Statistics Descriptive Statistics Types of variables
More informationStat 101 Exam 1 - Embers Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.
More information2 Exploring Univariate Data
2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting
More informationSection 6-1 : Numerical Summaries
MAT 2377 (Winter 2012) Section 6-1 : Numerical Summaries With a random experiment comes data. In these notes, we learn techniques to describe the data. Data : We will denote the n observations of the random
More informationIntroduction to R (2)
Introduction to R (2) Boxplots Boxplots are highly efficient tools for the representation of the data distributions. The five number summary can be located in boxplots. Additionally, we can distinguish
More informationMeasures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean
Measure of Center Measures of Center The value at the center or middle of a data set 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) 1 2 Mean Notation The measure of center obtained by adding the values
More informationLecture 1: Empirical Properties of Returns
Lecture 1: Empirical Properties of Returns Econ 589 Eric Zivot Spring 2011 Updated: March 29, 2011 Daily CC Returns on MSFT -0.3 r(t) -0.2-0.1 0.1 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996
More informationMVE051/MSG Lecture 7
MVE051/MSG810 2017 Lecture 7 Petter Mostad Chalmers November 20, 2017 The purpose of collecting and analyzing data Purpose: To build and select models for parts of the real world (which can be used for
More informationChapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1
Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.
More informationMonte Carlo Simulation (Random Number Generation)
Monte Carlo Simulation (Random Number Generation) Revised: 10/11/2017 Summary... 1 Data Input... 1 Analysis Options... 6 Summary Statistics... 6 Box-and-Whisker Plots... 7 Percentiles... 9 Quantile Plots...
More informationDot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.
Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,
More informationThe Normal Distribution
Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we
More informationAP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE
AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,
More informationSome estimates of the height of the podium
Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40
More informationDescribing Data: One Quantitative Variable
STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive
More informationStatistical Analysis of Data from the Stock Markets. UiO-STK4510 Autumn 2015
Statistical Analysis of Data from the Stock Markets UiO-STK4510 Autumn 2015 Sampling Conventions We observe the price process S of some stock (or stock index) at times ft i g i=0,...,n, we denote it by
More informationHandout 4 numerical descriptive measures part 2. Example 1. Variance and Standard Deviation for Grouped Data. mf N 535 = = 25
Handout 4 numerical descriptive measures part Calculating Mean for Grouped Data mf Mean for population data: µ mf Mean for sample data: x n where m is the midpoint and f is the frequency of a class. Example
More informationSTAT 113 Variability
STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2
More informationOverview/Outline. Moving beyond raw data. PSY 464 Advanced Experimental Design. Describing and Exploring Data The Normal Distribution
PSY 464 Advanced Experimental Design Describing and Exploring Data The Normal Distribution 1 Overview/Outline Questions-problems? Exploring/Describing data Organizing/summarizing data Graphical presentations
More informationTopic 8: Model Diagnostics
Topic 8: Model Diagnostics Outline Diagnostics to check model assumptions Diagnostics concerning X Diagnostics using the residuals Diagnostics and remedial measures Diagnostics: look at the data to diagnose
More informationBasic Procedure for Histograms
Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that
More informationMath 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment
Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class
More informationSTATISTICAL DISTRIBUTIONS AND THE CALCULATOR
STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either
More informationSimple Descriptive Statistics
Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency
More informationPercentiles, STATA, Box Plots, Standardizing, and Other Transformations
Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Lecture 3 Reading: Sections 5.7 54 Remember, when you finish a chapter make sure not to miss the last couple of boxes: What Can Go
More informationLecture 23. STAT 225 Introduction to Probability Models April 4, Whitney Huang Purdue University. Normal approximation to Binomial
Lecture 23 STAT 225 Introduction to Probability Models April 4, 2014 approximation Whitney Huang Purdue University 23.1 Agenda 1 approximation 2 approximation 23.2 Characteristics of the random variable:
More informationLecture 6: Non Normal Distributions
Lecture 6: Non Normal Distributions and their Uses in GARCH Modelling Prof. Massimo Guidolin 20192 Financial Econometrics Spring 2015 Overview Non-normalities in (standardized) residuals from asset return
More informationExploring Data and Graphics
Exploring Data and Graphics Rick White Department of Statistics, UBC Graduate Pathways to Success Graduate & Postdoctoral Studies November 13, 2013 Outline Summarizing Data Types of Data Visualizing Data
More informationFrequency Distribution and Summary Statistics
Frequency Distribution and Summary Statistics Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. Stemplot 2. Frequency table 3. Summary
More informationData that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.
Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer
More informationFinancial Time Series and Their Characteristics
Financial Time Series and Their Characteristics Egon Zakrajšek Division of Monetary Affairs Federal Reserve Board Summer School in Financial Mathematics Faculty of Mathematics & Physics University of Ljubljana
More informationBoth the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.
Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of
More informationStatistics I Chapter 2: Analysis of univariate data
Statistics I Chapter 2: Analysis of univariate data Numerical summary Central tendency Location Spread Form mean quartiles range coeff. asymmetry median percentiles interquartile range coeff. kurtosis
More informationNOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS
NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows
More informationSTA 248 H1S Winter 2008 Assignment 1 Solutions
1. (a) Measures of location: STA 248 H1S Winter 2008 Assignment 1 Solutions i. The mean, 100 1=1 x i/100, can be made arbitrarily large if one of the x i are made arbitrarily large since the sample size
More informationChapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)
Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop
More informationCategorical. A general name for non-numerical data; the data is separated into categories of some kind.
Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,
More informationThe Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).
We will look the three common and useful measures of spread. The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). 1 Ameasure of the center
More informationEmpirical Rule (P148)
Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall
More informationStandardized Data Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis
Descriptive Statistics (Part 2) 4 Chapter Percentiles, Quartiles and Box Plots Grouped Data Skewness and Kurtosis McGraw-Hill/Irwin Copyright 2009 by The McGraw-Hill Companies, Inc. Chebyshev s Theorem
More informationCase Study: Heavy-Tailed Distribution and Reinsurance Rate-making
Case Study: Heavy-Tailed Distribution and Reinsurance Rate-making May 30, 2016 The purpose of this case study is to give a brief introduction to a heavy-tailed distribution and its distinct behaviors in
More informationMath 140 Introductory Statistics. First midterm September
Math 140 Introductory Statistics First midterm September 23 2010 Box Plots Graphical display of 5 number summary Q1, Q2 (median), Q3, max, min Outliers If a value is more than 1.5 times the IQR from the
More informationCHAPTER 2 Describing Data: Numerical
CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of
More informationSome Characteristics of Data
Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key
More informationContents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali
Part I Descriptive Statistics 1 Introduction and Framework... 3 1.1 Population, Sample, and Observations... 3 1.2 Variables.... 4 1.2.1 Qualitative and Quantitative Variables.... 5 1.2.2 Discrete and Continuous
More informationNumerical Descriptions of Data
Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =
More informationUnit 2 Statistics of One Variable
Unit 2 Statistics of One Variable Day 6 Summarizing Quantitative Data Summarizing Quantitative Data We have discussed how to display quantitative data in a histogram It is useful to be able to describe
More informationWeek 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.
Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow
More informationSYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data
SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu September 5, 2015
More informationThe Constant Expected Return Model
Chapter 1 The Constant Expected Return Model Date: February 5, 2015 The first model of asset returns we consider is the very simple constant expected return (CER) model. This model is motivated by the
More informationCopyright 2005 Pearson Education, Inc. Slide 6-1
Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is
More informationIntroduction to Computational Finance and Financial Econometrics Introduction to Portfolio Theory
You can t see this text! Introduction to Computational Finance and Financial Econometrics Introduction to Portfolio Theory Eric Zivot Spring 2015 Eric Zivot (Copyright 2015) Introduction to Portfolio Theory
More informationDescription of Data I
Description of Data I (Summary and Variability measures) Objectives: Able to understand how to summarize the data Able to understand how to measure the variability of the data Able to use and interpret
More informationThe University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay. Solutions to Final Exam.
The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay Solutions to Final Exam Problem A: (32 pts) Answer briefly the following questions. 1. Suppose
More informationEcon 424/CFRM 462 Portfolio Risk Budgeting
Econ 424/CFRM 462 Portfolio Risk Budgeting Eric Zivot August 14, 2014 Portfolio Risk Budgeting Idea: Additively decompose a measure of portfolio risk into contributions from the individual assets in the
More informationBusiness Statistics 41000: Probability 3
Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404
More informationDescriptive Statistics
Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations
More informationLecture Week 4 Inspecting Data: Distributions
Lecture Week 4 Inspecting Data: Distributions Introduction to Research Methods & Statistics 2013 2014 Hemmo Smit So next week No lecture & workgroups But Practice Test on-line (BB) Enter data for your
More informationDescriptive Statistics
Petra Petrovics Descriptive Statistics 2 nd seminar DESCRIPTIVE STATISTICS Definition: Descriptive statistics is concerned only with collecting and describing data Methods: - statistical tables and graphs
More informationSOLUTIONS TO THE LAB 1 ASSIGNMENT
SOLUTIONS TO THE LAB 1 ASSIGNMENT Question 1 Excel produces the following histogram of pull strengths for the 100 resistors: 2 20 Histogram of Pull Strengths (lb) Frequency 1 10 0 9 61 63 6 67 69 71 73
More informationModern Methods of Data Analysis - SS 2009
Modern Methods of Data Analysis Lecture II (7.04.09) Contents: Characterize data samples Characterize distributions Correlations, covariance Reminder: Average of a Sample arithmetic mean of data set: weighted
More informationMuch of what appears here comes from ideas presented in the book:
Chapter 11 Robust statistical methods Much of what appears here comes from ideas presented in the book: Huber, Peter J. (1981), Robust statistics, John Wiley & Sons (New York; Chichester). There are many
More informationStandard Normal, Inverse Normal and Sampling Distributions
Standard Normal, Inverse Normal and Sampling Distributions Section 5.5 & 6.6 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 9-3339 Cathy
More informationSummarising Data. Summarising Data. Examples of Types of Data. Types of Data
Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017
More informationFinancial Econometrics Notes. Kevin Sheppard University of Oxford
Financial Econometrics Notes Kevin Sheppard University of Oxford Monday 15 th January, 2018 2 This version: 22:52, Monday 15 th January, 2018 2018 Kevin Sheppard ii Contents 1 Probability, Random Variables
More informationHow Wealthy Are Europeans?
How Wealthy Are Europeans? Grades: 7, 8, 11, 12 (course specific) Description: Organization of data of to examine measures of spread and measures of central tendency in examination of Gross Domestic Product
More informationSession 5: Associations
Session 5: Associations Li (Sherlly) Xie http://www.nemoursresearch.org/open/statclass/february2013/ Session 5 Flow 1. Bivariate data visualization Cross-Tab Stacked bar plots Box plot Scatterplot 2. Correlation
More informationChapter 2: Descriptive Statistics. Mean (Arithmetic Mean): Found by adding the data values and dividing the total by the number of data.
-3: Measure of Central Tendency Chapter : Descriptive Statistics The value at the center or middle of a data set. It is a tool for analyzing data. Part 1: Basic concepts of Measures of Center Ex. Data
More informationStandard Deviation. Lecture 18 Section Robb T. Koether. Hampden-Sydney College. Mon, Sep 26, 2011
Standard Deviation Lecture 18 Section 5.3.4 Robb T. Koether Hampden-Sydney College Mon, Sep 26, 2011 Robb T. Koether (Hampden-Sydney College) Standard Deviation Mon, Sep 26, 2011 1 / 42 Outline 1 Variability
More informationLecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution
More informationDescriptive Statistics for Financial Time Series
Descriptive Statistics for Financial Time Series Econ 424/Amath 462 Summer 2014 Eric Zivot Updated: July 10, 2014 # Load libraries > library(tseries) > library(performanceanalytics) Data for Examples #
More informationIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics 17.871 Types of Variables ~Nominal (Quantitative) Nominal (Qualitative) categorical Ordinal Interval or ratio Describing data Moment Non-mean based measure Center
More informationMini-Lecture 3.1 Measures of Central Tendency
Mini-Lecture 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data 3. Explain what it means for a
More informationNCSS Statistical Software. Reference Intervals
Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and
More information3.1 Measures of Central Tendency
3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent
More informationSimulation Lecture Notes and the Gentle Lentil Case
Simulation Lecture Notes and the Gentle Lentil Case General Overview of the Case What is the decision problem presented in the case? What are the issues Sanjay must consider in deciding among the alternative
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial
More informationSTAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model
STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good
More informationLecture 18 Section Mon, Feb 16, 2009
The s the Lecture 18 Section 5.3.4 Hampden-Sydney College Mon, Feb 16, 2009 Outline The s the 1 2 3 The 4 s 5 the 6 The s the Exercise 5.12, page 333. The five-number summary for the distribution of income
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 10 (MWF) Checking for normality of the data using the QQplot Suhasini Subba Rao Checking for
More informationFitting financial time series returns distributions: a mixture normality approach
Fitting financial time series returns distributions: a mixture normality approach Riccardo Bramante and Diego Zappa * Abstract Value at Risk has emerged as a useful tool to risk management. A relevant
More informationCenter and Spread. Measures of Center and Spread. Example: Mean. Mean: the balance point 2/22/2009. Describing Distributions with Numbers.
Chapter 3 Section3-: Measures of Center Section 3-3: Measurers of Variation Section 3-4: Measures of Relative Standing Section 3-5: Exploratory Data Analysis Describing Distributions with Numbers The overall
More informationSTAT Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model
STAT 203 - Chapter 6 The Standard Deviation (SD) as a Ruler and The Normal Model In Chapter 5, we introduced a few measures of center and spread, and discussed how the mean and standard deviation are good
More informationAmath 546/Econ 589 Univariate GARCH Models
Amath 546/Econ 589 Univariate GARCH Models Eric Zivot April 24, 2013 Lecture Outline Conditional vs. Unconditional Risk Measures Empirical regularities of asset returns Engle s ARCH model Testing for ARCH
More information