Data analysis methods in weather and climate research

Similar documents
STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.

STATISTICS and PROBABILITY

Chapter 8. Introduction to Statistical Inference

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION

Applied Statistics I

Lecture Data Science

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Financial Econometrics Jeffrey R. Russell. Midterm 2014 Suggested Solutions. TA: B. B. Deng

Midterm Exam. b. What are the continuously compounded returns for the two stocks?

Financial Econometrics Jeffrey R. Russell Midterm 2014

Module 4: Point Estimation Statistics (OA3102)

Chapter 7: Point Estimation and Sampling Distributions

The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Data Distributions and Normality

EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS. Rick Katz

Random Variables and Probability Distributions

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Introduction to Statistical Data Analysis II

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Financial Time Series and Their Characteristics

Probability & Statistics

Chapter 5. Statistical inference for Parametric Models

12 The Bootstrap and why it works

Confidence Intervals Introduction

Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR

Much of what appears here comes from ideas presented in the book:

Chapter 7 - Lecture 1 General concepts and criteria

MgtOp S 215 Chapter 8 Dr. Ahn

Application of the Bootstrap Estimating a Population Mean

Are Market Neutral Hedge Funds Really Market Neutral?

Statistics for Business and Economics

Point Estimators. STATISTICS Lecture no. 10. Department of Econometrics FEM UO Brno office 69a, tel

A NEW POINT ESTIMATOR FOR THE MEDIAN OF GAMMA DISTRIBUTION

Linda Allen, Jacob Boudoukh and Anthony Saunders, Understanding Market, Credit and Operational Risk: The Value at Risk Approach

AP Statistics Chapter 6 - Random Variables

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی

Point Estimation. Principle of Unbiased Estimation. When choosing among several different estimators of θ, select one that is unbiased.

Module 3: Sampling Distributions and the CLT Statistics (OA3102)

Chapter 9. Sampling Distributions. A sampling distribution is created by, as the name suggests, sampling.

MATH 3200 Exam 3 Dr. Syring

Topic 8: Model Diagnostics

Market Risk Analysis Volume IV. Value-at-Risk Models

Modelling of Long-Term Risk

Discrete probability distributions

Asymptotic Distribution Free Interval Estimation

Basic Procedure for Histograms

Lecture 9 - Sampling Distributions and the CLT

Point Estimation. Edwin Leuven

CS145: Probability & Computing

Definition 9.1 A point estimate is any function T (X 1,..., X n ) of a random sample. We often write an estimator of the parameter θ as ˆθ.

Descriptive Statistics

PRMIA Exam 8002 PRM Certification - Exam II: Mathematical Foundations of Risk Measurement Version: 6.0 [ Total Questions: 132 ]

Measures of Center. Mean. 1. Mean 2. Median 3. Mode 4. Midrange (rarely used) Measure of Center. Notation. Mean

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

M249 Diagnostic Quiz

Analysis of truncated data with application to the operational risk estimation

Notice that X2 and Y2 are skewed. Taking the SQRT of Y2 reduces the skewness greatly.

Computing Statistics ID1050 Quantitative & Qualitative Reasoning

Continuous Probability Distributions & Normal Distribution

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Chapter 6 Confidence Intervals

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study

Statistical analysis and bootstrapping

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Examples: Random Variables. Discrete and Continuous Random Variables. Probability Distributions

CHAPTER 5. Introduction to Risk, Return, and the Historical Record INVESTMENTS BODIE, KANE, MARCUS

Quantitative Methods

STA Module 3B Discrete Random Variables

Contents. 1 Introduction. Math 321 Chapter 5 Confidence Intervals. 1 Introduction 1

Econ 6900: Statistical Problems. Instructor: Yogesh Uppal

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali

Chapter 8: Sampling distributions of estimators Sections

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority

Tests for One Variance

Chapter 5: Statistical Inference (in General)

STA Rev. F Learning Objectives. What is a Random Variable? Module 5 Discrete Random Variables

Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance

STAT/MATH 395 PROBABILITY II

Can we use kernel smoothing to estimate Value at Risk and Tail Value at Risk?

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

Simulation Lecture Notes and the Gentle Lentil Case

Shifting our focus. We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why?

Chapter 5 Discrete Probability Distributions. Random Variables Discrete Probability Distributions Expected Value and Variance

Chapter 7. Sampling Distributions

PIVOTAL QUANTILE ESTIMATES IN VAR CALCULATIONS. Peter Schaller, Bank Austria Creditanstalt (BA-CA) Wien,

Paper Reference. Paper Reference(s) 6683/01 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary

Back to estimators...

Chapter 4: Estimation

Section 7.2. Estimating a Population Proportion

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

1. The number of dental claims for each insured in a calendar year is distributed as a Geometric distribution with variance of

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

The SAS System 11:03 Monday, November 11,

Value (x) probability Example A-2: Construct a histogram for population Ψ.

Modelling catastrophic risk in international equity markets: An extreme value approach. JOHN COTTER University College Dublin

physicsandmathstutor.com Paper Reference Statistics S1 Advanced/Advanced Subsidiary Wednesday 20 May 2009 Afternoon Time: 1 hour 30 minutes

Chapter 7 Sampling Distributions and Point Estimation of Parameters

2011 Pearson Education, Inc

Transcription:

Data analysis methods in weather and climate research Dr. David B. Stephenson Department of Meteorology University of Reading www.met.rdg.ac.uk/cag 5. Parameter estimation Fitting probability models he sampling distribution of an estimator Error bars and Confidence intervals ypes of estimator Accuracy bias and efficiency of estimators (c 004 D.B.Stephenson@reading.ac.uk 1 5. Probability modelling in 6 steps Modelling strategy: 1. Explore the data sample (EDA. Identify a suitable distribution 3. Fit the distribution to the data by estimation of the parameters 4. Check the goodness-of-fit 5. Make out-of-sample predictions 6. Go back to 1 or if needed (c 004 D.B.Stephenson@reading.ac.uk 5. Parameter estimation Estimate the values of population parameter(s that give the best fit of the probability model ~ f ( x; to the observed sample of data. Note: we fit the model to the data NO the data to the model! (c 004 D.B.Stephenson@reading.ac.uk 3 (c 000 Dr. David Stephenson

5. Sample statistics and estimators Parameters are estimated by using sample statistics [] of the original random variables. Such sample statistics are known as estimators. For example the population mean parameter in the Normal distribution N ( can be estimated by the sample mean x. his is known as a point estimate. he hat symbol denotes estimate of. (c 004 D.B.Stephenson@reading.ac.uk 4 5. Interval estimates Rather than just give a single best estimate of a parameter ( point estimate it is more informative to give a likely range of possible values in other words an interval estimate. he simplest way to do this is to quote the best estimate plus/minus the standard deviation in this estimate: he standard deviation quantifies the amount of uncertainty in the estimate caused by sampling. (c 004 D.B.Stephenson@reading.ac.uk 5 5. Sampling distribution Each sample statistic [] is distributed with its own sampling distribution : ~ f ( n he sampling distribution depends on: Choice of sample statistic; Sample size n; Parameters of the original distribution f ( ~ (c 004 D.B.Stephenson@reading.ac.uk 6 (c 000 Dr. David Stephenson

5. Mean of iid normally distributed variables For iid ( independent and identically distributed normally distributed random variables: E( ~ N( ~ N( Var ( / n (c 004 D.B.Stephenson@reading.ac.uk 7 Sampling distribution of sample mean / n 5. Central Limit heorem ~ lim n f ( ~ and independen t N( / n his works for ANY f( with finite mean and variance and explains why we see so many variables that are normally distributed e.g. mean errors due to many random effects. (c 004 D.B.Stephenson@reading.ac.uk 8 5. Definition of standard error he standard error is the standard deviation of the sample statistic. i.e. Var( e.g. for sample mean of iid variables: n (c 004 D.B.Stephenson@reading.ac.uk 9 (c 000 Dr. David Stephenson

5. Confidence intervals (C.I. s he (1 100% confidence interval of a sample statistic is the interval between the / and the 1 / quantiles of the sampling distribution. Pr{ t t } 1 / confidence level 1 / here is probability 1 overlap the true value. that the interval will (c 004 D.B.Stephenson@reading.ac.uk 10 5. Examples of confidence intervals (c 004 D.B.Stephenson@reading.ac.uk 11 5. Some commonly used C.I. s Alpha 1-alpha Zc Description 0.50 0.50 0.68 50% C.I. +/- probable error 0.3 0.68 1.00 68% C.I. +/- 1 std. errors 0.05 0.95 1.96~ 95% C.I. ~+/- std. errors 0.001 0.999 3.9 99.9% C.I. ~+/- 3 std. errors (c 004 D.B.Stephenson@reading.ac.uk 1 (c 000 Dr. David Stephenson

5. Choice of estimator Many ways to choose the estimators such as: Method of moments use sample moments e.g. mean variance skewness etc. Robust estimation use rank statistics such as the median IQR etc. instead. Maximum Likelihood Estimation choose estimator so that it maximises the likelihood of our data sampling occurring. (c 004 D.B.Stephenson@reading.ac.uk 13 5. Accuracy bias and efficiency he accuracy of an estimator can be quantified as follows: Mean Squared Error E (( ( E( Var( squared " bias" " efficiency" here is invariably a trade-off between bias and efficiency. (c 004 D.B.Stephenson@reading.ac.uk 14 5. Summary Model is fit to the data by using sample statistics (estimators to estimate the true model parameters Interval estimates give a range of probable values rather than a single point estimate Each estimator has its own probability distribution known as a sampling distribution he standard deviation of the estimator is known as the standard error he sampling distribution can be used to construct confidence intervals in which the true value is most likely to be found. here are several methods for estimating parameters: moment method robust estimation maximum likelihood estimation. Different estimators have different accuracies bias and efficiency. (c 004 D.B.Stephenson@reading.ac.uk 15 (c 000 Dr. David Stephenson