Improving the accuracy of estimates for complex sampling in auditing 1.

Size: px
Start display at page:

Download "Improving the accuracy of estimates for complex sampling in auditing 1."

Transcription

1 Improving the accuracy of estimates for complex sampling in auditing 1. Y. G. Berger 1 P. M. Chiodini 2 M. Zenga 2 1 University of Southampton (UK) 2 University of Milano-Bicocca (Italy) The research leading to these results has received support under the European Commissions 7th Framework Programme (FP7/ ) under grant agreement n312691, InGRID Inclusive Growth Research Infrastructure Diffusion. Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

2 Overview 1 Audit Sampling 2 Definitions 3 Simulation study 4 Conclusions and Future research Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

3 AUDIT SAMPLING In auditing the goal is to verify if the values of the accounts reported by the company are not materially misstated To determine the total error in the amount reported by the company auditors should audit all accounts in the population of accounts. This is not possible as it is too costly and time expensive! In practice auditors verify only a sample of accounts to estimate the error of the total population of accounts (total amount error & error rate) Common statistical methods to select an audit sample are by without replacement or by probability proportional to size (PPS) Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

4 SAMPLING PLANS IN AUDITING For the auditor the only interest is to verify that the error rate is within a pre-assigned value, if the observed rate is greater than the pre-assigned value a census is made. Practically speaking all the sampling methods are reliable for audit sampling, by the way Monetary Unit Sampling (MUS) - a particular case of the PPS - is the most popular! This sampling method directs efforts towards high-valued items which contain the greatest potential of large overstatement Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

5 SAMPLING PLANS IN AUDITING The way the auditors make their choices in terms of sampling strategies is frequently based on personal experience! In general: book values distributions are highly positively skewed with different percentage of errors Two main scenarios can be met: - a population with relatively large number of small accounts combined with high rate of mistakes - a population with relatively large number of accounts combined with a small rate of mistakes Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

6 DEFINITIONS An accounting population consist of N line items with book (or recorded) values, y 1, y 2,, y N and total book amount T y defined by: T y = N y i. i=1 The audited (true) amount of the N line items in the population is denoted by x 1, x 2,..., x N and the total audited amount is: T x = N x i. i=1 Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

7 DEFINITIONS The error in item i, is z i = y i x i, 1 i N. When z i > 0, the i-th item is said to be overstated and when z i < 0, it is understated. When z i = 0, the account is said to be error free. The total error amount is defined as: N T z = z i. For y i 0, t i = z i y i = y i x i y i is called the fractional error or taint. The values (x 1, x 2,..., x N ) are unknown before sampling, whereas (y 1, y 2,..., y N ) are known. It is assumed that the amount of any overstatement does not exceed the stated recorded value. i=1 Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

8 DEFINITIONS The purpose of the audit is to estimate the total error amount T z : n n T z = z i = t i y i i=1 i=1 obtained by the examination of a sample of n items of the account. Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

9 Stringer Bound (Stringer, 1963) Let T 1,..., T n be the independent random variables which represent the taintings. The distribution of these taintings is some unknown mixture of distributions on the interval [0; 1], so that Pr(0 T i 1) = 1. Let denote µ = E(T i ) and let 0 t 1:n t 2:n... t n:n 1 be the ordered statistics of (T 1, T 2,..., T n ). For α (0; 1) and i = 0, 1,..., n 1 let p = p n (i; 1 α) be the unique solution of: with p n (n; 1 α) = 1. i k=0 ( ) n p k (1 p) n k = α k Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

10 Stringer Bound (Stringer, 1963) The Stringer method for obtaining an upper bound for the total overstatement error can be obtained by combining the upper limits for the sample error rates with the taints: ˆT z = T y p n(0, 1 α) + T y n [p n(i, 1 α) p n(i 1, 1 α)]t n i+1:n i=1 then p n(i, 1 α) is the (1 α) upper confidence limit for the binomial parameter when i errors are observed in a sample of size n. Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

11 Stringer Bound (Stringer, 1963) Equivalently for a given α, n and number of errors i, it is possible to find the value p n (i, 1 α) that satisfies: i j=0 ( ) n [p n (i, 1 α)] j [1 p n (i, 1 α)] n j = α. j The Stringer bound is sometimes calculated using the Poisson approximation for obtaining the upper confidence limits p n (i, 1 α). Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

12 Empirical Likelihood estimator EL is a non-parametric likelihood. Hartley and Rao in 1968 first introduced it in the context of survey sampling as scale-load approach. From early 2000 the EL approach has been introduced also in survey sampling literature. EL approach provides non-parametric confidence intervals similar to the parametric likelihood ratio intervals. Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

13 Empirical Likelihood estimator The shape and the orientation of the EL intervals are completely determined by the data. Chen et al. (2003) obtained EL intervals on the population mean for populations containing many zero values thats the case of audit sampling. Parametric likelihood ratio intervals based on parametric mixture distributions perform better than the standard normal theory intervals in terms of coverage, but EL intervals perform better under deviations from the assumed mixture model, by providing non-coverage rate below lower bound closer to the nominal value and also larger lower bound. Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

14 EL approach (Berger, 2016) Let U be a finite populations of N units. θ 0 is the unique solution G(θ) = 0, G(θ) = g i (θ) i U The empirical log-likelihood function is defined l (m) = log( m i ) = log(m i ) (1) i S i S m i is estimated by the value ˆm i which maximize l (m) subject m i 0 m i c = C i S Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

15 EL approach (Berger, 2016) The solution is given by where π i = np i. ˆm i = {(t + η) T c i } 1 = (π i + η T c i ) 1 Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

16 Maximum EL estimator The empirical log-likelihood ratio function is defined by ˆr(θ) = 2{l( ˆm) l( ˆm, θ)} where l( ˆm) = log( ˆm i ) i S l( ˆm, θ) = log( ˆm i (θ)) i S ˆm i (θ) is the value that maximize (1) subject to - m i 0 - m i ci = C with ci = (ci T, 0) T and C = (C T, 0) T i S Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

17 HOW TO EVALUATE AUDITING RISK Previous studies has already demonstrated that different sampling plans such as Systematic Sampling or Probability Proportional to Size plans (such as Unrestricted Random Sampling, Lahiri Sampling,...) are all reliable even for all sample size! We evaluate the efficiency of EL Bound respect the Stringer Bound. The parametr of interest is the ratio of error-per-euro (Taint) Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

18 SIMULATION STUDY We simulated data using a real accounting population of credit invoices of an audited society. 40,0 30,0 Frequency 20,0 10,0 0, Value Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

19 SIMULATION STUDY X LogNormal(7.001, 1.71). N = m = 1000 samples. Error randomly associated to the X i (no fraudulent hypothesis) Error rates : + 5%, + 10% + 20%. Taint simulated values: ; ; ; Sample fractions: * 0.05 * 0.1 Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

20 SIMULATION STUDY To compare different estimators Tightness, MSE and Coverage Probability have been computed Tightness is used to indicate how close the bound value is to the true error rate: (AV (ˆt) t) t 100 If it is small the sampling method is said to be tight and if it is large is said to be conservative Variability of the bound: this is an indicator of the uncertainty of the bound. This is measured by Mean Squared Error (MSE): 1 m m (ˆt i t) 2 where ˆt i is the estimated value for the error rate at the ith replicate. i=1 Coverage probability: for a specific bound it refers to the proportion of replications for which a bound is greater than or equal to the true population error amount. A bound is considered unreliable if its coverage is significantly below the specified nominal coverage, otherwise it is reliable. Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

21 Case Study I: Taint EL Stringer % Error % n/n Tight. MSE COV Tight. MSE COV % 8.745E % 8.012E % 3.275E % 3.121E % 1.311E % 6.532E % 2.003E % 9.041E % 3.096E % 8.521E % 1.281E % 8.301E Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

22 Case Study II: Taint EL Stringer % Error % n/n Tight. MSE COV Tight. MSE COV % 9.451E % 1.713E % 7.921E % 1.312E % 4.251E % 9.101E % 1.741E % 1.700E % 7.096E % 3.512E % 2.283E % 7.192E-04 1 Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

23 Case Study III: Taint EL Stringer % Error % n/n Tight. MSE COV Tight. MSE COV % 2.217E % 1.653E % 4.628E % 7.510E % 1.115E % 1.210E % 4.941E % 2.612E % 2.140E % 1.300E % 8.162E % 6.010E Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

24 Results * EL is generally better than SB for estimating error rate * EL is tighter for the estimation of the real error rate. SB is more conservative! * MSE of SB is greater respect EL. * EL and SB can be considered reliable. Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

25 Conclusions and Future research * We introduced di EL Bound to estimate the Upper Bound for the estimation of the error rate * In general EL Bound perform better respect the Stringer Bound. + Other distributions (Dagum, Gamma,...) for the X values. + Other hypothesys on the generation of the error (on the queues,...) Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

26 References Arens, A.A., Loebbecke, J.K., (1981). Applications of Statistical sampling to Auditing, Prentice-Hall, Inc.. Berger, Y. G., De La Riva Torres, O. (2015). Empirical likelihood confidence intervals for complex sampling designs, Journal of the Royal Statistical Society: Series B (Statistical Methodology),78(2), Chen, J., Chen, S. Y., and Rao, J. N. K. (2003). Empirical likelihood confidence intervals for the mean of a population containing many zero values. The Canadian Journal of Statistics 31, Chiodini P.M., Zenga M. (2014) Efficiency of the sample plans for symmetric and non-symmetric distributions in auditing: a comparison. in Contribution to sampling Statistics, ed Mecatti F., Conti PL., Ranalli MG Horgan, J.M. (2008). Monetary-unit sampling old and new, School of Computing, Dublin City University, Dublin,Ireland. Nandram, H.N., (2009). Applications of Statistical sampling to Auditing, Monetary unit sampling: Improving estimation of the total audit error, Advances in Accounting, incorporating Advances in International Accounting, 25, Owen Art B. (1988). Empirical likelihood ratio confidence intervals for a single functional, Biometrika 75(2), Rao, J. N. K. (2006). Empirical Likelihood Methods for Sample Survey Data:An Overview AUSTRIAN JOURNAL OF STATISTICS, 35(2-3), Stringer, K.W. (1963). Practical aspects of statistical sampling in auditing. Proceedings of the Business and Economics Statistics Section, ASA Berger, Chiodini, Zenga Improving the accuracy of estimates / 26

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is: **BEGINNING OF EXAMINATION** 1. You are given: (i) A random sample of five observations from a population is: 0.2 0.7 0.9 1.1 1.3 (ii) You use the Kolmogorov-Smirnov test for testing the null hypothesis,

More information

MATH 3200 Exam 3 Dr. Syring

MATH 3200 Exam 3 Dr. Syring . Suppose n eligible voters are polled (randomly sampled) from a population of size N. The poll asks voters whether they support or do not support increasing local taxes to fund public parks. Let M be

More information

Statistics Class 15 3/21/2012

Statistics Class 15 3/21/2012 Statistics Class 15 3/21/2012 Quiz 1. Cans of regular Pepsi are labeled to indicate that they contain 12 oz. Data Set 17 in Appendix B lists measured amounts for a sample of Pepsi cans. The same statistics

More information

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 6 Point Estimation Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Point Estimation Statistical inference: directed toward conclusions about one or more parameters. We will use the generic

More information

On Performance of Confidence Interval Estimate of Mean for Skewed Populations: Evidence from Examples and Simulations

On Performance of Confidence Interval Estimate of Mean for Skewed Populations: Evidence from Examples and Simulations On Performance of Confidence Interval Estimate of Mean for Skewed Populations: Evidence from Examples and Simulations Khairul Islam 1 * and Tanweer J Shapla 2 1,2 Department of Mathematics and Statistics

More information

Confidence Intervals Introduction

Confidence Intervals Introduction Confidence Intervals Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample mean X is a point estimate of the population mean μ

More information

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Statistical Intervals. Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 7 Statistical Intervals Chapter 7 Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 6 The Normal Distribution And Other Continuous Distributions

Statistics for Managers Using Microsoft Excel/SPSS Chapter 6 The Normal Distribution And Other Continuous Distributions Statistics for Managers Using Microsoft Excel/SPSS Chapter 6 The Normal Distribution And Other Continuous Distributions 1999 Prentice-Hall, Inc. Chap. 6-1 Chapter Topics The Normal Distribution The Standard

More information

On Some Statistics for Testing the Skewness in a Population: An. Empirical Study

On Some Statistics for Testing the Skewness in a Population: An. Empirical Study Available at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 12, Issue 2 (December 2017), pp. 726-752 Applications and Applied Mathematics: An International Journal (AAM) On Some Statistics

More information

Point Estimation. Principle of Unbiased Estimation. When choosing among several different estimators of θ, select one that is unbiased.

Point Estimation. Principle of Unbiased Estimation. When choosing among several different estimators of θ, select one that is unbiased. Point Estimation Point Estimation Definition A point estimate of a parameter θ is a single number that can be regarded as a sensible value for θ. A point estimate is obtained by selecting a suitable statistic

More information

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2

Determining Sample Size. Slide 1 ˆ ˆ. p q n E = z α / 2. (solve for n by algebra) n = E 2 Determining Sample Size Slide 1 E = z α / 2 ˆ ˆ p q n (solve for n by algebra) n = ( zα α / 2) 2 p ˆ qˆ E 2 Sample Size for Estimating Proportion p When an estimate of ˆp is known: Slide 2 n = ˆ ˆ ( )

More information

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5)

ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5) ECO220Y Estimation: Confidence Interval Estimator for Sample Proportions Readings: Chapter 11 (skip 11.5) Fall 2011 Lecture 10 (Fall 2011) Estimation Lecture 10 1 / 23 Review: Sampling Distributions Sample

More information

Module 4: Point Estimation Statistics (OA3102)

Module 4: Point Estimation Statistics (OA3102) Module 4: Point Estimation Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 8.1-8.4 Revision: 1-12 1 Goals for this Module Define

More information

Introduction to Statistical Data Analysis II

Introduction to Statistical Data Analysis II Introduction to Statistical Data Analysis II JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? Preface

More information

Homework Problems Stat 479

Homework Problems Stat 479 Chapter 2 1. Model 1 is a uniform distribution from 0 to 100. Determine the table entries for a generalized uniform distribution covering the range from a to b where a < b. 2. Let X be a discrete random

More information

Chapter 8: Sampling distributions of estimators Sections

Chapter 8: Sampling distributions of estimators Sections Chapter 8 continued Chapter 8: Sampling distributions of estimators Sections 8.1 Sampling distribution of a statistic 8.2 The Chi-square distributions 8.3 Joint Distribution of the sample mean and sample

More information

Applied Statistics I

Applied Statistics I Applied Statistics I Liang Zhang Department of Mathematics, University of Utah July 14, 2008 Liang Zhang (UofU) Applied Statistics I July 14, 2008 1 / 18 Point Estimation Liang Zhang (UofU) Applied Statistics

More information

EX-POST VERIFICATION OF PREDICTION MODELS OF WAGE DISTRIBUTIONS

EX-POST VERIFICATION OF PREDICTION MODELS OF WAGE DISTRIBUTIONS EX-POST VERIFICATION OF PREDICTION MODELS OF WAGE DISTRIBUTIONS LUBOŠ MAREK, MICHAL VRABEC University of Economics, Prague, Faculty of Informatics and Statistics, Department of Statistics and Probability,

More information

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality Point Estimation Some General Concepts of Point Estimation Statistical inference = conclusions about parameters Parameters == population characteristics A point estimate of a parameter is a value (based

More information

CH 5 Normal Probability Distributions Properties of the Normal Distribution

CH 5 Normal Probability Distributions Properties of the Normal Distribution Properties of the Normal Distribution Example A friend that is always late. Let X represent the amount of minutes that pass from the moment you are suppose to meet your friend until the moment your friend

More information

Inference of Several Log-normal Distributions

Inference of Several Log-normal Distributions Inference of Several Log-normal Distributions Guoyi Zhang 1 and Bose Falk 2 Abstract This research considers several log-normal distributions when variances are heteroscedastic and group sizes are unequal.

More information

BIO5312 Biostatistics Lecture 5: Estimations

BIO5312 Biostatistics Lecture 5: Estimations BIO5312 Biostatistics Lecture 5: Estimations Yujin Chung September 27th, 2016 Fall 2016 Yujin Chung Lec5: Estimations Fall 2016 1/34 Recap Yujin Chung Lec5: Estimations Fall 2016 2/34 Today s lecture and

More information

Some developments about a new nonparametric test based on Gini s mean difference

Some developments about a new nonparametric test based on Gini s mean difference Some developments about a new nonparametric test based on Gini s mean difference Claudio Giovanni Borroni and Manuela Cazzaro Dipartimento di Metodi Quantitativi per le Scienze Economiche ed Aziendali

More information

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Chapter 7 Sampling Distributions and Point Estimation of Parameters Chapter 7 Sampling Distributions and Point Estimation of Parameters Part 1: Sampling Distributions, the Central Limit Theorem, Point Estimation & Estimators Sections 7-1 to 7-2 1 / 25 Statistical Inferences

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions. ME3620 Theory of Engineering Experimentation Chapter III. Random Variables and Probability Distributions Chapter III 1 3.2 Random Variables In an experiment, a measurement is usually denoted by a variable

More information

Mixed Logit or Random Parameter Logit Model

Mixed Logit or Random Parameter Logit Model Mixed Logit or Random Parameter Logit Model Mixed Logit Model Very flexible model that can approximate any random utility model. This model when compared to standard logit model overcomes the Taste variation

More information

Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods

Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods ANZIAM J. 49 (EMAC2007) pp.c642 C665, 2008 C642 Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods S. Ahmad 1 M. Abdollahian 2 P. Zeephongsekul

More information

Chapter 5. Statistical inference for Parametric Models

Chapter 5. Statistical inference for Parametric Models Chapter 5. Statistical inference for Parametric Models Outline Overview Parameter estimation Method of moments How good are method of moments estimates? Interval estimation Statistical Inference for Parametric

More information

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions SGSB Workshop: Using Statistical Data to Make Decisions Module 2: The Logic of Statistical Inference Dr. Tom Ilvento January 2006 Dr. Mugdim Pašić Key Objectives Understand the logic of statistical inference

More information

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized

More information

MgtOp S 215 Chapter 8 Dr. Ahn

MgtOp S 215 Chapter 8 Dr. Ahn MgtOp S 215 Chapter 8 Dr. Ahn An estimator of a population parameter is a rule that tells us how to use the sample values,,, to estimate the parameter, and is a statistic. An estimate is the value obtained

More information

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION International Days of Statistics and Economics, Prague, September -3, MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION Diana Bílková Abstract Using L-moments

More information

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are

Chapter 7 presents the beginning of inferential statistics. The two major activities of inferential statistics are Chapter 7 presents the beginning of inferential statistics. Concept: Inferential Statistics The two major activities of inferential statistics are 1 to use sample data to estimate values of population

More information

Chapter 7 - Lecture 1 General concepts and criteria

Chapter 7 - Lecture 1 General concepts and criteria Chapter 7 - Lecture 1 General concepts and criteria January 29th, 2010 Best estimator Mean Square error Unbiased estimators Example Unbiased estimators not unique Special case MVUE Bootstrap General Question

More information

GENERATION OF APPROXIMATE GAMMA SAMPLES BY PARTIAL REJECTION

GENERATION OF APPROXIMATE GAMMA SAMPLES BY PARTIAL REJECTION IASC8: December 5-8, 8, Yokohama, Japan GEERATIO OF APPROXIMATE GAMMA SAMPLES BY PARTIAL REJECTIO S.H. Ong 1 Wen Jau Lee 1 Institute of Mathematical Sciences, University of Malaya, 563 Kuala Lumpur, MALAYSIA

More information

A Stratified Sampling Plan for Billing Accuracy in Healthcare Systems

A Stratified Sampling Plan for Billing Accuracy in Healthcare Systems A Stratified Sampling Plan for Billing Accuracy in Healthcare Systems Jirachai Buddhakulsomsiri Parthana Parthanadee Swatantra Kachhal Department of Industrial and Manufacturing Systems Engineering The

More information

STRESS-STRENGTH RELIABILITY ESTIMATION

STRESS-STRENGTH RELIABILITY ESTIMATION CHAPTER 5 STRESS-STRENGTH RELIABILITY ESTIMATION 5. Introduction There are appliances (every physical component possess an inherent strength) which survive due to their strength. These appliances receive

More information

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY

LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY LESSON 7 INTERVAL ESTIMATION SAMIE L.S. LY 1 THIS WEEK S PLAN Part I: Theory + Practice ( Interval Estimation ) Part II: Theory + Practice ( Interval Estimation ) z-based Confidence Intervals for a Population

More information

Statistics 13 Elementary Statistics

Statistics 13 Elementary Statistics Statistics 13 Elementary Statistics Summer Session I 2012 Lecture Notes 5: Estimation with Confidence intervals 1 Our goal is to estimate the value of an unknown population parameter, such as a population

More information

Appendix A. Selecting and Using Probability Distributions. In this appendix

Appendix A. Selecting and Using Probability Distributions. In this appendix Appendix A Selecting and Using Probability Distributions In this appendix Understanding probability distributions Selecting a probability distribution Using basic distributions Using continuous distributions

More information

Chapter 7. Inferences about Population Variances

Chapter 7. Inferences about Population Variances Chapter 7. Inferences about Population Variances Introduction () The variability of a population s values is as important as the population mean. Hypothetical distribution of E. coli concentrations from

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.

More information

On modelling of electricity spot price

On modelling of electricity spot price , Rüdiger Kiesel and Fred Espen Benth Institute of Energy Trading and Financial Services University of Duisburg-Essen Centre of Mathematics for Applications, University of Oslo 25. August 2010 Introduction

More information

Probability & Statistics

Probability & Statistics Probability & Statistics BITS Pilani K K Birla Goa Campus Dr. Jajati Keshari Sahoo Department of Mathematics Statistics Descriptive statistics Inferential statistics /38 Inferential Statistics 1. Involves:

More information

1 Sampling Distributions

1 Sampling Distributions 1 Sampling Distributions 1.1 Statistics and Sampling Distributions When a random sample is selected the numerical descriptive measures calculated from such a sample are called statistics. These statistics

More information

Sampling Distributions and the Central Limit Theorem

Sampling Distributions and the Central Limit Theorem Sampling Distributions and the Central Limit Theorem February 18 Data distributions and sampling distributions So far, we have discussed the distribution of data (i.e. of random variables in our sample,

More information

χ 2 distributions and confidence intervals for population variance

χ 2 distributions and confidence intervals for population variance χ 2 distributions and confidence intervals for population variance Let Z be a standard Normal random variable, i.e., Z N(0, 1). Define Y = Z 2. Y is a non-negative random variable. Its distribution is

More information

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5] 1 High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5] High-frequency data have some unique characteristics that do not appear in lower frequencies. At this class we have: Nonsynchronous

More information

Omitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations

Omitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations Journal of Statistical and Econometric Methods, vol. 2, no.3, 2013, 49-55 ISSN: 2051-5057 (print version), 2051-5065(online) Scienpress Ltd, 2013 Omitted Variables Bias in Regime-Switching Models with

More information

4-1. Chapter 4. Commonly Used Distributions by The McGraw-Hill Companies, Inc. All rights reserved.

4-1. Chapter 4. Commonly Used Distributions by The McGraw-Hill Companies, Inc. All rights reserved. 4-1 Chapter 4 Commonly Used Distributions 2014 by The Companies, Inc. All rights reserved. Section 4.1: The Bernoulli Distribution 4-2 We use the Bernoulli distribution when we have an experiment which

More information

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 8-26-2016 On Some Test Statistics for Testing the Population Skewness and Kurtosis:

More information

12/1/2017. Chapter. Copyright 2009 by The McGraw-Hill Companies, Inc. 8B-2

12/1/2017. Chapter. Copyright 2009 by The McGraw-Hill Companies, Inc. 8B-2 Sampling Distributions and Estimation (Part ) 8 Chapter Proportion C.I. for the Difference of Two s, m 1 -m C.I. for the Difference of Two Proportions, p 1 -p Population Variance, s McGraw-Hill/Irwin Copyright

More information

SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS

SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS Questions 1-307 have been taken from the previous set of Exam C sample questions. Questions no longer relevant

More information

Back to estimators...

Back to estimators... Back to estimators... So far, we have: Identified estimators for common parameters Discussed the sampling distributions of estimators Introduced ways to judge the goodness of an estimator (bias, MSE, etc.)

More information

Homework Problems Stat 479

Homework Problems Stat 479 Chapter 10 91. * A random sample, X1, X2,, Xn, is drawn from a distribution with a mean of 2/3 and a variance of 1/18. ˆ = (X1 + X2 + + Xn)/(n-1) is the estimator of the distribution mean θ. Find MSE(

More information

Confidence Intervals for the Difference Between Two Means with Tolerance Probability

Confidence Intervals for the Difference Between Two Means with Tolerance Probability Chapter 47 Confidence Intervals for the Difference Between Two Means with Tolerance Probability Introduction This procedure calculates the sample size necessary to achieve a specified distance from the

More information

ECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 10: Continuous RV Families. Prof. Vince Calhoun

ECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 10: Continuous RV Families. Prof. Vince Calhoun ECE 340 Probabilistic Methods in Engineering M/W 3-4:15 Lecture 10: Continuous RV Families Prof. Vince Calhoun 1 Reading This class: Section 4.4-4.5 Next class: Section 4.6-4.7 2 Homework 3.9, 3.49, 4.5,

More information

Lecture 2. Probability Distributions Theophanis Tsandilas

Lecture 2. Probability Distributions Theophanis Tsandilas Lecture 2 Probability Distributions Theophanis Tsandilas Comment on measures of dispersion Why do common measures of dispersion (variance and standard deviation) use sums of squares: nx (x i ˆµ) 2 i=1

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

5.3 Statistics and Their Distributions

5.3 Statistics and Their Distributions Chapter 5 Joint Probability Distributions and Random Samples Instructor: Lingsong Zhang 1 Statistics and Their Distributions 5.3 Statistics and Their Distributions Statistics and Their Distributions Consider

More information

Bayesian Inference for Volatility of Stock Prices

Bayesian Inference for Volatility of Stock Prices Journal of Modern Applied Statistical Methods Volume 3 Issue Article 9-04 Bayesian Inference for Volatility of Stock Prices Juliet G. D'Cunha Mangalore University, Mangalagangorthri, Karnataka, India,

More information

Two Populations Hypothesis Testing

Two Populations Hypothesis Testing Two Populations Hypothesis Testing Two Proportions (Large Independent Samples) Two samples are said to be independent if the data from the first sample is not connected to the data from the second sample.

More information

1/2 2. Mean & variance. Mean & standard deviation

1/2 2. Mean & variance. Mean & standard deviation Question # 1 of 10 ( Start time: 09:46:03 PM ) Total Marks: 1 The probability distribution of X is given below. x: 0 1 2 3 4 p(x): 0.73? 0.06 0.04 0.01 What is the value of missing probability? 0.54 0.16

More information

1. You are given the following information about a stationary AR(2) model:

1. You are given the following information about a stationary AR(2) model: Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4

More information

5/5/2014 یادگیري ماشین. (Machine Learning) ارزیابی فرضیه ها دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی. Evaluating Hypothesis (بخش دوم)

5/5/2014 یادگیري ماشین. (Machine Learning) ارزیابی فرضیه ها دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی. Evaluating Hypothesis (بخش دوم) یادگیري ماشین درس نوزدهم (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی ارزیابی فرضیه ها Evaluating Hypothesis (بخش دوم) 1 فهرست مطالب خطاي نمونه Error) (Sample خطاي واقعی Error) (True

More information

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates A point estimate is a single number, a confidence interval provides additional information about the variability of the estimate Lower

More information

Probability Models.S2 Discrete Random Variables

Probability Models.S2 Discrete Random Variables Probability Models.S2 Discrete Random Variables Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard Results of an experiment involving uncertainty are described by one or more random

More information

M249 Diagnostic Quiz

M249 Diagnostic Quiz THE OPEN UNIVERSITY Faculty of Mathematics and Computing M249 Diagnostic Quiz Prepared by the Course Team [Press to begin] c 2005, 2006 The Open University Last Revision Date: May 19, 2006 Version 4.2

More information

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.

Version A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise. Math 224 Q Exam 3A Fall 217 Tues Dec 12 Version A Problem 1. Let X be the continuous random variable defined by the following pdf: { 1 x/2 when x 2, f(x) otherwise. (a) Compute the mean µ E[X]. E[X] x

More information

Analysis of truncated data with application to the operational risk estimation

Analysis of truncated data with application to the operational risk estimation Analysis of truncated data with application to the operational risk estimation Petr Volf 1 Abstract. Researchers interested in the estimation of operational risk often face problems arising from the structure

More information

Chapter 4 Probability Distributions

Chapter 4 Probability Distributions Slide 1 Chapter 4 Probability Distributions Slide 2 4-1 Overview 4-2 Random Variables 4-3 Binomial Probability Distributions 4-4 Mean, Variance, and Standard Deviation for the Binomial Distribution 4-5

More information

Equivalence Tests for the Ratio of Two Means in a Higher- Order Cross-Over Design

Equivalence Tests for the Ratio of Two Means in a Higher- Order Cross-Over Design Chapter 545 Equivalence Tests for the Ratio of Two Means in a Higher- Order Cross-Over Design Introduction This procedure calculates power and sample size of statistical tests of equivalence of two means

More information

12 The Bootstrap and why it works

12 The Bootstrap and why it works 12 he Bootstrap and why it works For a review of many applications of bootstrap see Efron and ibshirani (1994). For the theory behind the bootstrap see the books by Hall (1992), van der Waart (2000), Lahiri

More information

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS Part 1: Introduction Sampling Distributions & the Central Limit Theorem Point Estimation & Estimators Sections 7-1 to 7-2 Sample data

More information

Homework: (Due Wed) Chapter 10: #5, 22, 42

Homework: (Due Wed) Chapter 10: #5, 22, 42 Announcements: Discussion today is review for midterm, no credit. You may attend more than one discussion section. Bring 2 sheets of notes and calculator to midterm. We will provide Scantron form. Homework:

More information

A Hidden Markov Model Approach to Information-Based Trading: Theory and Applications

A Hidden Markov Model Approach to Information-Based Trading: Theory and Applications A Hidden Markov Model Approach to Information-Based Trading: Theory and Applications Online Supplementary Appendix Xiangkang Yin and Jing Zhao La Trobe University Corresponding author, Department of Finance,

More information

Small Area Estimation of Poverty Indicators using Interval Censored Income Data

Small Area Estimation of Poverty Indicators using Interval Censored Income Data Small Area Estimation of Poverty Indicators using Interval Censored Income Data Paul Walter 1 Marcus Groß 1 Timo Schmid 1 Nikos Tzavidis 2 1 Chair of Statistics and Econometrics, Freie Universit?t Berlin

More information

1 Bayesian Bias Correction Model

1 Bayesian Bias Correction Model 1 Bayesian Bias Correction Model Assuming that n iid samples {X 1,...,X n }, were collected from a normal population with mean µ and variance σ 2. The model likelihood has the form, P( X µ, σ 2, T n >

More information

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD MAJOR POINTS Sampling distribution of the mean revisited Testing hypotheses: sigma known An example Testing hypotheses:

More information

10/1/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1

10/1/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Pivotal subject: distributions of statistics. Foundation linchpin important crucial You need sampling distributions to make inferences:

More information

A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution

A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution Debasis Kundu 1, Rameshwar D. Gupta 2 & Anubhav Manglick 1 Abstract In this paper we propose a very convenient

More information

Statistical estimation

Statistical estimation Statistical estimation Statistical modelling: theory and practice Gilles Guillot gigu@dtu.dk September 3, 2013 Gilles Guillot (gigu@dtu.dk) Estimation September 3, 2013 1 / 27 1 Introductory example 2

More information

The Probability of Legislative Shirking: Estimation and Validation

The Probability of Legislative Shirking: Estimation and Validation The Probability of Legislative Shirking: Estimation and Validation Serguei Kaniovski David Stadelmann November 21, 2015 Abstract We introduce a binomial mixture model for estimating the probability of

More information

Chapter 9: Sampling Distributions

Chapter 9: Sampling Distributions Chapter 9: Sampling Distributions 9. Introduction This chapter connects the material in Chapters 4 through 8 (numerical descriptive statistics, sampling, and probability distributions, in particular) with

More information

12. Conditional heteroscedastic models (ARCH) MA6622, Ernesto Mordecki, CityU, HK, 2006.

12. Conditional heteroscedastic models (ARCH) MA6622, Ernesto Mordecki, CityU, HK, 2006. 12. Conditional heteroscedastic models (ARCH) MA6622, Ernesto Mordecki, CityU, HK, 2006. References for this Lecture: Robert F. Engle. Autoregressive Conditional Heteroscedasticity with Estimates of Variance

More information

Chapter Seven: Confidence Intervals and Sample Size

Chapter Seven: Confidence Intervals and Sample Size Chapter Seven: Confidence Intervals and Sample Size A point estimate is: The best point estimate of the population mean µ is the sample mean X. Three Properties of a Good Estimator 1. Unbiased 2. Consistent

More information

Estimation of a Ramsay-Curve IRT Model using the Metropolis-Hastings Robbins-Monro Algorithm

Estimation of a Ramsay-Curve IRT Model using the Metropolis-Hastings Robbins-Monro Algorithm 1 / 34 Estimation of a Ramsay-Curve IRT Model using the Metropolis-Hastings Robbins-Monro Algorithm Scott Monroe & Li Cai IMPS 2012, Lincoln, Nebraska Outline 2 / 34 1 Introduction and Motivation 2 Review

More information

Practice Exam 1. Loss Amount Number of Losses

Practice Exam 1. Loss Amount Number of Losses Practice Exam 1 1. You are given the following data on loss sizes: An ogive is used as a model for loss sizes. Determine the fitted median. Loss Amount Number of Losses 0 1000 5 1000 5000 4 5000 10000

More information

NCSS Statistical Software. Reference Intervals

NCSS Statistical Software. Reference Intervals Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and

More information

Comparison of design-based sample mean estimate with an estimate under re-sampling-based multiple imputations

Comparison of design-based sample mean estimate with an estimate under re-sampling-based multiple imputations Comparison of design-based sample mean estimate with an estimate under re-sampling-based multiple imputations Recai Yucel 1 Introduction This section introduces the general notation used throughout this

More information

Chapter 8 Estimation

Chapter 8 Estimation Chapter 8 Estimation There are two important forms of statistical inference: estimation (Confidence Intervals) Hypothesis Testing Statistical Inference drawing conclusions about populations based on samples

More information

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is:

. 13. The maximum error (margin of error) of the estimate for μ (based on known σ) is: Statistics Sample Exam 3 Solution Chapters 6 & 7: Normal Probability Distributions & Estimates 1. What percent of normally distributed data value lie within 2 standard deviations to either side of the

More information

Technology Support Center Issue

Technology Support Center Issue United States Office of Office of Solid EPA/600/R-02/084 Environmental Protection Research and Waste and October 2002 Agency Development Emergency Response Technology Support Center Issue Estimation of

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 7 Estimation: Single Population Copyright 010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-1 Confidence Intervals Contents of this chapter: Confidence

More information

Two-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models

Two-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models Economic Review (Otaru University of Commerce), Vo.59, No.4, 4-48, March, 009 Two-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models Haruhiko

More information

Module 3: Sampling Distributions and the CLT Statistics (OA3102)

Module 3: Sampling Distributions and the CLT Statistics (OA3102) Module 3: Sampling Distributions and the CLT Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chpt 7.1-7.3, 7.5 Revision: 1-12 1 Goals for

More information

Consistent estimators for multilevel generalised linear models using an iterated bootstrap

Consistent estimators for multilevel generalised linear models using an iterated bootstrap Multilevel Models Project Working Paper December, 98 Consistent estimators for multilevel generalised linear models using an iterated bootstrap by Harvey Goldstein hgoldstn@ioe.ac.uk Introduction Several

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution

More information

THE USE OF THE LOGNORMAL DISTRIBUTION IN ANALYZING INCOMES

THE USE OF THE LOGNORMAL DISTRIBUTION IN ANALYZING INCOMES International Days of tatistics and Economics Prague eptember -3 011 THE UE OF THE LOGNORMAL DITRIBUTION IN ANALYZING INCOME Jakub Nedvěd Abstract Object of this paper is to examine the possibility of

More information