The Two Sample T-test with One Variance Unknown
|
|
- Heather Bailey
- 6 years ago
- Views:
Transcription
1 The Two Sample T-test with One Variance Unknown Arnab Maity Department of Statistics, Texas A&M University, College Station TX , U.S.A. Michael Sherman Department of Statistics, Texas A&M University, College Station TX , U.S.A. Summary We consider the situation in two sample testing when one variance is assumed to be known while the other variance is considered unknown. This situation arises, for example, when one is interested in comparing a standard treatment with a new treatment. Although this situation occurs relatively infrequently, our example discusses the important tool of moment matching and makes the classic two sample Satterthwaite t-approximation transparent. Key words: Two sample t-test; Moments; Satterthwaite; Behrens-Fisher.
2 Introduction The two sample t-test is a mainstay in statistical practice and is introduced in most, if not all, introductory statistics classes. A common assumption for the two samples is that they come from normally distributed populations with different means. When the two variances are known this leads directly to a test statistic with a standard normal distribution. If the variances are unknown, under the assumption of equal variances the two sample test uses a pooled estimate of variance. In this case the resulting test statistic has an exact t-distribution. In the case where the two variances may differ the two variances are estimated separately. The resulting test statistic does not have an exact t-distribution. Nevertheless, it is common to approximate the distribution by a t-distribution with a random degrees of freedom based on the data. In this note we consider the case where one of the variances is assumed to be known and the other is treated as unknown. This situation arises in practice when a standard treatment is being compared to a new treatment. The standard treatment leads to a known variance, while the new treatment may have a different mean (which is our main interest) and may well have a different variance. We sketch the usual distribution theory for the known cases and use the methodology to derive an approximate t-test in the two sample case with one variance known and one unknown. An interesting observation is that the method of deriving Satterthwaite s approximate degrees of freedom for the unequal variance case works for the case when one variance is assumed known and the other one is estimated. We derive the approximate t-test in the case of one unknown variance, illustrate with an example, and carry out a small simulation to evaluate the accuracy of the approximate t-distribution. This example illustrates an important approximation technique of moment matching and makes the Satterthwaite approximation transparent. This note was motivated by a question from a graduate student in an introductory class for engineering students. Hypothesis Tests for a Difference in Means of Two Normal Distributions Let Y,..., Y n Normal(µ, σ) and Y,..., Y m Normal(µ, σ) where the notation means independent and identically distributed as. Throughout, we are interested in the
3 quantity µ µ and we want to test H : µ µ =, for a fixed difference of interest.. An Overview of Existing Tests: Both Variances are Known or Both are Unknown Assume that the sample size from the first population is n and from the second m. simplest case arises when variances of both the population, i.e., σ and σ are known. In this case, the z-statistic, Z = Ȳ Ȳ σ n + σ m is used. Under H, Z has a Normal(, ) distribution and the test can be conducted based on this fact. A more complicated situation occurs when the population variances are unknown. In this case the above z-test does not apply anymore since the variances need to be estimated. If we assume σ = σ = σ, it is reasonable to combine the two samples to estimate σ, rather than estimating separately for the two samples. The usual pooled estimator of σ is given by Spooled = (n )S + (m )S, n + m where S = n i= (Y i Ȳ) /(n ) and S = m i= (Y i Ȳ) /(m ). Then, the test statistic T = Ȳ Ȳ ( Spooled + ) n m is used and has an exact t-distribution with γ = n + m degrees of freedom, under H. A more realistic case arises when σ σ. This is known as the Behrens-Fisher problem. In this case the test statistic, The T = Ȳ Ȳ ( ) () S + S n m ( ) σ ( ) does not have an exact t-distribution. The difficulty lies in the fact that + σ S + S n m n m does not have an appropriate χ distribution. However, Satterthwaite (94, 946) proposed a method which approximates the exact distribution by a suitable χ distribution. Thus, using Satterthwaite s approximation, ( ) σ ( ) γ n + σ S m n + S χ γ m, approximately.
4 The value of γ is obtained by matching the variances on both sides of the above equation. ( ) σ { This gives γ = + σ (σ /n) + (σ /m) n m n m }. γ is estimated by γ, where we substitute S and S in place of σ and σ. Thus, T has an approximate t-distribution with γ degrees of freedom. We use an entirely analogous method in Section. in the case of one variance assumed to be known. See, e.g., Scheffé (97) for more background on the Behrens-Fisher problem. For further information on estimation of the degrees of freedom see, e.g., Ames and Webster (99) or Moser and Stevens (99). The two sample t-test is a special case of the general linear model Y = Xβ + ɛ, where ɛ is distributed as Normal[, V (θ)] and β is a column vector of fixed effect parameters. The test statistic used to test a linear hypothesis, H : l T β =, is t = lt β l TĈl. () where Ĉ is an estimate of the covariance matrix of β. The general Satterthwaite approximation for the denominator degrees of freedom is computed as γ gen = (lt Ĉl) g T Ag (3) where g is the gradient of l T Cl (with respect to θ), evaluated at θ and A is the asymptotic variance-covariance matrix of θ obtained from the second derivative matrix of the likelihood equations. In this general situation, t in () has an approximate t-distribution with γ gen degrees of freedom. See Giesbrecht and Burns (985) and SAS/STAT 9. User s Guide (4) for details.. The Variance of One Population is Known but the Other is Unknown (σ known but σ unknown) The situation, where the variance of one population is known but the other unknown, arises when a new treatment is compared to an old or standard treatment. The variance for the standard treatment may be considered known from historical data, while the variance caused by the new treatment may not be the same as the old one. In this case, pooling the samples may not be a good idea since the variances may be very different from each other. On the 3
5 other hand, using T as in () requires an estimate for the variance of the standard treatment which is assumed to be known. Since σ is known, a reasonable test statistic is: T 3 = Ȳ Ȳ σ n + S m. (4) ( ) σ ( ) However, T 3 does not have an exact t-distribution since + σ σ + S does not have n m n m an appropriate χ distribution. Hence, our goal is to find the appropriate degrees of freedom, γ 3, such that ( ) σ ( ) γ 3 n + σ σ m n + S χ γ m 3, approximately. (5) One of the ways to achieve this is to use the method of moments. The two sides in (5) clearly have the same expectations so we seek to match variances: ( ) σ γ3 ( ) n + σ σ var m n + S = γ 3 m ( ) γ 3 = { (σ /m) σ m } n + σ, m since var(s ) = σ 4 /(m ). Since σ is unknown, and not assumed to be the known value of σ, we estimate it by S and obtain γ 3 = ( ) σ ( Hence we can say that γ 3 n + σ σ + S m n m ( ) σ + S n m (S /m) m. (6) ) χ bγ 3, approximately. Hence, under H T 3 = Ȳ Ȳ σ n + S m t bγ3, approximately. Based on this we can now conduct level-α t-tests: Alternative hypothesis Rejection criterion H : µ µ T 3 > t α/,bγ3 or T 3 < t α/,bγ3 H : µ µ > H : µ µ < T 3 > t α,bγ3 T 3 < t α,bγ3 4
6 Note that γ 3 in (6) can be derived as a special case of γ gen in (3) by considering a model where Y = (Y,..., Y n, Y,..., Y m ) T, β = (µ, µ ) T, l = (, ) T, X is a (n+m) matrix such that its first n rows are (, ) and last m rows are (, ), and V is a (m + n) (m + n) diagonal matrix with first n diagonal entries being the known variance σ and the others being σ [mathematical details are available from the authors]. 3 A Data Example We consider data from problem. in the textbook of Montgomery and Runger (3). We quote the problem: A polymer is manufactured in a batch chemical process. Viscosity measurements are normally made on each batch, and long experience with the process has indicated that the variability of the process is fairly stable with σ =. Fifteen batch viscosity measurements are given as follows: 74, 78, 776, 76, 745, 759, 795, 756, 74, 74, 76, 749, 739, 747, 74. A process change is made which involves switching the type of catalyst involved in the process. Following the process change, eight batch viscosity measurements are taken: 735, 775, 79, 755, 783, 76, 738, 78. Assume that process variability is unaffected by the catalyst change. Find a 9% confidence interval on the difference in mean batch viscosity resulting from the process change. Note that in the problem it is assumed that switching the type of catalyst does not affect the variance. We are interested in a possible shift of the mean and hence it seems we should allow for a shift in the variances as well. Analysis: We proceed by assuming that σ = is known but that σ is unknown, i.e., that the process variability is affected by the catalyst change. Then we use the approximate t-test developed in Section. with degrees of freedom ( ) σ + S n m γ 3 = = 5.5. (S /m) m Also, n = 5 and m = 8. Hence a 9% CI is given by (Ȳ Ȳ t.5,bγ3 σ /n + S /m, Ȳ Ȳ + t.5,bγ3 σ /n + S /m) = (.6638, 9.338). 5
7 For comparison s sake we consider the assumption that the process variability is unaffected by the catalyst change. In this situation, σ = σ =. So a 9% CI for µ µ is given by (Ȳ Ȳ z.5 σ /n + σ/m, Ȳ Ȳ + z.5 σ /n + σ/m) = (.773, 7.773). We see that the two intervals are similar but that the first interval is wider than the second. This is due to the assumption that the variance is known in the post process change group. The rough similarity between the two intervals is compatible with the usual F-test for equality of variances. Testing: H : σ = vs H : σ, we find that p value =.678. It is interesting to see how the intervals compare in a case where the variances are more different. We take the same example as in previous section but change the y values to : Y,new = [76, 83, 73, 755, 8, 786, 74, 79], so that is sample variance is In this case, results are given below, Assumption 9% CI for µ µ H : µ µ = σ and σ are known ( 33.73, ) Reject H σ is known and σ is unknown ( 44.78, 5.478) Fail to reject H We see that the results are very different. If we perform a test H : µ µ =, based on the intervals, these two methods yield different conclusions at α =.. A simulation, based on the preceding data values, was conducted with n = 5 and m = 8 where we generate, data sets with Y,..., Y n Normal(µ = 75., σ = ) and Y,..., Y m Normal(µ = 75., σ = 4). Tests for H : µ µ = vs H : µ µ are performed at α =. for each of the data sets and type I errors are estimated. The test based on T 3 gives a type I error of. whereas tests based on T and T produce type I errors of.6 and., respectively. Next, we compare the empirical distribution of our test statistic with the approximate t-distribution. We take n = and m = 5 with σ = 4 and σ =. We generate, data sets and for each data set we compute the two sample t-statistic, T 3. It is interesting to compare the empirical distribution of T, T and T 3 with their corresponding hypothesized t-distributions. Kernel density estimates of T, T and T 3 (using an Epanechnikov kernel) are shown in Figures, and 3, respectively. It is clear the agreement is very close for T and T 3. However, 6
8 for T the discrepancy is clearly visible. We see that assumption of equal variance is not desirable in this case. Finally, we compare the power curves of the tests based on T and T 3. We consider two configurations: Setup. n = 5 and m = 5 with σ = 4 and σ = Setup. n = and m = 5 with σ = 4 and σ = We take µ = and µ = µ + (for between 5 and 5) and estimate power of the tests for different values of. Results for Setup and Setup, based on, simulations, are given in Figures 4 and 5, respectively. It is evident that if the size of the sample with known variance is small, the test based on T 3 produces larger power than that of T. This effect, however, diminishes as the sample size increases. We reiterate that this example comes from a query from a student. Its practical importance is limited by the fact that typically both variances are unknown and for this reason T is more generally applicable than T 3. References Ames, M.H. and Webster, J.T. (99). On Estimating Approximate Degrees of Freedom, The American Statistician, 45, Giesbrecht, F.G. and Burns, J.C. (985). Two-Stage Analysis Based on a Mixed Model: Large-Sample Asymptotic Theory and Small-Sample Simulation Results, Biometrics, 4, Montgomery, D.C. and Runger, G.C. (3). Applied Statistics and Probability for Engineers, 3rd edition, John Wiley and Sons. Moser, B.K. and Stevens, G.R. (99). Homogeneity of Variance in the Two-Sample Means Test, The American Statistician, 46, 9-. SAS Institute Inc. (4). SAS/STAT R 9. User s Guide. Cary, NC:SAS Institute Inc. Satterthwaite, F.E. (94). Synthesis of Variance, Psychometrika, 6, Satterthwaite, F.E. (946). An Approximate Distribution of Estimates of Variance Components, Biometrics Bulletin, 6, -4. 7
9 Scheffé, H. (97). Practical Solutions of the Behrens-Fisher Problem, Journal of the American Statistical Association, 65,
10 Figure : Plots of the density estimate of T (solid line) and actual t-density with γ = n+m degrees of freedom (dashed line) along with the critical values (vertical lines) for two sided 5% tests for H : µ µ =.
11 Figure : Plots of the density estimate of T (solid line) and actual t-density with γ degrees of freedom (dashed line) along with the critical values (vertical lines) for two sided 5% tests for H : µ µ =.
12 Figure 3: Plots of density estimate of T 3 (solid line) and actual t-density with γ 3 degrees of freedom (dashed line) along with the critical values (vertical lines) for two sided 5% tests for H : µ µ =.
13 .9 T3 T Figure 4: Power curves of the t-tests based on T 3 (solid line) and T (dashed line) when n = 5 and m = 5.
14 .9 T3 T Figure 5: Power curves of the t-tests based on T 3 (solid line) and T (dashed line) when n = and m = 5.
Statistical Methodology. A note on a two-sample T test with one variance unknown
Statistical Methodology 8 (0) 58 534 Contents lists available at SciVerse ScienceDirect Statistical Methodology journal homepage: www.elsevier.com/locate/stamet A note on a two-sample T test with one variance
More informationExperimental Design and Statistics - AGA47A
Experimental Design and Statistics - AGA47A Czech University of Life Sciences in Prague Department of Genetics and Breeding Fall/Winter 2014/2015 Matúš Maciak (@ A 211) Office Hours: M 14:00 15:30 W 15:30
More informationInferences on Correlation Coefficients of Bivariate Log-normal Distributions
Inferences on Correlation Coefficients of Bivariate Log-normal Distributions Guoyi Zhang 1 and Zhongxue Chen 2 Abstract This article considers inference on correlation coefficients of bivariate log-normal
More informationAustralian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model
AENSI Journals Australian Journal of Basic and Applied Sciences Journal home page: wwwajbaswebcom Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model Khawla Mustafa Sadiq University
More informationInterval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems
Interval estimation September 29, 2017 STAT 151 Class 7 Slide 1 Outline of Topics 1 Basic ideas 2 Sampling variation and CLT 3 Interval estimation using X 4 More general problems STAT 151 Class 7 Slide
More informationQuantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples
Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu
More informationChapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.
Chapter 14 : Statistical Inference 1 Chapter 14 : Introduction to Statistical Inference Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Data x
More information8.1 Estimation of the Mean and Proportion
8.1 Estimation of the Mean and Proportion Statistical inference enables us to make judgments about a population on the basis of sample information. The mean, standard deviation, and proportions of a population
More information8.3 CI for μ, σ NOT known (old 8.4)
GOALS: 1. Learn the properties of the student t distribution and the t curve. 2. Understand how degrees of freedom, df, relate to t curves. 3. Recognize that t curves approach the SNC as df increases.
More informationChapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29
Chapter 5 Univariate time-series analysis () Chapter 5 Univariate time-series analysis 1 / 29 Time-Series Time-series is a sequence fx 1, x 2,..., x T g or fx t g, t = 1,..., T, where t is an index denoting
More informationThe Two-Sample Independent Sample t Test
Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 The General Formula The Equal-n Formula 4 5 6 Independence Normality Homogeneity of Variances 7 Non-Normality Unequal
More informationSTATISTICS and PROBABILITY
Introduction to Statistics Atatürk University STATISTICS and PROBABILITY LECTURE: SAMPLING DISTRIBUTIONS and POINT ESTIMATIONS Prof. Dr. İrfan KAYMAZ Atatürk University Engineering Faculty Department of
More informationWeek 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals
Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg :
More informationPoint Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage
6 Point Estimation Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Point Estimation Statistical inference: directed toward conclusions about one or more parameters. We will use the generic
More informationHomework Problems Stat 479
Chapter 10 91. * A random sample, X1, X2,, Xn, is drawn from a distribution with a mean of 2/3 and a variance of 1/18. ˆ = (X1 + X2 + + Xn)/(n-1) is the estimator of the distribution mean θ. Find MSE(
More informationSuperiority by a Margin Tests for the Ratio of Two Proportions
Chapter 06 Superiority by a Margin Tests for the Ratio of Two Proportions Introduction This module computes power and sample size for hypothesis tests for superiority of the ratio of two independent proportions.
More informationTwo-Sample T-Tests using Effect Size
Chapter 419 Two-Sample T-Tests using Effect Size Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the effect size is specified rather
More informationدرس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی
یادگیري ماشین توزیع هاي نمونه و تخمین نقطه اي پارامترها Sampling Distributions and Point Estimation of Parameter (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی درس هفتم 1 Outline Introduction
More informationUnit 5: Sampling Distributions of Statistics
Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate
More informationUnit 5: Sampling Distributions of Statistics
Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate
More informationTwo hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER
Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.
More informationCourse information FN3142 Quantitative finance
Course information 015 16 FN314 Quantitative finance This course is aimed at students interested in obtaining a thorough grounding in market finance and related empirical methods. Prerequisite If taken
More informationConover Test of Variances (Simulation)
Chapter 561 Conover Test of Variances (Simulation) Introduction This procedure analyzes the power and significance level of the Conover homogeneity test. This test is used to test whether two or more population
More informationStatistical Tables Compiled by Alan J. Terry
Statistical Tables Compiled by Alan J. Terry School of Science and Sport University of the West of Scotland Paisley, Scotland Contents Table 1: Cumulative binomial probabilities Page 1 Table 2: Cumulative
More informationMuch of what appears here comes from ideas presented in the book:
Chapter 11 Robust statistical methods Much of what appears here comes from ideas presented in the book: Huber, Peter J. (1981), Robust statistics, John Wiley & Sons (New York; Chichester). There are many
More informationTests for Two Independent Sensitivities
Chapter 75 Tests for Two Independent Sensitivities Introduction This procedure gives power or required sample size for comparing two diagnostic tests when the outcome is sensitivity (or specificity). In
More informationFinal Exam Suggested Solutions
University of Washington Fall 003 Department of Economics Eric Zivot Economics 483 Final Exam Suggested Solutions This is a closed book and closed note exam. However, you are allowed one page of handwritten
More informationSAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS
Science SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS Kalpesh S Tailor * * Assistant Professor, Department of Statistics, M K Bhavnagar University,
More informationPreviously, when making inferences about the population mean, μ, we were assuming the following simple conditions:
Chapter 17 Inference about a Population Mean Conditions for inference Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions: (1) Our data (observations)
More informationMixed models in R using the lme4 package Part 3: Inference based on profiled deviance
Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance Douglas Bates Department of Statistics University of Wisconsin - Madison Madison January 11, 2011
More informationA Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution
A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution Debasis Kundu 1, Rameshwar D. Gupta 2 & Anubhav Manglick 1 Abstract In this paper we propose a very convenient
More information**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:
**BEGINNING OF EXAMINATION** 1. You are given: (i) A random sample of five observations from a population is: 0.2 0.7 0.9 1.1 1.3 (ii) You use the Kolmogorov-Smirnov test for testing the null hypothesis,
More informationTime Observations Time Period, t
Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard Time Series and Forecasting.S1 Time Series Models An example of a time series for 25 periods is plotted in Fig. 1 from the numerical
More informationFINITE SAMPLE DISTRIBUTIONS OF RISK-RETURN RATIOS
Available Online at ESci Journals Journal of Business and Finance ISSN: 305-185 (Online), 308-7714 (Print) http://www.escijournals.net/jbf FINITE SAMPLE DISTRIBUTIONS OF RISK-RETURN RATIOS Reza Habibi*
More informationA New Multivariate Kurtosis and Its Asymptotic Distribution
A ew Multivariate Kurtosis and Its Asymptotic Distribution Chiaki Miyagawa 1 and Takashi Seo 1 Department of Mathematical Information Science, Graduate School of Science, Tokyo University of Science, Tokyo,
More informationExtend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty
Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty George Photiou Lincoln College University of Oxford A dissertation submitted in partial fulfilment for
More informationIntroduction to the Maximum Likelihood Estimation Technique. September 24, 2015
Introduction to the Maximum Likelihood Estimation Technique September 24, 2015 So far our Dependent Variable is Continuous That is, our outcome variable Y is assumed to follow a normal distribution having
More information1. You are given the following information about a stationary AR(2) model:
Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4
More informationROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices
ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices Bachelier Finance Society Meeting Toronto 2010 Henley Business School at Reading Contact Author : d.ledermann@icmacentre.ac.uk Alexander
More informationMaximum Likelihood Estimation
Maximum Likelihood Estimation The likelihood and log-likelihood functions are the basis for deriving estimators for parameters, given data. While the shapes of these two functions are different, they have
More information[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright
Faculty and Institute of Actuaries Claims Reserving Manual v.2 (09/1997) Section D7 [D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright 1. Introduction
More informationSample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method
Meng-Jie Lu 1 / Wei-Hua Zhong 1 / Yu-Xiu Liu 1 / Hua-Zhang Miao 1 / Yong-Chang Li 1 / Mu-Huo Ji 2 Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method Abstract:
More informationInference of Several Log-normal Distributions
Inference of Several Log-normal Distributions Guoyi Zhang 1 and Bose Falk 2 Abstract This research considers several log-normal distributions when variances are heteroscedastic and group sizes are unequal.
More informationLecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions
Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions ELE 525: Random Processes in Information Systems Hisashi Kobayashi Department of Electrical Engineering
More informationDuangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State University Tempe, AZ
Process Monitoring for Correlated Gamma Distributed Data Using Generalized Linear Model Based Control Charts Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State
More informationχ 2 distributions and confidence intervals for population variance
χ 2 distributions and confidence intervals for population variance Let Z be a standard Normal random variable, i.e., Z N(0, 1). Define Y = Z 2. Y is a non-negative random variable. Its distribution is
More informationA Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims
International Journal of Business and Economics, 007, Vol. 6, No. 3, 5-36 A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims Wan-Kai Pang * Department of Applied
More informationLecture 3: Factor models in modern portfolio choice
Lecture 3: Factor models in modern portfolio choice Prof. Massimo Guidolin Portfolio Management Spring 2016 Overview The inputs of portfolio problems Using the single index model Multi-index models Portfolio
More informationOmitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations
Journal of Statistical and Econometric Methods, vol. 2, no.3, 2013, 49-55 ISSN: 2051-5057 (print version), 2051-5065(online) Scienpress Ltd, 2013 Omitted Variables Bias in Regime-Switching Models with
More informationStatistical Intervals (One sample) (Chs )
7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and
More informationSYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data
SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu September 5, 2015
More informationCalibration of Interest Rates
WDS'12 Proceedings of Contributed Papers, Part I, 25 30, 2012. ISBN 978-80-7378-224-5 MATFYZPRESS Calibration of Interest Rates J. Černý Charles University, Faculty of Mathematics and Physics, Prague,
More informationEstimation of a parametric function associated with the lognormal distribution 1
Communications in Statistics Theory and Methods Estimation of a parametric function associated with the lognormal distribution Jiangtao Gou a,b and Ajit C. Tamhane c, a Department of Mathematics and Statistics,
More informationObjective Bayesian Analysis for Heteroscedastic Regression
Analysis for Heteroscedastic Regression & Esther Salazar Universidade Federal do Rio de Janeiro Colóquio Inter-institucional: Modelos Estocásticos e Aplicações 2009 Collaborators: Marco Ferreira and Thais
More informationDistribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.
Lecture 34 Section 10.2 Hampden-Sydney College Fri, Oct 31, 2008 Outline 1 2 3 4 5 6 7 8 Exercise 10.4, page 633. A psychologist is studying the distribution of IQ scores of girls at an alternative high
More informationMVE051/MSG Lecture 7
MVE051/MSG810 2017 Lecture 7 Petter Mostad Chalmers November 20, 2017 The purpose of collecting and analyzing data Purpose: To build and select models for parts of the real world (which can be used for
More informationNon-Inferiority Tests for the Difference Between Two Proportions
Chapter 0 Non-Inferiority Tests for the Difference Between Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority tests of the difference in twosample
More informationConfidence Intervals for One-Sample Specificity
Chapter 7 Confidence Intervals for One-Sample Specificity Introduction This procedures calculates the (whole table) sample size necessary for a single-sample specificity confidence interval, based on a
More informationTests for Paired Means using Effect Size
Chapter 417 Tests for Paired Means using Effect Size Introduction This procedure provides sample size and power calculations for a one- or two-sided paired t-test when the effect size is specified rather
More informationChapter Seven: Confidence Intervals and Sample Size
Chapter Seven: Confidence Intervals and Sample Size A point estimate is: The best point estimate of the population mean µ is the sample mean X. Three Properties of a Good Estimator 1. Unbiased 2. Consistent
More informationLog-linear Modeling Under Generalized Inverse Sampling Scheme
Log-linear Modeling Under Generalized Inverse Sampling Scheme Soumi Lahiri (1) and Sunil Dhar (2) (1) Department of Mathematical Sciences New Jersey Institute of Technology University Heights, Newark,
More informationProbabilistic Analysis of the Economic Impact of Earthquake Prediction Systems
The Minnesota Journal of Undergraduate Mathematics Probabilistic Analysis of the Economic Impact of Earthquake Prediction Systems Tiffany Kolba and Ruyue Yuan Valparaiso University The Minnesota Journal
More informationTests for One Variance
Chapter 65 Introduction Occasionally, researchers are interested in the estimation of the variance (or standard deviation) rather than the mean. This module calculates the sample size and performs power
More informationModelling Returns: the CER and the CAPM
Modelling Returns: the CER and the CAPM Carlo Favero Favero () Modelling Returns: the CER and the CAPM 1 / 20 Econometric Modelling of Financial Returns Financial data are mostly observational data: they
More informationAn Information Based Methodology for the Change Point Problem Under the Non-central Skew t Distribution with Applications.
An Information Based Methodology for the Change Point Problem Under the Non-central Skew t Distribution with Applications. Joint with Prof. W. Ning & Prof. A. K. Gupta. Department of Mathematics and Statistics
More informationConfidence Intervals Introduction
Confidence Intervals Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample mean X is a point estimate of the population mean μ
More informationINSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION
INSTITUTE AND FACULTY OF ACTUARIES Curriculum 2019 SPECIMEN EXAMINATION Subject CS1A Actuarial Statistics Time allowed: Three hours and fifteen minutes INSTRUCTIONS TO THE CANDIDATE 1. Enter all the candidate
More informationNBER WORKING PAPER SERIES A REHABILITATION OF STOCHASTIC DISCOUNT FACTOR METHODOLOGY. John H. Cochrane
NBER WORKING PAPER SERIES A REHABILIAION OF SOCHASIC DISCOUN FACOR MEHODOLOGY John H. Cochrane Working Paper 8533 http://www.nber.org/papers/w8533 NAIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts
More informationDealing with forecast uncertainty in inventory models
Dealing with forecast uncertainty in inventory models 19th IIF workshop on Supply Chain Forecasting for Operations Lancaster University Dennis Prak Supervisor: Prof. R.H. Teunter June 29, 2016 Dennis Prak
More informationLecture 39 Section 11.5
on Lecture 39 Section 11.5 Hampden-Sydney College Mon, Nov 10, 2008 Outline 1 on 2 3 on 4 on Exercise 11.27, page 715. A researcher was interested in comparing body weights for two strains of laboratory
More informationAn Empirical Examination of Traditional Equity Valuation Models: The case of the Athens Stock Exchange
European Research Studies, Volume 7, Issue (1-) 004 An Empirical Examination of Traditional Equity Valuation Models: The case of the Athens Stock Exchange By G. A. Karathanassis*, S. N. Spilioti** Abstract
More informationM249 Diagnostic Quiz
THE OPEN UNIVERSITY Faculty of Mathematics and Computing M249 Diagnostic Quiz Prepared by the Course Team [Press to begin] c 2005, 2006 The Open University Last Revision Date: May 19, 2006 Version 4.2
More informationTests for Two ROC Curves
Chapter 65 Tests for Two ROC Curves Introduction Receiver operating characteristic (ROC) curves are used to summarize the accuracy of diagnostic tests. The technique is used when a criterion variable is
More informationSTAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.
STAT 509: Statistics for Engineers Dr. Dewei Wang Applied Statistics and Probability for Engineers Sixth Edition Douglas C. Montgomery George C. Runger 7 Point CHAPTER OUTLINE 7-1 Point Estimation 7-2
More informationOn the Distribution of Multivariate Sample Skewness for Assessing Multivariate Normality
On the Distribution of Multivariate Sample Skewness for Assessing Multivariate Normality Naoya Okamoto and Takashi Seo Department of Mathematical Information Science, Faculty of Science, Tokyo University
More informationStochastic Models. Statistics. Walt Pohl. February 28, Department of Business Administration
Stochastic Models Statistics Walt Pohl Universität Zürich Department of Business Administration February 28, 2013 The Value of Statistics Business people tend to underestimate the value of statistics.
More informationRiskTorrent: Using Portfolio Optimisation for Media Streaming
RiskTorrent: Using Portfolio Optimisation for Media Streaming Raul Landa, Miguel Rio Communications and Information Systems Research Group Department of Electronic and Electrical Engineering University
More informationECON 214 Elements of Statistics for Economists
ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education
More informationTwo-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models
Economic Review (Otaru University of Commerce), Vo.59, No.4, 4-48, March, 009 Two-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models Haruhiko
More informationSimulation Wrap-up, Statistics COS 323
Simulation Wrap-up, Statistics COS 323 Today Simulation Re-cap Statistics Variance and confidence intervals for simulations Simulation wrap-up FYI: No class or office hours Thursday Simulation wrap-up
More informationECON FINANCIAL ECONOMICS
ECON 337901 FINANCIAL ECONOMICS Peter Ireland Boston College Fall 2017 These lecture notes by Peter Ireland are licensed under a Creative Commons Attribution-NonCommerical-ShareAlike 4.0 International
More informationCPSC 540: Machine Learning
CPSC 540: Machine Learning Monte Carlo Methods Mark Schmidt University of British Columbia Winter 2019 Last Time: Markov Chains We can use Markov chains for density estimation, d p(x) = p(x 1 ) p(x }{{}
More informationECON FINANCIAL ECONOMICS
ECON 337901 FINANCIAL ECONOMICS Peter Ireland Boston College Spring 2018 These lecture notes by Peter Ireland are licensed under a Creative Commons Attribution-NonCommerical-ShareAlike 4.0 International
More informationInt. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS001) p approach
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS001) p.5901 What drives short rate dynamics? approach A functional gradient descent Audrino, Francesco University
More informationApplied Statistics I
Applied Statistics I Liang Zhang Department of Mathematics, University of Utah July 14, 2008 Liang Zhang (UofU) Applied Statistics I July 14, 2008 1 / 18 Point Estimation Liang Zhang (UofU) Applied Statistics
More informationMarket Risk Analysis Volume I
Market Risk Analysis Volume I Quantitative Methods in Finance Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume I xiii xvi xvii xix xxiii
More informationContinuous Distributions
Quantitative Methods 2013 Continuous Distributions 1 The most important probability distribution in statistics is the normal distribution. Carl Friedrich Gauss (1777 1855) Normal curve A normal distribution
More informationChapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means
Chapter 11: Inference for Distributions 11.1 Inference for Means of a Population 11.2 Comparing Two Means 1 Population Standard Deviation In the previous chapter, we computed confidence intervals and performed
More informationCan we use kernel smoothing to estimate Value at Risk and Tail Value at Risk?
Can we use kernel smoothing to estimate Value at Risk and Tail Value at Risk? Ramon Alemany, Catalina Bolancé and Montserrat Guillén Riskcenter - IREA Universitat de Barcelona http://www.ub.edu/riskcenter
More informationσ 2 : ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics
σ : ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics CONTENTS Estimating other parameters besides μ Estimating variance Confidence intervals for σ Hypothesis tests for σ Estimating standard
More informationDesign of Engineering Experiments Part 9 Experiments with Random Factors
Design of ngineering xperiments Part 9 xperiments with Random Factors Text reference, Chapter 13, Pg. 484 Previous chapters have considered fixed factors A specific set of factor levels is chosen for the
More informationChapter 7. Inferences about Population Variances
Chapter 7. Inferences about Population Variances Introduction () The variability of a population s values is as important as the population mean. Hypothetical distribution of E. coli concentrations from
More informationμ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics
μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics CONTENTS Estimating parameters The sampling distribution Confidence intervals for μ Hypothesis tests for μ The t-distribution Comparison
More informationTwo Populations Hypothesis Testing
Two Populations Hypothesis Testing Two Proportions (Large Independent Samples) Two samples are said to be independent if the data from the first sample is not connected to the data from the second sample.
More informationLecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.
ENM 207 Lecture 12 Some Useful Continuous Distributions Normal Distribution The most important continuous probability distribution in entire field of statistics. Its graph, called the normal curve, is
More informationFinancial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR
Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR Nelson Mark University of Notre Dame Fall 2017 September 11, 2017 Introduction
More informationLogit Models for Binary Data
Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response
More informationConfidence Intervals for Pearson s Correlation
Chapter 801 Confidence Intervals for Pearson s Correlation Introduction This routine calculates the sample size needed to obtain a specified width of a Pearson product-moment correlation coefficient confidence
More informationOn Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study
Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 8-26-2016 On Some Test Statistics for Testing the Population Skewness and Kurtosis:
More informationModelling strategies for bivariate circular data
Modelling strategies for bivariate circular data John T. Kent*, Kanti V. Mardia, & Charles C. Taylor Department of Statistics, University of Leeds 1 Introduction On the torus there are two common approaches
More information