The Two Sample T-test with One Variance Unknown

Size: px
Start display at page:

Download "The Two Sample T-test with One Variance Unknown"

Transcription

1 The Two Sample T-test with One Variance Unknown Arnab Maity Department of Statistics, Texas A&M University, College Station TX , U.S.A. Michael Sherman Department of Statistics, Texas A&M University, College Station TX , U.S.A. Summary We consider the situation in two sample testing when one variance is assumed to be known while the other variance is considered unknown. This situation arises, for example, when one is interested in comparing a standard treatment with a new treatment. Although this situation occurs relatively infrequently, our example discusses the important tool of moment matching and makes the classic two sample Satterthwaite t-approximation transparent. Key words: Two sample t-test; Moments; Satterthwaite; Behrens-Fisher.

2 Introduction The two sample t-test is a mainstay in statistical practice and is introduced in most, if not all, introductory statistics classes. A common assumption for the two samples is that they come from normally distributed populations with different means. When the two variances are known this leads directly to a test statistic with a standard normal distribution. If the variances are unknown, under the assumption of equal variances the two sample test uses a pooled estimate of variance. In this case the resulting test statistic has an exact t-distribution. In the case where the two variances may differ the two variances are estimated separately. The resulting test statistic does not have an exact t-distribution. Nevertheless, it is common to approximate the distribution by a t-distribution with a random degrees of freedom based on the data. In this note we consider the case where one of the variances is assumed to be known and the other is treated as unknown. This situation arises in practice when a standard treatment is being compared to a new treatment. The standard treatment leads to a known variance, while the new treatment may have a different mean (which is our main interest) and may well have a different variance. We sketch the usual distribution theory for the known cases and use the methodology to derive an approximate t-test in the two sample case with one variance known and one unknown. An interesting observation is that the method of deriving Satterthwaite s approximate degrees of freedom for the unequal variance case works for the case when one variance is assumed known and the other one is estimated. We derive the approximate t-test in the case of one unknown variance, illustrate with an example, and carry out a small simulation to evaluate the accuracy of the approximate t-distribution. This example illustrates an important approximation technique of moment matching and makes the Satterthwaite approximation transparent. This note was motivated by a question from a graduate student in an introductory class for engineering students. Hypothesis Tests for a Difference in Means of Two Normal Distributions Let Y,..., Y n Normal(µ, σ) and Y,..., Y m Normal(µ, σ) where the notation means independent and identically distributed as. Throughout, we are interested in the

3 quantity µ µ and we want to test H : µ µ =, for a fixed difference of interest.. An Overview of Existing Tests: Both Variances are Known or Both are Unknown Assume that the sample size from the first population is n and from the second m. simplest case arises when variances of both the population, i.e., σ and σ are known. In this case, the z-statistic, Z = Ȳ Ȳ σ n + σ m is used. Under H, Z has a Normal(, ) distribution and the test can be conducted based on this fact. A more complicated situation occurs when the population variances are unknown. In this case the above z-test does not apply anymore since the variances need to be estimated. If we assume σ = σ = σ, it is reasonable to combine the two samples to estimate σ, rather than estimating separately for the two samples. The usual pooled estimator of σ is given by Spooled = (n )S + (m )S, n + m where S = n i= (Y i Ȳ) /(n ) and S = m i= (Y i Ȳ) /(m ). Then, the test statistic T = Ȳ Ȳ ( Spooled + ) n m is used and has an exact t-distribution with γ = n + m degrees of freedom, under H. A more realistic case arises when σ σ. This is known as the Behrens-Fisher problem. In this case the test statistic, The T = Ȳ Ȳ ( ) () S + S n m ( ) σ ( ) does not have an exact t-distribution. The difficulty lies in the fact that + σ S + S n m n m does not have an appropriate χ distribution. However, Satterthwaite (94, 946) proposed a method which approximates the exact distribution by a suitable χ distribution. Thus, using Satterthwaite s approximation, ( ) σ ( ) γ n + σ S m n + S χ γ m, approximately.

4 The value of γ is obtained by matching the variances on both sides of the above equation. ( ) σ { This gives γ = + σ (σ /n) + (σ /m) n m n m }. γ is estimated by γ, where we substitute S and S in place of σ and σ. Thus, T has an approximate t-distribution with γ degrees of freedom. We use an entirely analogous method in Section. in the case of one variance assumed to be known. See, e.g., Scheffé (97) for more background on the Behrens-Fisher problem. For further information on estimation of the degrees of freedom see, e.g., Ames and Webster (99) or Moser and Stevens (99). The two sample t-test is a special case of the general linear model Y = Xβ + ɛ, where ɛ is distributed as Normal[, V (θ)] and β is a column vector of fixed effect parameters. The test statistic used to test a linear hypothesis, H : l T β =, is t = lt β l TĈl. () where Ĉ is an estimate of the covariance matrix of β. The general Satterthwaite approximation for the denominator degrees of freedom is computed as γ gen = (lt Ĉl) g T Ag (3) where g is the gradient of l T Cl (with respect to θ), evaluated at θ and A is the asymptotic variance-covariance matrix of θ obtained from the second derivative matrix of the likelihood equations. In this general situation, t in () has an approximate t-distribution with γ gen degrees of freedom. See Giesbrecht and Burns (985) and SAS/STAT 9. User s Guide (4) for details.. The Variance of One Population is Known but the Other is Unknown (σ known but σ unknown) The situation, where the variance of one population is known but the other unknown, arises when a new treatment is compared to an old or standard treatment. The variance for the standard treatment may be considered known from historical data, while the variance caused by the new treatment may not be the same as the old one. In this case, pooling the samples may not be a good idea since the variances may be very different from each other. On the 3

5 other hand, using T as in () requires an estimate for the variance of the standard treatment which is assumed to be known. Since σ is known, a reasonable test statistic is: T 3 = Ȳ Ȳ σ n + S m. (4) ( ) σ ( ) However, T 3 does not have an exact t-distribution since + σ σ + S does not have n m n m an appropriate χ distribution. Hence, our goal is to find the appropriate degrees of freedom, γ 3, such that ( ) σ ( ) γ 3 n + σ σ m n + S χ γ m 3, approximately. (5) One of the ways to achieve this is to use the method of moments. The two sides in (5) clearly have the same expectations so we seek to match variances: ( ) σ γ3 ( ) n + σ σ var m n + S = γ 3 m ( ) γ 3 = { (σ /m) σ m } n + σ, m since var(s ) = σ 4 /(m ). Since σ is unknown, and not assumed to be the known value of σ, we estimate it by S and obtain γ 3 = ( ) σ ( Hence we can say that γ 3 n + σ σ + S m n m ( ) σ + S n m (S /m) m. (6) ) χ bγ 3, approximately. Hence, under H T 3 = Ȳ Ȳ σ n + S m t bγ3, approximately. Based on this we can now conduct level-α t-tests: Alternative hypothesis Rejection criterion H : µ µ T 3 > t α/,bγ3 or T 3 < t α/,bγ3 H : µ µ > H : µ µ < T 3 > t α,bγ3 T 3 < t α,bγ3 4

6 Note that γ 3 in (6) can be derived as a special case of γ gen in (3) by considering a model where Y = (Y,..., Y n, Y,..., Y m ) T, β = (µ, µ ) T, l = (, ) T, X is a (n+m) matrix such that its first n rows are (, ) and last m rows are (, ), and V is a (m + n) (m + n) diagonal matrix with first n diagonal entries being the known variance σ and the others being σ [mathematical details are available from the authors]. 3 A Data Example We consider data from problem. in the textbook of Montgomery and Runger (3). We quote the problem: A polymer is manufactured in a batch chemical process. Viscosity measurements are normally made on each batch, and long experience with the process has indicated that the variability of the process is fairly stable with σ =. Fifteen batch viscosity measurements are given as follows: 74, 78, 776, 76, 745, 759, 795, 756, 74, 74, 76, 749, 739, 747, 74. A process change is made which involves switching the type of catalyst involved in the process. Following the process change, eight batch viscosity measurements are taken: 735, 775, 79, 755, 783, 76, 738, 78. Assume that process variability is unaffected by the catalyst change. Find a 9% confidence interval on the difference in mean batch viscosity resulting from the process change. Note that in the problem it is assumed that switching the type of catalyst does not affect the variance. We are interested in a possible shift of the mean and hence it seems we should allow for a shift in the variances as well. Analysis: We proceed by assuming that σ = is known but that σ is unknown, i.e., that the process variability is affected by the catalyst change. Then we use the approximate t-test developed in Section. with degrees of freedom ( ) σ + S n m γ 3 = = 5.5. (S /m) m Also, n = 5 and m = 8. Hence a 9% CI is given by (Ȳ Ȳ t.5,bγ3 σ /n + S /m, Ȳ Ȳ + t.5,bγ3 σ /n + S /m) = (.6638, 9.338). 5

7 For comparison s sake we consider the assumption that the process variability is unaffected by the catalyst change. In this situation, σ = σ =. So a 9% CI for µ µ is given by (Ȳ Ȳ z.5 σ /n + σ/m, Ȳ Ȳ + z.5 σ /n + σ/m) = (.773, 7.773). We see that the two intervals are similar but that the first interval is wider than the second. This is due to the assumption that the variance is known in the post process change group. The rough similarity between the two intervals is compatible with the usual F-test for equality of variances. Testing: H : σ = vs H : σ, we find that p value =.678. It is interesting to see how the intervals compare in a case where the variances are more different. We take the same example as in previous section but change the y values to : Y,new = [76, 83, 73, 755, 8, 786, 74, 79], so that is sample variance is In this case, results are given below, Assumption 9% CI for µ µ H : µ µ = σ and σ are known ( 33.73, ) Reject H σ is known and σ is unknown ( 44.78, 5.478) Fail to reject H We see that the results are very different. If we perform a test H : µ µ =, based on the intervals, these two methods yield different conclusions at α =.. A simulation, based on the preceding data values, was conducted with n = 5 and m = 8 where we generate, data sets with Y,..., Y n Normal(µ = 75., σ = ) and Y,..., Y m Normal(µ = 75., σ = 4). Tests for H : µ µ = vs H : µ µ are performed at α =. for each of the data sets and type I errors are estimated. The test based on T 3 gives a type I error of. whereas tests based on T and T produce type I errors of.6 and., respectively. Next, we compare the empirical distribution of our test statistic with the approximate t-distribution. We take n = and m = 5 with σ = 4 and σ =. We generate, data sets and for each data set we compute the two sample t-statistic, T 3. It is interesting to compare the empirical distribution of T, T and T 3 with their corresponding hypothesized t-distributions. Kernel density estimates of T, T and T 3 (using an Epanechnikov kernel) are shown in Figures, and 3, respectively. It is clear the agreement is very close for T and T 3. However, 6

8 for T the discrepancy is clearly visible. We see that assumption of equal variance is not desirable in this case. Finally, we compare the power curves of the tests based on T and T 3. We consider two configurations: Setup. n = 5 and m = 5 with σ = 4 and σ = Setup. n = and m = 5 with σ = 4 and σ = We take µ = and µ = µ + (for between 5 and 5) and estimate power of the tests for different values of. Results for Setup and Setup, based on, simulations, are given in Figures 4 and 5, respectively. It is evident that if the size of the sample with known variance is small, the test based on T 3 produces larger power than that of T. This effect, however, diminishes as the sample size increases. We reiterate that this example comes from a query from a student. Its practical importance is limited by the fact that typically both variances are unknown and for this reason T is more generally applicable than T 3. References Ames, M.H. and Webster, J.T. (99). On Estimating Approximate Degrees of Freedom, The American Statistician, 45, Giesbrecht, F.G. and Burns, J.C. (985). Two-Stage Analysis Based on a Mixed Model: Large-Sample Asymptotic Theory and Small-Sample Simulation Results, Biometrics, 4, Montgomery, D.C. and Runger, G.C. (3). Applied Statistics and Probability for Engineers, 3rd edition, John Wiley and Sons. Moser, B.K. and Stevens, G.R. (99). Homogeneity of Variance in the Two-Sample Means Test, The American Statistician, 46, 9-. SAS Institute Inc. (4). SAS/STAT R 9. User s Guide. Cary, NC:SAS Institute Inc. Satterthwaite, F.E. (94). Synthesis of Variance, Psychometrika, 6, Satterthwaite, F.E. (946). An Approximate Distribution of Estimates of Variance Components, Biometrics Bulletin, 6, -4. 7

9 Scheffé, H. (97). Practical Solutions of the Behrens-Fisher Problem, Journal of the American Statistical Association, 65,

10 Figure : Plots of the density estimate of T (solid line) and actual t-density with γ = n+m degrees of freedom (dashed line) along with the critical values (vertical lines) for two sided 5% tests for H : µ µ =.

11 Figure : Plots of the density estimate of T (solid line) and actual t-density with γ degrees of freedom (dashed line) along with the critical values (vertical lines) for two sided 5% tests for H : µ µ =.

12 Figure 3: Plots of density estimate of T 3 (solid line) and actual t-density with γ 3 degrees of freedom (dashed line) along with the critical values (vertical lines) for two sided 5% tests for H : µ µ =.

13 .9 T3 T Figure 4: Power curves of the t-tests based on T 3 (solid line) and T (dashed line) when n = 5 and m = 5.

14 .9 T3 T Figure 5: Power curves of the t-tests based on T 3 (solid line) and T (dashed line) when n = and m = 5.

Statistical Methodology. A note on a two-sample T test with one variance unknown

Statistical Methodology. A note on a two-sample T test with one variance unknown Statistical Methodology 8 (0) 58 534 Contents lists available at SciVerse ScienceDirect Statistical Methodology journal homepage: www.elsevier.com/locate/stamet A note on a two-sample T test with one variance

More information

Experimental Design and Statistics - AGA47A

Experimental Design and Statistics - AGA47A Experimental Design and Statistics - AGA47A Czech University of Life Sciences in Prague Department of Genetics and Breeding Fall/Winter 2014/2015 Matúš Maciak (@ A 211) Office Hours: M 14:00 15:30 W 15:30

More information

Inferences on Correlation Coefficients of Bivariate Log-normal Distributions

Inferences on Correlation Coefficients of Bivariate Log-normal Distributions Inferences on Correlation Coefficients of Bivariate Log-normal Distributions Guoyi Zhang 1 and Zhongxue Chen 2 Abstract This article considers inference on correlation coefficients of bivariate log-normal

More information

Australian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model

Australian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model AENSI Journals Australian Journal of Basic and Applied Sciences Journal home page: wwwajbaswebcom Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model Khawla Mustafa Sadiq University

More information

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems Interval estimation September 29, 2017 STAT 151 Class 7 Slide 1 Outline of Topics 1 Basic ideas 2 Sampling variation and CLT 3 Interval estimation using X 4 More general problems STAT 151 Class 7 Slide

More information

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu

More information

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Chapter 14 : Statistical Inference 1 Chapter 14 : Introduction to Statistical Inference Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Data x

More information

8.1 Estimation of the Mean and Proportion

8.1 Estimation of the Mean and Proportion 8.1 Estimation of the Mean and Proportion Statistical inference enables us to make judgments about a population on the basis of sample information. The mean, standard deviation, and proportions of a population

More information

8.3 CI for μ, σ NOT known (old 8.4)

8.3 CI for μ, σ NOT known (old 8.4) GOALS: 1. Learn the properties of the student t distribution and the t curve. 2. Understand how degrees of freedom, df, relate to t curves. 3. Recognize that t curves approach the SNC as df increases.

More information

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29 Chapter 5 Univariate time-series analysis () Chapter 5 Univariate time-series analysis 1 / 29 Time-Series Time-series is a sequence fx 1, x 2,..., x T g or fx t g, t = 1,..., T, where t is an index denoting

More information

The Two-Sample Independent Sample t Test

The Two-Sample Independent Sample t Test Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 The General Formula The Equal-n Formula 4 5 6 Independence Normality Homogeneity of Variances 7 Non-Normality Unequal

More information

STATISTICS and PROBABILITY

STATISTICS and PROBABILITY Introduction to Statistics Atatürk University STATISTICS and PROBABILITY LECTURE: SAMPLING DISTRIBUTIONS and POINT ESTIMATIONS Prof. Dr. İrfan KAYMAZ Atatürk University Engineering Faculty Department of

More information

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg :

More information

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 6 Point Estimation Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Point Estimation Statistical inference: directed toward conclusions about one or more parameters. We will use the generic

More information

Homework Problems Stat 479

Homework Problems Stat 479 Chapter 10 91. * A random sample, X1, X2,, Xn, is drawn from a distribution with a mean of 2/3 and a variance of 1/18. ˆ = (X1 + X2 + + Xn)/(n-1) is the estimator of the distribution mean θ. Find MSE(

More information

Superiority by a Margin Tests for the Ratio of Two Proportions

Superiority by a Margin Tests for the Ratio of Two Proportions Chapter 06 Superiority by a Margin Tests for the Ratio of Two Proportions Introduction This module computes power and sample size for hypothesis tests for superiority of the ratio of two independent proportions.

More information

Two-Sample T-Tests using Effect Size

Two-Sample T-Tests using Effect Size Chapter 419 Two-Sample T-Tests using Effect Size Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the effect size is specified rather

More information

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی یادگیري ماشین توزیع هاي نمونه و تخمین نقطه اي پارامترها Sampling Distributions and Point Estimation of Parameter (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی درس هفتم 1 Outline Introduction

More information

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate

More information

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.

More information

Course information FN3142 Quantitative finance

Course information FN3142 Quantitative finance Course information 015 16 FN314 Quantitative finance This course is aimed at students interested in obtaining a thorough grounding in market finance and related empirical methods. Prerequisite If taken

More information

Conover Test of Variances (Simulation)

Conover Test of Variances (Simulation) Chapter 561 Conover Test of Variances (Simulation) Introduction This procedure analyzes the power and significance level of the Conover homogeneity test. This test is used to test whether two or more population

More information

Statistical Tables Compiled by Alan J. Terry

Statistical Tables Compiled by Alan J. Terry Statistical Tables Compiled by Alan J. Terry School of Science and Sport University of the West of Scotland Paisley, Scotland Contents Table 1: Cumulative binomial probabilities Page 1 Table 2: Cumulative

More information

Much of what appears here comes from ideas presented in the book:

Much of what appears here comes from ideas presented in the book: Chapter 11 Robust statistical methods Much of what appears here comes from ideas presented in the book: Huber, Peter J. (1981), Robust statistics, John Wiley & Sons (New York; Chichester). There are many

More information

Tests for Two Independent Sensitivities

Tests for Two Independent Sensitivities Chapter 75 Tests for Two Independent Sensitivities Introduction This procedure gives power or required sample size for comparing two diagnostic tests when the outcome is sensitivity (or specificity). In

More information

Final Exam Suggested Solutions

Final Exam Suggested Solutions University of Washington Fall 003 Department of Economics Eric Zivot Economics 483 Final Exam Suggested Solutions This is a closed book and closed note exam. However, you are allowed one page of handwritten

More information

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS

SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS Science SAMPLE STANDARD DEVIATION(s) CHART UNDER THE ASSUMPTION OF MODERATENESS AND ITS PERFORMANCE ANALYSIS Kalpesh S Tailor * * Assistant Professor, Department of Statistics, M K Bhavnagar University,

More information

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions: Chapter 17 Inference about a Population Mean Conditions for inference Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions: (1) Our data (observations)

More information

Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance

Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance Douglas Bates Department of Statistics University of Wisconsin - Madison Madison January 11, 2011

More information

A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution

A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution Debasis Kundu 1, Rameshwar D. Gupta 2 & Anubhav Manglick 1 Abstract In this paper we propose a very convenient

More information

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is: **BEGINNING OF EXAMINATION** 1. You are given: (i) A random sample of five observations from a population is: 0.2 0.7 0.9 1.1 1.3 (ii) You use the Kolmogorov-Smirnov test for testing the null hypothesis,

More information

Time Observations Time Period, t

Time Observations Time Period, t Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard Time Series and Forecasting.S1 Time Series Models An example of a time series for 25 periods is plotted in Fig. 1 from the numerical

More information

FINITE SAMPLE DISTRIBUTIONS OF RISK-RETURN RATIOS

FINITE SAMPLE DISTRIBUTIONS OF RISK-RETURN RATIOS Available Online at ESci Journals Journal of Business and Finance ISSN: 305-185 (Online), 308-7714 (Print) http://www.escijournals.net/jbf FINITE SAMPLE DISTRIBUTIONS OF RISK-RETURN RATIOS Reza Habibi*

More information

A New Multivariate Kurtosis and Its Asymptotic Distribution

A New Multivariate Kurtosis and Its Asymptotic Distribution A ew Multivariate Kurtosis and Its Asymptotic Distribution Chiaki Miyagawa 1 and Takashi Seo 1 Department of Mathematical Information Science, Graduate School of Science, Tokyo University of Science, Tokyo,

More information

Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty

Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty George Photiou Lincoln College University of Oxford A dissertation submitted in partial fulfilment for

More information

Introduction to the Maximum Likelihood Estimation Technique. September 24, 2015

Introduction to the Maximum Likelihood Estimation Technique. September 24, 2015 Introduction to the Maximum Likelihood Estimation Technique September 24, 2015 So far our Dependent Variable is Continuous That is, our outcome variable Y is assumed to follow a normal distribution having

More information

1. You are given the following information about a stationary AR(2) model:

1. You are given the following information about a stationary AR(2) model: Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4

More information

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices Bachelier Finance Society Meeting Toronto 2010 Henley Business School at Reading Contact Author : d.ledermann@icmacentre.ac.uk Alexander

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation The likelihood and log-likelihood functions are the basis for deriving estimators for parameters, given data. While the shapes of these two functions are different, they have

More information

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright Faculty and Institute of Actuaries Claims Reserving Manual v.2 (09/1997) Section D7 [D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright 1. Introduction

More information

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method Meng-Jie Lu 1 / Wei-Hua Zhong 1 / Yu-Xiu Liu 1 / Hua-Zhang Miao 1 / Yong-Chang Li 1 / Mu-Huo Ji 2 Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method Abstract:

More information

Inference of Several Log-normal Distributions

Inference of Several Log-normal Distributions Inference of Several Log-normal Distributions Guoyi Zhang 1 and Bose Falk 2 Abstract This research considers several log-normal distributions when variances are heteroscedastic and group sizes are unequal.

More information

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions ELE 525: Random Processes in Information Systems Hisashi Kobayashi Department of Electrical Engineering

More information

Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State University Tempe, AZ

Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State University Tempe, AZ Process Monitoring for Correlated Gamma Distributed Data Using Generalized Linear Model Based Control Charts Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State

More information

χ 2 distributions and confidence intervals for population variance

χ 2 distributions and confidence intervals for population variance χ 2 distributions and confidence intervals for population variance Let Z be a standard Normal random variable, i.e., Z N(0, 1). Define Y = Z 2. Y is a non-negative random variable. Its distribution is

More information

A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims

A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims International Journal of Business and Economics, 007, Vol. 6, No. 3, 5-36 A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims Wan-Kai Pang * Department of Applied

More information

Lecture 3: Factor models in modern portfolio choice

Lecture 3: Factor models in modern portfolio choice Lecture 3: Factor models in modern portfolio choice Prof. Massimo Guidolin Portfolio Management Spring 2016 Overview The inputs of portfolio problems Using the single index model Multi-index models Portfolio

More information

Omitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations

Omitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations Journal of Statistical and Econometric Methods, vol. 2, no.3, 2013, 49-55 ISSN: 2051-5057 (print version), 2051-5065(online) Scienpress Ltd, 2013 Omitted Variables Bias in Regime-Switching Models with

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data

SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data SYSM 6304 Risk and Decision Analysis Lecture 2: Fitting Distributions to Data M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu September 5, 2015

More information

Calibration of Interest Rates

Calibration of Interest Rates WDS'12 Proceedings of Contributed Papers, Part I, 25 30, 2012. ISBN 978-80-7378-224-5 MATFYZPRESS Calibration of Interest Rates J. Černý Charles University, Faculty of Mathematics and Physics, Prague,

More information

Estimation of a parametric function associated with the lognormal distribution 1

Estimation of a parametric function associated with the lognormal distribution 1 Communications in Statistics Theory and Methods Estimation of a parametric function associated with the lognormal distribution Jiangtao Gou a,b and Ajit C. Tamhane c, a Department of Mathematics and Statistics,

More information

Objective Bayesian Analysis for Heteroscedastic Regression

Objective Bayesian Analysis for Heteroscedastic Regression Analysis for Heteroscedastic Regression & Esther Salazar Universidade Federal do Rio de Janeiro Colóquio Inter-institucional: Modelos Estocásticos e Aplicações 2009 Collaborators: Marco Ferreira and Thais

More information

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether.

Distribution. Lecture 34 Section Fri, Oct 31, Hampden-Sydney College. Student s t Distribution. Robb T. Koether. Lecture 34 Section 10.2 Hampden-Sydney College Fri, Oct 31, 2008 Outline 1 2 3 4 5 6 7 8 Exercise 10.4, page 633. A psychologist is studying the distribution of IQ scores of girls at an alternative high

More information

MVE051/MSG Lecture 7

MVE051/MSG Lecture 7 MVE051/MSG810 2017 Lecture 7 Petter Mostad Chalmers November 20, 2017 The purpose of collecting and analyzing data Purpose: To build and select models for parts of the real world (which can be used for

More information

Non-Inferiority Tests for the Difference Between Two Proportions

Non-Inferiority Tests for the Difference Between Two Proportions Chapter 0 Non-Inferiority Tests for the Difference Between Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority tests of the difference in twosample

More information

Confidence Intervals for One-Sample Specificity

Confidence Intervals for One-Sample Specificity Chapter 7 Confidence Intervals for One-Sample Specificity Introduction This procedures calculates the (whole table) sample size necessary for a single-sample specificity confidence interval, based on a

More information

Tests for Paired Means using Effect Size

Tests for Paired Means using Effect Size Chapter 417 Tests for Paired Means using Effect Size Introduction This procedure provides sample size and power calculations for a one- or two-sided paired t-test when the effect size is specified rather

More information

Chapter Seven: Confidence Intervals and Sample Size

Chapter Seven: Confidence Intervals and Sample Size Chapter Seven: Confidence Intervals and Sample Size A point estimate is: The best point estimate of the population mean µ is the sample mean X. Three Properties of a Good Estimator 1. Unbiased 2. Consistent

More information

Log-linear Modeling Under Generalized Inverse Sampling Scheme

Log-linear Modeling Under Generalized Inverse Sampling Scheme Log-linear Modeling Under Generalized Inverse Sampling Scheme Soumi Lahiri (1) and Sunil Dhar (2) (1) Department of Mathematical Sciences New Jersey Institute of Technology University Heights, Newark,

More information

Probabilistic Analysis of the Economic Impact of Earthquake Prediction Systems

Probabilistic Analysis of the Economic Impact of Earthquake Prediction Systems The Minnesota Journal of Undergraduate Mathematics Probabilistic Analysis of the Economic Impact of Earthquake Prediction Systems Tiffany Kolba and Ruyue Yuan Valparaiso University The Minnesota Journal

More information

Tests for One Variance

Tests for One Variance Chapter 65 Introduction Occasionally, researchers are interested in the estimation of the variance (or standard deviation) rather than the mean. This module calculates the sample size and performs power

More information

Modelling Returns: the CER and the CAPM

Modelling Returns: the CER and the CAPM Modelling Returns: the CER and the CAPM Carlo Favero Favero () Modelling Returns: the CER and the CAPM 1 / 20 Econometric Modelling of Financial Returns Financial data are mostly observational data: they

More information

An Information Based Methodology for the Change Point Problem Under the Non-central Skew t Distribution with Applications.

An Information Based Methodology for the Change Point Problem Under the Non-central Skew t Distribution with Applications. An Information Based Methodology for the Change Point Problem Under the Non-central Skew t Distribution with Applications. Joint with Prof. W. Ning & Prof. A. K. Gupta. Department of Mathematics and Statistics

More information

Confidence Intervals Introduction

Confidence Intervals Introduction Confidence Intervals Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample mean X is a point estimate of the population mean μ

More information

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION INSTITUTE AND FACULTY OF ACTUARIES Curriculum 2019 SPECIMEN EXAMINATION Subject CS1A Actuarial Statistics Time allowed: Three hours and fifteen minutes INSTRUCTIONS TO THE CANDIDATE 1. Enter all the candidate

More information

NBER WORKING PAPER SERIES A REHABILITATION OF STOCHASTIC DISCOUNT FACTOR METHODOLOGY. John H. Cochrane

NBER WORKING PAPER SERIES A REHABILITATION OF STOCHASTIC DISCOUNT FACTOR METHODOLOGY. John H. Cochrane NBER WORKING PAPER SERIES A REHABILIAION OF SOCHASIC DISCOUN FACOR MEHODOLOGY John H. Cochrane Working Paper 8533 http://www.nber.org/papers/w8533 NAIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts

More information

Dealing with forecast uncertainty in inventory models

Dealing with forecast uncertainty in inventory models Dealing with forecast uncertainty in inventory models 19th IIF workshop on Supply Chain Forecasting for Operations Lancaster University Dennis Prak Supervisor: Prof. R.H. Teunter June 29, 2016 Dennis Prak

More information

Lecture 39 Section 11.5

Lecture 39 Section 11.5 on Lecture 39 Section 11.5 Hampden-Sydney College Mon, Nov 10, 2008 Outline 1 on 2 3 on 4 on Exercise 11.27, page 715. A researcher was interested in comparing body weights for two strains of laboratory

More information

An Empirical Examination of Traditional Equity Valuation Models: The case of the Athens Stock Exchange

An Empirical Examination of Traditional Equity Valuation Models: The case of the Athens Stock Exchange European Research Studies, Volume 7, Issue (1-) 004 An Empirical Examination of Traditional Equity Valuation Models: The case of the Athens Stock Exchange By G. A. Karathanassis*, S. N. Spilioti** Abstract

More information

M249 Diagnostic Quiz

M249 Diagnostic Quiz THE OPEN UNIVERSITY Faculty of Mathematics and Computing M249 Diagnostic Quiz Prepared by the Course Team [Press to begin] c 2005, 2006 The Open University Last Revision Date: May 19, 2006 Version 4.2

More information

Tests for Two ROC Curves

Tests for Two ROC Curves Chapter 65 Tests for Two ROC Curves Introduction Receiver operating characteristic (ROC) curves are used to summarize the accuracy of diagnostic tests. The technique is used when a criterion variable is

More information

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved. STAT 509: Statistics for Engineers Dr. Dewei Wang Applied Statistics and Probability for Engineers Sixth Edition Douglas C. Montgomery George C. Runger 7 Point CHAPTER OUTLINE 7-1 Point Estimation 7-2

More information

On the Distribution of Multivariate Sample Skewness for Assessing Multivariate Normality

On the Distribution of Multivariate Sample Skewness for Assessing Multivariate Normality On the Distribution of Multivariate Sample Skewness for Assessing Multivariate Normality Naoya Okamoto and Takashi Seo Department of Mathematical Information Science, Faculty of Science, Tokyo University

More information

Stochastic Models. Statistics. Walt Pohl. February 28, Department of Business Administration

Stochastic Models. Statistics. Walt Pohl. February 28, Department of Business Administration Stochastic Models Statistics Walt Pohl Universität Zürich Department of Business Administration February 28, 2013 The Value of Statistics Business people tend to underestimate the value of statistics.

More information

RiskTorrent: Using Portfolio Optimisation for Media Streaming

RiskTorrent: Using Portfolio Optimisation for Media Streaming RiskTorrent: Using Portfolio Optimisation for Media Streaming Raul Landa, Miguel Rio Communications and Information Systems Research Group Department of Electronic and Electrical Engineering University

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education

More information

Two-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models

Two-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models Economic Review (Otaru University of Commerce), Vo.59, No.4, 4-48, March, 009 Two-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models Haruhiko

More information

Simulation Wrap-up, Statistics COS 323

Simulation Wrap-up, Statistics COS 323 Simulation Wrap-up, Statistics COS 323 Today Simulation Re-cap Statistics Variance and confidence intervals for simulations Simulation wrap-up FYI: No class or office hours Thursday Simulation wrap-up

More information

ECON FINANCIAL ECONOMICS

ECON FINANCIAL ECONOMICS ECON 337901 FINANCIAL ECONOMICS Peter Ireland Boston College Fall 2017 These lecture notes by Peter Ireland are licensed under a Creative Commons Attribution-NonCommerical-ShareAlike 4.0 International

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning Monte Carlo Methods Mark Schmidt University of British Columbia Winter 2019 Last Time: Markov Chains We can use Markov chains for density estimation, d p(x) = p(x 1 ) p(x }{{}

More information

ECON FINANCIAL ECONOMICS

ECON FINANCIAL ECONOMICS ECON 337901 FINANCIAL ECONOMICS Peter Ireland Boston College Spring 2018 These lecture notes by Peter Ireland are licensed under a Creative Commons Attribution-NonCommerical-ShareAlike 4.0 International

More information

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS001) p approach

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS001) p approach Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS001) p.5901 What drives short rate dynamics? approach A functional gradient descent Audrino, Francesco University

More information

Applied Statistics I

Applied Statistics I Applied Statistics I Liang Zhang Department of Mathematics, University of Utah July 14, 2008 Liang Zhang (UofU) Applied Statistics I July 14, 2008 1 / 18 Point Estimation Liang Zhang (UofU) Applied Statistics

More information

Market Risk Analysis Volume I

Market Risk Analysis Volume I Market Risk Analysis Volume I Quantitative Methods in Finance Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume I xiii xvi xvii xix xxiii

More information

Continuous Distributions

Continuous Distributions Quantitative Methods 2013 Continuous Distributions 1 The most important probability distribution in statistics is the normal distribution. Carl Friedrich Gauss (1777 1855) Normal curve A normal distribution

More information

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means Chapter 11: Inference for Distributions 11.1 Inference for Means of a Population 11.2 Comparing Two Means 1 Population Standard Deviation In the previous chapter, we computed confidence intervals and performed

More information

Can we use kernel smoothing to estimate Value at Risk and Tail Value at Risk?

Can we use kernel smoothing to estimate Value at Risk and Tail Value at Risk? Can we use kernel smoothing to estimate Value at Risk and Tail Value at Risk? Ramon Alemany, Catalina Bolancé and Montserrat Guillén Riskcenter - IREA Universitat de Barcelona http://www.ub.edu/riskcenter

More information

σ 2 : ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

σ 2 : ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics σ : ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics CONTENTS Estimating other parameters besides μ Estimating variance Confidence intervals for σ Hypothesis tests for σ Estimating standard

More information

Design of Engineering Experiments Part 9 Experiments with Random Factors

Design of Engineering Experiments Part 9 Experiments with Random Factors Design of ngineering xperiments Part 9 xperiments with Random Factors Text reference, Chapter 13, Pg. 484 Previous chapters have considered fixed factors A specific set of factor levels is chosen for the

More information

Chapter 7. Inferences about Population Variances

Chapter 7. Inferences about Population Variances Chapter 7. Inferences about Population Variances Introduction () The variability of a population s values is as important as the population mean. Hypothetical distribution of E. coli concentrations from

More information

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics

μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics μ: ESTIMATES, CONFIDENCE INTERVALS, AND TESTS Business Statistics CONTENTS Estimating parameters The sampling distribution Confidence intervals for μ Hypothesis tests for μ The t-distribution Comparison

More information

Two Populations Hypothesis Testing

Two Populations Hypothesis Testing Two Populations Hypothesis Testing Two Proportions (Large Independent Samples) Two samples are said to be independent if the data from the first sample is not connected to the data from the second sample.

More information

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics.

Lecture 12. Some Useful Continuous Distributions. The most important continuous probability distribution in entire field of statistics. ENM 207 Lecture 12 Some Useful Continuous Distributions Normal Distribution The most important continuous probability distribution in entire field of statistics. Its graph, called the normal curve, is

More information

Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR

Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR Nelson Mark University of Notre Dame Fall 2017 September 11, 2017 Introduction

More information

Logit Models for Binary Data

Logit Models for Binary Data Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response

More information

Confidence Intervals for Pearson s Correlation

Confidence Intervals for Pearson s Correlation Chapter 801 Confidence Intervals for Pearson s Correlation Introduction This routine calculates the sample size needed to obtain a specified width of a Pearson product-moment correlation coefficient confidence

More information

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 8-26-2016 On Some Test Statistics for Testing the Population Skewness and Kurtosis:

More information

Modelling strategies for bivariate circular data

Modelling strategies for bivariate circular data Modelling strategies for bivariate circular data John T. Kent*, Kanti V. Mardia, & Charles C. Taylor Department of Statistics, University of Leeds 1 Introduction On the torus there are two common approaches

More information