Characteristics of measures of directional dependence - Monte Carlo studies

Size: px
Start display at page:

Download "Characteristics of measures of directional dependence - Monte Carlo studies"

Transcription

1 Characteristics of measures of directional dependence - Monte Carlo studies Alexander von Eye Richard P. DeShon Michigan State University

2 Characteristics of measures of directional dependence - Monte Carlo studies abstract Recent results (Dodge & Rousson, 000; von Eye & DeShon, 008) show that, in the context of linear models, the response variable will always have less skew than the explanatory variable. This applies accordingly to the kurtosis of the two variables. These facts can be used to determine the direction of dependence. Specifically, using third and fourth order moments, and information concerning the deviation of variables from normality, it can be ascertained which of two variables is the response and which is the explanatory variable. In this article, the ratio of two skewness measures, the ratio of two kurtosis measures, and one omnibus test of deviation from normality are examined. Simulation studies are reported that allow one to answer the question whether, these measures (1) are sensitive to various data distributions, () sample size, and (3) a simple correlation structure. Results suggest that all three measures are highly sensitive to these factors. The ratio of two kurtosis measures is sensitive in particular to the correlation structure. It is concluded that (1) directional dependence can be based on various types of deviation from normality, () measures that respond to deviations based on skewness and kurtosis have characteristics that make them prime candidates for determining directional dependence, and (3) each of the measures proposed thus far is highly sensitive to either specific or omnibus deviations from normality.

3 Characteristics of measures of directional dependence - A Monte Carlo study It is widely believed that the tools of statistics cannot be used to determine the direction of effect in data collected using observational research methods. One reason for this is that it is easy to show that standard statistical methods such as regression analysis do not allow one to derive conclusions about directional dependence (von Eye & DeShon, 008). Another reason is that, as a widely used adage states, correlation does not imply causation (but it sure is a hint; see Lynd- Stevenson, 007; Tufte, 006). Recently, matters have taken a most interesting and dramatic turn. To be able to derive conclusions about directional dependence, authors (e.g., Dodge & Rousson, 000) have proposed moving away from standard methods of analysis which involve employing no more than first and second order moments. Instead, the authors propose using at least third order moments, that is indicators of skewness. Recently, von Eye and DeShon (008) proposed to also use fourth order moments, that is, indicators of kurtosis. Based on recent work, it is now possible to determine which variable in a linear model functions as the response variable and which functions as the explanatory variable (i.e., directional dependence). This evidence is made possible through a fundamental shift in the representation of linear models. Virtually all statistical analyses of continuous variables in psychological research are based on knowledge of the first two moments (i.e., means, variances, and covariances) of the joint distribution of two or more variables. This is entirely reasonable under the common normality assumption. However, it is also well known that the variables in empirical research are not normally distributed (Micceri, 1989) and, therefore, many of the higher order moments such as skew and kurtosis deviate from expectancy. Thus, higher order moments provide critical information that can be used to determine directional dependency. Specifically, under standard assumptions for linear models, the response variable is a linear combination of an explanatory variable - that need not be normally distributed - and a normally distributed error. If the explanatory variable has a nonnormal distribution, and the error term is normally distributed, then the response variable is a linear convolution of these two distributions that must have a distribution closer to normality than the explanatory variable. In this article, we review a selection of the new approaches to determining directional

4 directional dependence - simulations; p. 4 dependency. Specifically, we focus on those approaches that use third and fourth order moments (see, for example, Dodge & Rousson, 000, 001; Muddapur, 003; Shimizu, Hoyer, Hyvärinen, & Kerminen, 006; Shimizu & Kano, 006; Sungur, 005; von Eye & DeShon, 008). We then ask questions concerning the characteristics of two measures used to derive conclusions about directional dependence. We answer these questions based on results from Monte Carlo simulation studies. 1. Assessing Directional Dependence The ideas that guided the recent and very active research area of devising methods for the determination of directional dependence are based on the proposition that, if one random variate is determined by one or several others, it will have describable distributional characteristics. Statistical methods that are based on first and second order moments, for example, regression analysis, are unable to reflect these characteristics (for an illustration, see von Eye & DeShon, 008). Therefore, it appears reasonable to conclude that statistical evidence cannot be used to determine the direction of effects or the direction of causality. However, higher order moments provide information that can be used to determine the direction of effects. Dodge and Rousson (000, 001) presented a proof for the proposition that the response variable will always have less skew than the explanatory variable. Based on this result, they proposed comparing the skewness of the response and the explanatory variables. Von Eye and DeShon (008) discussed a generalization of this proposition, according to which any deviation from normality will be less pronounced in the response variable than in the explanatory variable. Therefore, the widely used Mardia tests of skewness and kurtosis (Mardia, 1970, 1980; cf. von Eye, 005, 006; von Eye & Gardiner, 004) as well as omnibus tests of normality such as the one proposed by D Agostino (1971; D Agostino, Belanger, & D Agostino, 1990; D Agostino & Pearson, 1973) can be used to determine direction of dependence. In the following sections, we provide an overview of three approaches to determining the direction of effects. The first is based on Dodge and Rousson s (000) proposal of using skewness. The second approach, proposed by von Eye and DeShon (008), is based on the omnibus measure of deviation from normality proposed by D Agostino (1971). The third approach, also proposed by

5 directional dependence - simulations; p. 5 von Eye and DeShon (008), involves using the ratio of two kurtosis measures. 1.1 Skewness- and Kurtosis-Based Approaches Dodge and Rousson s idea (000) of using skewness to determine direction of effect (details follow below) goes back to Karl Pearson. In an article that was published in 1895, Pearson noted that skewness and kurtosis can be used to appraise deviations from normality. Tests of (multi)normality that are based on skewness have been proposed, for example, by Mardia (1970, 1980). Dodge and Rousson s procedure involves the following steps. Let Y be the response variable and X the explanatory variable. Then, the skewness of X and the skewness of Y are defined by the third moment, or and respectively. The fourth moment, the kurtosis of X and Y, is and respectively. For X, Y N(0; 1), ã = 0 and ä = 3.0. Definition and computational aspects of central moments can be found in many textbooks (e.g., Hogg & Tanis, 1993; see also D Agostino & Pearson, 1973; Walwyn, 005). To come to a conclusion about the direction of effect, Dodge and Rousson (000) propose using the Bravais-Pearson correlation where cov X,Y is the covariance between X and Y, and ó X and ó Y are the standard deviations of X and Y, respectively (for interpretations of ñ, see, for example, Dodge & Rousson, 001; Falk & Well, 1996; Rodgers & Nicewander, 1988; Rovine & von Eye, 1997). Now, using ã and ã, Dodge and Rousson (000, 001) showed that the cube of the Y X correlation coefficient can be given by the ratio of the skewness of the response variable to the

6 directional dependence - simulations; p. 6 skewness of the explanatory variable, or if ã X 0. This equation implies that the skewness of the response variable is always smaller than the skewness of the explanatory variable (in absolute values, because the range of the correlation is -1 ñ +1). If, however, the cube of the correlation is greater than 1.0, (1) the result is incorrect by definition, () researchers may redefine the status of X and Y as response and explanatory variables, or (3) X and Y are not involved in a directed or causal relationship at all. According to von Eye and DeShon (008) there are five interesting implications of this relationship: (1) If a symmetric error, for example, normally distributed noise, is added to a skewed explanatory variable, the response variable will be less skewed; () the cube of the correlation coefficient can be interpreted as the percentage of skewness that is left after a linear model was applied to describe the relationship between X and Y; (3) if the ratio of the two skewness scores lies outside [-1; +1], the cube of the correlation suggests that not X but Y may be the explanatory variable; (4) if X is perfectly symmetric as is the case in normal or in uniform distributions, directional dependence cannot be determined; and (5) above and beyond what is generally known about the Bravais-Pearson s correlation, ñ does, in the context of linear modeling, possess asymmetric properties (Dodge & Rousson, 001). In a way similar to using third order moments, one can also use fourth order moments to determine the direction of effect (von Eye & Deshon, 008; cf. Dodge & Rousson, 001). 3 Specifically, in addition to ñ, one can calculate 3 for ä X > 0. As for ñ, Y can be interpreted as the response or dependent variable if î < 1.0. If î 1.0, either X is the response variable, or X and Y are not involved in a directed relationship. The relationship of î with the Bravais-Pearson correlation still needs to be explored in detail. However, the use of î is based on the original idea that variates that are completely determined will show no

7 more than random deviation from normality. 1. von Eye and DeShon s Deviation-from-Normality Approach directional dependence - simulations; p. 7 An alternative approach to the one presented in Section 1.1 was proposed von von Eye and DeSon (008). This approach is based on a statistic proposed by D Agostino (1971; cf. D Agostino et al., 1990; D Agostino & Pearson, 1973). This statistic is omnibus to both skewness and kurtosis violations of normality. It uses skewness as well as kurtosis information, and creates a score that is distributed as with degrees of freedom. Specifically, consider the third standardized moment, which, for the normal distribution assumes the value of 0, and the fourth standardized moment, which, for the normal distribution, assumes the value of 3. Based on these two moments, D Agostino (1971) proposes a test that enables researchers to detect deviations from normality that are due to either skewness or kurtosis. The test statistic is where z( b 1) and z(b ) are the normal approximations to b 1and b (detailed equations can be found in D Agostino et al., 1990). The K statistic is approximately distributed as with df =, when the population is normally distributed. To determine the direction of effect, the K scores of two variables are compared in a fashion parallel to the comparison of two skewness scores. 3 In the following sections, we present results from simulation studies for the measures ñ and 3 K. We begin with ñ.. Monte Carlo Simulation Studies on the Direction of Effects The following Monte Carlo studies were designed to answer questions concerning the behavior of ñ 3 and K under various conditions (the detailed questions will be asked at the beginning of Sections.1 and.). In each of the Monte Carlo studies, the following factors were varied.

8 Simulation results for d; p. 8 Shape of distribution. The following five distribution shapes were realized (cf. Von Eye, 005; von Eye, von Eye, & Bogat, 006): (1) Normal distribution. The generator GASDEV from the Numerical Recipes FORTRAN collection (Press, Flannery, Teukolsky, & Vetterling, 1989) was used to create N(0;1)- distributed data (see also Sicking, 1994). The data created using GASDEV were expected not to deviate from multinormality. () Uniform distribution. The generator RANDOM, available in the Power Station s PortLib function pool, was used to create pseudo random numbers, z, from the interval 0 z < 1. The data created using RANDOM were expected to deviate from multinormality. They are symmetric, that is, skew will be about zero, but they show kurtosis characterized by heavy tails. (3) Logarithmic distribution. Uniform variates x were subjected to the logarithmic transformation log(x). Specifically, a transformed variable x was created as The resulting data were expected to exhibit some skewness and elevated kurtosis. (4) Inverse Laplace-transformed. The Laplace probability distribution, also known as double exponential distribution, is for x < á, - < x <, and â > 0. This distribution has a mean of zero, a skewness of zero, and a kurtosis of 0. For â = 1 and x centered scores, the probability distribution becomes A uniform distribution has no skew but exhibits increased kurtosis. Performing an inverse Laplace transformation on a uniform distribution should, therefore, result in a distribution with reduced kurtosis and possibly elevated skewness. The Laplace function has no inverse. Therefore, the transformation introduced by von Eye and von Eye (005) was performed. Application of this transformation to the uniformly distributed random numbers results in a distribution with both slightly elevated skewness and elevated kurtosis. The kurtosis of the

9 Simulation results for d; p. 9 transformed uniform distribution has a positive sign. The kurtosis of the uniform distribution was negative. In other words, this transformation changed the distribution from being heavytailed to heavy around the belt line. 1/3 (5) Cube root transformation. This transformation was used to create x = ½ x from the uniform x scores. Considering that the uniform scores that were cube root-transformed had no skewness and an only slightly elevated kurtosis, the resulting scores should have elevated skewness and elevated kurtosis. 3 For the simulations of ñ and î, two random variates were created for the calculation of the ratio of the two skewness and the two kurtosis scores. Each of these variates was subjected to the same transformations. Crossed, 5 x 5 patterns resulted. 3 In brief, for the simulation of ñ, the following data sets were created: (1) normally distributed variates; () uniformly distributed variates; (3) logarithmically-transformed variates (from a uniform distribution); (4) uniformly distributed variates that were subjected to the substitute of the inverse Laplace transformation; and (5) cube root-transformed variates. Magnitude of variable intercorrelation. The correlation that was taken into account varied between 0.0 and 0.6, in increments of 0.. The correlation was created between the original, normally distributed variable and the variates that resulted from the transformations described above (for details on how the correlations were created, see von Eye, 005). Sample size (N): The sample size in the simulation runs varied from 00 to 100 objects, in increments of Direction of Effects - A Monte Carlo Study on the Ratio of Two Skewness Scores To follow the notation proposed by Dodge and Rousson (000) and also used by von Eye and DeShon 3 (008), we will, from now on, use the character d to denote ñ. The Monte Carlo study for d was designed to address the following questions: 1. Is the measure d sensitive to various shapes of distributions?. What is the á curve of d; that is, how many null hypotheses are rejected? The design that was used to study the behavior of d had 5 (distribution shape) x 5 (distribution shape of the second random variate) x 4 (correlation levels) x 6 (sample sizes) = 600 cells. Each run was repeated 100 times so that a total of 60,000 samples was available for analysis. The repetitions

10 Simulation results for d; p. 10 were used as a covariate. In the following sections, we answer the two questions that were posed above. We begin with Question Is the Measure d Sensitive to Various Transformations of Data? To answer the first question, the sensitivity of the d values to the five independent variables of the simulation was examined using a standard ANOVA. The ratio of the two skewness scores, d, was used as the dependent variable. Results are summarized in Table 1. The overall multiple R was Table 1: 5 (Distribution Shape; ITRANS1) x 5 (Distribution Shape of second Random Variate; ITRANS) x 4 (Correlation Levels; RHO) x 6 (Sample Sizes;N) ANOVA with d as Dependent Measure and Repetitions (REP) as Covariate Source Type III SS df Mean Squares F-ratio p-value ITRANS1 5.63E E ITRANS 1.586E RHO N 1.938E ITRANS1*ITRANS 3.075E ITRANS1*RHO ITRANS1*N 1.63E ITRANS*RHO ITRANS*N RHO*N ITRANS1*ITRANS*RHO ITRANS1*ITRANS*N ITRANS1*RHO*N ITRANS*RHO*N ITRANS1*ITRANS*RHO*N REP Error The results in Table 1 show that, with the exception of the repetition variable, all effects are significant. By far the strongest effect was that of Shape of Distribution (ITRANS1). Figure 1 displays the distribution of d, by Shape of Distribution.

11 simulation results ; p. 11 Figure 1: Distribution of d for (1) Normal, () Uniform, (3), Inverse-Laplace Transformed, (4) Cube Root-transformed, and (5) Logarithmic Distributions Figure 1 suggests that d (labeled D in Figure 1) resembles a leptokurtic normal distribution for normal random variates (Panel 1). In contrast, for uniform distributions, d varies within a wide range (Panel ). As expected, many of the d scores are negative, thus reflecting the platykurtic shape of the distribution. The inverse Laplace transformation creates distributions that are skewed (Panel 3). In addition, d is mostly negative and shows a wider range than for the normal distribution. This applies accordingly to the distribution that is created by the cube root transformation (Panel 4). This distribution is asymmetric and leptokurtic. In contrast, the logarithmic transformation results in an asymmetric and platykurtic distribution of d. The second transformation variable (ITRANS) has the second strongest effect. It results in similar distributions as the first. Therefore, it is not illustrated here.

12 simulation results ; p. 1 The third strongest effect is that of the main effect of N. It is not depicted here. N has the effect that the range of d values increases, and the distribution becomes more asymmetric. The interaction of Shape of Distribution with N shows that the standard deviation of the distribution of d increases in particular for normally distributed data (also not illustrated here). All of the other effects, although significant, explain only small portions of the variance of d. Therefore, they will not be discussed in detail in the present context. More interesting is the effect of the interaction of the two transformation variables. In the following five figures (Figure through 6), we demonstrate the effect that the five distribution shapes have on the distribution of d, given a particular distribution. We begin with the distribution of d when the first of the two distributions (numerator of the d ratio) is normal (Figure ). Figure : Distribution of d When the First Variable Is Normal and the Second Is (1) Normal, () Uniform, (3), Inverse Laplace-transformed, (4) Cube Root-transformed, and (5) Logarithmic Figure shows that the distribution of d is leptokurtic normal when the first of the two variables is normal. In the ideal case, d will, under each of these conditions, be zero because normal distribution show no skew. Random variations yield normally distributed scores for d.

13 simulation results ; p. 13 Figure 3: Distribution of d When the First Variable Is Uniform and the Second Is (1) Normal, () Uniform, (3), Inverse Laplace-transformed, (4) Cube Root-transformed, and (5) Logarithmic Figure 3 shows that, when the first variable is uniformly distributed, d is skewed with a bias for negative scores, unless the second variable is normally distributed (Panel 1) or cube root-transformed. In these cases, the distribution of d is asymmetric but with a mean of zero. Figure 4 shows that d assumes extreme scores, most of them negative, when the first distribution is inverse Laplace-transformed (cf. Panel 3 in Figure 1). This applies even when the second variable is normally distributed (Panel 1).

14 simulation results ; p. 14 Figure 4: Distribution of d When the First Variable Is Inverse Laplace-transformed and the Second Is (1) Normal, () Uniform, (3), Inverse Laplace-transformed, (4) Cube Roottransformed, and (5) Logarithmic Figure 5: Distribution of d When the First Variable Is Cube Root-transformed and the Second Is (1) Normal, () Uniform, (3), Inverse Laplace-transformed, (4) Cube Roottransformed, and (5) Logarithmic

15 simulation results ; p. 15 Figure 5 shows that, when the first variable is cube root-transformed, d can assume a wide range of scores, in particular when the second variable is uniformly or inverse Laplace-transformed. The range is much narrower when the second variable is normally distributed. The range is narrow and most scores are positive when the second variable is also cube root-transformed. Figure 6: Distribution of d When the First Variable Is Logarithmic and the Second Is (1) Normal, () Uniform, (3), Inverse Laplace-transformed, (4) Cube Root-transformed, and (5) Logarithmic Figure 6 shows that d can assume a wide range of mostly negative scores when the first variable was subjected to a logarithmic transformation. The only exception is the case in which the second variable was cube root-transformed (Panel 4). In this case, the range is narrow, the distribution is asymmetric, and the scores vary about zero..1. á Curves of d In this section, we ask what proportion of cases coefficient d identifies as extreme, that is, outside of the 5% interval. The alpha curves depicted in Figure 7 were created based on the normally distributed data (ITRANS = 1) in the simulation.

16 simulation results ; p. 16 Figure 7: Left Panel: á-curve of d, by N; Logarithmic Smoother; Right Panel: Histogram of d; Normal Smoother The left Panel of Figure 7 shows that d is very sensitive, in particular for large samples. It identifies about 5% of the cases as extreme as long as the sample size is 400 cases or less. For more than 400 cases, far more than 5% of the cases are identified as extreme. We conclude that the use of d can come with the risk of an increased á error. The right panel of 7 shows why d tends to suggest non-conservative decisions. The distribution of d is both lepto- and platykurtic. More cases appear at the extreme end of the distribution than one would expect for a (leptokurtic) normal distribution. For each of the transformations, far more cases than 5% were identified as extreme. This was as expected. For the inverse Laplace-transformed data, all of the cases were identified as extreme.. Direction of Effects - A Monte Carlo Study on the Measure K In this section, we present results from a simulation study on D Agostino s measure K of deviation from normality (see also D Agostino et al., 1990; D Agostino & Pearson, 1973; von Eye & DeShon, 008). This measure is omnibus to deviations due to either skewness or kurtosis. Specifically, the following questions are addressed:

17 1. Is the measure K sensitive to various shapes of distributions?. Is the measure K distributed as with df =? 3. What is the á curve of K ; that is, how many null hypotheses are rejected? simulation results ; p. 17 To address these questions, the same factors were varied as in the Monte Carlo study for K. In addition, the factors covered the same range of scores. However, the design for the study was different. The dependent measure was the measure K. Therefore, there was no need to include two dependent variables in the simulations. The resulting design thus had 5 (distribution shape) x 4 (correlation levels) x 6 (sample sizes) = 10 cells. Each run was repeated 500 times so that a total of 60,000 samples was available for analysis. In the following sections, we answer the three questions that were posed above...1 Is the measure K sensitive to various transformations of data? In a first analysis, the sensitivity of the K values to the three factors of the simulation was examined using ANOVA. The repetitions were taken into account as a covariate. Results are summarized in Table. Table : 5 (Distribution Shape; ITRANS) x 4 (Correlation Level; RHO) x 6 (Sample Size; N) ANOVA with K as Dependent Measure and Repetitions (REP) as Covariate Source Type III SS df Mean Squares F-ratio p-value ITRANS RHO N ITRANS1*RHO ITRANS1*N RHO*N ITRANS1*RHO*N REP Error The ANOVA table shows that, with the exception of the repetitions, each of the effects is significant. Clearly the strongest effect is observed for shape of distribution (ITRANS). Figure 8 illustrates the

18 1 effect. simulation results ; p. 18 Figure 8 shows that, for normal distributions, K approximates a -distribution with degrees of freedom (Panel 1; this issue will be taken up in more detail, below). For uniform distributions, that is, for distributions with no skew, the variation of K shows more extreme scores (Panel ). In contrast and as expected, the K for the inverse-laplace transformed distributions (Panel 3) hovered about zero, with few extreme scores. The cube root-transformed scores as well as the scores after the logarithmic transformation resulted in skewed distributions with the largest numbers of extreme scores (Panels 4 and 5, respectively 3). Figure 8: Distribution of K for (1) Normal, () Uniform, (3) Inverse-Laplace Transformed, (4) Cube Root Transformed, and (5) Logarithmic Distributions 1 To obtain a more detailed look of the distributions under the conditions of the simulation, we excluded all values x > 30 for Figures 8 and 9. The percentage of eliminated scores was 1.9. For the following Figures, all scores were used.

19 simulation results ; p. 19 Figure 9: Distribution of K by Magnitude of Correlation Figure 9 shows that, although significant, ñ hardly changes the nature of the distribution. None of the remaining significant effects is illustrated here. They reflect, for example, the anticipated increase in deviations from normality that comes with the sample size increase. Compared to the effect of shape of distribution, each of the remaining effects is rather weak. Although significant, none of these effects explained large portions of variance. Therefore, they will not be discussed in the remainder of this article.

20 simulation results ; p. 0.. Distributional Characteristics of K To answer the second question, a probability plot of the measure K was created with reference to a -distribution with df =? Figure 10 shows this plot, with box plots of the observed and expected distributions at the margins. Figure 10: Probability Plot of K with Reference to a -Distribution with df = Clearly, the approximation of the -distribution with df = is miserable. However, one has to consider that the data used for the plot in Figure 10 represent a mixture of five different distributions and four magnitudes of correlation. Therefore, and in order to do justice to the measure K, we now examine the probability plot exclusively for normally distributed. Figure 11 displays this plot, with box plots of the observed and expected distributions at the margins.

21 simulation results ; p. 1 Figure 11: Probability Plot of K with Reference to a -Distribution with df =, for Normally Distributed Data Figure 11 shows that the distribution of K is reasonably close to a -distribution with df =. The measure seems slightly biased such that (1) more cases with values close to zero are observed than expected based on the -distribution with df =, and () fewer cases with large scores are observed than expected. The correlation structure does not make a difference that would be visible in the printed graphs. We conclude that K can be used to examine the direction of effects. The characteristics of K in the context of hypothesis testing are discussed in the next section...3 K as a Significance Test In a first step towards answering the question concerning the characteristics of K in the context of hypothesis testing, we create curves representing the proportion of rejected null hypotheses for varying sample sizes for each of the examined distributions. The null hypothesis for the K test is that the sample is drawn from an underlying distribution that is normal. Therefore, the rejection rate curves for samples from a normal distribution reflect the ability of the test to maintain the stated á

22 simulation results ; p. level. For all other distributions, the rejection rate reflects the power of the test to detect samples drawn from non-normal distributions. Figure 1 displays the rejection rates of the K test based on 5000 replications for samples of size 0 to 500 from the normal distribution, the uniform distribution, inverse-laplace transformed, cube root transformed, and logarithmic distributions. Figure 1: Rejection Rate Curves for K for (1) Normal, () Uniform, (3), Inverse-Laplace Transformed, (4) Cube Root-transformed, and (5) Logarithmic Distributions As can be seen in Figure 1, the K test maintains the desired á level of.05 across the range of samples sizes investigated for the normal distribution. It does appear that there is a very slight elevation in rejection rates for small sample sizes. With respect to power, the K test has low to

23 simulation results ; p. 3 moderate power to detect deviations from normality consistent with the uniform and cubed-root distributions with sample sizes smaller than 60. With sample sizes of approximately 80 all distributions are detected at acceptable levels (e.g., rejection rates greater than 80%) and the power to detect deviations from normality in all nonnormal distributions studies is near 100% with samples larger than Direction of Effects - A Monte Carlo Study on the Ratio of Two Kurtosis Scores The Monte Carlo study for î was designed to address the following questions: 1. Is the measure î sensitive to various shapes of distributions?. What is the á curve of î; that is, how many null hypotheses are rejected? The design that was used to study the behavior of î had 5 (distribution shape) x 5 (distribution shape of the second random variate) x 4 (correlation levels) x 6 (sample sizes) = 600 cells. Each run was repeated 00 times so that a total of 10,000 samples was available for analysis. The repetitions were used as a covariate. In the following sections, we answer the two questions that were posed above. We begin with Question 1. That is, we ask whether the measure î is sensitive to various transformations of data? To answer this question, the sensitivity of the î values to the five independent variables of the simulation was examined using a standard ANOVA. The ratio of the two kurtosis scores, î, was used as the dependent variable. Results are summarized in Table 3. The overall multiple R was

24 simulation results ; p. 4 Table 3: 5 (Distribution Shape; ITRANS1) x 5 (Distribution Shape of second Random Variate; ITRANS) x 4 (Correlation Levels; RHO) x 6 (Sample Sizes;N) ANOVA with î as Dependent Measure and Repetitions (REP) as Covariate Source Type III SS df Mean Squares F-ratio p-value ITRANS ITRANS RHO N ITRANS*ITRANS ITRANS*RHO ITRANS*N ITRANS*RHO ITRANS*N RHO*N ITRANS*ITRANS*RHO ITRANS*ITRANS*N ITRANS*RHO*N ITRANS*RHO*N ITRANS*ITRANS*RHO*N REP Error The results in Table 3 show that all effects are significant. In contrast to the ANOVAs of d and K, however, the strongest effect on kurtosis was that of Correlation (RHO). Figure 13 displays the distribution of î, by RHO.

25 simulation results ; p. 5 Figure 13: Distribution of î (Labeled as RATIOKURT), by Magnitude of Correlation Figure 13 suggests that the spread of î, the ratio of the two kurtosis measures, increases as the correlation between the two variables increases. Note that, for ñ = 0 and ignoring the outliers, the distribution of î is almost symmetric. This effect is dependent on both the sample size and the shape of the distribution. Considering only the normally distributed data (ITRANS = 1 and ITRANS = 1) and ñ = 0, we find not only that the spread of ñ shrinks as the sample size increases. We also find that the distribution of î becomes more symmetric as the sample size increases. This is illustrated in Figure 14.

26 simulation results ; p. 6 Figure 14: Distribution of î, by Sample Size, for Two Normally Distributed Variates The second strongest effect was that of the sample size, N. It is not depicted here (see, however, Figure 14). The effect is that the spread of î increases with the sample size. Figure 15: Distribution of î for (1) Normal, () Uniform, (3) Inverse-Laplace Transformed, (4)

27 Cube Root Transformed, and (5) Logarithmic Distributions simulation results ; p. 7 The third largest effect was found for distribution shape. Figure 15 displays the distribution of î, by shape of distribution (ITRANS). It shows that the spread of î varies with the shape of the distribution. Specifically, î has the widest spread for the cube root transformed (Panel 4) and the uniform distributions (Panel ). The smallest spread was observed for the normal distribution (Panel 1). None of the remaining significant main effects and interactions will be depicted. Although significant, the effects make only little contributions to the magnitude (or distribution) of î. 3. Comparing d, î, and K In this section, we compare the three measures d, î, and K. Considering that d and î are based on different types of deviation from normality, skewness and kurtosis, and K is omnibus to both, one would expect the correlation between d and î to be very low, and the correlations of either with K higher. Table 4 shows the correlation table for these three measures for all 10,000 simulation runs in Section.3. Table 4: Correlations Among d, î, and K, Aggregated over All Conditions Varied in Section 3. Measure d î î.18 K The correlation results in Table 4 disconfirm our expectations in the sense that the correlation between î and K is smaller than the correlation between d and î. However, one has to consider that

28 simulation results ; p. 8 4 out of the five distributions that went into the data pool were non-normal. Therefore, we now ask whether this correlation pattern is dependent upon the shape of the simulated distributions. Table 5 displays the correlation pattern, by distribution. Table 5: Correlations Among d, î, and K, by Shape of Distribution (Each Correlation is Based on 4,000 Runs) Measure d î Normal Distribution î -.07 K Uniform Distribution î.19 K Inverse Laplace-Transformed î.3 K Cube-Root Transformed î.08 K Logarithmic î.6 K Clearly, the correlations in Table 5 match our expectations, in particular for the normal (and the cube root-transformed) distribution. In addition, these correlations reflect the characteristics of the

29 simulation results ; p. 9 generated distributions. 4. Discussion The development of methods for the determination of the direction of effects using the means of statistics has, recently, taken an interesting turn (for recent theoretical developments in the discussion of causality, see Lynd-Stevenson, 007). Farthest developed are the approaches that resort to using third and fourth order moments. In the present research, we report results from Monte Carlo simulation studies on the behavior of the ratio, d, of two skewness measures, the ratio, î, of two kurtosis measures, and an omnibus statistic, K, of deviation from normality. The factors varied in the simulations were shape of distribution [(1) Normal, () Uniform, (3), Inverse Laplace-transformed, (4) Cube Root-transformed, and (5) Logarithmic], sample size, and correlation structure. Each of these factors had significant effects. As expected, the strongest effects came with shape of distribution. Only for the ratio î, the strongest effect was that for the correlation of the two simulated variates. The á curves that were based on the normally distributed data suggest that the omnibus measure is sensitive to both skewness and kurtosis violations of normality. In addition, it tends to suggest statistical decisions quite close to the nominal level á, and that regardless of sample size. The distributional characteristics of the ratio of two skewness measures and the ratio of two krtosis measures still need to be determined. The question as to which of the three measures discussed here, d, î, and K, can be recommended can be answered based on their characteristics. If researchers consider differences between measures of skewness, kurtosis or, in more general terms, measures of deviation from normality only if the deviation from normality is significant, K or any other omnibus test will be the measure of choice. This includes the simultaneous use of Mardia s (1970) tests, even von Eye and Gardiner s (004) local tests of deviation from normality. If, however, the focus is on the effects of an explanatory variable as they reflect in particular in either skewness or kurtosis, d and î are the measures of choice.

30 simulation results ; p. 30 A most important issue to discuss is the status of the methods presented here in the process of determining direction of effects. From our perspective, there can be no doubt that theory must exist before variables are subjected to the procedures discussed in the literature. Clearly, it makes no sense to conclude that just any two variables that differ in skewness are in a causal relationship with each other. The status of the methods discussed here is, therefore, that of tools that can be used to build theory. These methods do not replace conceptual and theoretical thinking.

31 simulation results ; p. 31 References D Agostino, R.B. (1971). Transformation to normality of the null distribution of g 1. Biometrika, 57, D Agostino, R.B., Belanger, A., & D Agostino, R.B. Jr. (1990). A suggestion for using powerful and informative tests of normality. The American Statistician, 44, D Agostino, R.B., & Pearson, E.S. (1973). Testing departures from normality. 1. Fuller empirical results for the distribution of b and b 1. Biometrika, 60, Dodge, Y., & Rousson, V. (000). Direction dependence in a regression line. Communications in Statistics - Theory and Methods, 3, Dodge, Y., & Rousson, V. (001). On asymmetric properties of the correlation coefficient in the regression setting. The American Statistician, 55, Everitt, B.S. (1998). The Cambridge dictionary of statistics. Cambridge, UK: Cambridge University Press. th Hogg, R.V., & Tanis, E.A. (1993). Probability and statistical inference (4 ed.). New York: Macmillan. Falk, R., and Well, A. D. (1996). Correlation as Probability of Common Descent. Multivariate Behavioral Research, 31, Lynd-Stevenson, R.M. (007). Concerns regarding the traditional paradigm for causal research: The unified paradigm and causal research in scientific Psychology. Review of General Psychology, 11, Mardia, K.V. (1970). Measures of multivariate skewness and kurtosis with applications. Biometrika, 57, Mardia, K.V. (1980). Tests of univariate and mutivariate normality. In P.R. Krishnaiah (ed.), Handbook of statistics (vol. 1; pp 79-30). Amsterdam: North Holland. Micceri, T. (1989). The unicorn, the normal curve, and other improbable creatures. Psychological Bulletin, 105,

32 simulation results ; p. 3 Muddapur, M.V. (003). On directional dependence in a regression line. Communications in Statistics. Theory and Methods, 3, Pearson, K. (1895). Contributions to the mathematical theory of evolution. Philosophical Transactions of the Royal Society of London, 91, Press, W.H., Flannery, B.P., Teukolsky, S.A., & Vetterling, W.T. (1989). Numerical recipes. The art of scientific computing (FORTRAN version). Cambridge: Cambridge University Press. Rodgers, J.L., & Nicewander, W.A. (1988). Thirteen ways to look at the correlation coefficient. The American Statistician, 4, Rovine, M.J., & von Eye, A. (1997). A 14th way to look at a correlation coefficient: Correlation as the proportion of matches. The American Statistician, 51, Shimizu, S., Hoyer, P.O., Hyvärinen, A., & Kerminen, A. (006). A linear non-gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7, Shimizu, S., & Kano, Y. (006). Use of non-normality in structural equation modeling: Application to direction of causation. Journal of Statistical Planning and Inference, 136 (in press). Sicking, F.J. (1994). GASDEV. Sungur, E.A. (005). A note on directional dependence in regression setting. Communications in Statistics. Theory and Methods, 34, Tufte, E.R. (006). The cognitive style of PowerPoint: Pitching out corrupts within. Cheshire, CT: Graphics Press. von Eye, A. (005). Comparing tests of multinormality - A Monte Carlo study. InterStat, (October, 005). von Eye, A. (006). Comparing Tests of Multinormality under Sparse Data Conditions - a Monte Carlo Study. InterStat, (May, 006). von Eye, A., & DeShon, R.P. (008). Moment Approaches to Directional Dependency. Under editorial review. von Eye, A., & Gardiner, J.C. (004). Locating deviations from multivariate normality.

33 simulation results ; p. 33 Understanding Statistics, 3, von Eye, A., & von Eye, M. (005). Can one use Cohen s kappa to examine disagreement? Methodology, 1, von Eye, A.,von Eye, M., & Bogat, G.A. (006). Multinormality and symmetry: A comparison of two statistical tests. Psychology Science, 48, Walwyn, R. (005). Moments. In B.S. Everitt, & D.C. Howell (eds.), Handbook of Statistics in Social Science (pp ). Chichester, UK: John Wiley.

Comparing Tests of Multinormality under Sparse Data Conditions - a Monte Carlo Study

Comparing Tests of Multinormality under Sparse Data Conditions - a Monte Carlo Study Comparing Tests of Multinormality under Sparse Data Conditions - a Monte Carlo Study Alexander von Eye 1 Michigan State University 1 The author is indebted to G. Anne Bogat for helpful comments on an earlier

More information

Resampling techniques to determine direction of effects in linear regression models

Resampling techniques to determine direction of effects in linear regression models Resampling techniques to determine direction of effects in linear regression models Wolfgang Wiedermann, Michael Hagmann, Michael Kossmeier, & Alexander von Eye University of Vienna, Department of Psychology

More information

Robust Critical Values for the Jarque-bera Test for Normality

Robust Critical Values for the Jarque-bera Test for Normality Robust Critical Values for the Jarque-bera Test for Normality PANAGIOTIS MANTALOS Jönköping International Business School Jönköping University JIBS Working Papers No. 00-8 ROBUST CRITICAL VALUES FOR THE

More information

Appendix A (Pornprasertmanit & Little, in press) Mathematical Proof

Appendix A (Pornprasertmanit & Little, in press) Mathematical Proof Appendix A (Pornprasertmanit & Little, in press) Mathematical Proof Definition We begin by defining notations that are needed for later sections. First, we define moment as the mean of a random variable

More information

Market Risk Analysis Volume I

Market Risk Analysis Volume I Market Risk Analysis Volume I Quantitative Methods in Finance Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume I xiii xvi xvii xix xxiii

More information

Shape Measures based on Mean Absolute Deviation with Graphical Display

Shape Measures based on Mean Absolute Deviation with Graphical Display International Journal of Business and Statistical Analysis ISSN (2384-4663) Int. J. Bus. Stat. Ana. 1, No. 1 (July-2014) Shape Measures based on Mean Absolute Deviation with Graphical Display E.A. Habib*

More information

Chapter 7. Inferences about Population Variances

Chapter 7. Inferences about Population Variances Chapter 7. Inferences about Population Variances Introduction () The variability of a population s values is as important as the population mean. Hypothetical distribution of E. coli concentrations from

More information

starting on 5/1/1953 up until 2/1/2017.

starting on 5/1/1953 up until 2/1/2017. An Actuary s Guide to Financial Applications: Examples with EViews By William Bourgeois An actuary is a business professional who uses statistics to determine and analyze risks for companies. In this guide,

More information

2018 AAPM: Normal and non normal distributions: Why understanding distributions are important when designing experiments and analyzing data

2018 AAPM: Normal and non normal distributions: Why understanding distributions are important when designing experiments and analyzing data Statistical Failings that Keep Us All in the Dark Normal and non normal distributions: Why understanding distributions are important when designing experiments and Conflict of Interest Disclosure I have

More information

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS Melfi Alrasheedi School of Business, King Faisal University, Saudi

More information

The Two-Sample Independent Sample t Test

The Two-Sample Independent Sample t Test Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 The General Formula The Equal-n Formula 4 5 6 Independence Normality Homogeneity of Variances 7 Non-Normality Unequal

More information

GENERATION OF STANDARD NORMAL RANDOM NUMBERS. Naveen Kumar Boiroju and M. Krishna Reddy

GENERATION OF STANDARD NORMAL RANDOM NUMBERS. Naveen Kumar Boiroju and M. Krishna Reddy GENERATION OF STANDARD NORMAL RANDOM NUMBERS Naveen Kumar Boiroju and M. Krishna Reddy Department of Statistics, Osmania University, Hyderabad- 500 007, INDIA Email: nanibyrozu@gmail.com, reddymk54@gmail.com

More information

KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI

KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI 88 P a g e B S ( B B A ) S y l l a b u s KARACHI UNIVERSITY BUSINESS SCHOOL UNIVERSITY OF KARACHI BS (BBA) VI Course Title : STATISTICS Course Number : BA(BS) 532 Credit Hours : 03 Course 1. Statistical

More information

Asymptotic Distribution Free Interval Estimation

Asymptotic Distribution Free Interval Estimation D.L. Coffman et al.: ADF Intraclass Correlation 2008 Methodology Hogrefe Coefficient 2008; & Huber Vol. Publishers for 4(1):4 9 ICC Asymptotic Distribution Free Interval Estimation for an Intraclass Correlation

More information

ESTIMATION OF MODIFIED MEASURE OF SKEWNESS. Elsayed Ali Habib *

ESTIMATION OF MODIFIED MEASURE OF SKEWNESS. Elsayed Ali Habib * Electronic Journal of Applied Statistical Analysis EJASA, Electron. J. App. Stat. Anal. (2011), Vol. 4, Issue 1, 56 70 e-issn 2070-5948, DOI 10.1285/i20705948v4n1p56 2008 Università del Salento http://siba-ese.unile.it/index.php/ejasa/index

More information

Lecture 6: Non Normal Distributions

Lecture 6: Non Normal Distributions Lecture 6: Non Normal Distributions and their Uses in GARCH Modelling Prof. Massimo Guidolin 20192 Financial Econometrics Spring 2015 Overview Non-normalities in (standardized) residuals from asset return

More information

Lectures delivered by Prof.K.K.Achary, YRC

Lectures delivered by Prof.K.K.Achary, YRC Lectures delivered by Prof.K.K.Achary, YRC Given a data set, we say that it is symmetric about a central value if the observations are distributed symmetrically about the central value. In symmetrically

More information

Does Calendar Time Portfolio Approach Really Lack Power?

Does Calendar Time Portfolio Approach Really Lack Power? International Journal of Business and Management; Vol. 9, No. 9; 2014 ISSN 1833-3850 E-ISSN 1833-8119 Published by Canadian Center of Science and Education Does Calendar Time Portfolio Approach Really

More information

A Skewed Truncated Cauchy Logistic. Distribution and its Moments

A Skewed Truncated Cauchy Logistic. Distribution and its Moments International Mathematical Forum, Vol. 11, 2016, no. 20, 975-988 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/imf.2016.6791 A Skewed Truncated Cauchy Logistic Distribution and its Moments Zahra

More information

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions SGSB Workshop: Using Statistical Data to Make Decisions Module 2: The Logic of Statistical Inference Dr. Tom Ilvento January 2006 Dr. Mugdim Pašić Key Objectives Understand the logic of statistical inference

More information

Business Statistics 41000: Probability 3

Business Statistics 41000: Probability 3 Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404

More information

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1

Chapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and

More information

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study

On Some Test Statistics for Testing the Population Skewness and Kurtosis: An Empirical Study Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 8-26-2016 On Some Test Statistics for Testing the Population Skewness and Kurtosis:

More information

A New Multivariate Kurtosis and Its Asymptotic Distribution

A New Multivariate Kurtosis and Its Asymptotic Distribution A ew Multivariate Kurtosis and Its Asymptotic Distribution Chiaki Miyagawa 1 and Takashi Seo 1 Department of Mathematical Information Science, Graduate School of Science, Tokyo University of Science, Tokyo,

More information

Simple Descriptive Statistics

Simple Descriptive Statistics Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency

More information

Descriptive Statistics for Educational Data Analyst: A Conceptual Note

Descriptive Statistics for Educational Data Analyst: A Conceptual Note Recommended Citation: Behera, N.P., & Balan, R. T. (2016). Descriptive statistics for educational data analyst: a conceptual note. Pedagogy of Learning, 2 (3), 25-30. Descriptive Statistics for Educational

More information

Fundamentals of Statistics

Fundamentals of Statistics CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct

More information

MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL

MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL Isariya Suttakulpiboon MSc in Risk Management and Insurance Georgia State University, 30303 Atlanta, Georgia Email: suttakul.i@gmail.com,

More information

2. Copula Methods Background

2. Copula Methods Background 1. Introduction Stock futures markets provide a channel for stock holders potentially transfer risks. Effectiveness of such a hedging strategy relies heavily on the accuracy of hedge ratio estimation.

More information

A New Test for Correlation on Bivariate Nonnormal Distributions

A New Test for Correlation on Bivariate Nonnormal Distributions Journal of Modern Applied Statistical Methods Volume 5 Issue Article 8 --06 A New Test for Correlation on Bivariate Nonnormal Distributions Ping Wang Great Basin College, ping.wang@gbcnv.edu Ping Sa University

More information

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means

Chapter 11: Inference for Distributions Inference for Means of a Population 11.2 Comparing Two Means Chapter 11: Inference for Distributions 11.1 Inference for Means of a Population 11.2 Comparing Two Means 1 Population Standard Deviation In the previous chapter, we computed confidence intervals and performed

More information

Simulation of probability distributions commonly used in hydrological frequency analysis

Simulation of probability distributions commonly used in hydrological frequency analysis HYDROLOGICAL PROCESSES Hydrol. Process. 2, 5 6 (27) Published online May 26 in Wiley InterScience (www.interscience.wiley.com) DOI: 2/hyp.676 Simulation of probability distributions commonly used in hydrological

More information

Power of t-test for Simple Linear Regression Model with Non-normal Error Distribution: A Quantile Function Distribution Approach

Power of t-test for Simple Linear Regression Model with Non-normal Error Distribution: A Quantile Function Distribution Approach Available Online Publications J. Sci. Res. 4 (3), 609-622 (2012) JOURNAL OF SCIENTIFIC RESEARCH www.banglajol.info/index.php/jsr of t-test for Simple Linear Regression Model with Non-normal Error Distribution:

More information

FV N = PV (1+ r) N. FV N = PVe rs * N 2011 ELAN GUIDES 3. The Future Value of a Single Cash Flow. The Present Value of a Single Cash Flow

FV N = PV (1+ r) N. FV N = PVe rs * N 2011 ELAN GUIDES 3. The Future Value of a Single Cash Flow. The Present Value of a Single Cash Flow QUANTITATIVE METHODS The Future Value of a Single Cash Flow FV N = PV (1+ r) N The Present Value of a Single Cash Flow PV = FV (1+ r) N PV Annuity Due = PVOrdinary Annuity (1 + r) FV Annuity Due = FVOrdinary

More information

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda,

MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE. Dr. Bijaya Bhusan Nanda, MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, CONTENTS What is measures of dispersion? Why measures of dispersion? How measures of dispersions are calculated? Range Quartile

More information

Some Characteristics of Data

Some Characteristics of Data Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key

More information

An Insight Into Heavy-Tailed Distribution

An Insight Into Heavy-Tailed Distribution An Insight Into Heavy-Tailed Distribution Annapurna Ravi Ferry Butar Butar ABSTRACT The heavy-tailed distribution provides a much better fit to financial data than the normal distribution. Modeling heavy-tailed

More information

chapter 2-3 Normal Positive Skewness Negative Skewness

chapter 2-3 Normal Positive Skewness Negative Skewness chapter 2-3 Testing Normality Introduction In the previous chapters we discussed a variety of descriptive statistics which assume that the data are normally distributed. This chapter focuses upon testing

More information

CABARRUS COUNTY 2008 APPRAISAL MANUAL

CABARRUS COUNTY 2008 APPRAISAL MANUAL STATISTICS AND THE APPRAISAL PROCESS PREFACE Like many of the technical aspects of appraising, such as income valuation, you have to work with and use statistics before you can really begin to understand

More information

12.1 One-Way Analysis of Variance. ANOVA - analysis of variance - used to compare the means of several populations.

12.1 One-Way Analysis of Variance. ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

I. Return Calculations (20 pts, 4 points each)

I. Return Calculations (20 pts, 4 points each) University of Washington Winter 015 Department of Economics Eric Zivot Econ 44 Midterm Exam Solutions This is a closed book and closed note exam. However, you are allowed one page of notes (8.5 by 11 or

More information

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions ELE 525: Random Processes in Information Systems Hisashi Kobayashi Department of Electrical Engineering

More information

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION International Days of Statistics and Economics, Prague, September -3, MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION Diana Bílková Abstract Using L-moments

More information

On Some Statistics for Testing the Skewness in a Population: An. Empirical Study

On Some Statistics for Testing the Skewness in a Population: An. Empirical Study Available at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 12, Issue 2 (December 2017), pp. 726-752 Applications and Applied Mathematics: An International Journal (AAM) On Some Statistics

More information

Chapter 4 Variability

Chapter 4 Variability Chapter 4 Variability PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter and Larry B. Wallnau Chapter 4 Learning Outcomes 1 2 3 4 5

More information

An Improved Version of Kurtosis Measure and Their Application in ICA

An Improved Version of Kurtosis Measure and Their Application in ICA International Journal of Wireless Communication and Information Systems (IJWCIS) Vol 1 No 1 April, 011 6 An Improved Version of Kurtosis Measure and Their Application in ICA Md. Shamim Reza 1, Mohammed

More information

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology FE670 Algorithmic Trading Strategies Lecture 4. Cross-Sectional Models and Trading Strategies Steve Yang Stevens Institute of Technology 09/26/2013 Outline 1 Cross-Sectional Methods for Evaluation of Factor

More information

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018 ` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.

More information

A Test of the Normality Assumption in the Ordered Probit Model *

A Test of the Normality Assumption in the Ordered Probit Model * A Test of the Normality Assumption in the Ordered Probit Model * Paul A. Johnson Working Paper No. 34 March 1996 * Assistant Professor, Vassar College. I thank Jahyeong Koo, Jim Ziliak and an anonymous

More information

discussion Papers Some Flexible Parametric Models for Partially Adaptive Estimators of Econometric Models

discussion Papers Some Flexible Parametric Models for Partially Adaptive Estimators of Econometric Models discussion Papers Discussion Paper 2007-13 March 26, 2007 Some Flexible Parametric Models for Partially Adaptive Estimators of Econometric Models Christian B. Hansen Graduate School of Business at the

More information

Web Science & Technologies University of Koblenz Landau, Germany. Lecture Data Science. Statistics and Probabilities JProf. Dr.

Web Science & Technologies University of Koblenz Landau, Germany. Lecture Data Science. Statistics and Probabilities JProf. Dr. Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Statistics and Probabilities JProf. Dr. Claudia Wagner Data Science Open Position @GESIS Student Assistant Job in Data

More information

Correlation vs. Trends in Portfolio Management: A Common Misinterpretation

Correlation vs. Trends in Portfolio Management: A Common Misinterpretation Correlation vs. rends in Portfolio Management: A Common Misinterpretation Francois-Serge Lhabitant * Abstract: wo common beliefs in finance are that (i) a high positive correlation signals assets moving

More information

IOP 201-Q (Industrial Psychological Research) Tutorial 5

IOP 201-Q (Industrial Psychological Research) Tutorial 5 IOP 201-Q (Industrial Psychological Research) Tutorial 5 TRUE/FALSE [1 point each] Indicate whether the sentence or statement is true or false. 1. To establish a cause-and-effect relation between two variables,

More information

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی یادگیري ماشین توزیع هاي نمونه و تخمین نقطه اي پارامترها Sampling Distributions and Point Estimation of Parameter (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی درس هفتم 1 Outline Introduction

More information

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority Chapter 235 Analysis of 2x2 Cross-Over Designs using -ests for Non-Inferiority Introduction his procedure analyzes data from a two-treatment, two-period (2x2) cross-over design where the goal is to demonstrate

More information

St. Xavier s College Autonomous Mumbai. Syllabus For 2 nd Semester Course in Statistics (June 2015 onwards)

St. Xavier s College Autonomous Mumbai. Syllabus For 2 nd Semester Course in Statistics (June 2015 onwards) St. Xavier s College Autonomous Mumbai Syllabus For 2 nd Semester Course in Statistics (June 2015 onwards) Contents: Theory Syllabus for Courses: S.STA.2.01 Descriptive Statistics (B) S.STA.2.02 Statistical

More information

Copyrighted 2007 FINANCIAL VARIABLES EFFECT ON THE U.S. GROSS PRIVATE DOMESTIC INVESTMENT (GPDI)

Copyrighted 2007 FINANCIAL VARIABLES EFFECT ON THE U.S. GROSS PRIVATE DOMESTIC INVESTMENT (GPDI) FINANCIAL VARIABLES EFFECT ON THE U.S. GROSS PRIVATE DOMESTIC INVESTMENT (GPDI) 1959-21 Byron E. Bell Department of Mathematics, Olive-Harvey College Chicago, Illinois, 6628, USA Abstract I studied what

More information

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali Part I Descriptive Statistics 1 Introduction and Framework... 3 1.1 Population, Sample, and Observations... 3 1.2 Variables.... 4 1.2.1 Qualitative and Quantitative Variables.... 5 1.2.2 Discrete and Continuous

More information

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron Statistical Models of Stocks and Bonds Zachary D Easterling: Department of Economics The University of Akron Abstract One of the key ideas in monetary economics is that the prices of investments tend to

More information

Data Distributions and Normality

Data Distributions and Normality Data Distributions and Normality Definition (Non)Parametric Parametric statistics assume that data come from a normal distribution, and make inferences about parameters of that distribution. These statistical

More information

ECON Introductory Econometrics. Lecture 1: Introduction and Review of Statistics

ECON Introductory Econometrics. Lecture 1: Introduction and Review of Statistics ECON4150 - Introductory Econometrics Lecture 1: Introduction and Review of Statistics Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 1-2 Lecture outline 2 What is econometrics? Course

More information

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,

More information

Chapter 18: The Correlational Procedures

Chapter 18: The Correlational Procedures Introduction: In this chapter we are going to tackle about two kinds of relationship, positive relationship and negative relationship. Positive Relationship Let's say we have two values, votes and campaign

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation The likelihood and log-likelihood functions are the basis for deriving estimators for parameters, given data. While the shapes of these two functions are different, they have

More information

Establishing a framework for statistical analysis via the Generalized Linear Model

Establishing a framework for statistical analysis via the Generalized Linear Model PSY349: Lecture 1: INTRO & CORRELATION Establishing a framework for statistical analysis via the Generalized Linear Model GLM provides a unified framework that incorporates a number of statistical methods

More information

Inferences on Correlation Coefficients of Bivariate Log-normal Distributions

Inferences on Correlation Coefficients of Bivariate Log-normal Distributions Inferences on Correlation Coefficients of Bivariate Log-normal Distributions Guoyi Zhang 1 and Zhongxue Chen 2 Abstract This article considers inference on correlation coefficients of bivariate log-normal

More information

1/42. Wirtschaftsuniversität Wien, Nicola Loperfido, Urbino University

1/42. Wirtschaftsuniversität Wien, Nicola Loperfido, Urbino University Nicola Loperfido Università degli Studi di Urbino "Carlo Bo, Dipartimento di Economia, Società e Politica Via Saffi 42, Urbino (PU), ITALY e-mail: nicola.loperfido@uniurb.it 1/42 Outline Finite mixtures

More information

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION

Module Tag PSY_P2_M 7. PAPER No.2: QUANTITATIVE METHODS MODULE No.7: NORMAL DISTRIBUTION Subject Paper No and Title Module No and Title Paper No.2: QUANTITATIVE METHODS Module No.7: NORMAL DISTRIBUTION Module Tag PSY_P2_M 7 TABLE OF CONTENTS 1. Learning Outcomes 2. Introduction 3. Properties

More information

Prerequisites for modeling price and return data series for the Bucharest Stock Exchange

Prerequisites for modeling price and return data series for the Bucharest Stock Exchange Theoretical and Applied Economics Volume XX (2013), No. 11(588), pp. 117-126 Prerequisites for modeling price and return data series for the Bucharest Stock Exchange Andrei TINCA The Bucharest University

More information

CHAPTER II LITERATURE STUDY

CHAPTER II LITERATURE STUDY CHAPTER II LITERATURE STUDY 2.1. Risk Management Monetary crisis that strike Indonesia during 1998 and 1999 has caused bad impact to numerous government s and commercial s bank. Most of those banks eventually

More information

Analysis of Messy Data (Outliers etc.)

Analysis of Messy Data (Outliers etc.) Analysis of Messy Data (Outliers etc.) Saif Shahin The University of Texas at Austin Entry for International Encyclopedia of Communication Research Methods Saif Shahin School of Journalism The University

More information

Empirical tests of directional dependence

Empirical tests of directional dependence Empirical tests of directional dependence Felix Thoemmes, Sarah Moore, & Marina Yamasaki Cornell University Directional dependence Dodge & Rousson (2000, 2001) Attempt at devising a statistical tests that

More information

Window Width Selection for L 2 Adjusted Quantile Regression

Window Width Selection for L 2 Adjusted Quantile Regression Window Width Selection for L 2 Adjusted Quantile Regression Yoonsuh Jung, The Ohio State University Steven N. MacEachern, The Ohio State University Yoonkyung Lee, The Ohio State University Technical Report

More information

Asymmetric Price Transmission: A Copula Approach

Asymmetric Price Transmission: A Copula Approach Asymmetric Price Transmission: A Copula Approach Feng Qiu University of Alberta Barry Goodwin North Carolina State University August, 212 Prepared for the AAEA meeting in Seattle Outline Asymmetric price

More information

Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions

Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions Technical Note: An Improved Range Chart for Normal and Long-Tailed Symmetrical Distributions Pandu Tadikamalla, 1 Mihai Banciu, 1 Dana Popescu 2 1 Joseph M. Katz Graduate School of Business, University

More information

Some developments about a new nonparametric test based on Gini s mean difference

Some developments about a new nonparametric test based on Gini s mean difference Some developments about a new nonparametric test based on Gini s mean difference Claudio Giovanni Borroni and Manuela Cazzaro Dipartimento di Metodi Quantitativi per le Scienze Economiche ed Aziendali

More information

On the Distribution and Its Properties of the Sum of a Normal and a Doubly Truncated Normal

On the Distribution and Its Properties of the Sum of a Normal and a Doubly Truncated Normal The Korean Communications in Statistics Vol. 13 No. 2, 2006, pp. 255-266 On the Distribution and Its Properties of the Sum of a Normal and a Doubly Truncated Normal Hea-Jung Kim 1) Abstract This paper

More information

14.1 Moments of a Distribution: Mean, Variance, Skewness, and So Forth. 604 Chapter 14. Statistical Description of Data

14.1 Moments of a Distribution: Mean, Variance, Skewness, and So Forth. 604 Chapter 14. Statistical Description of Data 604 Chapter 14. Statistical Description of Data In the other category, model-dependent statistics, we lump the whole subject of fitting data to a theory, parameter estimation, least-squares fits, and so

More information

Contents. An Overview of Statistical Applications CHAPTER 1. Contents (ix) Preface... (vii)

Contents. An Overview of Statistical Applications CHAPTER 1. Contents (ix) Preface... (vii) Contents (ix) Contents Preface... (vii) CHAPTER 1 An Overview of Statistical Applications 1.1 Introduction... 1 1. Probability Functions and Statistics... 1..1 Discrete versus Continuous Functions... 1..

More information

ELEMENTS OF MONTE CARLO SIMULATION

ELEMENTS OF MONTE CARLO SIMULATION APPENDIX B ELEMENTS OF MONTE CARLO SIMULATION B. GENERAL CONCEPT The basic idea of Monte Carlo simulation is to create a series of experimental samples using a random number sequence. According to the

More information

The Great Moderation Flattens Fat Tails: Disappearing Leptokurtosis

The Great Moderation Flattens Fat Tails: Disappearing Leptokurtosis The Great Moderation Flattens Fat Tails: Disappearing Leptokurtosis WenShwo Fang Department of Economics Feng Chia University 100 WenHwa Road, Taichung, TAIWAN Stephen M. Miller* College of Business University

More information

Fitting financial time series returns distributions: a mixture normality approach

Fitting financial time series returns distributions: a mixture normality approach Fitting financial time series returns distributions: a mixture normality approach Riccardo Bramante and Diego Zappa * Abstract Value at Risk has emerged as a useful tool to risk management. A relevant

More information

8.1 Estimation of the Mean and Proportion

8.1 Estimation of the Mean and Proportion 8.1 Estimation of the Mean and Proportion Statistical inference enables us to make judgments about a population on the basis of sample information. The mean, standard deviation, and proportions of a population

More information

2 DESCRIPTIVE STATISTICS

2 DESCRIPTIVE STATISTICS Chapter 2 Descriptive Statistics 47 2 DESCRIPTIVE STATISTICS Figure 2.1 When you have large amounts of data, you will need to organize it in a way that makes sense. These ballots from an election are rolled

More information

Statistical Evidence and Inference

Statistical Evidence and Inference Statistical Evidence and Inference Basic Methods of Analysis Understanding the methods used by economists requires some basic terminology regarding the distribution of random variables. The mean of a distribution

More information

Impact of Unemployment and GDP on Inflation: Imperial study of Pakistan s Economy

Impact of Unemployment and GDP on Inflation: Imperial study of Pakistan s Economy International Journal of Current Research in Multidisciplinary (IJCRM) ISSN: 2456-0979 Vol. 2, No. 6, (July 17), pp. 01-10 Impact of Unemployment and GDP on Inflation: Imperial study of Pakistan s Economy

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

Moments and Measures of Skewness and Kurtosis

Moments and Measures of Skewness and Kurtosis Moments and Measures of Skewness and Kurtosis Moments The term moment has been taken from physics. The term moment in statistical use is analogous to moments of forces in physics. In statistics the values

More information

Graphic-1: Market-Regimes with 4 states

Graphic-1: Market-Regimes with 4 states The Identification of Market-Regimes with a Hidden-Markov Model by Dr. Chrilly Donninger Chief Scientist, Sibyl-Project Sibyl-Working-Paper, June 2012 http://www.godotfinance.com/ Financial assets follow

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

NCSS Statistical Software. Reference Intervals

NCSS Statistical Software. Reference Intervals Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and

More information

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality Point Estimation Some General Concepts of Point Estimation Statistical inference = conclusions about parameters Parameters == population characteristics A point estimate of a parameter is a value (based

More information

A Robust Test for Normality

A Robust Test for Normality A Robust Test for Normality Liangjun Su Guanghua School of Management, Peking University Ye Chen Guanghua School of Management, Peking University Halbert White Department of Economics, UCSD March 11, 2006

More information

Model Construction & Forecast Based Portfolio Allocation:

Model Construction & Forecast Based Portfolio Allocation: QBUS6830 Financial Time Series and Forecasting Model Construction & Forecast Based Portfolio Allocation: Is Quantitative Method Worth It? Members: Bowei Li (303083) Wenjian Xu (308077237) Xiaoyun Lu (3295347)

More information

Keywords coefficient omega, reliability, Likert-type ítems.

Keywords coefficient omega, reliability, Likert-type ítems. ASYMPTOTICALLY DISTRIBUTION FREE (ADF) INTERVAL ESTIMATION OF COEFFICIENT ALPHA IE Working Paper WP06-4 05-1-006 Alberto Maydeu Olivares Donna L. Coffman Instituto de Empresa The Methodology Center Marketing

More information

An Examination of the Predictive Abilities of Economic Derivative Markets. Jennifer McCabe

An Examination of the Predictive Abilities of Economic Derivative Markets. Jennifer McCabe An Examination of the Predictive Abilities of Economic Derivative Markets Jennifer McCabe The Leonard N. Stern School of Business Glucksman Institute for Research in Securities Markets Faculty Advisor:

More information

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical

More information

On Using Asymptotic Critical Values in Testing for Multivariate Normality

On Using Asymptotic Critical Values in Testing for Multivariate Normality On Using Asymptotic Critical Values in Testing for Multivariate Normality Christopher J. Mecklin 1 and Daniel J. Mundfrom 2 1 Department of Mathematics and Statistics,Murray State University; 2 Department

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method Meng-Jie Lu 1 / Wei-Hua Zhong 1 / Yu-Xiu Liu 1 / Hua-Zhang Miao 1 / Yong-Chang Li 1 / Mu-Huo Ji 2 Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method Abstract:

More information