Introduction to R (2)
|
|
- Warren Cain
- 5 years ago
- Views:
Transcription
1 Introduction to R (2) Boxplots Boxplots are highly efficient tools for the representation of the data distributions. The five number summary can be located in boxplots. Additionally, we can distinguish outliers using boxplots. Remember the three distributions in the previous handout (figures 6, 7 and 8). Note that these distributions are symmetric, left skewed and right skewed respectively. Here is how I created figure 1 below: > par(mfrow=c(3,2)) > hist(n1) > boxplot(n1) > hist(n3) > boxplot(n3) > hist(n5) > boxplot(n5) Note that the command par(mf row) creates a 3 by 2 grid in the graphic area. The side-by-side boxplots are very helpful in comparing two or more distributions. For example figure 2 shows a side-by-side boxplots for the two skewed distributions of figure 1. > boxplot(n3,n5) 1
2 Histogram of n n1 Histogram of n n3 Histogram of n n5 Figure 1. Histograms along with boxplots for the distributions of the previous handout. 2
3 Figure 2. Side-by-side boxplots for the two skewed-distributions of figure 1. 3
4 Linear Transformations and Their Effect on x and s. To experiment with the idea of linear transformations, let s assume that we have the following sample of observations. > test1 [1] [10] [19] [28] [37] [46] [55] [64] [73] [82] [91] [100] the mean and the standard deviation of these number are: > mean(test1) [1] > sd(test1) [1] Note also that the distribution of these numbers is fairly symmetric as shown in figure 3. Let us perform the following simple linear transformation on these numbers: Y =3+2 X (1) 4
5 Histogram of test test1 Figure 3. Histogram of the data set test1. 5
6 Here is the code: > test2<-2*test1+3 > mean(test2) [1] > sd(test2) [1] > hist(test2) Note that the shape of the distribution remains intact yet the linear transformation affects the mean and the standard deviation: > *2 + 3 [1] > *2 [1]
7 Histogram of test test2 Figure 4. Histogram of the linearly transformed data set or test2. 7
8 Verification of the 68% - 95% % Rule To verify the rule, first I generate 100,000 observations from a normal distribution whose mean is 0 and whose standard deviation is 1. Next, I count the number of simulations whose values are between -1 to 1, -2 to 2 and -3 to 3 respectively. > test3<-rnorm(100000,0,1) > mean(test3) [1] > sd(test3) [1] > length(test3[test3>-1 & test3<1]) [1] > length(test3[test3>-2 & test3<2]) [1] > length(test3[test3>-3 & test3<3]) [1] Standardization By subtracting data points from their mean and dividing them by their standard deviation, we will generate a new data set whose mean is 0 and whose standard deviation is 1. Here is an example: > test<-c(1,2,3,4,5) > mean(test) [1] 3 > sd(test) [1] > mean(test-mean(test)) [1] 0 8
9 Histogram of test test3 Figure 5. Histogram of the 100,000 simulated observations from a Normal(0,1) distribution. 9
10 > sd(test/sd(test)) [1] 1 > z.test<-(test-mean(test))/sd(test) > z.test [1] > mean(z.test) [1] 0 > sd(z.test) [1] 1 10
11 Areas under the Normal Curve We can re-produce the z table using R. The two commands of interest are qnorm and pnorm to calculate the quantiles and the probabilities respectively. For example to find the area associated with Z< 1.3, we read from the z table. We can do the same using the pnorm command: > pnorm(-1.3,0,1) [1] Note that the syntax is such that we input the cut-off value of interest, followed by the mean and the standard deviation of the normal distribution. In that context, we are not bound to the standard normal distribution, but we can calculate the areas under any normal distribution. Conversely, we can calculate the cut-off value below which the area of the standard normal is say We should get 1.3 back. This is done using the qnorm command: > qnorm( ,0,1) [1] -1.3 Normal Quantile Plots or QQ- Assessing Normality: plots These are the most powerful tools for assessing the normality of a data set. The idea is relatively simple. We want to know whether the empirical quantiles of our data will match with the theoretical quantiles of a standard normal. This can be achieved by plotting the data distributions against their associated empirical and theoretical quantiles. Data will form a 45-degree line if normality holds. R does provides QQ-plots by the qqnorm command. Here is the QQ-plot for the data set test3 (figure 1): > qqnorm(test3) In figure 2, we demonstrate the qqplot of a symmetric, a left-skewed and a right-skewed distribution. The code below, shows you how I generated this figure. > par(mfrow=c(3,2)) > hist(n1) > qqnorm(n1) > hist(n3) 11
12 0 2 4 Sample Quantiles 2 4 Normal Q Q Plot Theoretical Quantiles Figure 1. The Normal Quantile plot for test
13 > qqnorm(n3) > hist(n5) > qqnorm(n5) 13
14 Histogram of n1 Normal Q Q Plot Sample Quantiles n Theoretical Quantiles Histogram of n3 Normal Q Q Plot Sample Quantiles n Theoretical Quantiles Histogram of n5 Normal Q Q Plot Sample Quantiles n Theoretical Quantiles Figure 2. The Normal Quantile plots for symmetric and asymmetric distributions. 14
15 Data Relationships Importing Text-files > grades<-read.table(" > grades Verbal Math GPA > dim(grades) [1] > grades[,1] [1] [19] [37] [55]
16 [73] [91] > grades[,"verbal"] [1] [19] [37] [55] [73] [91] > hist(grades[,"verbal"]) > summary(grades[,1]) Min. 1st Qu. Median Mean 3rd Qu. Max > verbal<-grades[,1] > stem(verbal) The decimal point is 2 digit(s) to the right of the
17 Histogram of grades[, 1] grades[, 1] Figure 3. The histogram of the first column of grades generated by R Histogram of grades[, "Verbal"] grades[, "Verbal"] Figure 4. The histogram of the first column of grades (called Verbal ) 17
18 grades[, 3] grades[, 1] Figure 5. The scatterplot for Verbal versus Math Scatterplots and Pearson s Correlation To create scatterplots, the command is simply plot. > plot(grades[,1],grades[,2]) To calculate the correlation coefficient between any two random variables, we can use the cor command: > cor(grades[,1],grades[,2]) [1] > cor(grades[,1],grades[,3]) [1] > cor(grades[,2],grades[,3]) [1]
19 grades[, 2] grades[, 1] Figure 6. The scatterplot for Verbal versus GPA grades[, 3] grades[, 2] Figure 7. The scatterplot for Math versus GPA 19
Introduction to Computational Finance and Financial Econometrics Descriptive Statistics
You can t see this text! Introduction to Computational Finance and Financial Econometrics Descriptive Statistics Eric Zivot Summer 2015 Eric Zivot (Copyright 2015) Descriptive Statistics 1 / 28 Outline
More informationChapter 3. Numerical Descriptive Measures. Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1
Chapter 3 Numerical Descriptive Measures Copyright 2016 Pearson Education, Ltd. Chapter 3, Slide 1 Objectives In this chapter, you learn to: Describe the properties of central tendency, variation, and
More informationYork University MATH 1131 (FALL 2005): Introduction to Statistics Mid Term Test Friday, Oct 28, 2005
York University MATH 1131 (FALL 2005): Introduction to Statistics Mid Term Test Friday, Oct 28, 2005 Last Name: Given Names: Student Number: Signature : DO NOT WRITE IN THIS AREA Read the following instructions
More informationInverse Normal Distribution and Approximation to Binomial
Inverse Normal Distribution and Approximation to Binomial Section 5.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 16-3339 Cathy Poliak,
More informationExploring Data and Graphics
Exploring Data and Graphics Rick White Department of Statistics, UBC Graduate Pathways to Success Graduate & Postdoctoral Studies November 13, 2013 Outline Summarizing Data Types of Data Visualizing Data
More informationThe Normal Distribution
Stat 6 Introduction to Business Statistics I Spring 009 Professor: Dr. Petrutza Caragea Section A Tuesdays and Thursdays 9:300:50 a.m. Chapter, Section.3 The Normal Distribution Density Curves So far we
More informationSTATISTICAL DISTRIBUTIONS AND THE CALCULATOR
STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either
More informationStat 101 Exam 1 - Embers Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.
More informationSome estimates of the height of the podium
Some estimates of the height of the podium 24 36 40 40 40 41 42 44 46 48 50 53 65 98 1 5 number summary Inter quartile range (IQR) range = max min 2 1.5 IQR outlier rule 3 make a boxplot 24 36 40 40 40
More informationSTAT 113 Variability
STAT 113 Variability Colin Reimer Dawson Oberlin College September 14, 2017 1 / 48 Outline Last Time: Shape and Center Variability Boxplots and the IQR Variance and Standard Deviaton Transformations 2
More informationWe will also use this topic to help you see how the standard deviation might be useful for distributions which are normally distributed.
We will discuss the normal distribution in greater detail in our unit on probability. However, as it is often of use to use exploratory data analysis to determine if the sample seems reasonably normally
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 10 (MWF) Checking for normality of the data using the QQplot Suhasini Subba Rao Checking for
More informationAssignment 3-Solutions
Assignment 3-Solutions Question 1. - Joint Probability Mass Function Consider the function x y 1.0 1.0 1.5 2.0 1.5 3.0 2.5 4.0 3.0 4.0 Determine the following: (a) Show that If is a valid probability mass
More informationMath 140 Introductory Statistics. First midterm September
Math 140 Introductory Statistics First midterm September 23 2010 Box Plots Graphical display of 5 number summary Q1, Q2 (median), Q3, max, min Outliers If a value is more than 1.5 times the IQR from the
More informationEmpirical Rule (P148)
Interpreting the Standard Deviation Numerical Descriptive Measures for Quantitative data III Dr. Tom Ilvento FREC 408 We can use the standard deviation to express the proportion of cases that might fall
More informationHonor Code: By signing my name below, I pledge my honor that I have not violated the Booth Honor Code during this examination.
Name: OUTLINE SOLUTIONS University of Chicago Graduate School of Business Business 41000: Business Statistics Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper for the formulas.
More informationFirst Midterm Examination Econ 103, Statistics for Economists February 16th, 2016
First Midterm Examination Econ 103, Statistics for Economists February 16th, 2016 You will have 70 minutes to complete this exam. Graphing calculators, notes, and textbooks are not permitted. I pledge
More informationMath 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment
Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class
More informationDot Plot: A graph for displaying a set of data. Each numerical value is represented by a dot placed above a horizontal number line.
Introduction We continue our study of descriptive statistics with measures of dispersion, such as dot plots, stem and leaf displays, quartiles, percentiles, and box plots. Dot plots, a stem-and-leaf display,
More informationThe Normal Probability Distribution
1 The Normal Probability Distribution Key Definitions Probability Density Function: An equation used to compute probabilities for continuous random variables where the output value is greater than zero
More informationNOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS
NOTES TO CONSIDER BEFORE ATTEMPTING EX 2C BOX PLOTS A box plot is a pictorial representation of the data and can be used to get a good idea and a clear picture about the distribution of the data. It shows
More informationChapter 3. Descriptive Measures. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1
Chapter 3 Descriptive Measures Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Descriptive Measures Mean, Median and Mode Copyright 2016, 2012, 2008 Pearson Education, Inc.
More informationBasic Procedure for Histograms
Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that
More informationChapter 4. The Normal Distribution
Chapter 4 The Normal Distribution 1 Chapter 4 Overview Introduction 4-1 Normal Distributions 4-2 Applications of the Normal Distribution 4-3 The Central Limit Theorem 4-4 The Normal Approximation to the
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 10 (MWF) Checking for normality of the data using the QQplot Suhasini Subba Rao Review of previous
More informationIt is common in the field of mathematics, for example, geometry, to have theorems or postulates
CHAPTER 5 POPULATION DISTRIBUTIONS It is common in the field of mathematics, for example, geometry, to have theorems or postulates that establish guiding principles for understanding analysis of data.
More informationApplication of the Bootstrap Estimating a Population Mean
Application of the Bootstrap Estimating a Population Mean Movie Average Shot Lengths Sources: Barry Sands Average Shot Length Movie Database L. Chihara and T. Hesterberg (2011). Mathematical Statistics
More informationStandard Normal Calculations
Standard Normal Calculations Section 4.3 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 10-2311 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationSOLUTIONS TO THE LAB 1 ASSIGNMENT
SOLUTIONS TO THE LAB 1 ASSIGNMENT Question 1 Excel produces the following histogram of pull strengths for the 100 resistors: 2 20 Histogram of Pull Strengths (lb) Frequency 1 10 0 9 61 63 6 67 69 71 73
More informationLecture 3: Data Description - Multiple Attributes
Lecture 3: Data Description - Multiple Attributes Graham Elliott December 2008 Graham Elliott () December 2008 1 / 25 The Basic Objective Most interesting problems relate not to means etc. but to relationships
More informationFound under MATH NUM
While you wait Edit the last line of your z-score program : Disp round(z, 2) Found under MATH NUM Bluman, Chapter 6 1 Sec 6.2 Bluman, Chapter 6 2 Bluman, Chapter 6 3 6.2 Applications of the Normal Distributions
More informationSTA 248 H1S Winter 2008 Assignment 1 Solutions
1. (a) Measures of location: STA 248 H1S Winter 2008 Assignment 1 Solutions i. The mean, 100 1=1 x i/100, can be made arbitrarily large if one of the x i are made arbitrarily large since the sample size
More informationDATA SUMMARIZATION AND VISUALIZATION
APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296
More informationStandard Normal, Inverse Normal and Sampling Distributions
Standard Normal, Inverse Normal and Sampling Distributions Section 5.5 & 6.6 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 9-3339 Cathy
More informationSummary of Statistical Analysis Tools EDAD 5630
Summary of Statistical Analysis Tools EDAD 5630 Test Name Program Used Purpose Steps Main Uses/Applications in Schools Principal Component Analysis SPSS Measure Underlying Constructs Reliability SPSS Measure
More informationProbability and distributions
2 Probability and distributions The concepts of randomness and probability are central to statistics. It is an empirical fact that most experiments and investigations are not perfectly reproducible. The
More informationCopyright 2005 Pearson Education, Inc. Slide 6-1
Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is
More informationReview: Chebyshev s Rule. Measures of Dispersion II. Review: Empirical Rule. Review: Empirical Rule. Auto Batteries Example, p 59.
Review: Chebyshev s Rule Measures of Dispersion II Tom Ilvento STAT 200 Is based on a mathematical theorem for any data At least ¾ of the measurements will fall within ± 2 standard deviations from the
More informationMM and ML for a sample of n = 30 from Gamma(3,2) ===============================================
and for a sample of n = 30 from Gamma(3,2) =============================================== Generate the sample with shape parameter α = 3 and scale parameter λ = 2 > x=rgamma(30,3,2) > x [1] 0.7390502
More informationAP Statistics Chapter 6 - Random Variables
AP Statistics Chapter 6 - Random 6.1 Discrete and Continuous Random Objective: Recognize and define discrete random variables, and construct a probability distribution table and a probability histogram
More informationProf. Thistleton MAT 505 Introduction to Probability Lecture 3
Sections from Text and MIT Video Lecture: Sections 2.1 through 2.5 http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-041-probabilistic-systemsanalysis-and-applied-probability-fall-2010/video-lectures/lecture-1-probability-models-and-axioms/
More informationR Lab Session : Part 2
R Lab Session : Part 2 To see a review of how to start R, look at the beginning of Lab1 http://www-stat.stanford.edu/ epurdom/rlab.htm Probability Calculations The following examples demonstrate how to
More informationAP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE
AP STATISTICS Name: FALL SEMESTSER FINAL EXAM STUDY GUIDE Period: *Go over Vocabulary Notecards! *This is not a comprehensive review you still should look over your past notes, homework/practice, Quizzes,
More informationBoth the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.
Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need. For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of
More informationIntroduction to Statistical Data Analysis II
Introduction to Statistical Data Analysis II JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? Preface
More informationI. Return Calculations (20 pts, 4 points each)
University of Washington Winter 015 Department of Economics Eric Zivot Econ 44 Midterm Exam Solutions This is a closed book and closed note exam. However, you are allowed one page of notes (8.5 by 11 or
More informationNCSS Statistical Software. Reference Intervals
Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and
More informationUnit2: Probabilityanddistributions. 3. Normal distribution
Announcements Unit: Probabilityanddistributions 3 Normal distribution Sta 101 - Spring 015 Duke University, Department of Statistical Science February, 015 Peer evaluation 1 by Friday 11:59pm Office hours:
More informationChapter ! Bell Shaped
Chapter 6 6-1 Business Statistics: A First Course 5 th Edition Chapter 7 Continuous Probability Distributions Learning Objectives In this chapter, you learn:! To compute probabilities from the normal distribution!
More informationSome Characteristics of Data
Some Characteristics of Data Not all data is the same, and depending on some characteristics of a particular dataset, there are some limitations as to what can and cannot be done with that data. Some key
More informationPopulations and Samples Bios 662
Populations and Samples Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-08-22 16:29 BIOS 662 1 Populations and Samples Random Variables Random sample: result
More informationMAKING SENSE OF DATA Essentials series
MAKING SENSE OF DATA Essentials series THE NORMAL DISTRIBUTION Copyright by City of Bradford MDC Prerequisites Descriptive statistics Charts and graphs The normal distribution Surveys and sampling Correlation
More informationFinancial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR
Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR Nelson Mark University of Notre Dame Fall 2017 September 11, 2017 Introduction
More informationGGraph. Males Only. Premium. Experience. GGraph. Gender. 1 0: R 2 Linear = : R 2 Linear = Page 1
GGraph 9 Gender : R Linear =.43 : R Linear =.769 8 7 6 5 4 3 5 5 Males Only GGraph Page R Linear =.43 R Loess 9 8 7 6 5 4 5 5 Explore Case Processing Summary Cases Valid Missing Total N Percent N Percent
More informationDescribing Data: One Quantitative Variable
STAT 250 Dr. Kari Lock Morgan The Big Picture Describing Data: One Quantitative Variable Population Sampling SECTIONS 2.2, 2.3 One quantitative variable (2.2, 2.3) Statistical Inference Sample Descriptive
More informationQQ Plots Stat 342, Spring 2014 Prof. Guttorp - TA Aaron Zimmerman
QQ Plots Stat 342, Spring 2014 Prof. Guttorp - TA Aaron Zimmerman To get you started, remember that that a q-q-plot plots (Fn 1 (p), F0 1 (p)) for p (0, 1), where Fn 1 (p) = inf{y : F n (y) p}, where F
More informationSummary of Information from Recapitulation Report Submittals (DR-489 series, DR-493, Central Assessment, Agricultural Schedule):
County: Martin Study Type: 2014 - In-Depth The department approved your preliminary assessment roll for 2014. Roll approval statistical summary reports and graphics for 2014 are attached for additional
More informationStatistics and Probability
Statistics and Probability Continuous RVs (Normal); Confidence Intervals Outline Continuous random variables Normal distribution CLT Point estimation Confidence intervals http://www.isrec.isb-sib.ch/~darlene/geneve/
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes. Standardizing normal distributions The Standard Normal Curve
6.1 6.2 The Standard Normal Curve Standardizing normal distributions The "bell-shaped" curve, or normal curve, is a probability distribution that describes many reallife situations. Basic Properties 1.
More informationDescriptive Statistics
Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations
More informationM249 Diagnostic Quiz
THE OPEN UNIVERSITY Faculty of Mathematics and Computing M249 Diagnostic Quiz Prepared by the Course Team [Press to begin] c 2005, 2006 The Open University Last Revision Date: May 19, 2006 Version 4.2
More informationMidterm Exam. b. What are the continuously compounded returns for the two stocks?
University of Washington Fall 004 Department of Economics Eric Zivot Economics 483 Midterm Exam This is a closed book and closed note exam. However, you are allowed one page of notes (double-sided). Answer
More information1. Distinguish three missing data mechanisms:
1 DATA SCREENING I. Preliminary inspection of the raw data make sure that there are no obvious coding errors (e.g., all values for the observed variables are in the admissible range) and that all variables
More informationthe display, exploration and transformation of the data are demonstrated and biases typically encountered are highlighted.
1 Insurance data Generalized linear modeling is a methodology for modeling relationships between variables. It generalizes the classical normal linear model, by relaxing some of its restrictive assumptions,
More informationHIGHER SECONDARY I ST YEAR STATISTICS MODEL QUESTION PAPER
HIGHER SECONDARY I ST YEAR STATISTICS MODEL QUESTION PAPER Time - 2½ Hrs Max. Marks - 70 PART - I 15 x 1 = 15 Answer all the Questions I. Choose the Best Answer 1. Statistics may be called the Science
More informationProbability & Statistics Modular Learning Exercises
Probability & Statistics Modular Learning Exercises About The Actuarial Foundation The Actuarial Foundation, a 501(c)(3) nonprofit organization, develops, funds and executes education, scholarship and
More informationDescribing Uncertain Variables
Describing Uncertain Variables L7 Uncertainty in Variables Uncertainty in concepts and models Uncertainty in variables Lack of precision Lack of knowledge Variability in space/time Describing Uncertainty
More informationLecture 2 Describing Data
Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms
More informationSampling Distributions
Section 8.1 119 Sampling Distributions Section 8.1 C H A P T E R 8 4Example 2 (pg. 378) Sampling Distribution of the Sample Mean The heights of 3-year-old girls are normally distributed with μ=38.72 and
More information1 Describing Distributions with numbers
1 Describing Distributions with numbers Only for quantitative variables!! 1.1 Describing the center of a data set The mean of a set of numerical observation is the familiar arithmetic average. To write
More information2 Exploring Univariate Data
2 Exploring Univariate Data A good picture is worth more than a thousand words! Having the data collected we examine them to get a feel for they main messages and any surprising features, before attempting
More informationReview of commonly missed questions on the online quiz. Lecture 7: Random variables] Expected value and standard deviation. Let s bet...
Recap Review of commonly missed questions on the online quiz Lecture 7: ] Statistics 101 Mine Çetinkaya-Rundel OpenIntro quiz 2: questions 4 and 5 September 20, 2011 Statistics 101 (Mine Çetinkaya-Rundel)
More informationLecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution
More information* Source:
Problem: A recent report from Gallup stated that most teachers don t want to be armed in school. Gallup asked K-12 teachers if they would be willing to be trained so they could carry a gun at school. Eighteen
More informationTopic 8: Model Diagnostics
Topic 8: Model Diagnostics Outline Diagnostics to check model assumptions Diagnostics concerning X Diagnostics using the residuals Diagnostics and remedial measures Diagnostics: look at the data to diagnose
More informationData screening, transformations: MRC05
Dale Berger Data screening, transformations: MRC05 This is a demonstration of data screening and transformations for a regression analysis. Our interest is in predicting current salary from education level
More informationNumerical Descriptions of Data
Numerical Descriptions of Data Measures of Center Mean x = x i n Excel: = average ( ) Weighted mean x = (x i w i ) w i x = data values x i = i th data value w i = weight of the i th data value Median =
More informationHydrology 4410 Class 29. In Class Notes & Exercises Mar 27, 2013
Hydrology 4410 Class 29 In Class Notes & Exercises Mar 27, 2013 Log Normal Distribution We will not work an example in class. The procedure is exactly the same as in the normal distribution, but first
More informationECON 214 Elements of Statistics for Economists
ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education
More informationLecture 1: Empirical Properties of Returns
Lecture 1: Empirical Properties of Returns Econ 589 Eric Zivot Spring 2011 Updated: March 29, 2011 Daily CC Returns on MSFT -0.3 r(t) -0.2-0.1 0.1 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996
More informationChapter 11 Part 6. Correlation Continued. LOWESS Regression
Chapter 11 Part 6 Correlation Continued LOWESS Regression February 17, 2009 Goal: To review the properties of the correlation coefficient. To introduce you to the various tools that can be used to decide
More informationEpidemiology Principle of Biostatistics Chapter 7: Sampling Distributions (continued) John Koval
Principle of Biostatistics Chapter 7: Sampling Distributions (continued) John Koval Department of Epidemiology and Biostatistics University of Western Ontario Next want to look at histogram of sample statistics
More informationSimple Descriptive Statistics
Simple Descriptive Statistics These are ways to summarize a data set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency
More informationContents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali
Part I Descriptive Statistics 1 Introduction and Framework... 3 1.1 Population, Sample, and Observations... 3 1.2 Variables.... 4 1.2.1 Qualitative and Quantitative Variables.... 5 1.2.2 Discrete and Continuous
More informationTable of Contents. New to the Second Edition... Chapter 1: Introduction : Social Research...
iii Table of Contents Preface... xiii Purpose... xiii Outline of Chapters... xiv New to the Second Edition... xvii Acknowledgements... xviii Chapter 1: Introduction... 1 1.1: Social Research... 1 Introduction...
More informationLesson 12: Describing Distributions: Shape, Center, and Spread
: Shape, Center, and Spread Opening Exercise Distributions - Data are often summarized by graphs. We often refer to the group of data presented in the graph as a distribution. Below are examples of the
More informationStatistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron
Statistical Models of Stocks and Bonds Zachary D Easterling: Department of Economics The University of Akron Abstract One of the key ideas in monetary economics is that the prices of investments tend to
More informationReview. What is the probability of throwing two 6s in a row with a fair die? a) b) c) d) 0.333
Review In most card games cards are dealt without replacement. What is the probability of being dealt an ace and then a 3? Choose the closest answer. a) 0.0045 b) 0.0059 c) 0.0060 d) 0.1553 Review What
More informationThe Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s).
We will look the three common and useful measures of spread. The Range, the Inter Quartile Range (or IQR), and the Standard Deviation (which we usually denote by a lower case s). 1 Ameasure of the center
More information4. Basic distributions with R
4. Basic distributions with R CA200 (based on the book by Prof. Jane M. Horgan) 1 Discrete distributions: Binomial distribution Def: Conditions: 1. An experiment consists of n repeated trials 2. Each trial
More information23.1 Probability Distributions
3.1 Probability Distributions Essential Question: What is a probability distribution for a discrete random variable, and how can it be displayed? Explore Using Simulation to Obtain an Empirical Probability
More informationChapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)
Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop
More informationMathematics 1000, Winter 2008
Mathematics 1000, Winter 2008 Lecture 4 Sheng Zhang Department of Mathematics Wayne State University January 16, 2008 Announcement Monday is Martin Luther King Day NO CLASS Today s Topics Curves and Histograms
More informationChapter 15: Sampling distributions
=true true Chapter 15: Sampling distributions Objective (1) Get "big picture" view on drawing inferences from statistical studies. (2) Understand the concept of sampling distributions & sampling variability.
More information3.1 Measures of Central Tendency
3.1 Measures of Central Tendency n Summation Notation x i or x Sum observation on the variable that appears to the right of the summation symbol. Example 1 Suppose the variable x i is used to represent
More informationCHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS
CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS Note: This section uses session window commands instead of menu choices CENTRAL LIMIT THEOREM (SECTION 7.2 OF UNDERSTANDABLE STATISTICS) The Central Limit
More information5.1 Mean, Median, & Mode
5.1 Mean, Median, & Mode definitions Mean: Median: Mode: Example 1 The Blue Jays score these amounts of runs in their last 9 games: 4, 7, 2, 4, 10, 5, 6, 7, 7 Find the mean, median, and mode: Example 2
More informationSoftware Tutorial ormal Statistics
Software Tutorial ormal Statistics The example session with the teaching software, PG2000, which is described below is intended as an example run to familiarise the user with the package. This documented
More informationRegression and Simulation
Regression and Simulation This is an introductory R session, so it may go slowly if you have never used R before. Do not be discouraged. A great way to learn a new language like this is to plunge right
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao The binomial: mean and variance Recall that the number of successes out of n, denoted
More information