Analysis of Cross- Sectional Data Exercise WS 2017/18

Size: px
Start display at page:

Download "Analysis of Cross- Sectional Data Exercise WS 2017/18"

Transcription

1 Analysis of Cross- Sectional Data Exercise WS 2017/18 Exercise III: Variables October 25 / 26, 2017 czymara@wiso.uni-koeln.de

2 Remember So far: types of data Experimental vs. observational Cross-section, panel, time series, pooled cross-section Today Types of variables Their description & analysis

3 Quantitative data Non-metric (distance between scale values can t be measured) vs. metric nominal ordinal discrete: can only take particular (but possibly many) values continuous (dt. stetig): can occupy any value over a continuous range. Practically, an underlying continuous construct is sufficient

4 Levels of measurement Categorical Continuous Level of measurement Nominal Characteristics Identification No intrinsic order Examples Gender, country of origin Ordinal Order (<, =, >) Level of education, Likert scales Interval Ratio Order Difference Arbitrary zero point Like interval but absolute zero points Temperature in Celsius, year dates Temperature in Kelvin, income, age Absolute Like ratio but fixed units Rooms in a house, inhabitants in a country

5 Odds & odds ratios Group 1 Group 2 Category 1 a b N(cat. 1) Category 2 c d N(cat. 2) N(group 1) N(group 2) Odds: Relations, e. g. from cat. 1 to cat. 2: N(cat. 1) / N(cat. 2) Odds ratios: Relations of relations, e. g. from cat. 1 to cat. 2 for group 1 to group 2: (a/c) / (b/d)

6 Reading odds ratios! Group 1 Group 2 Category 1 a b Category 2 c d Formula Odds ratios OR >1: odds are higher in first group (compared to second); <1: odds are smaller in first group (compared to second) But NOT The probability!! The odds (chances) of being in category 1 rather than in category 2 are [OR] times larger (smaller) for group 1 than for group 2. Alternatively: The odds (chances) of being in category 1 rather than in category 2 are [(OR-1)*100] percent larger (smaller) for group 1 than for group 2.

7 Odds vs. probability Simple example: flipping a coin Odds of heads (vs. tails): 1:1 Probability of heads: 50% But both mathematically related: probability heads = odds(heads) 1 + odds(heads) Cf.

8 Odds & odds ratios in a nutshell Gender (x) Passanger survivied (Y) Yes (b1) No (b2) Total Female (a1) 344(h11) 126 (h12) 470 (h1.) Male (a2) 367 (h21) 1364 (h22) 1731 (h2.) Total 711 (h.1) 1490 (h.2) 2201 (n) O surv., no surv. female = survival female no survival female O b1, b2 X = a 1 = h 11 h 12 O surv., no surv. male = survival male no survival male O b1, b2 X = a 2 = h 21 h 22 OR = Odds survival, no survival female Odds survival, no survival male OR = O b1, b2 X = a 1 O b1, b2 X = a 2 = h 11/h 12 h 21 /h 22

9 Exercise III: Variables (p. 75 f.)

10 Help for exercise 3: No 2 (p. 75 f.) a) Variable Type Description Scale age Continuous?? sex? 1=male, 2=female nominal degree??? papres80??? b) Use tab command e) Use options:,by & discrete & percent i) Tab degree fpresqu

11 2a solution Variable Type Description Scale age continuous Age (in years) ratio sex categorical 1=male, 2=female nominal degree categorical 5 categories ordinal papres80 continuous Prestige scale ordinal / interval

12 Multiple choice questions

13 + = + =

14 Percentage men: 50/632= Percentage women: 31/876= Difference: =0.0437

15

16

17 Happy holiday (so next week no lecture & no exercises)

18 In addition: Stata commands

19 DESCRIPTION Frequency table Calculator (odds) Cross-table COMMAND tab varname dis a/b tab varname1 varname2 (rows) (columns) Histogram hist varname1, by(varname2) discrete percent Quantiles (creates equal sized groups out of a continuous variable) xtile newvar = oldvar, n(#) # = Number of quantiles Universität zu Köln Folie: 8

20 DESCRIPTION Summarize with extra details Cross-table Histogram Box-plot Box-plots for different groups COMMAND sum varname, detail tab varname1 varname2 (rows) (columns) hist varname graph box varlist graph box varlist, over(varname) (group) Universität zu Köln Folie: 10

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data Summarising Data Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Today we will consider Different types of data Appropriate ways to summarise these data 17/10/2017

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Stemplots (or Stem-and-leaf plots) Stemplot and Boxplot T -- leading digits are called stems T -- final digits are called leaves STAT 74 Descriptive Statistics 2 Example: (number

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Ani Manichaikul amanicha@jhsph.edu 16 April 2007 1 / 40 Course Information I Office hours For questions and help When? I ll announce this tomorrow

More information

**The chart below shows the amount of leisure time enjoyed by men and women of different employment status.

**The chart below shows the amount of leisure time enjoyed by men and women of different employment status. Bar Graph **The chart below shows the amount of leisure time enjoyed by men and women of different employment status. Write a report for a university lecturer describing the information shown below. Leisure

More information

Introduction to Descriptive Statistics

Introduction to Descriptive Statistics Introduction to Descriptive Statistics 17.871 Types of Variables ~Nominal (Quantitative) Nominal (Qualitative) categorical Ordinal Interval or ratio Describing data Moment Non-mean based measure Center

More information

Assessing Normality. Contents. 1 Assessing Normality. 1.1 Introduction. Anthony Tanbakuchi Department of Mathematics Pima Community College

Assessing Normality. Contents. 1 Assessing Normality. 1.1 Introduction. Anthony Tanbakuchi Department of Mathematics Pima Community College Introductory Statistics Lectures Assessing Normality Department of Mathematics Pima Community College Redistribution of this material is prohibited without written permission of the author 2009 (Compile

More information

Experimental Probability - probability measured by performing an experiment for a number of n trials and recording the number of outcomes

Experimental Probability - probability measured by performing an experiment for a number of n trials and recording the number of outcomes MDM 4U Probability Review Properties of Probability Experimental Probability - probability measured by performing an experiment for a number of n trials and recording the number of outcomes Theoretical

More information

23.1 Probability Distributions

23.1 Probability Distributions 3.1 Probability Distributions Essential Question: What is a probability distribution for a discrete random variable, and how can it be displayed? Explore Using Simulation to Obtain an Empirical Probability

More information

One Proportion Superiority by a Margin Tests

One Proportion Superiority by a Margin Tests Chapter 512 One Proportion Superiority by a Margin Tests Introduction This procedure computes confidence limits and superiority by a margin hypothesis tests for a single proportion. For example, you might

More information

Prof. Thistleton MAT 505 Introduction to Probability Lecture 3

Prof. Thistleton MAT 505 Introduction to Probability Lecture 3 Sections from Text and MIT Video Lecture: Sections 2.1 through 2.5 http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-041-probabilistic-systemsanalysis-and-applied-probability-fall-2010/video-lectures/lecture-1-probability-models-and-axioms/

More information

Rules so far. Probability with Tables. Likely Voter. Handout of Class Data. Union of Events A and B. Tom Ilvento STAT 200

Rules so far. Probability with Tables. Likely Voter. Handout of Class Data. Union of Events A and B. Tom Ilvento STAT 200 Rules so far Probability with Tables Tom Ilvento STAT 200 Probability of A Union A B) = A) + B) A B) A B) Conditional A B) = Probability B) Probability of an P ( A B) = B) A B) Intersection Handout of

More information

Lecture 2 Describing Data

Lecture 2 Describing Data Lecture 2 Describing Data Thais Paiva STA 111 - Summer 2013 Term II July 2, 2013 Lecture Plan 1 Types of data 2 Describing the data with plots 3 Summary statistics for central tendency and spread 4 Histograms

More information

Essential Question: What is a probability distribution for a discrete random variable, and how can it be displayed?

Essential Question: What is a probability distribution for a discrete random variable, and how can it be displayed? COMMON CORE N 3 Locker LESSON Distributions Common Core Math Standards The student is expected to: COMMON CORE S-IC.A. Decide if a specified model is consistent with results from a given data-generating

More information

1 Variables and data types

1 Variables and data types 1 Variables and data types The data in statistical studies come from observations. Each observation generally yields a variety data which produce values for different variables. Variables come in two basic

More information

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative STAT:10 Statistical Methods and Computing Normal Distributions Lecture 4 Feb. 6, 17 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowa.edu 1 2 Using density curves to describe the distribution of values of

More information

II. Random Variables

II. Random Variables II. Random Variables Random variables operate in much the same way as the outcomes or events in some arbitrary sample space the distinction is that random variables are simply outcomes that are represented

More information

Statistics for Business and Economics: Random Variables:Continuous

Statistics for Business and Economics: Random Variables:Continuous Statistics for Business and Economics: Random Variables:Continuous STT 315: Section 107 Acknowledgement: I d like to thank Dr. Ashoke Sinha for allowing me to use and edit the slides. Murray Bourne (interactive

More information

Section M Discrete Probability Distribution

Section M Discrete Probability Distribution Section M Discrete Probability Distribution A random variable is a numerical measure of the outcome of a probability experiment, so its value is determined by chance. Random variables are typically denoted

More information

Name: Homework Assignment Six Due Friday, Feb. 27 th

Name: Homework Assignment Six Due Friday, Feb. 27 th Name: Homework Assignment Six Due Friday, Feb. 27 th Suggest Reading: Chapter 5 Section 1 3, Chapter 6 Section 1- Section 2 1. List the probability model for the below situations. (Remember to list the

More information

NCSS Statistical Software. Reference Intervals

NCSS Statistical Software. Reference Intervals Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and

More information

The normal distribution is a theoretical model derived mathematically and not empirically.

The normal distribution is a theoretical model derived mathematically and not empirically. Sociology 541 The Normal Distribution Probability and An Introduction to Inferential Statistics Normal Approximation The normal distribution is a theoretical model derived mathematically and not empirically.

More information

Probability, Odds Ratio and Risk Ratio. Dr. Abbas Adigun (PhD) Biostatistician 19 th May 2017

Probability, Odds Ratio and Risk Ratio. Dr. Abbas Adigun (PhD) Biostatistician 19 th May 2017 Probability, Odds Ratio and Risk Ratio Dr. Abbas Adigun (PhD) Biostatistician 19 th May 2017 Probability Probability is a measure of the chance of getting some outcome of interest from some event. The

More information

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 5 Student Lecture Notes 5-1. Department of Quantitative Methods & Information Systems. Business Statistics Chapter 5 Student Lecture Notes 5-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 5 Discrete Probability Distributions QMIS 120 Dr. Mohammad Zainal Chapter Goals

More information

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either

More information

PROBABILITY and BAYES THEOREM

PROBABILITY and BAYES THEOREM PROBABILITY and BAYES THEOREM From: http://ocw.metu.edu.tr/pluginfile.php/2277/mod_resource/content/0/ ocw_iam530/2.conditional%20probability%20and%20bayes%20theorem.pdf CONTINGENCY (CROSS- TABULATION)

More information

IPUMS Int.l Extraction and Analysis

IPUMS Int.l Extraction and Analysis Minnesota Population Center Training and Development IPUMS Int.l Extraction and Analysis Exercise 1 OBJECTIVE: Gain an understanding of how the IPUMS dataset is structured and how it can be leveraged to

More information

Lecture 07: Measures of central tendency

Lecture 07: Measures of central tendency Lecture 07: Measures of central tendency Ernesto F. L. Amaral September 21, 2017 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A Tool for Social Research. Stamford:

More information

Lecture Data Science

Lecture Data Science Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Statistics Foundations JProf. Dr. Claudia Wagner Learning Goals How to describe sample data? What is mode/median/mean?

More information

Today's Agenda Hour 1 Correlation vs association, Pearson s R, non-linearity, Spearman rank correlation,

Today's Agenda Hour 1 Correlation vs association, Pearson s R, non-linearity, Spearman rank correlation, Today's Agenda Hour 1 Correlation vs association, Pearson s R, non-linearity, Spearman rank correlation, Hour 2 Hypothesis testing for correlation (Pearson) Correlation and regression. Correlation vs association

More information

Lecture 3: Review of Probability, MATLAB, Histograms

Lecture 3: Review of Probability, MATLAB, Histograms CS 4980/6980: Introduction to Data Science c Spring 2018 Lecture 3: Review of Probability, MATLAB, Histograms Instructor: Daniel L. Pimentel-Alarcón Scribed and Ken Varghese This is preliminary work and

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

DATA SUMMARIZATION AND VISUALIZATION

DATA SUMMARIZATION AND VISUALIZATION APPENDIX DATA SUMMARIZATION AND VISUALIZATION PART 1 SUMMARIZATION 1: BUILDING BLOCKS OF DATA ANALYSIS 294 PART 2 PART 3 PART 4 VISUALIZATION: GRAPHS AND TABLES FOR SUMMARIZING AND ORGANIZING DATA 296

More information

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics.

Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Week 1 Variables: Exploration, Familiarisation and Description. Descriptive Statistics. Convergent validity: the degree to which results/evidence from different tests/sources, converge on the same conclusion.

More information

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods 1 SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 Lecture 10: Multinomial regression baseline category extension of binary What if we have multiple possible

More information

institution Top 10 to 20 undergraduate

institution Top 10 to 20 undergraduate Appendix Table A1 Who Responded to the Survey Dynamics of the Gender Gap for Young Professionals in the Financial and Corporate Sectors By Marianne Bertrand, Claudia Goldin, Lawrence F. Katz On-Line Appendix

More information

Logistic Regression Analysis

Logistic Regression Analysis Revised July 2018 Logistic Regression Analysis This set of notes shows how to use Stata to estimate a logistic regression equation. It assumes that you have set Stata up on your computer (see the Getting

More information

Exploring the Scope of Neurometrically Informed Mechanism Design. Ian Krajbich 1,3,4 * Colin Camerer 1,2 Antonio Rangel 1,2

Exploring the Scope of Neurometrically Informed Mechanism Design. Ian Krajbich 1,3,4 * Colin Camerer 1,2 Antonio Rangel 1,2 Exploring the Scope of Neurometrically Informed Mechanism Design Ian Krajbich 1,3,4 * Colin Camerer 1,2 Antonio Rangel 1,2 Appendix A: Instructions from the SLM experiment (Experiment 1) This experiment

More information

ECON 214 Elements of Statistics for Economists 2016/2017

ECON 214 Elements of Statistics for Economists 2016/2017 ECON 214 Elements of Statistics for Economists 2016/2017 Topic The Normal Distribution Lecturer: Dr. Bernardin Senadza, Dept. of Economics bsenadza@ug.edu.gh College of Education School of Continuing and

More information

Chapter 9 Chapter Friday, June 4 th

Chapter 9 Chapter Friday, June 4 th Chapter 9 Chapter 10 Sections 9.1 9.5 and 10.1 10.5 Friday, June 4 th Parameter and Statisticti ti Parameter is a number that is a summary characteristic of a population Statistic, is a number that is

More information

Objectives. 1. Learn more details about the cohort study design. 2. Comprehend confounding and calculate unbiased estimates

Objectives. 1. Learn more details about the cohort study design. 2. Comprehend confounding and calculate unbiased estimates Abortion Week 6 1 Objectives 1. Learn more details about the cohort study design 2. Comprehend confounding and calculate unbiased estimates 3. Critically evaluate how abortion is related to issues that

More information

Full file at Chapter 2 Descriptive Statistics: Tabular and Graphical Presentations

Full file at   Chapter 2 Descriptive Statistics: Tabular and Graphical Presentations Descriptive Statistics: Tabular and Graphical Presentations Learning Objectives 1. Learn how to construct and interpret summarization procedures for qualitative data such as : frequency and relative frequency

More information

Quantile Regression in Survival Analysis

Quantile Regression in Survival Analysis Quantile Regression in Survival Analysis Andrea Bellavia Unit of Biostatistics, Institute of Environmental Medicine Karolinska Institutet, Stockholm http://www.imm.ki.se/biostatistics andrea.bellavia@ki.se

More information

2. Modeling Uncertainty

2. Modeling Uncertainty 2. Modeling Uncertainty Models for Uncertainty (Random Variables): Big Picture We now move from viewing the data to thinking about models that describe the data. Since the real world is uncertain, our

More information

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA Session 178 TS, Stats for Health Actuaries Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA Presenter: Joan C. Barrett, FSA, MAAA Session 178 Statistics for Health Actuaries October 14, 2015 Presented

More information

MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment

MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment MBEJ 1023 Planning Analytical Methods Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment Contents What is statistics? Population and Sample Descriptive Statistics Inferential

More information

Lab #7. In previous lectures, we discussed factorials and binomial coefficients. Factorials can be calculated with:

Lab #7. In previous lectures, we discussed factorials and binomial coefficients. Factorials can be calculated with: Introduction to Biostatistics (171:161) Breheny Lab #7 In Lab #7, we are going to use R and SAS to calculate factorials, binomial coefficients, and probabilities from both the binomial and the normal distributions.

More information

Applications of Data Analysis (EC969) Simonetta Longhi and Alita Nandi (ISER) Contact: slonghi and

Applications of Data Analysis (EC969) Simonetta Longhi and Alita Nandi (ISER) Contact: slonghi and Applications of Data Analysis (EC969) Simonetta Longhi and Alita Nandi (ISER) Contact: slonghi and anandi; @essex.ac.uk Week 2 Lecture 1: Sampling (I) Constructing Sampling distributions and estimating

More information

Table 1. Summary of Faculty Salary Data for Fall Mean Salary Males. Mean Salary Females. Median Salary Males

Table 1. Summary of Faculty Salary Data for Fall Mean Salary Males. Mean Salary Females. Median Salary Males Report to the UTK Faculty Senate from the Senate Budget and Planning Committee Analysis of Faculty Salary Data based upon Gender using Data from Fall 2015 Draft August 31, 2016 Louis J. Gross, Chair, Faculty

More information

The following content is provided under a Creative Commons license. Your support

The following content is provided under a Creative Commons license. Your support MITOCW Recitation 6 The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free. To make

More information

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a

Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Homework: Due Wed, Nov 3 rd Chapter 8, # 48a, 55c and 56 (count as 1), 67a Announcements: There are some office hour changes for Nov 5, 8, 9 on website Week 5 quiz begins after class today and ends at

More information

Descriptive Statistics in Analysis of Survey Data

Descriptive Statistics in Analysis of Survey Data Descriptive Statistics in Analysis of Survey Data March 2013 Kenneth M Coleman Mohammad Nizamuddiin Khan Survey: Definition A survey is a systematic method for gathering information from (a sample of)

More information

Lecture 6 Probability

Lecture 6 Probability Faculty of Medicine Epidemiology and Biostatistics الوبائيات واإلحصاء الحيوي (31505204) Lecture 6 Probability By Hatim Jaber MD MPH JBCM PhD 3+4-7-2018 1 Presentation outline 3+4-7-2018 Time Introduction-

More information

Business Statistics. Chapter 5 Discrete Probability Distributions QMIS 120. Dr. Mohammad Zainal

Business Statistics. Chapter 5 Discrete Probability Distributions QMIS 120. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 5 Discrete Probability Distributions QMIS 120 Dr. Mohammad Zainal Chapter Goals After completing this chapter, you should

More information

Hydrology 4410 Class 29. In Class Notes & Exercises Mar 27, 2013

Hydrology 4410 Class 29. In Class Notes & Exercises Mar 27, 2013 Hydrology 4410 Class 29 In Class Notes & Exercises Mar 27, 2013 Log Normal Distribution We will not work an example in class. The procedure is exactly the same as in the normal distribution, but first

More information

Probability Distributions. Definitions Discrete vs. Continuous Mean and Standard Deviation TI 83/84 Calculator Binomial Distribution

Probability Distributions. Definitions Discrete vs. Continuous Mean and Standard Deviation TI 83/84 Calculator Binomial Distribution Probability Distributions Definitions Discrete vs. Continuous Mean and Standard Deviation TI 83/84 Calculator Binomial Distribution Definitions Random Variable: a variable that has a single numerical value

More information

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw

MAS1403. Quantitative Methods for Business Management. Semester 1, Module leader: Dr. David Walshaw MAS1403 Quantitative Methods for Business Management Semester 1, 2018 2019 Module leader: Dr. David Walshaw Additional lecturers: Dr. James Waldren and Dr. Stuart Hall Announcements: Written assignment

More information

Getting to know a data-set (how to approach data) Overview: Descriptives & Graphing

Getting to know a data-set (how to approach data) Overview: Descriptives & Graphing Overview: Descriptives & Graphing 1. Getting to know a data set 2. LOM & types of statistics 3. Descriptive statistics 4. Normal distribution 5. Non-normal distributions 6. Effect of skew on central tendency

More information

STAB22 section 1.3 and Chapter 1 exercises

STAB22 section 1.3 and Chapter 1 exercises STAB22 section 1.3 and Chapter 1 exercises 1.101 Go up and down two times the standard deviation from the mean. So 95% of scores will be between 572 (2)(51) = 470 and 572 + (2)(51) = 674. 1.102 Same idea

More information

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

Categorical. A general name for non-numerical data; the data is separated into categories of some kind. Chapter 5 Categorical A general name for non-numerical data; the data is separated into categories of some kind. Nominal data Categorical data with no implied order. Eg. Eye colours, favourite TV show,

More information

Problem Set 2. PPPA 6022 Due in class, on paper, March 5. Some overall instructions:

Problem Set 2. PPPA 6022 Due in class, on paper, March 5. Some overall instructions: Problem Set 2 PPPA 6022 Due in class, on paper, March 5 Some overall instructions: Please use a do-file (or its SAS or SPSS equivalent) for this work do not program interactively! I have provided Stata

More information

Descriptive Statistics Bios 662

Descriptive Statistics Bios 662 Descriptive Statistics Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-08-19 08:51 BIOS 662 1 Descriptive Statistics Descriptive Statistics Types of variables

More information

Probability & Statistics Modular Learning Exercises

Probability & Statistics Modular Learning Exercises Probability & Statistics Modular Learning Exercises About The Actuarial Foundation The Actuarial Foundation, a 501(c)(3) nonprofit organization, develops, funds and executes education, scholarship and

More information

Lecture 2 INTERVAL ESTIMATION II

Lecture 2 INTERVAL ESTIMATION II Lecture 2 INTERVAL ESTIMATION II Recap Population of interest - want to say something about the population mean µ perhaps Take a random sample... Recap When our random sample follows a normal distribution,

More information

The Labour Market and Economic Growth / Standard of Living. Mark Wooden

The Labour Market and Economic Growth / Standard of Living. Mark Wooden The Labour Market and Economic Growth / Standard of Living Mark Wooden A Simple Identity GDP GDP H * POP H POP GDP = Real Gross Domestic Product Pop = Total population H = Aggregate hours of work Trends,

More information

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions:

Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions: Chapter 17 Inference about a Population Mean Conditions for inference Previously, when making inferences about the population mean, μ, we were assuming the following simple conditions: (1) Our data (observations)

More information

Bidding Decision Example

Bidding Decision Example Bidding Decision Example SUPERTREE EXAMPLE In this chapter, we demonstrate Supertree using the simple bidding problem portrayed by the decision tree in Figure 5.1. The situation: Your company is bidding

More information

Exploring Data and Graphics

Exploring Data and Graphics Exploring Data and Graphics Rick White Department of Statistics, UBC Graduate Pathways to Success Graduate & Postdoctoral Studies November 13, 2013 Outline Summarizing Data Types of Data Visualizing Data

More information

Numerical summary of data

Numerical summary of data Numerical summary of data Introduction to Statistics Measures of location: mode, median, mean, Measures of spread: range, interquartile range, standard deviation, Measures of form: skewness, kurtosis,

More information

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley Outline: 1) Review of Variation & Error 2) Binomial Distributions 3) The Normal Distribution 4) Defining the Mean of a population Goals:

More information

MATHEMATICAL LITERACY

MATHEMATICAL LITERACY MATBUS JUNE 2013 EXAMINATION DATE: 7 JUNE 2013 TIME: 14H00 16H00 TOTAL: 100 MARKS DURATION: 2 HOURS PASS MARK: 40% (UC-02) MATHEMATICAL LITERACY THIS EXAMINATION PAPER CONSISTS OF 9 QUESTIONS: ANSWER ALL

More information

Statistical Disclosure Control for Tabular Data. Tabular data protection theory. Tabular data protection: Introduction. Theory and Methods (part I)

Statistical Disclosure Control for Tabular Data. Tabular data protection theory. Tabular data protection: Introduction. Theory and Methods (part I) Statistical Disclosure Control for Tabular Data Theory and Methods (part I) Federal Statistical Office of Germany, C 1 Mathematical Statistical Methods 03.06.2013 Folie 1 Tabular data protection theory

More information

LABOUR FORCE SURVEY 2017 MAIN RESULTS

LABOUR FORCE SURVEY 2017 MAIN RESULTS LABOUR FORCE SURVEY 2017 MAIN RESULTS In 2017 the number of economically active population aged 15-64 was 3 277.5 thousand and represented 71.3% of population in the same age group. The activity rate (15-64

More information

Business Statistics 41000: Probability 4

Business Statistics 41000: Probability 4 Business Statistics 41000: Probability 4 Drew D. Creal University of Chicago, Booth School of Business February 14 and 15, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office:

More information

A useful modeling tricks.

A useful modeling tricks. .7 Joint models for more than two outcomes We saw that we could write joint models for a pair of variables by specifying the joint probabilities over all pairs of outcomes. In principal, we could do this

More information

Event p351 An event is an outcome or a set of outcomes of a random phenomenon. That is, an event is a subset of the sample space.

Event p351 An event is an outcome or a set of outcomes of a random phenomenon. That is, an event is a subset of the sample space. Chapter 12: From randomness to probability 350 Terminology Sample space p351 The sample space of a random phenomenon is the set of all possible outcomes. Example Toss a coin. Sample space: S = {H, T} Example:

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Midterm Exam ٩(^ᴗ^)۶ In class, next week, Thursday, 26 April. 1 hour, 45 minutes. 5 questions of varying lengths.

More information

12.1 One-Way Analysis of Variance. ANOVA - analysis of variance - used to compare the means of several populations.

12.1 One-Way Analysis of Variance. ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

Diploma in Financial Management with Public Finance

Diploma in Financial Management with Public Finance Diploma in Financial Management with Public Finance Cohort: DFM/09/FT Jan Intake Examinations for 2009 Semester II MODULE: STATISTICS FOR FINANCE MODULE CODE: QUAN 1103 Duration: 2 Hours Reading time:

More information

Finance 527: Lecture 35, Psychology of Investing V2

Finance 527: Lecture 35, Psychology of Investing V2 Finance 527: Lecture 35, Psychology of Investing V2 [John Nofsinger]: Welcome to the second video for the psychology of investing. In this one, we re going to talk about overconfidence. Like this little

More information

Outline. Unit 3: Descriptive Statistics for Continuous Data. Outline. Reminder: the library metaphor

Outline. Unit 3: Descriptive Statistics for Continuous Data. Outline. Reminder: the library metaphor Unit 3: Descriptive Statistics for Continuous Data Statistics for Linguists with R A SIGIL Course Designed by Marco Baroni 1 and Stefan Evert 2 1 Center for Mind/Brain Sciences (CIMeC) University of Trento,

More information

A random variable is a quantitative variable that represents a certain

A random variable is a quantitative variable that represents a certain Section 6.1 Discrete Random Variables Example: Probability Distribution, Spin the Spinners Sum of Numbers on Spinners Theoretical Probability 2 0.04 3 0.08 4 0.12 5 0.16 6 0.20 7 0.16 8 0.12 9 0.08 10

More information

UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certificate of Education Ordinary Level STATISTICS 4040/01

UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certificate of Education Ordinary Level STATISTICS 4040/01 UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certificate of Education Ordinary Level STATISTICS 4040/01 Paper 1 Additional Materials: Answer Booklet/Paper Graph paper (2 sheets) Mathematical

More information

Total number of balls played

Total number of balls played Class IX - NCERT Maths Exercise (15.1) Question 1: In a cricket math, a batswoman hits a boundary 6 times out of 30 balls she plays. Find the probability that she did not hit a boundary. Solution 1: Number

More information

ECON 214 Elements of Statistics for Economists

ECON 214 Elements of Statistics for Economists ECON 214 Elements of Statistics for Economists Session 7 The Normal Distribution Part 1 Lecturer: Dr. Bernardin Senadza, Dept. of Economics Contact Information: bsenadza@ug.edu.gh College of Education

More information

Class 11. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 11. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 11 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 2017 by D.B. Rowe 1 Agenda: Recap Chapter 5.3 continued Lecture 6.1-6.2 Go over Eam 2. 2 5: Probability

More information

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS A random variable is the description of the outcome of an experiment in words. The verbal description of a random variable tells you how to find or calculate

More information

The Binomial Probability Distribution

The Binomial Probability Distribution The Binomial Probability Distribution MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Objectives After this lesson we will be able to: determine whether a probability

More information

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives

9/17/2015. Basic Statistics for the Healthcare Professional. Relax.it won t be that bad! Purpose of Statistic. Objectives Basic Statistics for the Healthcare Professional 1 F R A N K C O H E N, M B B, M P A D I R E C T O R O F A N A L Y T I C S D O C T O R S M A N A G E M E N T, LLC Purpose of Statistic 2 Provide a numerical

More information

Project for the Regional Advancement of Statistics in the Caribbean - PRASC

Project for the Regional Advancement of Statistics in the Caribbean - PRASC Project for the Regional Advancement of Statistics in the Caribbean - PRASC Descriptive Statistical Tools for Data Analysis Analysis Workshop - Module 2 2 March 21-24, 2016 Kingstown, Saint Vincent and

More information

Master of Science in Strategic Management Degree Master of Science in Strategic Supply Chain Management Degree

Master of Science in Strategic Management Degree Master of Science in Strategic Supply Chain Management Degree CHINHOYI UNIVERSITY OF TECHNOLOGY SCHOOL OF BUSINESS SCIENCES AND MANAGEMENT POST GRADUATE PROGRAMME Master of Science in Strategic Management Degree Master of Science in Strategic Supply Chain Management

More information

6.1 Discrete & Continuous Random Variables. Nov 4 6:53 PM. Objectives

6.1 Discrete & Continuous Random Variables. Nov 4 6:53 PM. Objectives 6.1 Discrete & Continuous Random Variables examples vocab Objectives Today we will... - Compute probabilities using the probability distribution of a discrete random variable. - Calculate and interpret

More information

Frequency Distributions

Frequency Distributions Frequency Distributions January 8, 2018 Contents Frequency histograms Relative Frequency Histograms Cumulative Frequency Graph Frequency Histograms in R Using the Cumulative Frequency Graph to Estimate

More information

SEX DISCRIMINATION PROBLEM

SEX DISCRIMINATION PROBLEM SEX DISCRIMINATION PROBLEM 5. Displaying Relationships between Variables In this section we will use scatterplots to examine the relationship between the dependent variable (starting salary) and each of

More information

Fundamentals of Statistics

Fundamentals of Statistics CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct

More information

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82

Homework: Due Wed, Feb 20 th. Chapter 8, # 60a + 62a (count together as 1), 74, 82 Announcements: Week 5 quiz begins at 4pm today and ends at 3pm on Wed If you take more than 20 minutes to complete your quiz, you will only receive partial credit. (It doesn t cut you off.) Today: Sections

More information

We will use an example which will result in a paired t test regarding the labor force participation rate for women in the 60 s and 70 s.

We will use an example which will result in a paired t test regarding the labor force participation rate for women in the 60 s and 70 s. Now let s review methods for one quantitative variable. We will use an example which will result in a paired t test regarding the labor force participation rate for women in the 60 s and 70 s. 17 The labor

More information

Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny.

Random variables The binomial distribution The normal distribution Sampling distributions. Distributions. Patrick Breheny. Distributions September 17 Random variables Anything that can be measured or categorized is called a variable If the value that a variable takes on is subject to variability, then it the variable is a

More information

STAT 201 Chapter 6. Distribution

STAT 201 Chapter 6. Distribution STAT 201 Chapter 6 Distribution 1 Random Variable We know variable Random Variable: a numerical measurement of the outcome of a random phenomena Capital letter refer to the random variable Lower case letters

More information

the number of correct answers on question i. (Note that the only possible values of X i

the number of correct answers on question i. (Note that the only possible values of X i 6851_ch08_137_153 16/9/02 19:48 Page 137 8 8.1 (a) No: There is no fixed n (i.e., there is no definite upper limit on the number of defects). (b) Yes: It is reasonable to believe that all responses are

More information

Full Web Appendix: How Financial Incentives Induce Disability Insurance. Recipients to Return to Work. by Andreas Ravndal Kostøl and Magne Mogstad

Full Web Appendix: How Financial Incentives Induce Disability Insurance. Recipients to Return to Work. by Andreas Ravndal Kostøl and Magne Mogstad Full Web Appendix: How Financial Incentives Induce Disability Insurance Recipients to Return to Work by Andreas Ravndal Kostøl and Magne Mogstad A Tables and Figures Table A.1: Characteristics of DI recipients

More information