PDQ-Notes Reynolds Farley

Similar documents
Using the Budget Features in Quicken 2003

Handout 5: Summarizing Numerical Data STAT 100 Spring 2016

Medicaid Insurance and Redistribution in Old Age

Chapter 3. Lecture 3 Sections

You should already have a worksheet with the Basic Plus Plan details in it as well as another plan you have chosen from ehealthinsurance.com.

Tutorial. Morningstar DirectSM. Quick Start Guide

Summary of Statistical Analysis Tools EDAD 5630

The Normal Distribution & Descriptive Statistics. Kin 304W Week 2: Jan 15, 2012

Technology Assignment Calculate the Total Annual Cost

14. Roster Processing

Chapter 18. Indebtedness

Categorical. A general name for non-numerical data; the data is separated into categories of some kind.

This document describes version 1.1 of the Flexible Quota System.

Logistic Regression Analysis

2CORE. Summarising numerical data: the median, range, IQR and box plots

Project your expenses

Romero Catholic Academy Gender Pay Reporting Findings

Spreadsheet Directions

Pennsylvania Amend Instructions:

Research Wizard: UPGRADE (March 2006) Descriptions and Screenshots

Part 2 Handout Introduction to DemProj

Freddie Mac Servicing Success Program. Reference Guide. December 2017

Telemet Equity Contribution & Attribution

FMS View Expense Budget Information

Setting Up and Assigning Bank Rec Groups

RetirementWorks. Among its features, the Social Security Retirement Benefits module

WORKDAY: ADDING BENEFICIARIES/BENEFICIARY CHANGES

GuruFocus User Manual: My Portfolios

Principia Research Mode Online Basics Training Manual

of the city. District 4 had the largest population of 18- through 24-year-olds (college-age Salt Lake City 2000 Population

Calculating the Number and Percent of Workers in Your State by Establishment Size

Entering Credit Card Charges

Go! Guide: Insurance in the EHR

Intermediate Macroeconomic Theory. Costas Azariadis. Costas Azariadis. Lecture 3: Productivity and Labor

Gender Pay Gap Report 2017

PNC BENEFIT PLUS HSA INVESTMENT USER GUIDE. Home Page and Dashboard Navigation. pnc.com/pncbenefitplus

MgtOp 215 TEST 1 (Golden) Spring 2016 Dr. Ahn. Read the following instructions very carefully before you start the test.

How To View Your Balances

Creating and Assigning Targets

How Wealthy Are Europeans?

RUNNING THE G/L FUNDING BALANCE REPORT

between Income and Life Expectancy

Social Studies 201 January 28, Percentiles 2

RBC Advisor Workstation Research: Graphing Job Aid Use with Clients Interpret and Customize the graph

Exploratory Data Analysis

Discrete Probability Distributions

Additional Wage Information

The 2008 Statistics on Income, Poverty, and Health Insurance Coverage by Gary Burtless THE BROOKINGS INSTITUTION

FY18 Open Enrollment

Benefits: Open Enrollment Guide

SSHE Open Enrollment through ESS

Brown University Tidemark Users Guide

NCSS Statistical Software. Reference Intervals

How To Enter or Change My Direct Deposit Banking Information

Next Year Budget Entry

2 2 In general, to find the median value of distribution, if there are n terms in the distribution the

Discrete Probability Distributions

Personal Finance Amortization Table. Name: Period:

Gatekeeper Module Gatekeeper Version 3.5 June

Terms & Characteristics

Average income from employment in 1995 was

Basic -- Manage Your Bank Account and Your Budget

Information you need to manage your plan

Economics 448: Lecture 14 Measures of Inequality

Florida State University. From the SelectedWorks of Patrick L. Mason. Patrick Leon Mason, Florida State University. Winter February, 2009

Dashboard. Dashboard Page

Analyzing the Elements of Real GDP in FRED Using Stacking

~FY18~ STOP SLOW IFAS INSTRUCTIONS. Budget Preparation RED X STOP! YELLOW DOT Caution or warning. GREEN CHECK MARK GO!! Record Accepted!

Ti 83/84. Descriptive Statistics for a List of Numbers

Using the Clients & Portfolios Module in Advisor Workstation

IPUMS Int.l Extraction and Analysis

WEB APPENDIX 8A 7.1 ( 8.9)

BELEX.info User Manual

EasyPayNet SM Additional Wage Information

Analyzing the Elements of Real GDP in FRED Using Stacking

Federal 1040 Amend Instructions:

Financial Budgeting. User Guide

New Employees How to Enroll in Health Coverage

Retirement Security: What s Working and What s Not? James Poterba MIT, NBER, & TIAA-CREF. Bipartisan Policy Center 30 July 2014

PVCC S STRATEGIC PLANNING ONLINE SYSTEM (SPOL)

University of Texas at Dallas School of Management. Investment Management Spring Estimation of Systematic and Factor Risks (Due April 1)

WinTen² Budget Management

FIRST TIME USERS At the log in page, you will need to enter your User name and PIN.

Using an Excel spreadsheet to calculate Andrew s 18th birthday costs

The Colorado Family Resource Simulator: A Demonstration. Colorado Center on Law and Policy 30 October 2015 Seth Hartig and Curtis Skinner

Budget Training. Self Service Banner

Log In to Your Account. Change the Investment Mix for Your Current Balance. Change Future Contributions Investment Elections

Exploring The Value Line Page

Patterns of Pay: results of the Annual Survey of Hours and Earnings

Accessing your payslip from DMUhub

Quick Reference Guide. Employer Health and Safety Planning Tool Kit

Budget - By Salesperson

NBER WORKING PAPER SERIES THE GROWTH IN SOCIAL SECURITY BENEFITS AMONG THE RETIREMENT AGE POPULATION FROM INCREASES IN THE CAP ON COVERED EARNINGS

Tidemark Budget Data Entry Payroll Budget (2017)

FIRST TIME USERS At the log in page, you will need to enter your User name and PIN.

Descriptive Statistics

By Jack VanDerhei, Ph.D., Employee Benefit Research Institute

Management Setup & Quick Start Guide. Sub heading i.e version xxx. Grower Edition Gatekeeper Version 3.5 June 2016

Budgetary Control Set up Document

Analyzing Accumulated Change: More Applications of Integrals & 7.1 Differences of Accumulated Changes

Transcription:

PDQ-Notes Reynolds Farley PDQ-Note 7 Quantiles and Medians

PDQ-Note 7 Quantiles and Medians The mean of a distribution is an excellent measure of central tendency. If we sum the years of age reported by all persons living in a state and then divide by the number of people in the computation, we obtain the average age, that is, the mean number of years of life lived by people in that state. However, the median is the other very common measure of central tendency. This is the number that divides a distribution into its upper and lower halves. For example, if you obtain the median age for residents in a state, you will know that one-half of the population reported younger ages than the median age while the other half reported older ages. For many economic indicators, the median is used as a measure of central tendency rather than the mean. This is because persons with very high incomes or earnings substantially raise the mean of a distribution of incomes or earnings. However, their great incomes or earnings have very little impact upon the median. If economic polarization occurs such that the rich get richer over time while the poor stay about the same in income or earnings, the mean of the income or earnings distribution may increase, even increase rapidly over time, while the median hardly changes. After selecting the 1990 PUMS 5% data set, and bringing the Query Setup window to your screen, you may move your cursor to the down arrow in the far right corner of the Query Type box. Click there, and you will find Quantile as a mode. Highlight that Quantile mode, to obtain a number of quantile points in the distribution of any quantitative data item. When you highlight the Quantile mode and look toward the bottom of your Query Setup Window, you will notice that two new boxes have appeared. One of these, Quantile Expression is the box where you enter the data items whose median, or quantile points you wish to learn. You may use a data item from the data set such as age or a created data item such as 1.39*rpincome. To the immediate left of the Quantile Expression box, is a Quantile Order box containing 10 as a default value. This is where you type in the number of slices in the distribution you wish to analyze. If you were interested in the median only, you would enter 2 since you are interested in slicing the distribution into halves so as to find out the median. If you were interested in the quartile points of the distribution; that is, the 25 th percentile point, the median, and the 75 th percentile point; you would enter 4. That is, you want to slice the distribution into four different units, each of them with the same number of observations. If you enter 5 into the Quantile Order box, you will obtain the quintile points of the distribution by slicing the distribution into five parts (but none of these would represent the precise median). If you were interested in knowing what amount of earnings distinguishes the top 1 percent of earnings from the bottom 99 percent, you would enter 100 into the Quantile Order box. Please note that the Quantile Order box has up and down arrows providing you with choices of quantile points. If you wish just the median scroll down until 2 appears in the Quantile Order box. If you wish to obtain 20 quantile points please scroll down to 20. Page 1 2001-12-18

Example 1 Distribution of Men and Women Over Age 59 by Deciles At this point, let s try an example of using PDQ-Explore in the Quantile mode. You might be interested in the decile points of the age distributions of men and women who were at least 60 at the last census date. If so, you would type the following into the Universe/Selection box in the Expert Query window: age>59 To obtain these decile points for men and women, type the following into the Repeat For Each (Dimension 3) box: sex Because your interest is in the distribution by age, you should type the following into the box for Quantile Expression age Because you wish to obtain decile points in the distribution, type the following into the Quantile Order box: 10 Example 2 Distribution of Income for Young Male and Female Physicians You might be interested in the quintile points in the distribution of total income for young men and women who were physicians and who reported at least some income. For this run with PDQ-Explore, you might type the following into the Universe/Selection box: Age>29 & age<50 & occup=84 & rpincome<>0 Note that occup=84 selects persons who reported physician as their occupation and rpincome<>70 selects physicians who had positive or negative incomes. Once again, you wish to compare men and women so you would type the following into the Repeat for each (Dimension 3) box: Sex For this analysis, the data item whose quintile points you wish to obtain will be: Rpincome Since we are interested in the quintile points of income distributions, we type the following into the Quantile Order box or use the scroll bar to scroll down to 5 : 5 After you make the appropriate entries into the boxes in the Quantile mode, please click the Results tab. In just a few seconds, you will see the quintile points you selected for whatever data item is in the Quantile Expression box. Page 2 2001-12-18

Quantile Results The results screen for a query in the Quantile mode produces five columns of data for the data item listed in the Quantile Expression box: N-tile: This reports the quantile whose value is shown to the immediate right. If you asked for the decile points of a distribution, you will see 10 th, 20 th, and 30 th and so forth on your screen. The 50 th N-tile is the median. Cutoff: This is the numerical value in the distribution of the data item associated with the n-tile point whose value is shown to the left. The numerical value, for example, associated with the 20 th decile separates the bottom 20 percent of the distribution from the top 80 percent. Percent of Aggregate: This shows the percent of the total values of the data item you have analyzed that are held by the slice of the distribution identified by the number in the N-tile column to the right. Recall that in the quantile mode, identical percents of the total number of people, households, families or housing units are in each slice. If you are dealing with deciles of the distribution every slice, that is, every decile, will include exactly one-tenth of the number of people or households. However, the bottom 10 percent of a distribution does not, ordinarily have or receive 10 percent of the data item whose distribution you are analyzing. The lowest 10 percent of families typically obtain much less than 10 percent of total income received by all families. And the youngest 10 percent of the population does not have 10 percent of the total years of age reported by a group you are studying. The Percent of Aggregation reports the share of the total values of the data item received or held by the slice of the distribution under consideration. Cumulative Percent: Numbers in this column show the cumulative percent of the values on the data item under consideration received by or held by the slice of the distribution under consideration and by every lower slice. The cumulative percent associated with the median or 50 th decile mark is the share of the total distribution of income, or age or whatever quantity is being studied, that is received by or held by the lower half of the distribution. Cumulative Aggregate: These numbers are similar to those in the Cumulative Percent column but are expressed in the units of the data item, such as years for age and dollars for rpincome or income1. Page 3 2001-12-18

Cautions when Using the Quantile Mode Please think carefully about the data items you enter in the Repeat for each (Axis 3) box. If you enter data items such as occup (for occupation), pob (for place of birth), industry (for industry of employment) or ancstry1 (for first reported ancestry), you will be asking for the quantiles for several hundred distributions in separate tables. You will not be able to readily interpret such an extensive amount of output. Please also think about the number you enter in the Quantile Order box. If you are interested in just the median or just decile points in the distribution, you will produce unnecessarily elaborate and cluttered results if you enter 100 into the Quantile Order box. Many census data items are top coded. The number in the Cutoff column of output associated with the 100.00 N-tile from Quantile results is equal to the largest reported value plus 1 for the data item specified in the analysis. For example, age was top coded at 90 years in 1980 and 1990 so Quantile queries using this data item will show 91 as the age associated with the cutoff value for 100.00 N-tile. Of course, there were some people who reported more than 91 years in the census but their ages were top coded at 90. Earnings and income data items were also top coded so the numbers associated with 100.00 N-tile in the Cutoff column equal the maximum reported earnings or income (subject to possible top coding) plus one dollar. All census data items have numerical codes but not all those codes may be interpreted quantitatively. You may enter any data item you wish into the Quantile Expression box on the Expert Query window and obtain the median or decile points of its distribution. But numerical codes for states were assigned on an alphabetical basis so knowing the median or deciles of that distribution tells you nothing useful. Page 4 2001-12-18