Introduction to Population Modeling

Similar documents
σ e, which will be large when prediction errors are Linear regression model

Financial Applications Involving Exponential Functions

Answers to Exercise 8

AP Statistics Chapter 6 - Random Variables

Topic #1: Evaluating and Simplifying Algebraic Expressions

Biol 356 Lab 7. Mark-Recapture Population Estimates

This homework assignment uses the material on pages ( A moving average ).

Jacob: What data do we use? Do we compile paid loss triangles for a line of business?

Jacob: The illustrative worksheet shows the values of the simulation parameters in the upper left section (Cells D5:F10). Is this for documentation?

IOP 201-Q (Industrial Psychological Research) Tutorial 5

Jaime Frade Dr. Niu Interest rate modeling

Questions 3-6 are each weighted twice as much as each of the other questions.

Multivariate Statistics Lecture Notes. Stephen Ansolabehere

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Continuous random variables

STATISTICS 110/201, FALL 2017 Homework #5 Solutions Assigned Mon, November 6, Due Wed, November 15

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

MAKING SENSE OF DATA Essentials series

Chapter 6. Transformation of Variables

Risk Analysis. å To change Benchmark tickers:

The Simple Regression Model

Algebra 1 Unit 3: Writing Equations

You may be given raw data concerning costs and revenues. In that case, you ll need to start by finding functions to represent cost and revenue.

Multiple Regression. Review of Regression with One Predictor

Rand Final Pop 2. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Applications of Exponential Functions Group Activity 7 Business Project Week #10

MVE051/MSG Lecture 7

Tests for the Difference Between Two Linear Regression Intercepts

Math of Finance Exponential & Power Functions

The Simple Regression Model

Math 140 Introductory Statistics. First midterm September

The Delta Method. j =.

Problem Set 5 Answers. ( ) 2. Yes, like temperature. See the plot of utility in the notes. Marginal utility should be positive.

Linear Modeling Business 5 Supply and Demand

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Intro to GLM Day 2: GLM and Maximum Likelihood

Chapter 6 Analyzing Accumulated Change: Integrals in Action

Some history. The random walk model. Lecture notes on forecasting Robert Nau Fuqua School of Business Duke University

Frequency Distribution Models 1- Probability Density Function (PDF)

Trends in currency s return

Statistics 101: Section L - Laboratory 6

Linear regression model

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -26 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

The misleading nature of correlations

Models of Patterns. Lecture 3, SMMD 2005 Bob Stine

Stat3011: Solution of Midterm Exam One

starting on 5/1/1953 up until 2/1/2017.

MATH 10 INTRODUCTORY STATISTICS

Section 4.3 Objectives

Name Date

Math 1526 Summer 2000 Session 1

Name Name. To enter the data manually, go to the StatCrunch website ( and log in (new users must register).

What Practitionors Nood to Know...

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING

Non-linearities in Simple Regression

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

Maximum Likelihood Estimation

Appendix A (Pornprasertmanit & Little, in press) Mathematical Proof

MATH 10 INTRODUCTORY STATISTICS

BACKGROUND KNOWLEDGE for Teachers and Students

M249 Diagnostic Quiz

Systems of Ordinary Differential Equations. Lectures INF2320 p. 1/48

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

The line drawn for part (a) will depend on each student s subjective choice about the position of the line. For this reason, it has been omitted.

Chapter 13 Return, Risk, and Security Market Line

MLLunsford 1. Activity: Mathematical Expectation

Robust Critical Values for the Jarque-bera Test for Normality

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

Business Statistics: A First Course

Chapter 8 Statistical Intervals for a Single Sample

Homework Assignment Section 3

Review Exercise Set 13. Find the slope and the equation of the line in the following graph. If the slope is undefined, then indicate it as such.

Generalized Linear Models

Regression. Lecture Notes VII

CHAPTER 8 Estimating with Confidence

Prentice Hall Connected Mathematics, Grade 7 Unit 2004 Correlated to: Maine Learning Results for Mathematics (Grades 5-8)

ELEMENTS OF MONTE CARLO SIMULATION

Mark-recapture models for closed populations

Software Tutorial ormal Statistics

Chapter 14. Descriptive Methods in Regression and Correlation. Copyright 2016, 2012, 2008 Pearson Education, Inc. Chapter 14, Slide 1

Estimating Beta. The standard procedure for estimating betas is to regress stock returns (R j ) against market returns (R m ): R j = a + b R m

Lesson 21: Comparing Linear and Exponential Functions Again

Math 122 Calculus for Business Admin. and Social Sciences

Considerations for Planning and Scheduling Part 3 Blending the Planned Maintenance Program and Reactive Maintenance Plan

Washington University Fall Economics 487. Project Proposal due Monday 10/22 Final Project due Monday 12/3

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS

Statistical Methods in Practice STAT/MATH 3379

Order Making Fiscal Year 2018 Annual Adjustments to Transaction Fee Rates

Exponential Functions with Base e

Case 2: Motomart INTRODUCTION OBJECTIVES

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron

3: Balance Equations

Function Transformation Exploration

Market Volatility and Risk Proxies

Advanced Topics in Derivative Pricing Models. Topic 4 - Variance products and volatility derivatives

3.3 rates and slope intercept form ink.notebook. October 23, page 103. page 104. page Rates and Slope Intercept Form

Game Theory and Economics Prof. Dr. Debarshi Das Department of Humanities and Social Sciences Indian Institute of Technology, Guwahati

Algebra I EOC 10-Day STAAR Review. Hedgehog Learning

Chapter 3: Answers to Questions and Problems

Transcription:

Introduction to Population Modeling In addition to estimating the size of a population, it is often beneficial to estimate how the population size changes over time. Ecologists often uses models to create a mathematical description of a population. A linear model can be written as N(t) = a + b* t. The linear model states that whenever the time interval t increases by one, the population size will increase by a fixed amount, b. Instead of using N to represent population size, the term N(t) will be used to represent the population at a particular time, t. While the linear model is very simple, most animal populations do not grow linearly. Exponential models are often used to model population growth. These models assume the population size will increase by a fixed percent each year and can be written as N(t) = N(0)*R t In this expression, N(0) is the starting population size at time 0 and R is called the growth rate of the population. If R is less than one, the population is decreasing and if R is greater than one, the population is increasing. Example: In the initial year of a study (time t = 0), the population of a certain species of fish starts at 1000 and is expected to increase by 10% of each year. Thus the number in the population at time t is equal to the number at time (t-1) multiplied by the growth rate of the population. We can calculate the population for following years as follows: Population at year 1 = N(1) Population at year 2 = N(2) = N(0)+N(0)*10% = N(0)(1+.1) = 1000*1.1 = 1100 = N(1) + N(1)*10% = N(1)(1.1) = N(0)(1.1)(1.1) = 1000*1.1 2 = 1210 Population at year 20 = N(20) = N(0)*R 20 = 1000(1.1) 20 = 6727.50 In the initial year example, N(0) = 1000 and R = 1.1.

1) Assume that an initial population started at 1000 species and the population declines by 5% each year. What are N(0), R, and N(20)? 2) Use software to create 5 columns of data. The first column should number t = 0,1,2 30. The next four columns should be four exponential models. All four models have an initial population size of 1000. The growth rate for the four models should be 0.8, 1, 1.1, and 1.4 respectively. Plot all four models on one graph. Rescale the graph so that the y-axis ranges from 0 to 5000. Describe any patterns that you see. Using Regression to Estimate a Population Growth Rate Most introductory statistics textbooks describe how to use least squares techniques to calculate the slope and y-intercept in simple linear regression models. These techniques do not work directly for modeling population over time because the terms in the model, N(t) = N(0)*R t, are not additive. However, a simple natural log transformation can be made to this model in order for it to look similar to a simple linear regression model. ln N(t) = ln{n(0)*r t } = ln{n(0)} + ln{ R t } = ln{n(0) + tln{r} = a + tb 3) Repeat the previous question, however instead of plotting N(t) verses t, plot ln{n(t)}versus t. Describe any patterns that you see. 4) For the first model with R = 0.8, use simple linear regression to calculate a and b. Give a clear explanation of how you can use b to estimate the growth rate in the exponential model. 5) Try a few other transformations on the first model with R = 0.8 (such as the square root or cubed root of N(t)) and conduct a simple linear regression. For each model, create a residuals verses t plot. Explain any patterns you see in these residual plots. 6) Go to the Great Fish Pond game located at www.cs.grinnell.edu/~kuipers/statsgames/capturerecapture and select the modeling option. Play the game and collect 20 years of data. Copy and paste the data into a software package. Then plot the data for the two ponds. Note that the

same population model is used for all data you collect from your sample. However, if you restart the game, the population model will change. 7) Transform the data, then use simple linear regression techniques to estimate a and b for both the fish food and natural pond models. Finally, use b to estimate R for both models. Submit your two exponential models. 8) Use the models in the previous problem to make your estimates in the Great Fish Pond game. Determining if Two Slopes are Significantly Different Assumptions: Before conducting a hypothesis test for regression models, it is necessary to check the model assumptions. The simple linear regression model assumes that: 1) the parameters are constant (the true slope and y-intercept do not change throughout the entire time period of the study) 2) the terms in the model are additive 3) the error terms sum to zero 4) the error terms are independent and identically distributed 5) the variability of the error terms in the regression model does not depend on x, and 6) the error terms follow a normal probability distribution. Details of these assumptions are discussed in most introductory text books and will not be given here. However, we will focus on the 5 th assumption. 9) Refer to the graph of your original Great Fish Pond data in question 6). Notice that the points appear to be much more spread out as t increases. Recall that the data from your game came from capture-recapture estimates. Assuming that the capture-recapture sample sizes are the same for all 20 years, explain why estimates tend to be more variable as the true population increases. 10) Conduct a simple linear regression on the original game data and create a residuals verses t plot. Explain the patterns you see in the residual plots. 11) Conduct a simple linear regression on the transformed game data, ln{n(t)}, and create a residuals verses t plot. Explain the patterns you see in the residual plots. Specifically, does the variability of the residuals still appear to depend on x?

The natural log function is one type of variance stabilizing transformation. In essence, if the variability of N(t) increases as N(t) increases, a variance stabilizing transformation is often helpful in assuring that assumption 5) is met. Hypothesis Tests and Confidence Intervals: Once the model assumptions are met, most introductory statistics textbooks show that b follows a normal distribution. Using the subscript f to represent the fish food model and n to represent the natural model it can be shown that: 12) Any linear combination of independent normal random variables is also normally distributed. Use the equations shown above to find the mean and variance of b f - b n. 13) Give the formula for the confidence interval for b f - b n assuming that f and n are known. 14) Give the formula for the confidence interval for b f - b n assuming that f and n are unknown. In other words, use s f and s n as estimates for f and n and use the t distribution degrees of freedom instead of the normal distribution. Following similar logic as the previous questions, it can be shown that the test statistic for the difference of two slopes is given by With degrees of freedom and

15) Use the confidence interval you created in the previous question to determine if the growth rates for the two ponds are significantly different. Discuss the impact of random sampling and random allocation when you state your conclusions.