Topic 5 Lecture 3 Estimating Policy Effects via the Simple Linear. Regression Model (SLRM) and the Ordinary Least Squares (OLS) Method

Similar documents
Sharif University of Technology. Graduate School of Management and Economics. Econometrics I. Fall Seyed Mahdi Barakchian

Lecture 2. Review of Linear Regression I Statistics Statistical Methods II. Presented January 9, 2018

Statistics and Quantitative Analysis U4320. Segment 8 Prof. Sharyn O Halloran

LECTURE 6: HETEROSKEDASTICITY

Chapter 5 ESTIMATION OF MAINTENANCE COST PER HOUR USING AGE REPLACEMENT COST MODEL

Relating your PIRA and PUMA test marks to the national standard

Relating your PIRA and PUMA test marks to the national standard

Getting Started with Correlated Component Regression (CCR) in XLSTAT-CCR

Technical Papers supporting SAP 2009

Using MATLAB/ Simulink in the designing of Undergraduate Electric Machinery Courses

Improving CERs building

ESSAYS ESSAY B ESSAY A and 2009 are given below:

arxiv:submit/ [math.gm] 27 Mar 2018

PARTIAL LEAST SQUARES: WHEN ORDINARY LEAST SQUARES REGRESSION JUST WON T WORK

Abstract. 1. Introduction. 1.1 object. Road safety data: collection and analysis for target setting and monitoring performances and progress

When the points on the graph of a relation lie along a straight line, the relation is linear

PREDICTION OF FUEL CONSUMPTION

Houghton Mifflin MATHEMATICS. Level 1 correlated to Chicago Academic Standards and Framework Grade 1

Multiple Imputation of Missing Blood Alcohol Concentration (BAC) Values in FARS

State of Health Estimation for Lithium Ion Batteries NSERC Report for the UBC/JTT Engage Project

Some Robust and Classical Nonparametric Procedures of Estimations in Linear Regression Model

The Mechanics of Tractor Implement Performance

Sample Reports. Overview. Appendix C

POLLUTION PREVENTION AND RESPONSE. Application of more than one engine operational profile ("multi-map") under the NOx Technical Code 2008

North Carolina End-of-Grade ELA/Reading Tests: Third and Fourth Edition Concordances

ECONOMICS-ECON (ECON)

Stat 301 Lecture 30. Model Selection. Explanatory Variables. A Good Model. Response: Highway MPG Explanatory: 13 explanatory variables

Chapter 2 & 3: Interdependence and the Gains from Trade

d / cm t 2 / s 2 Fig. 3.1

namibia UniVERSITY OF SCIEnCE AnD TECHnOLOGY FACULTY OF HEALTH AND APPLIED SCIENCES DEPARTMENT OF MATHEMATICS AND STATISTICS MARKS: 100

Testing for seasonal unit roots in heterogeneous panels using monthly data in the presence of cross sectional dependence

University Of California, Berkeley Department of Mechanical Engineering. ME 131 Vehicle Dynamics & Control (4 units)

The Degrees of Freedom of Partial Least Squares Regression

34.5 Electric Current: Ohm s Law OHM, OHM ON THE RANGE. Purpose. Required Equipment and Supplies. Discussion. Procedure

Aeronautical Engineering Design II Sizing Matrix and Carpet Plots. Prof. Dr. Serkan Özgen Dept. Aerospace Engineering Spring 2014

PVP Field Calibration and Accuracy of Torque Wrenches. Proceedings of ASME PVP ASME Pressure Vessel and Piping Conference PVP2011-

C67_2_27_Investigation 5. February 27, Rewrite in slope intercept form. Homework: Page 78, 2 Page 80, 13 & 14

The following output is from the Minitab general linear model analysis procedure.

Burn Characteristics of Visco Fuse

Preface... xi. A Word to the Practitioner... xi The Organization of the Book... xi Required Software... xii Accessing the Supplementary Content...

Introduction. Materials and Methods. How to Estimate Injection Percentage

18/10/2018. Mr Peter Adams General Manager, Wholesale Markets Australian Energy Regulator. By

Department of Economics

IMA Preprint Series # 2035

Effect of driving patterns on fuel-economy for diesel and hybrid electric city buses

Welcome to the SEI presentation on the basics of electricity

Integrating remote sensing and ground monitoring data to improve estimation of PM 2.5 concentrations for chronic health studies

Mechanical Drives

Mandatory Experiment: Electric conduction

Introduction. Kinematics and Dynamics of Machines. Involute profile. 7. Gears

Common position by FR and CEMA on mechanical couplings for towed vehicles 28/9/2015

INVESTIGATION ONE: WHAT DOES A VOLTMETER DO? How Are Values of Circuit Variables Measured?

Tactical Vehicle Cons & Reps Cost Estimating Relationship (CER) Tool

TRUTH AND LIES: CONSUMER PERCEPTION VS. DATA

Electric Motors and Drives

Professor Dr. Gholamreza Nakhaeizadeh. Professor Dr. Gholamreza Nakhaeizadeh

Academic Course Description

Optimizing Energy Consumption in Caltrain s Electric Distribution System Nick Tang

The Truth About Light Trucks

A Battery Smart Sensor and Its SOC Estimation Function for Assembled Lithium-Ion Batteries

Embedded system design for a multi variable input operations

Table 2: Ethnic Diversity and Local Public Goods (Kenya and Tanzania) Annual school spending/ pupil, USD

CRASH RISK RELATIONSHIPS FOR IMPROVED SAFETY MANAGEMENT OF ROADS

PERFORMANCE AND ACCEPTANCE OF ELECTRIC AND HYBRID VEHICLES

Important Formulas. Discrete Probability Distributions. Probability and Counting Rules. The Normal Distribution. Confidence Intervals and Sample Size

Regression Analysis of Count Data

Write or Identify a Linear Equation. Rate of Change. 2. What is the equation for a line that passes through the points (3,-4) and (-6, 20)?

Investigation of Relationship between Fuel Economy and Owner Satisfaction

Empirical comparative analysis of energy efficiency indicators for ships

Optimal Policy for Plug-In Hybrid Electric Vehicles Adoption IAEE 2014

Cost-Efficiency by Arash Method in DEA

Vehicle Scrappage and Gasoline Policy. Online Appendix. Alternative First Stage and Reduced Form Specifications

Application Notes. Calculating Mechanical Power Requirements. P rot = T x W

Embedded Torque Estimator for Diesel Engine Control Application

Regression Models Course Project, 2016

Multinational enterprise groups in the EU Dissemination from the EGR

Motor Trend MPG Analysis

Graphically Characterizing the Equilibrium of the Neoclassical Model

The Mark Ortiz Automotive

TABLE OF CONTENTS. Table of contents. Page ABSTRACT ACKNOWLEDGEMENTS TABLE OF TABLES TABLE OF FIGURES

EEEE 524/624: Fall 2017 Advances in Power Systems

tool<-read.csv(file="d:/chilo/regression 7/tool.csv", header=t) tool

Optimal Power Flow Formulation in Market of Retail Wheeling

Application of claw-back

Investigation in to the Application of PLS in MPC Schemes

correlated to the Virginia Standards of Learning, Grade 6

Lab 3 : Electric Potentials

Suburban bus route design

Linking the Indiana ISTEP+ Assessments to NWEA MAP Tests

London calling (probably)

MTH 127 OVERALL STUDENT LEARNING OUTCOMES (SLOs) RESULTS (including data from all tests & the final exam)

TABLE 4.1 POPULATION OF 100 VALUES 2

Motor Trend Yvette Winton September 1, 2016

Linking the Florida Standards Assessments (FSA) to NWEA MAP

Politics Philosophy Economics Undergraduate Degree Plan Curriculum Map New Plan Proposal: Appendix C

Effect of driving pattern parameters on fuel-economy for conventional and hybrid electric city buses

Problem Set 3 - Solutions

HASIL OUTPUT SPSS. Reliability Scale: ALL VARIABLES

Petrol consumption towards unsustainable development: Iranian case study

The State of Charge Estimation of Power Lithium Battery Based on RBF Neural Network Optimized by Particle Swarm Optimization

Transcription:

Econometrics for Health Policy, Health Economics, and Outcomes Research Topic 5 Lecture 3 Estimating Policy Effects via the Simple Linear Regression Model (SLRM) and the Ordinary Least Squares (OLS) Method Copyright Joseph V. Terza, Ph.D. 2007 All Rights Reserved 3. Estimating Policy Effects in the Context of the Simple Linear Regression Model via the Ordinary Least Squares Estimator Recall the specific form of the sampling model in the SLRM is y i = â 1 + xipâ 2 + e i (5-22) where the specific form of the regression error e i (as given in Definition 5.2) is defined as e i = y i -(â 1 + xipâ 2).

Under assumptions (a) through (e), w have fully characterized the aspect of the factual joint distribution of y (the dependent variable) and x p (the policy variable) that is relevant to the analysis the systematic part of the regression model E[y x p] = â 1 + xpâ 2. If we could observe the entire joint population of y and x p, we would know the desired conditional mean relationship. Unfortunately, taking a census typically imposes prohibitive costs. 2

We must rely on an appropriate estimation method, applied to a subset of the population (i.e., a sample), for estimating the values of the parameters â 1 and â 2. The ordinary least squares (OLS) estimator is one such estimation method which: is based on the analogy principal has its desirable statistical properties in particular, OLS is both unbiased and efficient 3

The population parameters â 1 and â 2 are the intercept and slope of the linear relationship between E[y x ] ( p ) and x, respectively p intercept : the point at which the line cuts the vertical axis of the graph slope: the rate of change in E[y x p] ( ) per one unit change in x p. Geometrically the OLS estimator finds the intercept and the slope of the line that best fits the two-dimensional plot of the xp-y data pairs. 4

(5.6) DEF: For any estimators of â 1and â 2, denoted as and, 1) the estimated regression line is defined as 2) the predicted value of y is and 3) the residual is defined as. 5

Figure 5-1 depicts all of the concepts defined in (5.6) Figure 5-1 Here Note that, and that is the vertical distance between the estimated regression line and the observed value of y. 6

In developing an estimation method we adhere to the analogy principle as we did in motivating the DOM estimator. There we noted that our object of interest, the policy effect, is the difference between two population averages. By analogy, we proposed the DOM which is the difference between two sample averages. 7

Similarly, in the present context we know that the fundamental component of the policy effect is the linear relationship between x p and y in the population. By analogy, we seek to plot an appropriate linear relationship in the sample. The problem, of course, is that there are an infinity of possible ways to fit a linear relationship to a sample of (x, y ) pairs generically represented in Figure 5-1 as the dashed estimated regression line. pi i 8

From among the infinity of possible methods we choose the method of least squares (ordinary least squares OLS) which chooses as the estimate values of â 1 and â 2, those values that in a very specific sense minimize the vertical distance between the estimated regression line and the observed data points. 9

For example, to each of the six data points in Figure 5-1 there corresponds a residual (i.e. the vertical distance between the point and the line). The least squares method positions the line so that the sum of the squared vertical distances (residuals) is the smallest. Another way to say this is that the least squares method chooses and so as to minimize the sum of squared residuals. 10

The formal definition of the Ordinary Least Squares (OLS) estimators of â 1 and â 2 is given in the following. (5.7) DEF: The Ordinary Least Squares (OLS) estimators of â 1and â 2are defined to be the minimizing values of for (5-23) where y and x denote the observed values of the outcome and policy i iip variable for the ith sample member, and i = 1,..., n. 11

It is easy to show that the solution to this optimization problem is (5-24) (5-25) and. It should also be noted that if x p is binary, the DOM estimator of the policy effect is identical to the OLS estimator of â 2. 12

(5.8) DEF: For the OLS estimators of â 1 and â 2, denoted as and, 1) the OLS estimated regression line is defined as 2) the OLS predicted value of y is and 3) the OLS residual is defined as. 13

For example, suppose we want to estimate the effect of a person s age on her yearly health care expenditure. We draw the following sample (y) EXPEND [$] (x p) AGE [yrs] 1200 50 500 35 650 25 300 30 450 18 2000 60 1500 65 350 21 200 20 1300 45 1100 40 250 19 1900 64 500 23 750 54 1000 55 650 47 100 24 800 26 850 27 14

Now consider the plot of these data pictured in Figure 5-2. Figure 5-2 Here 15

Each point denotes a sample observation for a particular individual s age and yearly health care expenditure. The solid line represents the OLS estimated line, i.e. (5-26) where are the estimated values of the intercept (â 1) and slope (â 2), respectively. The OLS estimated line is a best fit in the sense that it minimizes the squared vertical distances of the plotted points from the line. 16

Formulation of the SLRM, sampling, and OLS estimation combine to constitute two of the components of the scientific method, viz. 1) hypothesis formulation, and 2) observation. Choice of the SLRM as the basis for analysis lies at the heart of the former component, while sampling and OLS estimation are the keys to the latter. 17

4. Desirable Properties and Sampling Distribution of the OLS Estimator in the Context of the SLRM 4.1 Desirable Properties Unbiasedness (5.10) THEOREM: The OLS estimator in the simple linear regression model is unbiased, i.e. and E[â 1] = â1 E[â 2] = â 2. 18

(5.9) THEOREM: OLS estimators in the simple linear regression model can be written as linear combinations of the regression errors e,..., e n. Specifically, (5-27) and. (5-27) 19

We now note an efficiency result for the OLS estimator relative to a certain class of estimators. (5.11) THEOREM: (The Gauss-Markov Theorem) OLS estimators of â 1 and â 2 in the SLRM are Best Linear Unbiased Estimators (BLUE). The relevant class of estimators are those that can be formed as a linear combination of the conditional errors as in Theorem 5.9, and are unbiased as in Theorem 5.10. 20