Some Robust and Classical Nonparametric Procedures of Estimations in Linear Regression Model

Similar documents
Improving CERs building

Statistics and Quantitative Analysis U4320. Segment 8 Prof. Sharyn O Halloran

Effect of Sample Size and Method of Sampling Pig Weights on the Accuracy of Estimating the Mean Weight of the Population 1

Lecture 2. Review of Linear Regression I Statistics Statistical Methods II. Presented January 9, 2018

Technical Papers supporting SAP 2009

Topic 5 Lecture 3 Estimating Policy Effects via the Simple Linear. Regression Model (SLRM) and the Ordinary Least Squares (OLS) Method

Robust alternatives to best linear unbiased prediction of complex traits

LECTURE 6: HETEROSKEDASTICITY

9.3 Tests About a Population Mean (Day 1)

Example #1: One-Way Independent Groups Design. An example based on a study by Forster, Liberman and Friedman (2004) from the

LET S ARGUE: STUDENT WORK PAMELA RAWSON. Baxter Academy for Technology & Science Portland, rawsonmath.

Linking the Virginia SOL Assessments to NWEA MAP Growth Tests *

Investigation of Relationship between Fuel Economy and Owner Satisfaction

Sharif University of Technology. Graduate School of Management and Economics. Econometrics I. Fall Seyed Mahdi Barakchian

ESSAYS ESSAY B ESSAY A and 2009 are given below:

Using Statistics To Make Inferences 6. Wilcoxon Matched Pairs Signed Ranks Test. Wilcoxon Rank Sum Test/ Mann-Whitney Test

Effect of driving patterns on fuel-economy for diesel and hybrid electric city buses

Investigation in to the Application of PLS in MPC Schemes

Linking the New York State NYSTP Assessments to NWEA MAP Growth Tests *

Linking the Georgia Milestones Assessments to NWEA MAP Growth Tests *

Linking the North Carolina EOG Assessments to NWEA MAP Growth Tests *

Linking the Kansas KAP Assessments to NWEA MAP Growth Tests *

Identify Formula for Throughput with Multi-Variate Regression

DRIVER SPEED COMPLIANCE WITHIN SCHOOL ZONES AND EFFECTS OF 40 PAINTED SPEED LIMIT ON DRIVER SPEED BEHAVIOURS Tony Radalj Main Roads Western Australia

Linking the Alaska AMP Assessments to NWEA MAP Tests

Effect of driving pattern parameters on fuel-economy for conventional and hybrid electric city buses

TRINITY COLLEGE DUBLIN THE UNIVERSITY OF DUBLIN. Faculty of Engineering, Mathematics and Science. School of Computer Science and Statistics

5. CONSTRUCTION OF THE WEIGHT-FOR-LENGTH AND WEIGHT-FOR- HEIGHT STANDARDS

Performance of the Mean- and Variance-Adjusted ML χ 2 Test Statistic with and without Satterthwaite df Correction

A REPORT ON THE STATISTICAL CHARACTERISTICS of the Highlands Ability Battery CD

Stat 301 Lecture 30. Model Selection. Explanatory Variables. A Good Model. Response: Highway MPG Explanatory: 13 explanatory variables

Cost-Efficiency by Arash Method in DEA

Linking the Mississippi Assessment Program to NWEA MAP Tests

Linking the Florida Standards Assessments (FSA) to NWEA MAP

Linking the Indiana ISTEP+ Assessments to NWEA MAP Tests

Professor Dr. Gholamreza Nakhaeizadeh. Professor Dr. Gholamreza Nakhaeizadeh

Appendix B STATISTICAL TABLES OVERVIEW

TABLE 4.1 POPULATION OF 100 VALUES 2

Antonio Olmos Priyalatha Govindasamy Research Methods & Statistics University of Denver

Data envelopment analysis with missing values: an approach using neural network

THERMOELECTRIC SAMPLE CONDITIONER SYSTEM (TESC)

Testing for seasonal unit roots in heterogeneous panels using monthly data in the presence of cross sectional dependence

Oregon DOT Slow-Speed Weigh-in-Motion (SWIM) Project: Analysis of Initial Weight Data

PVP Field Calibration and Accuracy of Torque Wrenches. Proceedings of ASME PVP ASME Pressure Vessel and Piping Conference PVP2011-

Some Experimental Designs Using Helicopters, Designed by You. Next Friday, 7 April, you will conduct two of your four experiments.

Linking the Indiana ISTEP+ Assessments to the NWEA MAP Growth Tests. February 2017 Updated November 2017

Voting Draft Standard

Modeling Ignition Delay in a Diesel Engine

INFLUENCE OF CROSS FORCES AND BENDING MOMENTS ON REFERENCE TORQUE SENSORS FOR TORQUE WRENCH CALIBRATION

Extracting Tire Model Parameters From Test Data

Regularized Linear Models in Stacked Generalization

Investigating the Concordance Relationship Between the HSA Cut Scores and the PARCC Cut Scores Using the 2016 PARCC Test Data

Regression Models Course Project, 2016

20th. SOLUTIONS for FLUID MOVEMENT, MEASUREMENT & CONTAINMENT. Do You Need a Booster Pump? Is Repeatability or Accuracy More Important?

Optimization of Seat Displacement and Settling Time of Quarter Car Model Vehicle Dynamic System Subjected to Speed Bump

ACCIDENT MODIFICATION FACTORS FOR MEDIAN WIDTH

Test-Retest Analyses of ACT Engage Assessments for Grades 6 9, Grades 10 12, and College

Vehicle Scrappage and Gasoline Policy. Online Appendix. Alternative First Stage and Reduced Form Specifications

CITY OF EDMONTON COMMERCIAL VEHICLE MODEL UPDATE USING A ROADSIDE TRUCK SURVEY

Prediction of Bias-Ply Tire Deflection Based on Contact Area Index, Inflation Pressure and Vertical Load Using Linear Regression Model

The Mark Ortiz Automotive

Analyzing Crash Risk Using Automatic Traffic Recorder Speed Data

Benchmarking Inefficient Decision Making Units in DEA

Published: 14 October 2014

Efficiency Measurement on Banking Sector in Bangladesh

Driver Speed Compliance in Western Australia. Tony Radalj and Brian Kidd Main Roads Western Australia

Chapter 5 ESTIMATION OF MAINTENANCE COST PER HOUR USING AGE REPLACEMENT COST MODEL

Stat 401 B Lecture 31

PREDICTION OF FUEL CONSUMPTION

2018 Linking Study: Predicting Performance on the NSCAS Summative ELA and Mathematics Assessments based on MAP Growth Scores

Civil Engineering and Environmental, Gadjah Mada University TRIP ASSIGNMENT. Introduction to Transportation Planning

TESTING FOR INVERTIBILITY IN UNIVARIATE ARIMA PROCESSES. Rafael Flores de Frutos (*) Miguel Jerez Méndez (*) May 2002

HOW MUCH DRIVING DATA DO WE NEED TO ASSESS DRIVER BEHAVIOR?

PUBLICATIONS Silvia Ferrari February 24, 2017

Post 50 km/h Implementation Driver Speed Compliance Western Australian Experience in Perth Metropolitan Area

Verification of Redfin s Claims about Superior Notification Speed Performance for Listed Properties

North Carolina End-of-Grade ELA/Reading Tests: Third and Fourth Edition Concordances

The Incubation Period of Cholera: A Systematic Review Supplement. A. S. Azman, K. E. Rudolph, D.A.T. Cummings, J. Lessler

Predicting Solutions to the Optimal Power Flow Problem

Analysis and evaluation of a tyre model through test data obtained using the IMMa tyre test bench

BAC and Fatal Crash Risk

VT2+: Further improving the fuel economy of the VT2 transmission

2018 Linking Study: Predicting Performance on the Performance Evaluation for Alaska s Schools (PEAKS) based on MAP Growth Scores

June Safety Measurement System Changes

2018 Linking Study: Predicting Performance on the TNReady Assessments based on MAP Growth Scores

Motor Trend Yvette Winton September 1, 2016

Linking the PARCC Assessments to NWEA MAP Growth Tests

Problem Set 3 - Solutions

Burn Characteristics of Visco Fuse

Product Loss During Retail Motor Fuel Dispenser Inspection

Quality Improvement during Camshaft Keyway Tightening Using Shainin Approach

Using MATLAB/ Simulink in the designing of Undergraduate Electric Machinery Courses

Application of DSS to Evaluate Performance of Work Equipment of Wheel Loader with Parallel Linkage

Traffic Signal Volume Warrants A Delay Perspective

Preliminary Study on Quantitative Analysis of Steering System Using Hardware-in-the-Loop (HIL) Simulator

Statistical Estimation Model for Product Quality of Petroleum

Chapter 9 Real World Driving

Accelerating the Development of Expandable Liner Hanger Systems using Abaqus

Statistics for Social Research

Preface... xi. A Word to the Practitioner... xi The Organization of the Book... xi Required Software... xii Accessing the Supplementary Content...

Transcription:

Some Robust and Classical Nonparametric Procedures of Estimations in Linear Regression Model F.B. Adebola, Ph.D.; E.I. Olamide, M.Sc. * ; and O.O. Alabi, Ph.D. Department of Statistics, Federal University of Technology, Akure, Nigeria. E-mail: fbadebola@futa.edu.ng eiolamide@futa.edu.ng * ooalabi@futa.edu.ng ABSTRACT We examined some robust and nonparametric procedures for a simple linear regression model when the error terms are drawn from unit normal, lognormal, Student t-0df, Cauchy, and exponential power via Monte Carlo simulation technique. The results showed that the nonparametric Theil s method demonstrates the strongest performance gains in many cases which imply they have negligible bias and the smallest Mean Square Error (MSE). The second best results are obtained from Least Trimmed Squares (LTS) Methods with relative reductions in MSE. Though Least Absolute Deviation (LAD) and weighted Theil s regressions gave the poorest slope estimates with negative root mean square error (RMSE) values in most cases, it is noticed that LAD is always more efficient than LSE whenever the error component is Cauchy distributed. The LSE proved to be more efficient under normality assumption where as LTS and Theil s estimators performed better under nonnormal data conditions. (Keywords: least absolute deviation, least trimmed squares Monte Carlo simulation, ordinary least squares, regression analysis Theil s non-parametric, weighted Theil s non-parametric procedures) INTRODUCTION Regression analysis is a statistical tool for the investigation of relationships between variables. Usually, the investigator seeks to ascertain the causal effect of one variable upon another the effect of a price increase upon demand, for example, or the effect of changes in the money supply upon the inflation rate. To explore such issues, the investigator assembles data on the underlying variables of interest and employs regression to estimate the quantitative effect of the causal variables upon the variable that they influence. The investigator also typically assesses the statistical significance of the estimated relationships, that is, the degree of confidence that the true relationship is close to the estimated relationship. The simple linear regression model, which is the simplest form of a linear regression model, containing only one independent variable, is the focus of this research work. The independent variables, x i s are assumed to be non-stochastic design values. Plus the standard theory assumption that the error terms come from normal distribution, we also deal with cases where the error terms have a lognormal distribution, Cauchy distribution, Exponential Error distribution and t0 (Student-t with 0df) distribution. It is recognized that in the presence of normally distributed errors and homoscedasticity, OLS estimation is the method of choice. For situations in which the underlying assumptions of OLS estimation are not tenable, however, the choice of method for parameter estimation is not clearly defined. This research work reviews and briefly describes two classical nonparametric approaches and two robust regression approaches to Simple Linear Regression. The slope and y-intercept coefficients, and for the model () I = α + βe + () are estimated by using the ordinary least squares (OLS), Least Absolute Deviation (LAD), Least Trimmed Squares and Theil s (and weighted The Pacific Journal of Science and Technology Volume 9. Number. May 208 (Spring)

Theil) non parametric estimation methods. Variance, bias, mean square error, and relative mean square error of the estimates and are used to evaluate estimators performances with respect to each method under various situations. Under normality, the solutions of the equations given below are the maximum likelihood (ML) estimators which are also equal to the OLS estimators: MATERIALS AND METHODS Ordinary Least Squares (OLS) Estimation Procedure One of the most popular methods to model the functional relationship between variables is the OLS estimation procedure which is very simple and straightforward to apply. The basic idea of ordinary least squares is to optimize the fit by minimizing the total sum of the squares of the errors (deviations) between the observed values y i and the estimated values + x. In other words, the OLS method estimates the parameters and that minimize the sum of squares of the residuals S(, ), which is given in Equation (). Taking the partial derivative of () with respect to and, and equating to zero we get the OLS normal equations, the values of and that satisfy the equations are given by:, For testing hypothesis and constructing confidence intervals, the random errors, i s are assumed to be normally and independently distributed which leads to normally distributed response variables, y i s (Rawlings et. al., 998). The properties of OLS estimators are stronger under the normality assumption. If the error terms are normal and identically and independently distributed (i.i.d) with zero mean and constant variance S 2, then the OLS estimators and attain uniformly minimum variance in the range of all unbiased estimators. Under these assumptions the likelihood function of i s is: = 0 These assumptions and several types of model deficiencies can be detected with the help of the residual analysis. Suppose that the distribution of the errors is not normal. If the errors are coming from a population that has a mean of 0, then the OLS estimates may not be optimal, but they at least have the property of being unbiased. If we further assume that the variance of the error population is finite, then the OLS estimates have the property of being consistent and asymptotically normal. However, under these conditions, the OLS estimates and tests may lose much of their efficiency and they can result in poor performance. To deal with these situations, two approaches can be applied. One is to try to correct non-normality, if non-normality is determined and the other is to use alternative regression methods, which do not depend on the assumption of the normality (Birkes and Dodge, 99). Alternative Methods to OLS In real life, it is difficult to find a data set that satisfies all the assumptions necessary to apply OLS method. It has a 0% breakdown value, which means that a small percentage of contamination can cause the estimators to take values from - to + (Rousseeuw and Leroy, 987). Hence, if the observations are not normally distributed or they contain outliers, the OLS method is no longer convenient. That is why robust regression procedures are needed to remove the adverse effect of these situations. An estimator is said to be robust if it is fully efficient (nearly so) under an assumed model but maintains high efficiency for possible alternatives. Any robust method must be reasonably efficient when compared to the least squares estimators; The Pacific Journal of Science and Technology 2 Volume 9. Number. May 208 (Spring)

if the underlying distribution of errors are independent normal, and substantially more efficient than least squares estimators, when there are outlying observations. In this study, we investigated four other alternatives to OLS; Least Absolute Deviation, Least Trimmed Squares, Theil s Pairwise-Median and weighted Theil s non parametric procedure. effect on the estimators (Nevitt and Tam, 998). Thus, the only difference between OLS and LTS estimation is that in LTS, the largest squared residuals are not used (n-h observations will not affect the estimator). It has been demonstrated that the best robustness properties are achieved when h is approximately n/2, in which case the breakdown point attains 0% (Rousseeuw and Leroy, 987). Least Absolute Deviation The Least Absolute Deviations regression (LAD regression) is one of the principal alternatives to the Ordinary Least Squares method when one seeks to estimate regression parameters. The goal of the LAD regression is to provide a robust estimator which minimized the sum of the absolute residuals min. For a fixed β, the α, which minimizes the f function below is the sample median of {y i - β x i }. f(α,β) = + β x i ), The Y-outliers have less impact on the LAD results, because it does not square the residuals, and then the outliers are not given as much weight as in OLS procedure. However, LAD regression estimator is just as vulnerable as least squares estimates to high leverage outliers (Xoutliers). Least Trimmed Squares (LTS) Estimator LTS is an estimation procedure which achieves the purpose of being insensitive to changes in small percentage of data points. It aims at minimizing - x i ) 2. To minimize the Function, we should choose a subsample of h observations and compute some α and β that minimize the sum of squared residuals for the selected subsample. By applying this procedure to all subsamples, we have estimates for both α and β and the estimate which makes the objective function smallest is the final estimate (Cizek and Visek, 2000). Unfortunately, it is very difficult to obtain all subsamples unless a very small sample is analyzed. In LTS procedure, data points corresponding to a specified percentage of the largest residuals under an initial OLS estimation are deleted. The outlying cases are deleted to reduce their adverse Theil s and Weighted Theil s Non-parametric Procedures The robust estimate of slope for nonparametric fitted line was first described by Theil (90). Theil s regression is a nonparametric method which is used as an alternative to robust methods for data sets with outliers. Although the nonparametric procedures perform reasonably well for almost any possible distribution of errors and they lead to robust regression lines, they require a lot of computation. It is proved to be useful when outliers are suspected, but when there are more than few variables, the application becomes difficult (Mutan, 2004). Theil (90) proposed two methods, namely, the complete (Theil) and the weighted Theil method. The complete Theil slope estimate is computed by comparing each data pair to all others in a pairwise fashion. A data set of n(x,y) pairs will result in N = = pairwise comparisons. For each of these comparisons a slope computed. The median of all possible pairwise slopes is taken as the nonparametric Thiel's slope estimate, β THEIL Where: = = xj xi; i j n All the x i are assumed to be distinct, and we will lose no generality that they are arranged in ascending order (Hussain and Sprent, 98). Conover's estimator assures that the fitted line goes through the point (X median,y median ). This is analogous to OLS, where the fitted line always goes through the point (x i,y i ) (Nadia and Amaa, 20). To reduce the effect of outlying observations, some modifications are applied to Theil s method and each of the pairwise slopes, b ij s, are weighted by some weighting procedures. The is The Pacific Journal of Science and Technology Volume 9. Number. May 208 (Spring)

weighted Theil slope estimator for the n observations in the sample data is the weighted median of these b ij s. Recall that = where = and represents pairs of integers i and j with i j n n. According to Birkes and Dodge (99) a weighted median can be calculated as follows: x i s are ordered in an increasing sequence, so that x < x 2 <... < x n. The index k is obtained such that: w + w 2 + + w k- < 0. or W + w 2 + + w k- + w k > 0. where the weights, w i s, are nonnegative and add up to. x k is the weighted median. Thus the weighted Theil slope estimator of β is the pairwise slopes b ij = (y i - y j ) / (x i - x j ), with weights w ij = x i - x j / x i - x j 2, and is the ordinary median of (y i - * x i ) Monte Carlo Simulation Suppose that a statistic T based on a sample x, x 2,, x n has been formulated for testing a certain hypothesis; the test procedure is to reject H 0 if T(x, x 2,, x n ) c. Since the exact distribution of T is unknown, the value of c has been determined such that the asymptotic type I error rate is α = 0.0, (say), i.e., Pr (T c H 0 is true) =.0 as n. We can study the actual small sample behavior of T by the following procedure: a. Generate x, x 2,, x n from the appropriate distribution, say, the Normal distribution and compute T j = T(x, x 2,,x n ). b. Repeat N times, yielding a sample T, T 2,, T N, and c. compute proportion of times T j c as an estimate of the error rate i.e., d. Note that α, the type I error rate. Results of such a study may provide evidence to support the asymptotic theory even for small samples. Similarly, we may study the power of the test under different alternatives using Monte Carlo sampling. The study of Monte Carlo methods involves learning about: (i) Methods available to perform step (a) above correctly and efficiently, and (ii) How to perform step (c) above efficiently, by incorporating variance reduction" techniques to obtain more accurate estimates. RESULTS AND DISCUSSIONS The study addresses the problem via computer (Monte-Carlo) simulation methods. All programming for the simulation study is developed using FORTRAN. The design variable X is generated using a sequential model of the form X t = t; t =, 2,,..., n, while the response variable Y is generated using the relationship Y t = α + βx t + u t (for α = 2 and β = ). The study design includes using N= (0000/n) replications of randomly generated samples each of three sample sizes (n = 0, 0, 00) crossed with five types of error distributions (unit normal, lognormal, t-0df, Cauchy and exponential power) from which the random component u t is drawn. Alternative forms of the error distribution (mixture, contaminated and outliers) model are also considered to test the sensitivity of each of the simple linear regression estimation methods under study to various degrees and forms of outliers in the response variable Y direction. Algorithm for drawing random deviates from each of the error distributions are described in Evans, Hastings, and Peacock (99). For the alternative forms of the error distribution, taking the normal distribution as example: Standard model is a standard variate N(0,). The Outliers Model is a mixture of N(0,) and N(0,9) with probability p. The Mixture Model is a mixture of N(0,) and N(,9) with probability p while the Contamination Model is a mixture of N(0,) and G(p) with probability p (where G(p) is a geometric random variate with pmf and probability 0.). In each of the cases defined, p= 0.80. For detailed information on this approach see Evans, Hastings, and Peacock (99). For each simulated data set, the estimators of α and β are calculated. The estimation techniques that were considered are OLS, LAD, 20% LTS, Theil s and weighted Theil s regression. Using The Pacific Journal of Science and Technology 4 Volume 9. Number. May 208 (Spring)

these procedures, the y-intercept α and slope β estimators are computed and for each estimator mean, variance, bias, mean square error (MSE) and relative mean square error (RMSE) are calculated. Effects of Sample Size Across sample sizes, estimator variances (and, to some lesser degree estimator bias) decreased with increasing sample size. For example, the variances for the LSE slope estimator under the standard unit normal distribution are 0.022, 0.0004, 0.000 and 0.0000 for sample sizes n = 0, 0, 0 and 00 respectively. Also, the variances for the Theil s slope estimator under the mixture model for the Cauchy distribution are 0.4, 0.002, 0.0008 and 0.0000 for sample sizes n = 0, 0, 0, and 00, respectively. This pattern of decreasing variance and bias holds for all estimators under all error distributions. The patterns seen in the variances are also exhibited in the estimator MSE values. Because the results for the n = 0 sample size are intermediate to those for the n = 0 and n = 0 sample sizes, they are not reported here. It should also be noted that the exponential power distribution is very close to the unit normal distribution when its location parameter a=0 and scale and shape parameter equals one (b=c=). Thus, the distribution was used only as a control for the normality assumption in LSE. Slope Estimators Performances Based On Bias Criterion: Table gives summary results for β across all cells of the simulation study. A close look at this table reveals that LSE, LTS and Theil s slopes estimators are approximately unbiased i.e. they have negligible bias. This pattern of performance is observed to improve consistently as sample size increases and across all error distributions and respective alternative models. However, a deeper look at Table reveals that LSE is always biased whenever the error term is Cauchy distributed regardless of sample size. LAD and Weighted Theil s estimators are biased estimators of the population slope parameter β as they consistently over estimated β across all cells of the simulation study. However, LAD is observed to converge only when the sample size is relatively very large (n=00) or very small (n=0) and the error term is non-normal. With the introduction of contamination into the dataset, it was observed that LSE, LTS and Theil s estimator still remained unbiased while LAD and Weighted Theil s estimators remained biased except under the Mixture Model where all the estimators are relatively biased across all error distributions for very small sample size (n=0). Under this condition, Theil s estimator stayed on top most of the time with the least bias value, followed closely by LTS and then LSE with LAD and Weighted Theil s estimators competing rigorously with each other at the tail end. Based On Variance and RMSE Criteria: Table 2 and give summary results for β based on the variance and Relative Mean Square Error criteria across all cells of the simulation study. A close look at this table reveals that LSE had the least variance and hence the least MSE under the normal distribution. As sample size increases, LSE gained precision i.e. the variance and MSE decreases with increasing sample size. For instance, at n=0, there was 99.8% decrease in both variance and MSE value of LSE and at n=00 there was 99.9% decrease in both variance and MSE value of LSE. Under the Student-t distribution, LSE gained precision with a 2.9% reduction in both variance and MSE when n=0 and 99.99% decrease in both variance and MSE when n=00 relative to normal distribution. But as the error term begins to deviate from normality, LSE was observed to consistently loose precision with almost 400% increase in MSE across all sample sizes under the lognormal distribution and for the Cauchy distribution, LSE gave an outrageously large value for MSE compare to other estimators. This pattern of behavior confirms the fact that deviations from normality cause OLS estimators to be poor estimators. The Theil s nonparametric slope estimator followed LSE closely under the normal distribution with respect to MSE values (0.080) but as sample size increases it gains precision and its MSE values (0.00004 for n=0 and 0.0000240 for n=00) were approximately equal with that of LSE when the sample is very large (n 0). Under the student-t distribution, it gained more precision with a 7.72% and 7.6% decrease in MSE relative to LSE when n=0 and n=00 respectively, thereby outperforming LSE. The same pattern of behavior was observed The Pacific Journal of Science and Technology Volume 9. Number. May 208 (Spring)

under the lognormal and Cauchy distribution. Hence, Theil s gave a positive RMSE values whenever the error term come from non-normal distributions (Table ). This confirms that Theil estimator has high small-sample efficiency compared to the OLS estimator when the error term is heteroscedastic (Wilcox, 998). The Least Trimmed Squares estimation method came next in view to Theil s with a MSE value 0.079 under the normal distribution for sample size n=0 and as has been usually observed, it experiences a consistent decrease in its MSE as sample size gets larger (0.000, n=0; 0.0000666, n=00). For the non-normal distribution cases, LTS compete so rigorously with Theil s with a 4.87% (n=0) and 4.82% (n=00) decrease in MSE relative to LSE under the student-t distribution, thus, outperforming LSE and hence giving a positive RMSE value. It was observed that the farther away the error distribution deviated from normality, the better LTS becomes as it gains precision (.% when n=0, 42.97% when n=00 under Lognormal distribution and 00% when n=0, 99.96% when n==00 under Cauchy distribution) following closely after Theil s while displacing LSE outrightly. A close look at the tables above showed that LAD is especially consistent with the other estimators (in terms of its variance) only when the sample size is either very small (n=0) or very large (n=00). However, its MSE does not really show a consistent pattern with regards to sample size due to the fact that LAD consistently underestimated the slope parameter β, thus giving a negative bias across all sample sizes and distribution. It is also worthy of note that LAD gave a bigger variance and MSE value than any other estimator (except weighted Theil) across all sample sizes and error distribution with the exception of Cauchy distribution under which LAD gave a MSE value that is significantly smaller than that of LSE (.80 vs 7978, 0.946 vs 88.87,.04 vs.90 for n=0, 0, and 00, respectively) and therefore having a favorable RMSE across all sample sizes only when the error term is Cauchy distributed. The weighted Theil s estimator, like LAD, is not a good competitor compared to other slope estimator since it gave an inconsistent estimate of variance and MSE across all sample sizes and error distributions. Unlike LAD though, it consistently loses precision as the error term deviates from normality and its MSE was always highest whenever the error term is Cauchy distributed. Effects of Contamination For the first case of contamination considered in this project (outliers case), It was observed that Theil s nonparametric slope estimator outperformed every other slope estimators regardless of sample size or distribution. LSE slope estimator was better than LTS when the sample size is small (n<0) and the error distribution is normal. LAD maintained its statusquo beating LSE only when the error distribution is Cauchy. Like LAD, weighted Theil s slope estimator remained inconsistent and also maintained its response pattern to sample size and error distribution. Unlike LAD and weighted Theil, LSE, LTS and Theil s slope estimators generally improved in their precision compared to the standard model case. More so, the intercept estimators performance pattern remains the same for all estimators except for LTS, which like Theil s, proved to be more efficient than LSE, LAD and weighted Theil regardless of sample size or distribution (Table ). It is clear from Table (which present results for RMSE) that as long as the error distribution is normal, LSE slope estimator win the game. But as the sample size gets larger (n 00), Theil s nonparametric slope estimator defeats LSE by a slim chance of 4.7% reduction in its MSE relative to LSE. However, when the error is nonnormal, Theil s always takes the crown followed closely by LTS and then LAD (as usual, whenever the error is Cauchy distributed). The general pattern of estimators behavior, for the contamination case is similar to that of outliers error model case, only that for small samples (n 0) and whenever the error term is normally distributed, LSE slope estimator is the most efficient of all. But whenever n 0, Theil s nonparametric slope estimator takes the stage irrespective of distribution type and under nonnormal error distribution situation, Theil s, LTS and LAD kept to their status-quo. The Pacific Journal of Science and Technology 6 Volume 9. Number. May 208 (Spring)

Table : Summary of Population Intercept (α) Estimators Performance (Based on Bias). ESTIMATION METHODS (0df) LOG (0df) LOG (0df) LOG ERROR MODEL TYPE SAMPLE SIZE (N=0) STANDARD MODEL OUTLIER MODEL MIXTURE MODEL CONTAMINATION MODEL THEIL (0.724), 2T LE (0.80) LTS (-0.00), 2 LSE (-0.0020) LSE (0.897), 4 LAD (7.70) THEIL (-0.0024), 4 LAD (7.00) wtheil (4.000) wtheil (.9600) LTS(0.0), 2 LSE (0.026) THEIL (-0.00), 2 LSE (-0.0049) THEIL (0.029), 4 LAD (6.690) LTS (-0.0049), 4 wtheil(6.2200) wtheil (6.70) LAD (6.620) LTS (-0.009), 2 LSE (-0.0026) THEIL (-0.0029), 4 LAD (7.280) wtheil (-.9800) THEIL (-0.004), 2 LTS (-0.00) LSE (-0.006), 4 wtheil (6.470) LAD (6.470) THEIL (.0), 2 LTS (.2040) LSE (.660), 4 LAD (8.8090) wtheil (2.000) THEIL (-0.0267), 2 LTS (-0.028) LSE (6.620), 4 LAD (7.20) wtheil (4.4000) THEIL, 2 LTS (-0.0002) LSE (-0.00), 4 LAD (.700) wtheil (8.000) LSE (-0.00), 2 THEIL (-0.004) LTS (-0.002), 4 LAD (28.700) wtheil (.0) THEIL (.0220), 2 LTS (.400) LSE (.60), 4 LAD (29.8600) wtheil (27.0000) THEIL (0.00), 2 LTS (0.08) LSE (-.0690), 4 LAD (0.4600) wtheil (870.4000) THEIL (-0.0002), 2 LTS (-0.0009) LSE (-0.0046), 4 LAD (8.6000) wtheil (994.2000) THEIL (0.0008), 2 LTS (0.007) LSE (0.0026), 4 LAD (60.2000) wtheil (6.6200) THEIL (.020), 2 LTS (.0) LSE (.670), 4 LAD (60.4200) wtheil (62.6000) THEIL (-0.02), 2 LTS (-0.04) LSE (-2.6980), 4 LAD (90.600) wtheil (20.0000) THEIL (-0.000), 2 LSE (-0.000) LTS (0.002), 4 LAD (7.900) wtheil (.2400) THEIL (-0.0022), 2 LTS (-0.0029) LSE (-0.000), 4 wtheil (4.940) LAD (7.020) THEIL (0.9788), 2 LTS (0.9488) LSE (.620), 4 LAD (8.9670) wtheil (22.4700) THEIL (-0.042), 2 LTS (0.224) LSE (-0.272), 4 LAD (8.040) wtheil (748.6000) SAMPLE SIZE (N=0) THEIL (-0.004), 2 LTS (-0.0049) LSE (-0.0078), 4 LAD (4.9800) wtheil (77.9000) THEIL (-0.0002), 2 LSE (-0.0007) LTS (-0.00), 4 wtheil(6.4000) LAD (2.400) LTS (0.929), 2 THEIL (0.996) LSE (.840), 4 LAD (2.9000) wtheil (26.000) THEIL (0.009), 2 LTS (0.6) LSE (-.9700), 4 LAD (4.00) wtheil (220.0000) SAMPLE SIZE (N=00) THEIL (-0.0008), 2 LSE (-0.0028) LTS (0.00), 4 LAD (Nill) wtheil (-7.000) LTS, 2 THEIL LSE (0.0002), 4 wtheil (26.800) LAD (4.0800) LTS (0.968), 2 THEIL (0.9986) LSE (.200), 4 LAD (46.6400) wtheil (77.8000) LTS (0.008), 2 THEIL (-0.089) LSE (-.20), 4 LAD (0.000) wtheil (790.0000) THEIL (.460), 2 LTS (.6040) LSE (2.2460), 4 LAD (9.2440) wtheil (4.900) THEIL (0.264), 2 LTS (.200) LAD (9.60), 4 LSE (78.4000) wtheil (260.0000) LTS (0.46), 2 THEIL (0.806) LSE (0.966), 4 LAD (-24.400) wtheil (90.000) LTS (0.004), 2 THEIL (0.0044) LSE (0.0074), 4 LAD (26.0600) wtheil (.200) THEIL (.2470), 2 LTS (.440) LSE (2.270), 4 LAD (2.9200) wtheil (2.8000) THEIL (0.006), 2 LTS (.400) LSE (8.7060), 4 LAD (27.2000) wtheil (8820.0000) LTS (0.48), 2 THEIL (0.788) LSE (0.89), 4 LAD (8.7000) wtheil (9.000) THEIL (0.0066), 2 LTS (0.0079) LSE (0.000), 4 LAD (48.4800) wtheil (6.700) THEIL (.2220), 2 LTS (.060) LSE (2.20), 4 LAD (2.000) wtheil (620.0000) THEIL (-0.02), 2 LTS (.980) LSE (.700), 4 LAD (48.900) wtheil (2600.0000) THEIL (.0260), 2 LTS (.0990) LSE (.80), 4 LAD (8.690) wtheil (26.00) THEIL (-0.209), 2 LTS (-0.084) LSE (-2.260), 4 LAD (6.60) wtheil (.9000) LTS (-0.000), 2 THEIL (0.0009) LSE (-0.004), 4 LAD (20.2900) wtheil (80.9800) LSE (-0.0024), 2 LTS (-0.00) THEIL (-0.00), 4 LAD (20.0700), wtheil (29.600) THEIL (0.87), 2 LTS (.020) LSE (.00), 4 LAD (22.00) wtheil (90.000) THEIL (-0.048), 2 LTS (-0.940) LSE (-4.0), 4 LAD (29.8200) wtheil (2928.0000) LTS, 2 THEIL (-0.0004) LSE (-0.0048), 4 LAD (2.4900) wtheil (78.000) THEIL (0.0006), 2 LTS (0.008) LSE (0.0027), 4 LAD (4.8280) wtheil (8.9800) THEIL (0.848), 2 LTS (.080) LSE (.70), 4 LAD (20.6400) wtheil (628.000) THEIL (-0.086), 2 LTS (-0.448) LSE (-4.9740), 4 LAD (-.9000) wtheil (40.0000) The Pacific Journal of Science and Technology 7 Volume 9. Number. May 208 (Spring)

Table 2: Summary Table for Population Intercept (α) Estimators Performance (Based on Variance). ESTIMATION METHODS (0df) LOG (0df) LOG (0df) LOG ERROR MODEL TYPE SAMPLE SIZE (N=0) STANDARD MODEL OUTLIER MODEL MIXTURE MODEL CONTAMINATION MODEL THEIL (0.8), 2 LSE (0.46) LTS (0.662), 4 LAD (.880) wtheil (8.0700) THEIL (0.2722), 2 LTS (0.2698) LSE (0.04), 4 LAD (9.740) wtheil (.2600) THEIL (0.68), 2 LTS (.080) LSE (2.70), 4 LAD (6.80) wtheil (489.7000) THEIL (.7770), 2 LTS (9.80) LAD (.00), 4 LSE (6000.00) wtheil (262700000.0000) LSE (0.088), 2 THEIL (0.0978) LTS (0.8), 4 wtheil (27.6000) LAD (460.7000) THEIL (0.0), 2 LTS (0.09) LSE (0.097), 4 LAD (0.000) wtheil (20.8000) THEIL (0.0798), 2 LTS (0.772) LSE (0.87), 4 LAD (0.000) wtheil (2270.0000) THEIL (0.2968), 2 LTS (2.7800) LAD (.7200), 4 LSE (8640.0000) wtheil (29060000000.0000) LSE (0.096), 2 THEIL (0.047) LTS (0.072), 4 LAD (8.6000) wtheil (994.2000) THEIL (0.000), 2 LTS (0.004) LSE (0.006), 4 LAD (67.2200) wtheil (8.2000) THEIL (0.080), 2 LTS (0.08) LSE (0.94), 4 LAD (2.270) wtheil (400.0000) THEIL (0.26), 2 LTS (.40) LAD (40.8000), 4 LSE (669.0000) wtheil (400000000.0000) THEIL (0.098), 2 LTS (0.88) LSE (0.978), 4 LAD (.90) wtheil (7.9700) LTS (0.020), 2 THEIL (0.0208) LSE (0.0466), 4 wtheil (4.640) LAD (8.7040) THEIL (0.092), 2 LTS (0.960) LSE (0.880), 4 LAD (.960) wtheil (426.6000) LAD (6.9700), 2 THEIL (24.2600) LTS (267.000), 4 LSE (826000.0000) wtheil (289600000.0000) SAMPLE SIZE (N=0) THEIL (0.008), 2 LTS (0.09) LSE (0.046), 4 wtheil (292.000) LAD (4.6000) THEIL (0.000), 2 LTS (0.0007) LSE (0.009), 4 LAD (2.000) wtheil (9.200) THEIL (0.004), 2 LTS (0.029) LSE (0.62), 4 LAD (40.4000) wtheil (204.0000) THEIL (.640), 2 LTS (29.7000) LSE (666400.00), 4 LAD (.0000) wtheil (400000000) SAMPLE SIZE (N=00) THEIL (0.0004), 2 LTS (0.009) LSE (0.079), 4 LAD (Nill) wtheil (8.2000) THEIL, 2 LTS (0.0002) LSE (0.000), 4 LAD (6.9000) wtheil (7.900) LAD, 2 THEIL (0.000) LTS (0.0), 4 LSE (0.0902) wtheil (4600.0000) THEIL (0.7097), 2 LAD (.0900) LTS (27.4200), 4 LSE (20.0000) wtheil (7470000000.0000) THEIL (0.8084), 2 LSE (0.6289) LTS (.0640), 4 LAD (.90) wtheil (9.4200) LTS (0.2699), 2 THEIL (0.279), LSE (0.0), 4 LAD (0.6700) wtheil (.000) THEIL (.6960), 2 LTS (.60) LSE (.4680), 4 LAD (6.40) wtheil (9.000) THEIL (7.8200), 2 LTS (4.8000) LAD (8.0000), 4 LSE (484000.), wtheil (20980000000.00) LSE (0.29), 2 THEIL (0.448) LTS (0.224), 4 wtheil (2.7000) LAD (7.0000) THEIL (0.02), 2 LTS (0.09) LSE (0.096), 4 LAD (.90) wtheil (202.7000) THEIL (0.642), 2 LTS (0.4) LSE (0.66), 4 LAD (7.9200) wtheil (790.0000) THEIL (0.642), 2 LTS (0.2.) LAD (4.000), 4 LSE (86700.00) wtheil (2268000000000.000) LSE (0.02), 2 THEIL (0.0692) LTS (0.), 4 LAD (7.8000) wtheil (87.9000) THEIL (0.000), 2 LTS (0.004) LSE (0.0062), 4 LAD (8.6240) wtheil (80.6000) THEIL (0.074), 2 LTS (0.49) LSE (0.090), 4 LAD (0.7400) wtheil (4000.0000) LAD, 2 THEIL (0.277) LTS (.2900), 4 LSE (4860.0000) wtheil (670600000000.0000) LSE (0.462), 2 THEIL (0.48) LTS (0.609), 4 LAD (.4290) wtheil (8.00) THEIL (0.276), 2 LTS (0.28) LSE (0.2726), 4 LAD (0.400) wtheil (0.8800) THEIL (.7980), 2 LTS (.70) LSE (2.2660), 4 LAD (6.907) wtheil (490.000) THEIL (.90), 2 LTS (.40) LAD (4.00), 4 LSE (4294.00.), wtheil (772000.00) THEIL (0.080), 2 LSE (0.086) LTS (0.06), 4 LAD (276.000) wtheil (68.6000) THEIL (0.0070), 2 LTS (0.0099) LSE (0.076), 4 wtheil (9.700) LAD (679.9000) THEIL (0.099), 2 LTS (0.220) LSE (0.4070), 4 LAD (7.9600) wtheil (22760.0000) THEIL (0.27), 2 LTS (0.992) LAD (6.8000), 4 LSE (8026.000) wtheil (02000000.0000) THEIL (0.0), 2 LSE (0.09) LTS (0.064), 4 LAD (0.6000) wtheil (997.000) THEIL (0.006), 2 LTS (0.0029) LSE (0.007), 4 wtheil (746.700) LAD (969.8000) THEIL (0.04), 2 LTS (0.208) LSE (0.202), 4 LAD (2.8000) wtheil (0900.0000) THEIL (0.027), 2 LTS (0.7406) LSE (646.00), 4 LAD (9400.00) wtheil (2490000000.0000) The Pacific Journal of Science and Technology 8 Volume 9. Number. May 208 (Spring)

Table : Summary Table for Population Intercept (α) Estimators Performance (Based on RMSE) ESTIMATION METHODS (0df) LOG (0df) LOG (0df) LOG ERROR MODEL TYPE SAMPLE SIZE (N=0) STANDARD MODEL OUTLIER MODEL MIXTURE MODEL CONTAMINATION MODEL LSE (-), 2 THEIL (-0.200) LTS (-0.426), 4 LAD (- 24.000.), wtheil (-46.000) LTS (0.4), 2 THEIL (0.04) 6.7000) wtheil (-76.9000) THEIL (0.96), 2 LTS (0.486) 6.700) wtheil (-24.2000) THEIL (.0000), 2 LTS (.0000) LAD (0.9999), 4 LSE (-) wtheil (-44.000) LSE (-), 2 THEIL (-0.67) LTS (-0.4), 4 LAD (- 20260.000), wtheil (-8640.0000) THEIL (0.4266), 2 LTS (0.294) 47400.0000) wtheil (-9070.0000) THEIL (0.66), 2 LTS (0.66) 27.000) wtheil (-280.0000) THEIL (.0000), 2 LTS (0.9999) LAD (0.9474), 4 LSE (-) wtheil (-62000.000) LSE (-), 2 THEIL (-0.976) LTS (-0.444), 4 LAD (- 2090.000), wtheil (-829600.0000) THEIL (0.220), 2 LTS (0.479) 90000.0000) wtheil (-77200.0000) THEIL (0.60), 2 LTS (0.9) 270.0000) wtheil (-87200.0000) THEIL (.0000), 2 LTS (0.9998) 0.240) wtheil (-6080000.0000) THEIL (0.040), 2 LTS (0.99) 288.7000) wtheil (-729.2000) LTS (0.607), 2 THEIL (0.49) LSE (-), 4 wtheil (- 62.6000) LAD (-2.0000) THEIL (0.288), 2 LTS (0.09) 7.6900) wtheil (-46.000) THEIL (.0000), 2 LAD (0.9999) LTS (0.9997), 4 LSE (-) wtheil (-0.2000) SAMPLE SIZE (N=0) THEIL (0.9468), 2 LTS (0.44) 420.0000) wtheil (-800.0000) THEIL (0.979), 2 LTS (0.69) 900.7000) wtheil (-800.0000) LTS (0.429), 2 THEIL (0.64) 4.7000) wtheil (-400.0000) THEIL (.0000), 2 LTS (0.9998) LAD (0.994), 4 LSE (-) wtheil (-72200.000) SAMPLE SIZE (N=00) THEIL (0.979), 2 LTS (0.4792) 4.9800) wtheil (-79000.0000) THEIL (0.9884), 2 LTS (0.680) LSE (-), 4 wtheil (- 48000.0000) LAD (-4247.0000) LTS (0.487), 2 THEIL (0.07) 4.0000) wtheil (-200.0000) THEIL (.0000), 2 LTS (0.9989) LAD (0.900), 4 LSE (-) wtheil (-2947000.0000) LSE (-), 2 THEIL (-0.20) LTS (-0.6494), 4 LAD (- 88.2000.), wtheil (-24.9000) LTS (0.09), 2 THEIL (0.09) 76.2000) wtheil (-78.4000) THEIL (0.00), 2 LTS (0.26) 9.790) wtheil (-88.6000) THEIL (.0000), 2 LAD (.0000) LTS (.0000), 4 LSE (-) wtheil (-4.000) LSE (-), 2 THEIL (-0.66) LTS (-0.607), 4 wtheil (- 470), LAD (-0900.0000) THEIL (0.4279), 2 LTS (0.299) 4740.0000) wtheil (-9460.0000) THEIL (0.7028), 2 LTS (0.6296).000) wtheil (-4060.0000) THEIL (.0000), 2 LTS (0.9998) LAD (0.9990), 4 LSE (-) wtheil (-268000.0000) LSE (-), 2 THEIL (-0.69) LTS (-0.448), 4 LAD (- 00), wtheil (-48800.0000) THEIL (0.24), 2 LTS (0.96) 7200.0000) wtheil (-74800.0000) THEIL (0.709), 2 LTS (0.682) 2.6000) wtheil (-97420.0000) THEIL (.0000), 2 LTS (0.999) LAD (0.972), 4 LSE (-) wtheil (-2460000.0000) LSE (-), 2 THEIL (-0.8) LTS (-0.408), 4 LAD (- 26.000.), wtheil (-47.000) THEIL (0.2020), 2 LTS (0.984) LSE (-), 4 wtheil (-80.8000) LAD (-9.000) THEIL (0.6007), 2 LTS (0.488) 6.8000) wtheil (-22.4000) THEIL (0.9997), 2 LTS (0.9992) LAD (0.9802), 4 LSE (-) wtheil (-88.7000) THEIL (0.0070), 2 LSE (-) LTS (-0.227), 4 LAD (- 92.000), wtheil (-8720.0000) THEIL (0.6004), 2 LTS (0.479) 6600.0000) wtheil (-60860.0000) THEIL (0.6876), 2 LTS (0.48) 2.8000) wtheil (-29280.0000) THEIL (.0000), 2 LTS (0.9999) LAD (0.8689), 4 LSE (-) wtheil (-8000.0000) THEIL (0.008), 2 LSE (-) LTS (-0.4260), 4 LAD (- 9400.00), wtheil (-800.0000) THEIL (0.744), 2 LTS (0.4977) 74700.0000) wtheil (-7400.0000) THEIL (0.697), 2 LTS (0.48) 26.0000) wtheil (-22900.0000) THEIL (.0000), 2 LTS (0.9998).4800) wtheil (-299000.0000) The Pacific Journal of Science and Technology 9 Volume 9. Number. May 208 (Spring)

Table 4: Summary Table for Population Slope (β) Estimators Performance (Based on Bias). ESTIMATION METHODS (0df) LOG (0df) LOG (0df) LOG ERROR MODEL TYPE SAMPLE SIZE (N=0) STANDARD MODEL OUTLIER MODEL MIXTURE MODEL CONTAMINATION MODEL LTS, 2 LSE (0.000) THEIL (0.000), 4 LAD (- 0.9967) wtheil (-2.780) THEIL (0.0006), 2 LSE (0.0009) LTS (0.0009), 4 LAD (-0.970) wtheil (-.900) THEIL (0.0002), 2 LTS (-0.0004) LSE (0.0020), 4 LAD (-0.980) wtheil (-4.70) THEIL (0.0042), 2 LTS (0.0066) LSE (-0.8266), 4 LAD (-.0040) wtheil (-74.8900) LTS, 2 THEIL LSE (0.000), 4 wtheil (-.770) LAD (NILL) LSE (0.000), 2 THEIL(0.000) LTS (0.0002), 4 wtheil (-.280) LAD (NILL) THEIL (0.000), 2 LTS (0.0002) LSE (0.000), 4 LAD (-0.992) wtheil (-9.200) THEIL (-0.0000), 2 LTS (- 0.0006) LSE (0.229), 4 LAD (-0.9648) wtheil (-4.2000) LTS (-0.0000), 2 THEIL (0.000) LSE, 4 LAD (-0.96) wtheil (-.60) THEIL (-0.0000), 2 LTS (- 0.0000) LSE (-0.0000), 4 LAD (-.690) wtheil (-.220) THEIL (0.000), 2 LTS (0.000) LSE (0.0002), 4 LAD (-.690) wtheil (-2.2800) THEIL (0.0002), 2 LTS (0.0004) LSE (0.07), 4 LAD (-.020) wtheil (-497.000) LSE (-0.0002), 2 THEILS (- 0.0002) LTS (-0.000), 4 LAD (-0.992) wtheil (-2.0440) LTS (0.000), 2 THEIL (0.000), LSE (-0.000),, 4 wtheil (- 0.8987) LAD (-0.928) THEIL (0.070), 2 LTS (0.08) LSE (-0.0647), 4 LAD (-.000) wtheil (-.780) THEIL (0.0076), 2 LTS (-0.04) LSE (0.0908), 4 LAD (-.280) wtheil (-.8000) SAMPLE SIZE (N=0) THEIL (0.000), 2 LTS (0.000) LSE (0.000), 4 wtheil (-.040) LAD (Nill) THEIL, 2 LSE LTS, 4 LAD (-0.9788) wtheil (-0.64) THEIL (0.000), 2 LTS (0.0069) LSE (0.00), 4 LAD (-0.987) wtheil (-8.490) THEIL (-0.000), 2 LTS (-0.0029) LSE (0.904), 4 LAD (-0.8946) wtheil (-9.8000) THEIL (0.724), 2 LTS (0.80) LSE (0.897), 4 LAD (7.70) wtheil (4.000) LTS (0.0), 2 LSE (0.026), THEIL (0.029), 4 LAD (6.690) wtheil (6.70) THEIL (.460), 2 LTS (.6040) LSE (2.2460), 4 LAD (9.2440) wtheil (4.900) THEIL (0.264), 2 LTS (.200) LAD (9.60), 4 LSE (78.4000) wtheil (260.0000) THEIL (-0.0000), 2 LSE LTS (-0.000), 4 LAD (0.2472) wtheil (-.260) LTS, 2 THEIL (-0.0000) LSE (-0.000), 4 LAD (-0.9997) wtheil (-.2200) THEIL, 2 LSE (0.000) LTS (0.000), 4 LAD (-0.742) wtheil (-9.7820) THEIL (-0.000), 2 LTS (0.0042) LAD (-0.7), 4 LSE (.700) wtheil (-206.0000) SAMPLE SIZE (N=00) LTS (-0.0000), 2 THEIL THEIL (0.000), 2 LSE (0.0002) LSE (0.000), 4 wtheil (- LTS (0.0002), 4 LAD (-0.0920).420) wthwil (-.8690) LAD (Nill) LSE (-0.0000), 2 LTS (-0.0000) THEIL (-0.0000), 4 wtheil (- 0.226) LAD (-0.9044) THEIL (0.000), 2 LTS (0.00) LSE (0.006), 4 LAD (-0.8967) wtheil (-.400) LTS (-0.000), 2 THEIL (0.000) LSE (0.092), 4 LAD (-0.949) wtheil (-02.0000) THEIL (-0.000), 2 LTS (- 0.000) LSE (-0.000), 4 LAD (-0.460) wtheil (-.2220) THEIL (0.000), 2 LTS (0.000) LSE (-0.0006), 4 LAD (-0.779) wtheil (-2.200) THEIL (0.0002), 2 LTS (0.000) LSE (0.2744), 4 LAD (-0.902) wtheil (-2498.0000) THEIL (-0.0000), 2 LSE, LTS (-0.0002), 4 LAD (-0.9899), wtheil (2.740) THEIL (0.000), 2 LSE (0.0007) LTS (0.0008), 4 LAD (-0.886) wtheil (-.0) THEIL (-0.000), 2 LSE (0.00) LTS (-0.004), 4 LAD (-0.987) wtheil (-4.4740) THEIL (0.000), 2 LTS (0.008) LSE (-0.82), 4 LAD (-0.999) wtheil (-28.400) THEIL, 2 LTS LSE (0.000), 4 LAD (-0.420) wtheil (-.70) LSE (0.000), 2 THEIL (0.000) LTS (0.000), 4 LAD (-0.87) wtheil (-.620) THEIL (0.000), 2 LTS (0.0002) LSE (0.000), 4 LAD (-0.7084) wtheil (-9.8) THEIL (-0.0000), 2 LTS (-0.000) LSE (0.078), 4 LAD (-0.98) wtheil (-4.9000) LTS (-0.0000), 2 THEIL LSE (0.000), 4 LAD (-0.894) wtheil (-.680) THEIL (-0.0000), 2 LTS (-0.0000) LSE (-0.0000), 4 LAD (0.902) wtheil (-.680) LTS, 2 THEIL (-0.0000) LSE, 4 LAD (.600) wtheil (-2.400) LTS, 2 THEIL (0.000) LSE (0.029), 4 LAD (-0.28) wtheil (-284.000) The Pacific Journal of Science and Technology 20 Volume 9. Number. May 208 (Spring)

Table : Summary Table for Population Slope (β) Estimators Performance (Based on Variance). ESTIMATION METHODS (0df) LOG (0df) LOG (0df) LOG ERROR MODEL TYPE SAMPLE SIZE (N=0) STANDARD MODEL OUTLIER MODEL MIXTURE MODEL CONTAMINATION MODEL LSE (0.022), 2 THEIL (0.08) LTS (0.074), 4 LAD (0.07) wtheil (0.808) THEIL (0.0047), 2 LTS (0.0049) LSE (0.007), 4 LAD (0.4) wtheil (0.70) THEIL (0.07), 2 LTS (0.026) LSE (0.09), 4 LAD (0.00) wtheil (.000) THEIL (0.07), 2 LTS (0.22) LAD (0.00), 4 LSE (7978.0) wtheil (867000.0) LSE, 2 THEIL (0.000) LTS (0.000), 4 wtheil (0.422) LAD (NILL) THEIL, 2 LTS LSE, 4 wtheil (0.0) LAD (NILL) THEIL (0.0008), 2 LTS (0.0002) LSE (0.000), 4 LAD (0.0008) wtheil (4.9200) THEIL (0.0004), 2 LTS (0.008) LAD (0.0240), 4 LSE (88.8200) wtheil (44690000) LSE, 2 THEIL LTS, 4 LAD (0.009) wtheil (0.886) THEIL, 2 LTS LSE, 4 LAD (0.0) wthwil (0.8) THEIL, 2 LTS LSE (0.000), 4 LAD (0.002) wtheil (9.00) THEIL, 2 LTS (0.0004) LAD (0.0028), 4 LSE (.870) wtheil (70000) THEIL (0.006), 2 LSE (0.0070) LTS (0.007), 4 LAD (0.086) wtheil (0.764) THEIL (0.0009), 2 LTS (0.0009) LSE (0.00), 4 LAD (0.26) wtheil (0.6) THEIL (0.0077), 2 LTS (0.06) LSE (0.029), 4 LAD (0.0) wtheil (.400) LAD (0.277), 2 THEIL (0.4928) LTS (4.720), 4 LSE (900) wtheil (960000.00) SAMPLE SIZE (N=0) THEIL, 2 LTS (0.000) LSE (0.000), 4 wtheil (0.4490) LAD (NILL) THEIL, 2 LTS LSE, 4 LAD (0.008) wtheil (0.0298) THEIL, 2 LTS (0.000) LSE (0.000), 4 LAD (0.008) wtheil (.000) THEIL (0.00), 2 LAD (0.08) LTS (0.00), 4 LSE (97.4000) wtheil (7700000) SAMPLE SIZE (N=00) THEIL, 2 LTS LSE, 4 wtheil (0.4046) LAD (Nill) THEIL, 2 LTS LSE, 4 LAD (0.0007) wtheil (0.049) LAD, 2 THEIL LTS, 4 LSE wtheil (7.0800) THEIL (0.0002), 2 LAD (0.0027) LTS (0.00), 4 LSE (.200) wtheil (2800000.0000) LSE (0.064), 2 THEIL (0.090 LTS (0.0276), 4 LAD (0.0820) wtheil (0.698) THEIL (0.0047), 2 LTS 0.0049) LSE (0.007), 4 LAD (0.620) wtheil (0.74) THEIL (0.04), 2 LTS (0.0820) LSE (0.090), 4 LAD (0.028) wtheil (2.800) THEIL (0.4), 2 LAD (0.800) LTS (7.9060), 4 LSE (6000) wtheil (692400000.00) LSE (0.000), 2 THEIL (0.000) LTS (0.000), 4 wtheil (0.88) LAD (7.220) THEIL, 2 LTS LSE, 4 LAD (0.006) wtheil (0.6) THEIL (0.000), 2 LTS (0.0004) LSE (0.0007), 4 LAD (0.076) wtheil (27.200) THEIL (0.0004), 2 LAD (0.06) LTS (0.96), 4 LSE (668.4000) wtheil (488000000) THEIL, 2 LSE LTS, 4 wtheil (0.99) LAD (.060) THEIL, 2 LTS LSE, 4 LAD (0.24) wtheil (0.79) THEIL, 2 LTS LSE (0.000), 4 LAD (0.) wtheil (.000) LAD, 2 THEIL, LTS (0.0096), 4 LSE (6.00), wtheil (26000000.0000) LSE (0.02), 2 THEIL (0.06) LTS (0.07), 4 LAD (0.094) wtheil (0.82) THEIL (0.009), 2 LTS (0.0040) LSE (0.00), 4 LAD (0.628) wtheil (0.74) THEIL (0.080), 2 LTS (0.0298) LSE (0.069), 4 LAD (0.007) wtheil (.00) THEIL (0.040), 2 LTS (0.082) LAD (0.47), 4 LSE (96.4000) wtheil (2600.00) THEIL (0.000), 2 LSE (0.000) LTS (0.000), 4 LAD (0.77) wtheil (0.4246) THEIL, 2 LTS LSE, 4 LAD (2.490) wtheil (0.2948) THEIL (0.000), 2 LTS LSE, 4 LAD (2.490) wtheil (0.2948) THEIL (0.0002), 2 LTS (0.00) LAD (0.409), 4 LSE (6.420) wtheil (694000) THEIL, 2 LSE LTS, 4 LAD (0.066) wtheil (0.899) THEIL, 2 LTS LSE, 4 wtheil (0.2928) LAD (2.80) THEIL, 2 LTS LSE (0.000), 4 LAD (2.2480) wtheil (9.00) THEIL, 2 LTS (0.0002) LAD (0.267), 4 LSE (0.77) wtheil (2740000.0000) The Pacific Journal of Science and Technology 2 Volume 9. Number. May 208 (Spring)

Table 6: Summary Table for Population Slope (β) Estimators Performance (Based on RMSE). ERROR MODEL TYPE ESTIMATION SAMPLE SIZE (N=0) METHODS STANDARD MODEL OUTLIER MODEL MIXTURE MODEL CONTAMINATION MODEL (0df) LOG (0df) LOG (0df) LOG LSE (-), 2 THEIL (-0.) LTS (-0.422), 4 LAD (- 86.400) wtheil (-4.000) THEIL (0.774), 2 LTS (0.489) 7.8000) wtheil (-09.9000) THEIL (0.74), 2 LTS (0.) 6.8900) wtheil (-7.7000) THEIL (.0000), 2 LTS (.0000) LAD (0.9999), 4 LSE (-) wtheil (-087.0) LSE (-), 2 THEIL (-0.02) LTS (-0.287), 4 wtheil (- 0000.0000), LAD (NILL) THEIL (0.4962), 2 LTS (0.60) LSE (-), 4 wtheil (- 0700.0000) LAD (NILL) THEIL (0.840), 2 LTS (0.9) 2068.0000) wtheil (-22200.0000) THEIL (.0000), 2 LTS (.0000) LAD (0.989), 4 LSE (-) wtheil (-04200.0000) LSE (-), 2 THEIL (-0.07) LTS (-0.4427), 4 LAD (- 78740.00) wtheil (-6000.00) THEIL (0.76), 2 LTS (0.482) 00000.0000) wtheil (-2000.0000) THEIL (0.8468), 2 LTS (0.4297) 24670.0000) wtheil (-79000.000) THEIL (.0000), 2 LTS (0.9996) LAD (0.29), 4 LSE (-) wtheil (-40000.0000) THEIL (0.68), 2 LSE (-) LTS (-0.09), 4 LAD (- 4.000) wtheil (-64.7000) THEIL (0.999), 2 LTS (0.9) LSE (-), 4 wtheil (- 866.9000) LAD (-907.000) THEIL (0.8002), 2 LTS (0.668) 26.800) wtheil (-68.7000) THEIL (.0000), 2 LAD (0.9999) LTS (0.9996), 4 LSE (-) wtheil (-80.6000) SAMPLE SIZE (N=0) THEIL (0.4), 2 LTS (0.0298) 670.0000) wtheil (-800.00) THEIL (0.62), 2 LTS (0.282) LSE (-), 4 wtheil (- 200.0000) LAD (-000.0000) THEIL (0.946), 2 LTS (0.682) 2082.0000) wtheil (-2900.0000) THEIL (.0000), 2 LTS (0.9998) LAD (0.998), 4 LSE (-) wtheil (-29400.0000) SAMPLE SIZE (N=00) THEIL (0.6820), 2 LTS (0.0429) 4000.0000) wtheil (-62000.0000) THEIL (0.776), 2 LTS (0.2628) LSE (-), 4 wtheil (- 28000.0000) LAD (-6789000.0000) THEIL (0.9728), 2 LTS (0.667) 060.0000) wthwil (-2476000.0000) THEIL (.0000), 2 LTS (0.9989) LAD (0.826), 4 LSE (-) wtheil (-67000.0000) LSE (-), 2 THEIL (-0.7) LTS (-0.6800), 4 LAD (- 62.600) wtheil (-42.0000) THEIL (0.76), 2 LTS (0.47) 64.000) wtheil (-2.000) THEIL (0.696), 2 LTS (0.84) 0.4700) wtheil (-08.000) THEIL (.0000), 2 LAD (.0000) LTS (.0000), 4 LSE (-) wtheil (-097.0000) LSE (-), 2 THEIL (-0.022) LTS (-.0200), 4 LAD (- 400.0000) wtheil (-9480.0000) THEIL (0.4982), 2 LTS (0.) 6070.0000) wtheil (-0800.00) THEIL (0.8449), 2 LTS (0.49) 60.0000) wtheil (-66400.00) THEIL (.0000), 2 LTS (.0000) LAD (0.9999), 4 LSE (-) wtheil (-22700.0000) THEIL (0.047), 2 LSE (-) LTS (-.280), 4 LAD (- 660.00), wtheil (- 96400.00) THEIL (0.729), 2 LTS (0.4082) 07400.0000) wtheil (-8000.0000) THEIL (0.8782), 2 LTS (0.406) 80.0000) wtheil (-280000.0000) THEIL (.0000), 2 LTS (0.9997) LAD (0.977), 4 LSE (-) wtheil (-74000.0000) LSE (-), 2 THEIL (-0.22) LTS (-0.46), 4 LAD (- 87.00) wtheil (-46.7000) THEIL (0.2426), 2 LTS (0.229) 82.000) wtheil (-7.2000) THEIL (0.709), 2 LTS (0.77) 6.2700) wtheil (-69.8000) THEIL (0.9998), 2 LTS (0.9996) LAD (0.9942), 4 LSE (-) wtheil (-64.000) THEIL (0.0276), 2 LSE (-) LTS (-0.24), 4 LAD (- 78.0000) wtheil (-000.0000) THEIL (0.607), 2 LTS (0.478) LSE (-), 4 wtheil (- 0000.0000) LAD (-4600.00) THEIL (0.888), 2 LTS (0.474).0000) wtheil (-24800.00) THEIL (.0000), 2 LTS (0.9998) LAD (0.82), 4 LSE (-) wtheil (-24800.0000) THEIL (0.064), 2 LSE (-) LTS (-0.4270), 4 LAD (- 66240.00) wtheil (-000.00) THEIL (0.6968), 2 LTS (0.402) 2822000.0000) wtheil (-000.0000) THEIL (0.846), 2 LTS (0.8) 860.0000) wtheil (-77000.0000) THEIL (.0000), 2 LTS (0.9997) LAD (0.28), 4 LSE (-) wtheil (-690000.0000) The Pacific Journal of Science and Technology 22 Volume 9. Number. May 208 (Spring)

Y-Intercept Estimators Performance: The performances of the population y-intercept estimators for each of the methods were found to follow the same pattern as those of the slope estimators, but for a few significant variations (Table 4). First, it was noticeable that unlike the LAD and weighted Theil s slope estimators which consistently underestimated the slope parameter β, the intercept estimators for both procedures gave a positive bias values across all cells of the simulation (Table 4). It was also observed that the variance and MSE values of LAD intercept estimators are generally consistent with those of other intercept estimators than those of its slope estimators (Tables 4 and ). Furthermore, the variance and MSE for LSE, LTS and Theil s intercept estimators are significantly larger than those of their slope estimators, especially as sample size increases. For the contamination model, the patterns of estimators behavior for the intercept estimators are similar to the outliers case. However, estimates of variance and MSE are as usual larger than those of the slope estimators. It is worthy of note, also, that the LSE intercept estimator as long as normality assumptions hold, is by far much more efficient than every other intercept estimators regardless of sample size. But under non-normal distribution situations, Theil s, LTS and LAD take the stage as usual. Nevertheless, estimates of variance and MSE are larger and more consistent than the outliers case for all estimators across all cells of the simulation (Tables 4,, 6). CONCLUSION This study shows that for simple linear regression model with the error terms distributed as described earlier: Theil estimator has high small-sample efficiency compared to the OLS estimator when the error term is heteroscedastic (Wilcox, 998). More so, Theil s nonparametric estimation technique has the strongest performance and most reliable results and can be used in varying circumstances. LTS and LAD are most especially applicable when the error term comes from a heavy tailed distribution (Mutan, 2004). However, LTS proved to be more robust than LAD in most cases. LAD is not very robust and should be used with cautions, especially when there are uncertainties as regards the nature of the data in question. LSE method is only reliable As long as the normality assumption holds. RECOMMENDATION Application of statistical regression analysis is of great importance in the study of economy, though, most economic theories do not imply specific functional forms. It is therefore important that Statisticians and Econometricians focus more on the use as well as development of nonparametric methods and median-based estimators for econometric analyses, since it has been demonstrated that non-parametric procedures exhibit the strongest performance and most reliable results under varying data conditions. REFERENCES. Agullo, J.,200. A New Algorithm for Computing the Least Trimmed Squares Regression Estimator. Computational Statistics and Data Analysis. 6(4): 42-49. 2. Birkes, D. and Y. Dodge. 99. Alternative Methods of Regression. Wiley: New York.. Brown, B.M. 980: Median Estimates in Simple Linear Regression. Austral. J. Statist. 22(2):4-6. 4. Castillo, E., A.S. Hadi, and B. Lacruz. 200. Regression Diagnostics for the Least Absolute Deviations and the Minimax Methods. Communications in Statistics- Theory and Methods. 0(6): 97-22.. Cizek, P. and J.A. Visek. 2000. Least Trimmed Squares. Quantifikation und Simulation Okonomisher Prozesse. Humboldt Universitat zu Berlin with number. 00-. 6. Conover, W.J. 980. Practical Nonparametric Statistics (2nd edition). Wiley: New York, NY. The Pacific Journal of Science and Technology 2 Volume 9. Number. May 208 (Spring)

7. Rawlings, J.O., S.G. Pantula, and D.A. Dickey. 998. Applied Regression Analysis: A Research Tool. Springer: New York, NY. 8. Rivest, L. 994: Statistical Properties of Winsorized Means for Skewed Distributions. Biometrika. 8(2): 7-8. 9. Rousseeuw, P.J., and B.C. Van Zomeren. 990. Unmasking Multivariate Outliers and Leverage Points. Journal of American Statistical Association. 8:6-69. SUGGESTED CITATION Adebola, F.B., E.I. Olamide, and O.O. Alabi. 208. Some Robust and Classical Nonparametric Procedures of Estimations in Linear Regression Model. Pacific Journal of Science and Technology. 9():-24. Pacific Journal of Science and Technology 0. Talwar, P. 99. A Simulation Study of Some Nonparametric Regression Estimators. Computational Statistics and Data Analysis. (): 09-27.. Theil, H. 90: A Rank-Invariant Method of Linear and Polynomial Regression Analysis. Indagationes Mathematicae. 2: 8-9. ABOUT THE AUTHORS Dr. F.B. Adebola, is a Senior Lecturer in the Department of Statistics, Federal University of Technology, Akure. He holds B.Sc., M.Sc. and Ph.D. degrees in Statistics from the University of Ilorin. He is a member of the Royal Statistical Society. His research interests are in the areas of sample survey methods and its applications and stochastic processes. E.I. Olamide, is an Assistant Lecturer in the Department of Statistics, Federal University of Technology, Akure. He holds B.Sc. degree in Statistics from the University of Ado-Ekiti and M.Sc. degree in Statistics from the University of Ibadan. He is currently doing his Ph.D. program at the Federal University of Technology, Akure. His research interests are in the areas of Design and Analysis of Experiments and Statistical Modeling. Dr. O.O. Alabi, is a Senior Lecturer in the Department of Statistics, Federal University of Technology, Akure. He holds B.Sc., M.Sc. and Ph.D. degrees in Statistics from the University of Ilorin. His research interests are in the areas of Econometric Theories and applications. The Pacific Journal of Science and Technology 24 Volume 9. Number. May 208 (Spring)