tool<-read.csv(file="d:/chilo/regression 7/tool.csv", header=t) tool

Similar documents
delivery<-read.csv(file="d:/chilo/regression 4/delivery.csv", header=t) delivery

Regression Models Course Project, 2016

Drilling Example: Diagnostic Plots

R-Sq criterion Data : Surgical room data Chap 9

fruitfly fecundity example summary Tuesday, July 17, :13:19 PM 1

Motor Trend Yvette Winton September 1, 2016

Motor Trend MPG Analysis

AIC Laboratory R. Leaf November 28, 2016

. Enter. Model Summary b. Std. Error. of the. Estimate. Change. a. Predictors: (Constant), Emphaty, reliability, Assurance, responsive, Tangible

Exercises An Introduction to R for Epidemiologists using RStudio SER 2014

HASIL OUTPUT SPSS. Reliability Scale: ALL VARIABLES

The PRINCOMP Procedure

Lampiran IV. Hasil Output SPSS Versi 16.0 untuk Analisis Deskriptif

EXST7034 Multiple Regression Geaghan Chapter 11 Bootstrapping (Toluca example) Page 1

TRY OUT 25 Responden Variabel Kepuasan / x1

UJI VALIDITAS DAN RELIABILIAS VARIABEL KOMPENSASI

TRY OUT 30 Responden Variabel Kompetensi/ x1

Booklet of Code and Output for STAD29/STA 1007 Final Exam

Stat 401 B Lecture 31

female male help("predict") yhat age

Stat 301 Lecture 30. Model Selection. Explanatory Variables. A Good Model. Response: Highway MPG Explanatory: 13 explanatory variables

Getting Started with Correlated Component Regression (CCR) in XLSTAT-CCR

Guatemalan cholesterol example summary

Type 'demo()' for some demos, 'help()' for on-line help, or 'help.start()' for an HTML browser interface to help. Type 'q()' to quit R.

5. CONSTRUCTION OF THE WEIGHT-FOR-LENGTH AND WEIGHT-FOR- HEIGHT STANDARDS

TABLE 4.1 POPULATION OF 100 VALUES 2

Model Information Data Set. Response Variable (Events) Summe Response Variable (Trials) N Response Distribution Binomial Link Function

Topic 5 Lecture 3 Estimating Policy Effects via the Simple Linear. Regression Model (SLRM) and the Ordinary Least Squares (OLS) Method

The Coefficient of Determination

Modeling Ignition Delay in a Diesel Engine

LAMPIRAN I FORMULIR SURVEI

Lampiran 1. Data Perusahaan

Stat 302 Statistical Software and Its Applications Graphics

: ( .

Universitas Sumatera Utara

Important Formulas. Discrete Probability Distributions. Probability and Counting Rules. The Normal Distribution. Confidence Intervals and Sample Size

THERMOELECTRIC SAMPLE CONDITIONER SYSTEM (TESC)

Statistics and Quantitative Analysis U4320. Segment 8 Prof. Sharyn O Halloran

CHAPTER V CONCLUSION, SUGGESTION AND LIMITATION. 1. Independent commissioner boards proportion does not negatively affect

Stat 301 Lecture 26. Model Selection. Indicator Variables. Explanatory Variables

DATA SAMPEL TAHUN 2006

Basic SAS and R for HLM

LAMPIRAN I Data Perusahaan Sampel kode DPS EPS Ekuitas akpi ,97 51,04 40,

Catalytic effects of period iv transition metal in the oxidation of biodiesel

Master of Applied Statistics Applied Statistics Comprehensive Exam Computer Output January 2018

LAMPIRAN 1. Tabel 1. Data Indeks Harga Saham PT. ANTAM, tbk Periode 20 Januari Februari 2012

Stat 401 B Lecture 27

Chapter 5 ESTIMATION OF MAINTENANCE COST PER HOUR USING AGE REPLACEMENT COST MODEL

Road Surface characteristics and traffic accident rates on New Zealand s state highway network

Sharif University of Technology. Graduate School of Management and Economics. Econometrics I. Fall Seyed Mahdi Barakchian

Quality Control in Mineral Exploration

The Dynamics of Plug-in Electric Vehicles in the Secondary Market

LAMPIRAN 1. Lampiran Nama dan Kondisi Perusahaan Textile No Kode Nama Perusahaan Hasil z-score FD Non-FD

Author s Accepted Manuscript

Robust alternatives to best linear unbiased prediction of complex traits

Daftar Sampel Perusahaan Pertambangan. 4 BORN Borneo Lumbung Energy & Metal, Tbk

Daftar Sampel Perusahaan

Fuel Consumption Models for Tractor Test Reports

LAMPIRAN A. Tabulasi Data Perusahaan Sample

CEMENT AND CONCRETE REFERENCE LABORATORY PROFICIENCY SAMPLE PROGRAM

Identify Formula for Throughput with Multi-Variate Regression

Review of Upstate Load Forecast Uncertainty Model

Lampiran 1. Penjualan PT Honda Mandiri Bogor

DATA PENELITIAN 1. CAR CAR (%)

Voting Draft Standard

LECTURE 6: HETEROSKEDASTICITY

Appendix B STATISTICAL TABLES OVERVIEW

LAMPIRAN DAFTAR SAMPEL PENELITIAN. Kriteria No. Nama Perusahaan. Sampel Emiten

R Introductory Session

MODUL PELATIHAN SEM ANANDA SABIL HUSSEIN, PHD

sweetgum1.r: AIC(m1.t5,m2.t5,m3.t5,m4.t5,m5.t5,m6.t5,m7.t5) summary(m3.t5)

a. Uji kenormalan data model sebaran suhu pada band 7 citra tahun 2001 b. Uji kenormalan data model sebaran suhu pada band 4 citra tahun 2006

namibia UniVERSITY OF SCIEnCE AnD TECHnOLOGY FACULTY OF HEALTH AND APPLIED SCIENCES DEPARTMENT OF MATHEMATICS AND STATISTICS MARKS: 100

Lampiran i Jadwal Penelitian

MOTORCYCLE ACCIDENT MODEL ON THE ROAD SECTION OF HIGHLANDS REGION BY USING GENELARIZED LINEAR MODEL

Some Robust and Classical Nonparametric Procedures of Estimations in Linear Regression Model

Team project 2017 Dony Pratidana S. Hum Bima Agus Setyawan S. IIP

Improving CERs building

LAMPIRAN. Lampiran 1 Data Sampel Penelitian

SEM over time. Changes in Structure, changes in Means

Lampiran 1. Tabel Sampel Penelitian

PREDICTION OF FUEL CONSUMPTION

Lampiran 1. Uji Validitas dan Reliability Variabel Kualitas Pelayanan

QUALITY ASSURANCE & LAB ACCREDITATION

Technical Papers supporting SAP 2009

THE PHYSICS OF THE PINEWOOD DERBY

Lampiran 1. Daftar Sampel Perusahaan

Script mod4s3b: Serial Correlation, Hotel Application

Oregon DOT Slow-Speed Weigh-in-Motion (SWIM) Project: Analysis of Initial Weight Data

Rata-Rata Nilai Debt to Equity Ratio (DER) Perusahaan Otomotif yang 0, ,97 0, ,44 1,9 1,6 1,4 1,7 1,65

LET S ARGUE: STUDENT WORK PAMELA RAWSON. Baxter Academy for Technology & Science Portland, rawsonmath.

Interstate Freight in Australia,

Introduction. Traffic data collection. Introduction. Introduction. Traffic stream parameters

Component Mass Study Statistical Benchmarking

Data Nama Perusahaan Perbankan Yang Terdaftar Di BEI.

ACCIDENT MODIFICATION FACTORS FOR MEDIAN WIDTH

ENGINE VARIABLE IMPACT ANALYSIS OF FUEL USE AND EMISSIONS FOR HEAVY DUTY DIESEL MAINTENANCE EQUIPMENT

DEFECT DISTRIBUTION IN WELDS OF INCOLOY 908

EVALUATION OF CO-INJECTION MOLDING: AN ALTERNATIVE TO RECYCLING SCRAP PAINTED BUMPERS. Emily Cortright. Senior Honors Thesis

Statistics for Social Research

Transcription:

Regression nalysis lab 7 1 Indicator variables 1.1 Import data tool<-read.csv(file="d:/chilo/regression 7/tool.csv", header=t) tool life speed type 1 18.73 610 2 14.52 950 3 17.43 720 4 14.54 840 5 13.44 980 6 24.39 530 7 13.34 680 8 22.71 540 9 12.68 890 10 19.32 730 11 30.16 670 12 27.09 770 13 25.40 880 14 26.05 1000 15 33.49 760 16 35.62 590 17 26.07 910 18 36.78 650 19 34.95 810 20 43.67 500 2 scatter plot by group attach(tool) plot(speed, life, pch=16, col=c("red","blue")[type], xlab="lathe speed, x(rpm)", ylab="tool life, y(hours)", main="scatter plot") 1

Scatter plot tool life, y(hours) 15 20 25 30 35 40 500 600 700 800 900 1000 lathe speed, x(rpm) plot(speed, life, pch=16, type="n", xlab="lathe speed, x(rpm)", ylab="tool life, y(hours)", main="scatter plot") text(speed, life, type) 2

Scatter plot tool life, y(hours) 15 20 25 30 35 40 500 600 700 800 900 1000 lathe speed, x(rpm) 3 model 1 with y and x1 attach(tool) The following objects are masked from tool (position 3): tfit1 <- lm(life ~ speed, data=tool) summary(tfit1) Call: lm(formula = life ~ speed, data = tool) 3

Residuals: Min 1Q Median 3Q Max -12.973-7.300-0.928 7.233 12.777 Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) 43.6167 9.6032 4.54 0.00025 *** speed -0.0254 0.0125-2.03 0.05760. --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 Residual standard error: 8.44 on 18 degrees of freedom Multiple R-squared: 0.186,djusted R-squared: 0.141 F-statistic: 4.11 on 1 and 18 DF, p-value: 0.0576 tfit1$fit # fitted values 1 2 3 4 5 6 7 8 9 10 11 12 28.09 19.44 25.30 22.24 18.68 30.13 26.31 29.88 20.97 25.04 26.57 24.02 13 14 15 16 17 18 19 20 21.22 18.17 24.28 28.60 20.46 27.08 23.00 30.89 tfit1$res # residuals 1 2 3 4 5 6 7 8 9-9.364-4.922-7.865-7.702-5.239-5.740-12.973-7.166-8.289 10 11 12 13 14 15 16 17 18-5.721 3.593 3.067 4.176 7.880 9.213 7.017 5.610 9.704 19 20 11.945 12.777 summary(tfit1) Call: lm(formula = life ~ speed, data = tool) Residuals: Min 1Q Median 3Q Max -12.973-7.300-0.928 7.233 12.777 Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) 43.6167 9.6032 4.54 0.00025 *** speed -0.0254 0.0125-2.03 0.05760. --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 4

Residual standard error: 8.44 on 18 degrees of freedom Multiple R-squared: 0.186,djusted R-squared: 0.141 F-statistic: 4.11 on 1 and 18 DF, p-value: 0.0576 anova(tfit1) nalysis of Variance Table Response: life Df Sum Sq Mean Sq F value Pr(>F) speed 1 293 293.0 4.11 0.058. Residuals 18 1282 71.2 --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 confint(tfit1, level=0.95) 2.5 % 97.5 % (Intercept) 23.44108 6.379e+01 speed -0.05181 9.121e-04 R2_1<-summary(tfit1)$r.squared R2_1 [1] 0.186 R2.adj_1<-summary(tfit1)$adj.r.squared R2.adj_1 [1] 0.1408 sigmahat_1<-summary(tfit1)$sigma sigmahat_1 [1] 8.44 sigmahat2_1<-sigmahat_1^2 MSE_1<-sigmahat2_1 MSE_1 [1] 71.23 plot(life ~ speed, pch=16, col=c("red","blue")[type], main="scatter plot") abline(reg=tfit1) 5

Scatter plot life 15 20 25 30 35 40 500 600 700 800 900 1000 speed plot(tfit1$fit,tfit1$res, pch=16, col=c("red","blue")[type], xlab="fitted",ylab="residuals", main="model 1 residual plot") 6

Model 1 residual plot Residuals 10 5 0 5 10 18 20 22 24 26 28 30 Fitted 4 model 2 with y and x1, x2 attach(tool) The following objects are masked from tool (position 3): The following objects are masked from tool (position 4): tfit2 <- lm(life ~ speed + type, data=tool) summary(tfit2) 7

Call: lm(formula = life ~ speed + type, data = tool) Residuals: Min 1Q Median 3Q Max -5.553-1.787-0.002 1.839 4.984 Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) 36.98560 3.51038 10.54 7.2e-09 *** speed -0.02661 0.00452-5.89 1.8e-05 *** type 15.00425 1.35967 11.04 3.6e-09 *** --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 Residual standard error: 3.04 on 17 degrees of freedom Multiple R-squared: 0.9,djusted R-squared: 0.889 F-statistic: 76.7 on 2 and 17 DF, p-value: 3.09e-09 tfit2$fit # fitted values 1 2 3 4 5 6 7 8 9 10 11 12 20.76 11.71 17.83 14.64 10.91 22.88 18.89 22.62 13.31 17.56 34.16 31.50 13 14 15 16 17 18 19 20 28.58 25.38 31.77 36.29 27.78 34.70 30.44 38.69 tfit2$res # residuals 1 2 3 4 5 6 7 8-2.02519 2.81127-0.39840-0.09553 2.52948 1.50623-5.55268 0.09230 9 10 11 12 13 14 15 16-0.62517 1.75768-4.00301-4.41228-3.17549 0.66738 1.72164-0.67159 17 18 19 20-1.70727 2.08485 4.51200 4.98376 summary(tfit2) Call: lm(formula = life ~ speed + type, data = tool) Residuals: Min 1Q Median 3Q Max -5.553-1.787-0.002 1.839 4.984 Coefficients: Estimate Std. Error t value Pr(> t ) 8

(Intercept) 36.98560 3.51038 10.54 7.2e-09 *** speed -0.02661 0.00452-5.89 1.8e-05 *** type 15.00425 1.35967 11.04 3.6e-09 *** --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 Residual standard error: 3.04 on 17 degrees of freedom Multiple R-squared: 0.9,djusted R-squared: 0.889 F-statistic: 76.7 on 2 and 17 DF, p-value: 3.09e-09 anova(tfit2) nalysis of Variance Table Response: life Df Sum Sq Mean Sq F value Pr(>F) speed 1 293 293 31.7 3.0e-05 *** type 1 1125 1125 121.8 3.6e-09 *** Residuals 17 157 9 --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 confint(tfit2, level=0.95) 2.5 % 97.5 % (Intercept) 29.57934 44.39186 speed -0.03614-0.01707 type 12.13560 17.87290 R2_2<-summary(tfit2)$r.squared R2_2 [1] 0.9003 R2.adj_2<-summary(tfit2)$adj.r.squared R2.adj_2 [1] 0.8886 sigmahat_2<-summary(tfit2)$sigma sigmahat_2 [1] 3.039 sigmahat2_2<-sigmahat_2^2 MSE_2<-sigmahat2_2 MSE_2 9

[1] 9.239 beta.0<-summary(tfit2)$coef[1,1] beta.1<-summary(tfit2)$coef[2,1] beta.2<-summary(tfit2)$coef[3,1] beta.0 [1] 36.99 beta.1 [1] -0.02661 beta.2 [1] 15 plot(life ~ speed, pch = 16,, col=c("red","blue")[type], main="scatter plot") abline(beta.0, beta.1, lty=2, col="red") abline(beta.0+beta.2, beta.1, lty=1, col="blue") legend(950, 42, c("", ""), col = c("red", "blue"), text.col = c("red", "blue"), lty = c(2, 1), pch = c(16, 16), merge = TRUE) 10

Scatter plot life 15 20 25 30 35 40 500 600 700 800 900 1000 speed plot(tfit2$fit,tfit2$res, pch=16, col=c("red","blue")[type], xlab="fitted",ylab="residuals", main="model 2 residual plot") 11

Model 2 residual plot Residuals 4 2 0 2 4 10 15 20 25 30 35 Fitted t1<-rstudent(tfit2) t1 1 2 3 4 5 6 7 8-0.70850 1.03025-0.13423-0.03249 0.93903 0.54385-2.12781 0.03283 9 10 11 12 13 14 15 16-0.21614 0.59815-1.44449-1.59924-1.13270 0.24381 0.58543-0.23421 17 18 19 20-0.59899 0.72245 1.64823 2.05498 qqnorm(t1) qqline(t1) 12

Normal Q Q Plot Sample Quantiles 2 1 0 1 2 2 1 0 1 2 Theoretical Quantiles 5 model 3 with y and x1, x2, x1x2 attach(tool) The following objects are masked from tool (position 3): The following objects are masked from tool (position 4): The following objects are masked from tool (position 5): tfit3 <- lm(life ~ speed + type + speed*type, data=tool) 13

summary(tfit3) Call: lm(formula = life ~ speed + type + speed * type, data = tool) Residuals: Min 1Q Median 3Q Max -5.175-1.500 0.485 1.783 4.865 Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) 32.77476 4.63347 7.07 2.6e-06 *** speed -0.02097 0.00607-3.45 0.0033 ** type 23.97059 6.76897 3.54 0.0027 ** speed:type -0.01194 0.00884-1.35 0.1955 --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 Residual standard error: 2.97 on 16 degrees of freedom Multiple R-squared: 0.91,djusted R-squared: 0.894 F-statistic: 54.3 on 3 and 16 DF, p-value: 1.32e-08 tfit3$fit # fitted values 1 2 3 4 5 6 7 8 9 10 11 12 19.98 12.85 17.68 15.16 12.22 21.66 18.52 21.45 14.11 17.47 34.69 31.40 13 14 15 16 17 18 19 20 27.78 23.83 31.73 37.33 26.79 35.35 30.08 40.29 tfit3$res # residuals 1 2 3 4 5 6 7 8 9-1.2529 1.6670-0.2462-0.6198 1.2161 2.7295-5.1750 1.2592-1.4313 10 11 12 13 14 15 16 17 18 1.8535-4.5328-4.3114-2.3808 2.2189 1.7595-1.7059-0.7234 1.4289 19 20 4.8652 3.3818 summary(tfit3) Call: lm(formula = life ~ speed + type + speed * type, data = tool) Residuals: Min 1Q Median 3Q Max 14

-5.175-1.500 0.485 1.783 4.865 Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) 32.77476 4.63347 7.07 2.6e-06 *** speed -0.02097 0.00607-3.45 0.0033 ** type 23.97059 6.76897 3.54 0.0027 ** speed:type -0.01194 0.00884-1.35 0.1955 --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 Residual standard error: 2.97 on 16 degrees of freedom Multiple R-squared: 0.91,djusted R-squared: 0.894 F-statistic: 54.3 on 3 and 16 DF, p-value: 1.32e-08 anova(tfit3) nalysis of Variance Table Response: life Df Sum Sq Mean Sq F value Pr(>F) speed 1 293 293 33.25 2.9e-05 *** type 1 1125 1125 127.68 4.9e-09 *** speed:type 1 16 16 1.82 0.2 Residuals 16 141 9 --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 confint(tfit3, level=0.95) 2.5 % 97.5 % (Intercept) 22.95224 42.597281 speed -0.03385-0.008094 type 9.62101 38.320175 speed:type -0.03069 0.006800 R2_3<-summary(tfit3)$r.squared R2_3 [1] 0.9105 R2.adj_3<-summary(tfit3)$adj.r.squared R2.adj_3 [1] 0.8937 sigmahat_3<-summary(tfit3)$sigma sigmahat_3 15

[1] 2.968 sigmahat2_3<-sigmahat_3^2 MSE_3<-sigmahat2_3 MSE_3 [1] 8.811 tool<-tool[type=="",] tool life speed type 1 18.73 610 2 14.52 950 3 17.43 720 4 14.54 840 5 13.44 980 6 24.39 530 7 13.34 680 8 22.71 540 9 12.68 890 10 19.32 730 tool<-tool[type=="",] tool life speed type 11 30.16 670 12 27.09 770 13 25.40 880 14 26.05 1000 15 33.49 760 16 35.62 590 17 26.07 910 18 36.78 650 19 34.95 810 20 43.67 500 plot(life ~ speed, pch = 16,, col=c("red","blue")[type], main="scatter plot") abline(lm(tool$life ~ tool$speed), lty=2, col="red") abline(lm(tool$life ~ tool$speed), lty=1, col="blue") legend(950, 42, c("", ""), col = c("red", "blue"), text.col = c("red", "blue"), lty = c(2, 1), pch = c(16, 16), merge = TRUE) 16

Scatter plot life 15 20 25 30 35 40 500 600 700 800 900 1000 speed plot(tfit3$fit,tfit3$res, pch=16, col=c("red","blue")[type], xlab="fitted",ylab="residuals", main="model 3 residual plot") 17

Model 3 residual plot Residuals 4 2 0 2 4 15 20 25 30 35 40 Fitted 6 partial F tests 6.1 test whether two regression lines are identical attach(tool) The following objects are masked from tool (position 3): The following objects are masked from tool (position 4): The following objects are masked from tool (position 5): 18

The following objects are masked from tool (position 6): tfit1 <- lm(life ~ speed, data=tool) # Reduced model tfit3 <- lm(life ~ speed + type + speed*type, data=tool) # Full model anova(tfit1,tfit3) nalysis of Variance Table Model 1: life ~ speed Model 2: life ~ speed + type + speed * type Res.Df RSS Df Sum of Sq F Pr(>F) 1 18 1282 2 16 141 2 1141 64.8 2.1e-08 *** --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 6.2 test whether the slopes of two regression lines are equal attach(tool) The following objects are masked from tool (position 3): The following objects are masked from tool (position 4): The following objects are masked from tool (position 5): The following objects are masked from tool (position 6): The following objects are masked from tool (position 7): tfit2 <- lm(life ~ speed + type, data=tool) # Reduced model tfit3 <- lm(life ~ speed + type + speed*type, data=tool) # Full model anova(tfit2,tfit3) nalysis of Variance Table 19

Model 1: life ~ speed + type Model 2: life ~ speed + type + speed * type Res.Df RSS Df Sum of Sq F Pr(>F) 1 17 157 2 16 141 1 16.1 1.82 0.2 20