Motor Trend Yvette Winton September 1, 2016

Similar documents
Regression Models Course Project, 2016

Motor Trend MPG Analysis

AIC Laboratory R. Leaf November 28, 2016

tool<-read.csv(file="d:/chilo/regression 7/tool.csv", header=t) tool

Stat 301 Lecture 30. Model Selection. Explanatory Variables. A Good Model. Response: Highway MPG Explanatory: 13 explanatory variables

Stat 401 B Lecture 31

delivery<-read.csv(file="d:/chilo/regression 4/delivery.csv", header=t) delivery

Drilling Example: Diagnostic Plots

Subsetting Data in R. Data Wrangling in R

Investigation of Relationship between Fuel Economy and Owner Satisfaction

Appendix B STATISTICAL TABLES OVERVIEW

Important Formulas. Discrete Probability Distributions. Probability and Counting Rules. The Normal Distribution. Confidence Intervals and Sample Size

Modeling Ignition Delay in a Diesel Engine

Lampiran IV. Hasil Output SPSS Versi 16.0 untuk Analisis Deskriptif

fruitfly fecundity example summary Tuesday, July 17, :13:19 PM 1

9.3 Tests About a Population Mean (Day 1)

HASIL OUTPUT SPSS. Reliability Scale: ALL VARIABLES

Booklet of Code and Output for STAD29/STA 1007 Final Exam

Power and Fuel Economy Tradeoffs, and Implications for Benefits and Costs of Vehicle Greenhouse Gas Regulations

Stat 401 B Lecture 27

Studying the Factors Affecting Sales of New Energy Vehicles from Supply Side Shuang Zhang

ggplot2: easy graphics with R

MVT SOLUTIONS CERTIFIED TM FUEL ECONOMY TEST ECO FLAPS WITHOUT TRAILER SKIRTS MPG WITH TRAILER SKIRTS MPG

The PRINCOMP Procedure

Stat 301 Lecture 26. Model Selection. Indicator Variables. Explanatory Variables

Lecture 2. Review of Linear Regression I Statistics Statistical Methods II. Presented January 9, 2018

Type 'demo()' for some demos, 'help()' for on-line help, or 'help.start()' for an HTML browser interface to help. Type 'q()' to quit R.

R-Sq criterion Data : Surgical room data Chap 9

Relating your PIRA and PUMA test marks to the national standard

Using Statistics To Make Inferences 6. Wilcoxon Matched Pairs Signed Ranks Test. Wilcoxon Rank Sum Test/ Mann-Whitney Test

Relating your PIRA and PUMA test marks to the national standard

TABLE 4.1 POPULATION OF 100 VALUES 2

Tactical Vehicle Cons & Reps Cost Estimating Relationship (CER) Tool

. Enter. Model Summary b. Std. Error. of the. Estimate. Change. a. Predictors: (Constant), Emphaty, reliability, Assurance, responsive, Tangible

5. CONSTRUCTION OF THE WEIGHT-FOR-LENGTH AND WEIGHT-FOR- HEIGHT STANDARDS

Technical Papers supporting SAP 2009

Chapter 5 ESTIMATION OF MAINTENANCE COST PER HOUR USING AGE REPLACEMENT COST MODEL

Identify Formula for Throughput with Multi-Variate Regression

FutureMetrics LLC. 8 Airport Road Bethel, ME 04217, USA. Cheap Natural Gas will be Good for the Wood-to-Energy Sector!

PLUG ASSIST MATERIALS FOR IMPROVED FORMING OF TRANSPARENT POLYPROPYLENE

CASE STUDY 1612C FUEL ECONOMY TESTING

# of tests Condition g/mile ± g/mile ± g/mile ± (miles/gal) ± Impact of Diesel Extreme on emissions and fuel economy USDS results:

Improving CERs building

Objectives. Materials TI-73 CBL 2

Article: Sulfur Testing VPS Quality Approach By Dr Sunil Kumar Laboratory Manager Fujairah, UAE

Momentum, Energy and Collisions

Non-Obvious Relational Awareness for Diesel Engine Fluid Consumption

Vehicle Scrappage and Gasoline Policy. Online Appendix. Alternative First Stage and Reduced Form Specifications

TRINITY COLLEGE DUBLIN THE UNIVERSITY OF DUBLIN. Faculty of Engineering, Mathematics and Science. School of Computer Science and Statistics

When the points on the graph of a relation lie along a straight line, the relation is linear

PREDICTION OF FUEL CONSUMPTION

Investigating the Concordance Relationship Between the HSA Cut Scores and the PARCC Cut Scores Using the 2016 PARCC Test Data

A REPORT ON THE STATISTICAL CHARACTERISTICS of the Highlands Ability Battery CD

Reed Switch Life Characteristics under Different Levels of Capacitive Loading

Introduction. Materials and Methods. How to Estimate Injection Percentage

Effect of Sample Size and Method of Sampling Pig Weights on the Accuracy of Estimating the Mean Weight of the Population 1

CASE STUDY 1612B FUEL ECONOMY TESTING

Large Sample Ecodriving Experiment Preliminary Results

sponsoring agencies.)

DEPARTMENT OF STATISTICS AND DEMOGRAPHY MAIN EXAMINATION, 2011/12 STATISTICAL INFERENCE II ST232 TWO (2) HOURS. ANSWER ANY mree QUESTIONS

Lampiran 1. Data Perusahaan

PERFORMANCE AND ACCEPTANCE OF ELECTRIC AND HYBRID VEHICLES

Product Loss During Retail Motor Fuel Dispenser Inspection

September 21, Introduction. Environmental Protection Agency ( EPA ), National Highway Traffic Safety

Driver Personas. New Behavioral Clusters and Their Risk Implications. March 2018

Problem Set 3 - Solutions

Ride Smoothness Measurement and Specification Issues. Nicholas Vitillo, Ph. D. Manager, Bureau of Research New Jersey Department of Transportation

SAN PEDRO BAY PORTS YARD TRACTOR LOAD FACTOR STUDY Addendum

Topic 5 Lecture 3 Estimating Policy Effects via the Simple Linear. Regression Model (SLRM) and the Ordinary Least Squares (OLS) Method

DRIVER SPEED COMPLIANCE WITHIN SCHOOL ZONES AND EFFECTS OF 40 PAINTED SPEED LIMIT ON DRIVER SPEED BEHAVIOURS Tony Radalj Main Roads Western Australia

Oregon DOT Slow-Speed Weigh-in-Motion (SWIM) Project: Analysis of Initial Weight Data

Analyzing Crash Risk Using Automatic Traffic Recorder Speed Data

Ricardo-AEA. Passenger car and van CO 2 regulations stakeholder meeting. Sujith Kollamthodi 23 rd May

UMTRI An Examination of the Michigan 2010 Motor Vehicle Traffic Crash Fatality Increase

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN

Preface... xi. A Word to the Practitioner... xi The Organization of the Book... xi Required Software... xii Accessing the Supplementary Content...

Hydraulic Drive Head Performance Curves For Prediction of Helical Pile Capacity

Burn Characteristics of Visco Fuse

Guatemalan cholesterol example summary

Lab 9: Faraday s and Ampere s Laws

LET S ARGUE: STUDENT WORK PAMELA RAWSON. Baxter Academy for Technology & Science Portland, rawsonmath.

Index. Calculated field creation, 176 dialog box, functions (see Functions) operators, 177 addition, 178 comparison operators, 178

WHITE PAPER. Preventing Collisions and Reducing Fleet Costs While Using the Zendrive Dashboard

The Mutagenic Effects of Crude Oil Fuels on Cell Mutation. Michael Bushnell Pittsburgh Central Catholic High School 9th Grade

A DIFFERENCE IN ROLLOVER FREQUENCY BETWEEN CHEVROLET AND GMC TRUCKS. Hans C. Joksch. The University of Michigan Transportation Research Institute

A Cost Benefit Analysis of Faster Transmission System Protection Schemes and Ground Grid Design

Impact Evaluation of 2004 Compressed Air Prescriptive Rebates

TRY OUT 25 Responden Variabel Kepuasan / x1

Antonio Olmos Priyalatha Govindasamy Research Methods & Statistics University of Denver

Risk factors, driver behaviour and accident probability. The case of distracted driving.

Draft Project Deliverables: Policy Implications and Technical Basis

OPTIMIZATION OF BIODIESEL PRODCUTION FROM TRANSESTERIFICATION OF WASTE COOKING OILS USING ALKALINE CATALYSTS

Consumer Satisfaction with New Vehicles Subject to Greenhouse Gas and Fuel Economy Standards

Missouri Seat Belt Usage Survey for 2017

Quality Control in Mineral Exploration

Optimizing diesel engine parameters for low emissions using Taguchi method: variation risk analysis approach Part I

Nondestructive Detection and Quantification of Blueberry Bruising using Near-infrared (NIR) Hyperspectral Reflectance Imaging

UJI VALIDITAS DAN RELIABILIAS VARIABEL KOMPENSASI

Class 8 Truck Tractor Braking Performance Improvement Study

CCQTA TVP Project Update

Transcription:

Motor Trend Yvette Winton September 1, 2016 Executive Summary Objective In this analysis, the relationship between a set of variables and miles per gallon (MPG) (outcome) is explored from a data set of a collection of cars. The objective of the analysis is to find out if an automatic or manual transmission is more fuel effecient. The MPG difference between automatic and manual transmissions will also be quantified. Conclusion There is statistically significant difference between the single fitted model of mpg ~ am (transmission type) as well as the best fitted multivariate model of mpg ~ am + wt + qsec (transmission type + weight of vehicle + 1/4 mile time). In the single fitted model, mean fuel efficiency for manual transmission is 24.4 MPG, this is 7.2 MPG higher than that of automatic transmission which is 17.1 MPG. In best fitted multivariate model, mean fuel efficency for manual transmission is only 2.9 mpg higher than that of automatic transmission while holding weight and 1/4 mile time constant. There are uncertainty to both models. The single fitted model adjusted R-square is 0.339 and multivariate fitted model adjusted R-square is 0.833. Multivariate fitted model can better predict mpg but can only explain the mpg variability 83.3% of the time. Analysis To explore the fuel effciency difference, if any, between automatic and manual transmission. Fuel effciency is plotted by transmission type. (See appendix) T-test is done for fuel efficiency between the tansmission types. It is significantly different between the fuel effeciency of automatic transmission vs manual transmission because P-value is < 0.05 and confidence interval does not contain 0, thus null is rejected. Mean Fuel Efficiency for manual transmission is 24.4 MPG, this is 7.2 MPG higher than that of automatic transmission which is 17.1 MPG. t.test(mpg ~ transm, data=mtcars1) Welch Two Sample t-test data: mpg by transm t = -3.7671, df = 18.332, p-value = 0.001374 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: -11.280194-3.209684 sample estimates: mean in group automatic mean in group manual 17.14737 24.39231 1

However, there are other variables that fuel efficiency has strong correlation to. In order to find the best fitted multivariate model, the following is being done. Please refer to appendix for details. 1. Start with a fitted model of fuel efficiency vs transmission type (interest of this analysis) 2. After fitting fuel efficiency vs all variables in the dataset, variables are ranked in ascending order of their P-values. 3. Each variable (with P-values in ascending order) is added to fitted model of fuel efficiency vs transmission type, if P value of added variable is > 0.05, meaning newly added variable has little significance to the model, newly added variable will not be included in the model. 4. The above is repeated until all variables that are significant to the fitted model with P value < 0.05 are added. The best fitted model was found to be fuel effeciency vs transmission type + weight + 1/4mile time (lm(mpg ~ am+wt+qsec)). Anova is performed P value for best multivariate model is very small, thus all 3 variables am, wt and qsec have significant influence in the model predicting fuel efficency. In this mulivariate model, the mean of fuel efficiency of manual transmission is only 2.9 mpg higher than that of automatic transmission when holding wt and qsec constant. fitam <-lm(mpg~ am, data=mtcars) fitbest<-lm(mpg~ am + wt + qsec, data=mtcars) anova(fitam, fitbest) Analysis of Variance Table Model 1: mpg ~ am Model 2: mpg ~ am + wt + qsec Res.Df RSS Df Sum of Sq F Pr(>F) 1 30 720.90 2 28 169.29 2 551.61 45.618 1.55e-09 *** --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 summary(fitbest)$coeff (Intercept) 9.617781 6.9595930 1.381946 1.779152e-01 am 2.935837 1.4109045 2.080819 4.671551e-02 wt -3.916504 0.7112016-5.506882 6.952711e-06 qsec 1.225886 0.2886696 4.246676 2.161737e-04 Adjusted R square for single fitted model is 0.339 and adjusted R square for the multivariate fitted model is 0.833. summary(fitam)$adj.r.squared [1] 0.3384589 summary(fitbest)$adj.r.squared [1] 0.8335561 There is slight curve to the residual plot of the best fitted multivariate model lm(mpg ~ am + wt + qsec), but there is no obvious trend, residuals are random. From QQ plot, the residuals are normally distributed. (See appendix) 2

Appendix: library(ggplot2) ggplot(aes(x=transm, y=mpg), data=mtcars1) + geom_boxplot(aes(fill=transm)) + xlab("transmission Type") 35 Miles/Gallon vs Transmission Type 30 Miles/(US)gallon 25 20 transm automatic manual 15 10 automatic Transmission Type manual summary(lm(mpg ~., data=mtcars))$coeff (Intercept) 12.30337416 18.71788443 0.6573058 0.51812440 cyl -0.11144048 1.04502336-0.1066392 0.91608738 disp 0.01333524 0.01785750 0.7467585 0.46348865 hp -0.02148212 0.02176858-0.9868407 0.33495531 drat 0.78711097 1.63537307 0.4813036 0.63527790 wt -3.71530393 1.89441430-1.9611887 0.06325215 qsec 0.82104075 0.73084480 1.1234133 0.27394127 vs 0.31776281 2.10450861 0.1509915 0.88142347 am 2.52022689 2.05665055 1.2254035 0.23398971 gear 0.65541302 1.49325996 0.4389142 0.66520643 carb -0.19941925 0.82875250-0.2406258 0.81217871 summary(lm(mpg~ am + wt, data=mtcars))$coeff (Intercept) 37.32155131 3.0546385 12.21799285 5.843477e-13 am -0.02361522 1.5456453-0.01527855 9.879146e-01 wt -5.35281145 0.7882438-6.79080719 1.867415e-07 3

summary(lm(mpg~ am + wt + qsec, data=mtcars))$coeff (Intercept) 9.617781 6.9595930 1.381946 1.779152e-01 am 2.935837 1.4109045 2.080819 4.671551e-02 wt -3.916504 0.7112016-5.506882 6.952711e-06 qsec 1.225886 0.2886696 4.246676 2.161737e-04 summary(lm(mpg~ am + wt + qsec + hp, data=mtcars))$coeff (Intercept) 17.44019110 9.3188688 1.871492 0.072149342 am 2.92550394 1.3971471 2.093913 0.045790788 wt -3.23809682 0.8898986-3.638726 0.001141407 qsec 0.81060254 0.4388703 1.847021 0.075731202 hp -0.01764654 0.0141506-1.247052 0.223087932 summary(lm(mpg~ am + wt + qsec + disp, data=mtcars))$coeff (Intercept) 6.442378425 8.25723071 0.7802105 0.4420535035 am 3.310153929 1.51241278 2.1886577 0.0374482129 wt -4.588282099 1.16677426-3.9324506 0.0005290227 qsec 1.416958261 0.39148853 3.6194119 0.0012000578 disp 0.007689836 0.01053478 0.7299473 0.4717085406 summary(lm(mpg~ am + wt + qsec + drat, data=mtcars))$coeff (Intercept) 7.6277466 8.2102682 0.9290496 3.610954e-01 am 2.5728751 1.6225267 1.5857213 1.244465e-01 wt -3.8039824 0.7592452-5.0102160 2.963097e-05 qsec 1.1958078 0.2995350 3.9922141 4.517773e-04 drat 0.6429296 1.3551408 0.4744375 6.390028e-01 par(mfrow=c(2,2)) plot(fitbest) 4

Residuals 4 0 4 Residuals vs Fitted 10 15 20 25 30 1 1 Normal Q Q 2 1 0 1 2 Fitted values Theoretical Quantiles 0.0 1.0 Scale Location 10 15 20 25 30 1 1 Residuals vs Leverage Chrysler Fiat 128 Imperial 0.5 Merc 230 Cook's distance 0.00 0.10 0.20 0.30 Fitted values Leverage 5