Domain-invariant Partial Least Squares (di-pls) Regression: A novel method for unsupervised and semi-supervised calibration model adaptation

Similar documents
The Degrees of Freedom of Partial Least Squares Regression

Large Engines Competence Center

PLS score-loading correspondence and a bi-orthogonal factorization

Professor Dr. Gholamreza Nakhaeizadeh. Professor Dr. Gholamreza Nakhaeizadeh

Method for the estimation of the deformation frequency of passenger cars with the German In-Depth Accident Study (GIDAS)

Investigation in to the Application of PLS in MPC Schemes

Online Appendix for Subways, Strikes, and Slowdowns: The Impacts of Public Transit on Traffic Congestion

Regularized Linear Models in Stacked Generalization

MB3600-CH30 Laboratory FT-NIR analyzer for biodiesel applications Suitable for production optimization and product quality assessment

Lecture 2. Review of Linear Regression I Statistics Statistical Methods II. Presented January 9, 2018

Optimal Gasoline Blending

Statistical Learning Examples

Preface... xi. A Word to the Practitioner... xi The Organization of the Book... xi Required Software... xii Accessing the Supplementary Content...

Integrating remote sensing and ground monitoring data to improve estimation of PM 2.5 concentrations for chronic health studies

Supervised Learning to Predict Human Driver Merging Behavior

Workshop on Frame Theory and Sparse Representation for Complex Data June 1, 2017

Real-life emission of automatically stoked biomass boilers

Analysis of Partial Least Squares for Pose-Invariant Face Recognition

Investigating the Concordance Relationship Between the HSA Cut Scores and the PARCC Cut Scores Using the 2016 PARCC Test Data

Comparing FEM Transfer Matrix Simulated Compressor Plenum Pressure Pulsations to Measured Pressure Pulsations and to CFD Results

Module: Mathematical Reasoning

PARTIAL LEAST SQUARES: APPLICATION IN CLASSIFICATION AND MULTIVARIABLE PROCESS DYNAMICS IDENTIFICATION

Moment-Based Relaxations of the Optimal Power Flow Problem. Dan Molzahn and Ian Hiskens

Aviation in Austria. Austrian Ministry for Transport, Innovation and Technology

Zürich Testing on Fuel Effects and Future Work Programme

PREDICTION OF FUEL CONSUMPTION

Integrated System Design Optimisation: Combining Powertrain and Control Design

Article: Sulfur Testing VPS Quality Approach By Dr Sunil Kumar Laboratory Manager Fujairah, UAE

MB3600-HP10 Laboratory FT-NIR analyzer for hydrocarbon applications Pre-calibrated for blended gasoline, diesel, reformate and naphtha

Improving Analog Product knowledge using Principal Components Variable Clustering in JMP on test data.

Practical Applications of Compact High-Resolution 60 MHz Permanent Magnet NMR Systems for Reaction Monitoring and Online Process Control

Sum of ranking differences (SRD) to ensemble multivariate calibration model merits for tuning parameter selection and comparing calibration methods

Data envelopment analysis with missing values: an approach using neural network

Remote Process Analysis for Process Analysis and Optimization

On Using Storage and Genset for Mitigating Power Grid Failures

Robust alternatives to best linear unbiased prediction of complex traits

Säkerhet och självkörande personbilar Lotta Jakobsson och Trent Victor, Volvo Cars

Improving CERs building

White Paper.

Fuzzy Architecture of Safety- Relevant Vehicle Systems

13.10 How Series and Parallel Circuits Differ

CHAPTER 1 INTRODUCTION

Efficiency Measurement on Banking Sector in Bangladesh

Experimental Investigation of a 40K Single Stage High Frequency Pulse Tube Cryocooler

Smart Operation for AC Distribution Infrastructure Involving Hybrid Renewable Energy Sources

Online Learning and Optimization for Smart Power Grid

Arcing prevention by dry clean optimization at Shallow Trench Isolation (STI) Etch in AMAT MxP by use of plasma parameters

Ballard Power Systems

The use of PARAFAC in the analysis of CDOM fluorescence

Study of Fuel Economy Standard and Testing Procedure for Motor Vehicles in Thailand

Does V50 Depend on Armor Mass?

Performance Based Design for Bridge Piers Impacted by Heavy Trucks

TYPES OF BLENDING PROCESS

Forced vibration frequency response for a permanent magnetic planetary gear

Research in use of fuel conversion adapters in automobiles running on bioethanol and gasoline mixtures

PARTIAL LEAST SQUARES: WHEN ORDINARY LEAST SQUARES REGRESSION JUST WON T WORK

Estimation Procedure for Following Vapor Pressure Changes

/CENELEC Phase 3/Generic Preliminary Hazard Analysis Template

Rapid Measurement of Diesel Engine Oil Quality By Near Infrared Spectroscopy (NIRS)

Highly dynamic control of a test bench for highspeed train pantographs

SUPERVISED AND UNSUPERVISED CONDITION MONITORING OF NON-STATIONARY ACOUSTIC EMISSION SIGNALS

TECHNICAL REPORTS from the ELECTRONICS GROUP at the UNIVERSITY of OTAGO. Table of Multiple Feedback Shift Registers

Scaling of Betweenness Centrality in Weighted Complex Networks

Braking Performance Improvement Method for V2V Communication-Based Autonomous Emergency Braking at Intersections

Supporting Information

Study on V2V-based AEB System Performance Analysis in Various Road Conditions at an Intersection

Test rig for rod seals contact pressure measurement

Article: The Formation & Testing of Sludge in Bunker Fuels By Dr Sunil Kumar Laboratory Manager VPS Fujairah 15th January 2018

Featured Articles Utilization of AI in the Railway Sector Case Study of Energy Efficiency in Railway Operations

Accelerating the Development of Expandable Liner Hanger Systems using Abaqus

Successive Approximation Time-to-Digital Converter with Vernier-level Resolution

Integrated Operations Knut Hovda UiO, May 20th 2011 ABB Industry Examples Calculations and engineering software. ABB Group June 17, 2011 Slide 1

Optimization of Chromatogram Alignment Using A Class Separability Criterion

BAC and Fatal Crash Risk

Optimal Vehicle to Grid Regulation Service Scheduling

Dependence of Shaft Stiffness on the Crack Location

Regression Models Course Project, 2016

Treball Final de Grau

From Patient to Plate

Can Vehicle-to-Grid (V2G) Revenues Improve Market for Electric Vehicles?

FUZZY CONTROL OF INVERTED PENDULUM USING REAL-TIME TOOLBOX

Track Based Fuel and Lap Time Engine Optimization. ESTECO Academy Design Competition 2016/2017. In partnership with: APRILIA RACING & GTI Software

CFD on Cavitation around Marine Propellers with Energy-Saving Devices

DOC design & sizing using GT-SUITE European GT Conference Gauthier QUENEY 09/10/2017

DIRECT TORQUE CONTROL OF A THREE PHASE INDUCTION MOTOR USING HYBRID CONTROLLER. RAJESHWARI JADI (Reg.No: M070105EE)

AGENT-BASED MODELING, SIMULATION, AND CONTROL SOME APPLICATIONS IN TRANSPORTATION

A Large Modern High Speed Reciprocating Compressor

Synergies of EP/Anti Wear Additives for Metal Working Fluids. Dover Chemical Corporation John Nussbaumer Technical Service Manager-Metalworking

ASAM ATX. Automotive Test Exchange Format. XML Schema Reference Guide. Base Standard. Part 2 of 2. Version Date:

An Adaptive Nonlinear Filter Approach to Vehicle Velocity Estimation for ABS

Are You Prepared For Tier 3 Gasoline Standards? Presented by: Shaun Spiro and Leslie Johnson

Background Electric power source which is connected directly to the distribution network or on the customer site of the meter (Ackermann, 2001).

Estimation and Optimization of Vessel Fuel Consumption

Predicted response of Prague residents to regulation measures

Determination of Phenolic Antioxidant DBPC and DBP Levels in Electrical Insulating Oil

SOME ISSUES OF THE CRITICAL RATIO DISPATCH RULE IN SEMICONDUCTOR MANUFACTURING. Oliver Rose

Influence of Cylinder Bore Volume on Pressure Pulsations in a Hermetic Reciprocating Compressor

2.810 Manufacturing Processes and Systems. Quiz II (November 19, 2014) Open Book, Open Notes, Computers with Internet Off 90 Minutes

HELLENIC REPUBLIC MINISTRY OF DEVELOPMENT DIRECTORATE-GENERAL FOR ENERGY DIRECTORATE FOR RENEWABLE ENERGY SOURCES AND ENERGY-SAVING EXTENSIVE SUMMARY

Antonio Aguilar (antonioa), Justin Luke (jthluke), Robert Spragg (spragg)

Transcription:

Domain-invariant Partial Least Squares (di-pls) Regression: A novel method for unsupervised and semi-supervised calibration model adaptation R. Nikzad-Langerodi W. Zellinger E. Lughofer T. Reischer 2 S. Saminger-Platz Department of Knowledge-Based Mathematical Systems Johannes Kepler University Linz, Austria Fuzzy Logic Laboratory, Linz-Hagenberg 2 Metadynea GmbH, Krems, Austria th Winter Symposium on Chemometrics Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- / 8

Introduction/Motivation Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 2 / 8

Introduction/Motivation Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 2 / 8

Introduction/Motivation Instrument Standardization (Cargill Corn Data Set) X T =X T F ===== F=X T X S DS,PDS,GLSW,SST... Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 2 / 8

Introduction/Motivation Instrument Standardization (Cargill Corn Data Set) X T =X T F ===== F=X T X S DS,PDS,GLSW,SST... Why not align distributions implicitly? Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 2 / 8

Previous Work on Domain Adaptation Maximum Mean Discrepancy MMD(P, Q) = E XS P[φ(X S )] E XT Q[φ(X T )] H φ : X H Transfer Component Analysis (TCA) Pan et al. 20 min φ MMD s.t. Maximize Variance Scatter Component Analysis (SCA) Ghifary et al. 206 Requires non-linear Kernels to align higher order moments Deep/Transfer Learning Correlation Alignment (Corral) Sun et Saenko 206 Central Moment Discrepancy (CMD) Zellinger et al. 207 Unsupervised/Difficult to Optimize Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 3 / 8

Our Approach - Domain Regularization Domain Invariant Principle Component Analysis (PCA) min t,p tpt 2 F + λf (t S, t T ) }{{} Domain Regularizer Penalize Difference between Source and Target Distributions in LV Space Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 4 / 8

Our Approach - Domain Regularization Domain Invariant Principle Component Analysis (PCA) min t,p tpt 2 F + λf (t S, t T ) }{{} Domain Regularizer =0 {}}{ f (t S, t T ) = E[t S ] E[t T ] + E[tS] 2 E[tT 2 ] = n S pt X T S X S p n T pt X T T X T p Penalize Difference between Source and Target Distributions in LV Space Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 4 / 8

Our Approach - Domain Regularization Domain Invariant Principle Component Analysis (PCA) min t,p tpt 2 F + λf (t S, t T ) }{{} Domain Regularizer var = E[(X E[X ]) ] = E[X 2 ] }{{} 0 { =0 }} { f (t S, t T ) = E[t S ] E[t T ] + E[tS] 2 E[tT 2 ] µ = E[X ] = 0 }{{} local mean centering = n S pt X T S X S p n T pt X T T X T p Penalize Difference between Source and Target Distributions in LV Space Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 4 / 8

Domain-Invariant PCA L(t, p) = X tp T 2 F + λ n S pt X T S X S p n T pt X T T X T p p L = 0 Unconstrained Solution p T = tt X t T t [ I + ( λ 2t T t n S XT S X S n T XT T X T )] Identity Matrix (J J) (Deflated) Source and Target Covariance Matrices Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 5 / 8

Domain-Invariant PLS L(w) = X yw T 2 F + λ n S wt X T S X S w n T wt X T T X T w w L = 0 Unconstrained Solution w T = yt X y T y [ I + ( λ 2y T y n S XT S X S n T XT T X T )] Identity Matrix (J J) (Deflated) Source and Target Covariance Matrices Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 6 / 8

Proof of Concept λ = 0 λ = 00 Domain Regularization Aligns Covariance Structure of Source and Target Data Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 7 / 8

Let s Take a Closer Look Domain-Invariant PLS w T = yt X y T y [ I + ( λ 2y T y n S XT S X S n T XT T X T )] Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 8 / 8

Let s Take a Closer Look Domain-Invariant PLS w T = yt X y T y [ I + ( λ 2y T y n S XT S X S n T XT T X T )] X = X S Unsupervised Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 8 / 8

Let s Take a Closer Look Domain-Invariant PLS w T = yt X y T y [ I + ( λ 2y T y n S XT S X S n T XT T X T )] X = X S Unsupervised X = [X S ; X T ] Semi-Supervised Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 8 / 8

Let s Take a Closer Look Domain-Invariant PLS w T = yt X y T y [ I + ( λ 2y T y n S XT S X S n T XT T X T )] X = X S Unsupervised X = [X S ; X T ] Semi-Supervised X S, X T / X Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 8 / 8

How to Set λ Choosing λ too high aligns the Noise (Go for Pareto Optimal Point) Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 9 / 8

Component-Wise Model Selection Optimize λ i for i =,..., A LVs separately Largest effect usually for the first LV For NIR data 0 8 λ 0 9 λ >> 0 9 tends to shrink w T X T Xw Alternate between Optimization and Deflation Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 0 / 8

Case Study - Melamine Formaldehyde (MF) Condensation Monitoring of Condensation by FT-NIR Spectroscopy Recipe Changes often require Adaptation/Recalibration of PLS Models Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- / 8

Case Study - Melamine Formaldehyde (MF) Condensation Results Unsupervised Adaptation - Unlabeled Data from 3 Batches of Different Recipe Scenario RMSECV d B (T S, T T ) RMSEP PLS di-pls PLS di-pls PLS di-pls di-pls (Best) 562 568 2.30 0.0059 0.0037 2.76 2.47 2.45 562 86 2.34 2.8 0.026 0.09 3.29 3.28 n.s. 2.85 562 862 2.8 0.08 0.06 2.2 2.23 n.s. 2.29 568 562 2.30 n.s. 0.007 0.006 2.67 2.58 2.59 568 86 2.34 2.9 0.07 0.0 2.90 2.92 n.s. 2.58 568 862 2.2 0.08 0.06 2.45 2.38 2.37 86 562 2.5 0.020 0.09 3.53 3.29 3.4 86 568 2.64 2.47 0.03 0.02 3.23 3.0 3.04 86 862 2.32 0.049 0.04 3.0 2.8 2.69 862 562 2.22 0.023 0.022 3.60 3.52 n.s. 2.95 862 568 2.29 2.4 0.023 0.09 4.2 4. n.s. 4.03 862 86 2.3 0.057 0.054 4.92 4.90 n.s. 4.65 Improvement in Source (/2) and Target (6/2) Domain Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 2 / 8

Case Study - Melamine Formaldehyde (MF) Condensation Results Semi-Supervised Adaptation - Unlabeled Data from 3 Batches + 25 Labeled Samples Scenario RMSECV d B (T S, T T ) RMSEP PLS di-pls PLS di-pls PLS di-pls 562 568 2.44 2.7 0.8 0.3 2.69 2.64 n.s. 562 86 2.38 2.8 0. 0.02 3.6 2.68 562 862 2.40 2.25 0.42 0.03 2.57 2.22 568 562 2.34 2.26 n.s. 0.008 0.008 2.60 2.63 n.s. 568 86 2.40 2.40 n.s. 0.06 0.008 3.22 2.82 568 862 2.33 2.29 n.s. 0.40 0.04 2.48 2.32 86 562 2.65 2.44 0.29 0.08 3.39 3.04 86 568 2.90 2.67 0.36 0.26 3.24 2.93 86 862 2.65 2.44 0.39 0.06 2.96 2.75 862 562 2.8 2.52 0.45 0.0 3.27 2.83 862 568 2.25 2.0 0.60 0.34 2.65 2.3 862 86 2.70 2.43 0.9 0. 3.86 3.33 Outperformance of Standard PLS with Calibration Set Augmentation in 0/2 Scenarios Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 3 / 8

Case Study - Melamine Formaldehyde (MF) Condensation Unsupervised Model Adaptation Can improve predictions in target domain if P(X S ) P(X T ) P(y X S ) P(y X T ) (Mismatch in Marginal Distributions) (Conditionals are Similar) Semi-Supervised Model Adaptation is required if P(X S ) P(X T ) P(y X S ) P(y X T ) (Mismatch in Marginal Distributions) (Conditionals are Different) How to find out remains an open question! Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 4 / 8

Summary and Conclusion Domain Invariant Extensions to PCA and PLS Implicit distribution alignment Unsupervised/Semi- Supervised Adaptation (Component-Wise) Model Selection Tested on Real-World FT-NIR Dataset Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 5 / 8

Open Problems and Future Perspectives Open Problems Convexity {}}{ f (t S, t T ) = w T ( X T S X S X T T X T )w f = S 0 S + = QΛ + Q T Numerical Problems no guarantee that d(t S, t T ) gets smaller as λ is increased Future Perspectives Extension to Multiple Domains (i.e. X, X 2,...,X d X d+ ) S Heterogeneous Transfer Learning/Data Integration (i.e. X S X T ) Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 6 / 8

Acknowledgments This work was funded by the Austrian research funding association (FFG) under the scope of the COMET programme within the research project Industrial Methods for Process Analytical Chemistry - From Measurement Technologies to Information Systems (impacts) (contract # 843546). This programme is promoted by BMVIT, BMWFW, the federal state of Upper Austria and the federal state of Lower Austria. Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 7 / 8

Thank You! Ramin Nikzad-Langerodi (JKU Linz, Austria) Domain-Invariant PLS WSC- 8 / 8