KNIME Software Pieces KNIME.com AG. All Rights Reserved. 1

Similar documents
What s Cooking. Bernd Wiswedel KNIME KNIME.com AG. All Rights Reserved.

What s new. Bernd Wiswedel KNIME.com AG. All Rights Reserved.

What s Cooking. Bernd Wiswedel KNIME KNIME AG. All Rights Reserved.

What s Cooking. Bernd Wiswedel KNIME KNIME AG. All Rights Reserved.

What s Cooking. Bernd Wiswedel KNIME KNIME AG. All Rights Reserved.

What s cooking. Bernd Wiswedel KNIME.com AG. All Rights Reserved.

What s New. Bernd Wiswedel KNIME KNIME AG. All Rights Reserved.

What s New. Bernd Wiswedel KNIME KNIME AG. All Rights Reserved.

What s new. Bernd Wiswedel KNIME.com AG. All Rights Reserved.

KNIME Server Workshop

The Session.. Rosaria Silipo Phil Winters KNIME KNIME.com AG. All Right Reserved.

KNIME Spring Summit Opening -

Query Engines for Hive: MR, Spark, Tez with LLAP Considerations!

ANALYSIS OF TRAFFIC SPEEDS IN NEW YORK CITY. Austin Krauza BDA 761 Fall 2015

Survey Report Informatica PowerCenter Express. Right-Sized Data Integration for the Smaller Project

Informatica Powercenter 9 Designer Guide Pdf

Introduction to Abaqus/CAE. Abaqus 2018

License Model Schedule Actuate License Models for the Open Text End User License Agreement ( EULA ) effective as of November, 2015

Statistical Learning Examples

David A. Ostrowski Global Data Insights and Analytics

Veritas CloudPoint Release Notes. Ubuntu

Software for Data-Driven Battery Engineering. Battery Intelligence. AEC 2018 New York, NY. Eli Leland Co-Founder & Chief Product Officer 4/2/2018

BLUECAT ENTERPRISE DNS

Multi-level Feeder Queue Dispatch based Electric Vehicle Charging Model and its Implementation of Cloud-computing

Informatica Powercenter 9 Transformation Guide Pdf

Applied Data Science, Big Data and The PI System

FULL THROTTLE ANALYTICS

FALL 2007 MBA EXIT SURVEY (Sample size of 29: 15 responses from the San Marcos location and 14 responses from the RRHEC location)

MetaXpress PowerCore System Installation and User Guide

Professor Dr. Gholamreza Nakhaeizadeh. Professor Dr. Gholamreza Nakhaeizadeh

Scaling industrial control technologies for food & beverage industry

Open Source Big Data Management for Connected Vehicles

Porting Applications to the Grid

Training Course Catalog

Journal of Emerging Trends in Computing and Information Sciences

dcache, agile adoption of storage technologies

Towards Realizing Autonomous Driving Based on Distributed Decision Making for Complex Urban Environments

Substructures and Submodeling with Abaqus. About this Course

DYNA4 Open Simulation Framework with Flexible Support for Your Work Processes and Modular Simulation Model Library

Highly dynamic control of a test bench for highspeed train pantographs

2015 The MathWorks, Inc. 1

Release Enhancements GXP Xplorer GXP WebView

Data Mining Approach for Quality Prediction and Improvement of Injection Molding Process

GEODE Workshop on Smart Grids Projects Brussels, 10 th of May 2017.

Index COPYRIGHTED MATERIAL

LIGHT Battery. New technologies for an efficient BMS. LION Smart GmbH

Videosystem CAR-READER

PRODUCT DESCRIPTIONS AND METRICS

Global Grid Reliability Advances

EMPACK MECHELEN, 11 OCTOBER 2017, STAF SEURINCK, ABB BENELUX Upcoming digital solutions and services.

Issue 06 / StoraXe PowerBooster Compact outdoor battery system in distribution networks

Caliber: Road Quality Profiling

Barrie D. Fitzgerald Senior Research Analyst, Valdosta State University Sarah E. Hough Research Analyst, Valdosta State University Tiffany S.

Scaling Document Clustering in the Cloud. Robert Gillen Computer Science Research Cloud Futures 2011

WHITE PAPER. Informatica PowerCenter 8 on HP Integrity Servers: Doubling Performance with Linear Scalability for 64-bit Enterprise Data Integration

CHANGE OF IT THROUGH DIGITALIZATION. KLAUS STRAUB, CIO BMW GROUP

ABB Innovation & Technology Day

Stihl Technical Reference Guide

National Grid New Energy Solutions (NES)

DARS v2.10 New Features & Enhancements

Harris Geospatial Solutions

TechniCity Final Project: An Urban Parking Solution for Columbus, OH

QNX Automotive Overview Senthil Kumar, Application Engineer

Optimal Vehicle to Grid Regulation Service Scheduling

ADAS Solutions. Maintenance of advanced driver assistance systems.

DOWNLOAD B767 ENGINE RUN UP CHECKLIST - HOME4APK.COM

Wireless Monitoring of Airport Fuel Tank Farms to Optimize Operations

Smart Mobility in Berlin: Innovations in electric, connected, automated and intermodal Mobility

Lesson 1: Introduction to PowerCivil

Modeling Contact with Abaqus/Standard. Abaqus 2018

Sinfonia: a new paradigm for building scalable distributed systems

Remote Process Analysis for Process Analysis and Optimization

Intelligent Transportation Systems. Secure solutions for smart roads and connected highways. Brochure Intelligent Transportation Systems

Survey123 for ArcGIS smarter forms, smarter workfields

Testbeds for Reproducible Research

More power to manufacturers. Improving electric vehicle production processes

Battery Fingerprint Technologies

IN SPRINTS TOWARDS AUTONOMOUS DRIVING. BMW GROUP TECHNOLOGY WORKSHOPS. December 2017

Repeatable perfection comes to Wide Format

World premiere at Hannover Messe: ZF s highly automated forklift can see, think and act

DATA & ANALYSIS 2018 TOEIC. Program. Table of contents

Index. Calculated field creation, 176 dialog box, functions (see Functions) operators, 177 addition, 178 comparison operators, 178

THIELE TECHNOLOGIES, INC.

MSC/Flight Loads and Dynamics Version 1. Greg Sikes Manager, Aerospace Products The MacNeal-Schwendler Corporation

From Developing Credit Risk Models Using SAS Enterprise Miner and SAS/STAT. Full book available for purchase here.

GUI Customization with Abaqus. Abaqus 2017

SOLUTION BRIEF MACHINE DATA ANALYTICS FOR EV CHARGING STATIONS. SOLUTION BRIEF Machine Data Analytics for the EV Charging Stations Industry

PRODUCT DESCRIPTIONS AND METRICS

Storage. A Watt smarter!

Metal Forming with Abaqus. Abaqus 2017

Mini-Lab Gas Turbine Power System TM Sample Lab Experiment Manual

The future grid. Engineering Dreams

Battery Aging Analysis

Measurement made easy. Predictive Emission Monitoring Systems The new approach for monitoring emissions from industry

ESTECO DESIGN COMPETITION 2018 RULES AND REGULATIONS

InnoTrans SEPTEMBER BERLIN

Understanding KPI trade-offs - key challenges of modelling architectures and data acquisition Gurtner, G. and Cook, A.J.

Use of the ERD for administrative monitoring of Theta:

GPP PGS2 PARKING GUIDANCE SYSTEM

SHRP 2 RID. Roadway Information Database

Transcription:

KNIME Software Pieces 2017 KNIME.com AG. All Rights Reserved. 1

A Peek into KNIME Big Data Labs The Big Data Team KNIME 2017 KNIME.com AG. All Rights Reserved.

KNIME Big Data Connectors Package required drivers/libraries for HDFS, Hive, Impala access Runs on Hadoop Preconfigured connectors Hive Cloudera Impala (secured) HDFS, webhdfs, httpfs Support for Kerberos secured cluster Extends the open source database and remote file handling integration 2017 KNIME.com AG. All Rights Reserved. 3

KNIME Spark Executor Based on Spark MLlib Scalable machine learning library Runs on Hadoop Algorithms for Classification (decision tree, naïve bayes, ) Regression (logistic regression, linear regression, ) Clustering (k-means) Collaborative filtering (ALS) Dimensionality reduction (SVD, PCA) Supports Spark version 1.2, 1.3, 1.5 and 1.6 Support for Kerberos secured cluster 2017 KNIME.com AG. All Rights Reserved. 4

The Question Wouldn t it be great to know if your flight will be delayed? https://pixabay.com/en/staircase-airport-modern-technology-1149599/ 2017 KNIME.com AG. All Rights Reserved. 5

The Answer Of course, so let s learn a model that does! https://pixabay.com/en/banner-yes-no-decision-choice-1183407/ 2017 KNIME.com AG. All Rights Reserved. 6

The Airport Chicago O Hare International Airport https://commons.wikimedia.org/wiki/file:o%27hare_with_aa_plane.jpg Foto Ad Meskens 2017 KNIME.com AG. All Rights Reserved. 7

The Airport Flughafen Berlin Brandenburg https://de.wikipedia.org/wiki/datei:bbi_2010-07-23_5.jpg 2017 KNIME.com AG. All Rights Reserved. 8

The Data Historical Flight Data Airport and City Information Geo Coordinates Airplane Data Radar Images Textual Weather Reports https://commons.wikimedia.org/wiki/file:world-airline-routema p-2009.png 2017 KNIME.com AG. All Rights Reserved. 9

The Challenges Many Data Sources and Formats Analyze the Data Computing Constraints Large Unstructured Data What s new What s cooking 2017 KNIME.com AG. All Rights Reserved. 10

The Challenges Many Data Sources and Formats Analyze the Data Computing Constraints Large Unstructured Data What s new What s cooking 2017 KNIME.com AG. All Rights Reserved. 11

New Spark Reader and Writer Nodes Read and write various data formats from scalable storage e.g. HDFS Data preview in the node dialog 2017 KNIME.com AG. All Rights Reserved. 12

Virtual Data Warehouse 2017 KNIME.com AG. All Rights Reserved. 13

Different Data Sources and Formats 2017 KNIME.com AG. All Rights Reserved. 14

The Challenges Many Data Sources and Formats Analyze the Data https://pixabay.com/en/ball-binary-magnifying-glass-hand-958950/ What s new Computing Constraints Large Unstructured Data What s cooking 2017 KNIME.com AG. All Rights Reserved. 15

Model Learning on Spark 2017 KNIME.com AG. All Rights Reserved. 16

Do you speak SQL? Spark SQL Query node with syntax highlighting and query completion 2017 KNIME.com AG. All Rights Reserved. 17

Ad-hoc Analysis on Spark 2017 KNIME.com AG. All Rights Reserved. 18

Ad-hoc Analysis and Model Learning on Spark 2017 KNIME.com AG. All Rights Reserved. 19

The Challenges Many Data Sources and Formats Analyze the Data What s new https://www.flickr.com/photos/76657755@n04/7027596629 Computing Constraints Large Unstructured Data What s cooking 2017 KNIME.com AG. All Rights Reserved. 20

Moving to the Cloud Amazon EMR Cluster support Microsoft Azure HDInsight support Support for Cloud Connectors 2017 KNIME.com AG. All Rights Reserved. 21

Automatic Cluster Management Run the cluster only when it is needed https://commons.wikimedia.org/wiki/file:stopwatch_a.jpg 2017 KNIME.com AG. All Rights Reserved. 22

Automatic Cluster Management 2017 KNIME.com AG. All Rights Reserved. 23

The Challenges Many Data Sources and Formats Analyze the Data What s new https://pixabay.com/en/files-paper-office-paperwork-stack-1614223/ Computing Constraints Large Unstructured Data What s cooking 2017 KNIME.com AG. All Rights Reserved. 24

Image Data 2017 KNIME.com AG. All Rights Reserved. 25

Image Data -> KNIME Image Processing 2017 KNIME.com AG. All Rights Reserved. 26

Textual Data https://pixabay.com/en/emotions-man-happy-sad-face-adult-371238/ 2017 KNIME.com AG. All Rights Reserved. 27

Textual Data -> KNIME Text Processing 2017 KNIME.com AG. All Rights Reserved. 28

Chemical Data 2017 KNIME.com AG. All Rights Reserved. 29

Chemical Data -> Chemistry Extensions 2017 KNIME.com AG. All Rights Reserved. 30

More than 1500 native KNIME Nodes 2017 KNIME.com AG. All Rights Reserved. 31

What if you could use all KNIME nodes on Spark? 2017 KNIME.com AG. All Rights Reserved. 32

You Could Analyse Radar Data! 2017 KNIME.com AG. All Rights Reserved. 33

KNIME Image Processing on Spark 2017 KNIME.com AG. All Rights Reserved. 34

You Could Do Sentiment Analysis! https://pixabay.com/en/emotions-man-happy-sad-face-adult-371238/ 2017 KNIME.com AG. All Rights Reserved. 35

KNIME Text Processing on Spark 2017 KNIME.com AG. All Rights Reserved. 36

You Could Mine Chemical Structures! 2017 KNIME.com AG. All Rights Reserved. 37

Go to Greg s Talk Tomorrow 2017 KNIME.com AG. All Rights Reserved. 38

Behind the Scene Cluster Worker Node Cluster Worker Node Spark Executor JVM Spark Executor JVM Input RDD RDD Partition RDD Partition KNIME Workflow Execute KNIME workflow on Spark (OSGI) (OSGI) KNIME Workflow KNIME Workflow KNIME Analytics Platform KNIME Server Output RDD RDD Partition RDD Partition Workflow Replica 2017 KNIME.com AG. All Rights Reserved. 39

What about the cluster? 2017 KNIME.com AG. All Rights Reserved. 40

Let KNIME handle it! 2017 KNIME.com AG. All Rights Reserved. 41

The KNIME trademark and logo and OPEN FOR INNOVATION trademark are used by KNIME.com AG under license from KNIME GmbH, and are registered in the United States. KNIME is also registered in Germany. 2017 KNIME.com AG. All Rights Reserved. 42