Machine Learning for (JVM) Developers

Machine learning for (JVM) developers

Mateusz Dymczyk Software Engineer

H2O.ai

11th May 2016

Say who?

• Software Engineer @ H2O.ai • Ph.D. drop-out (AGH in Krakow) • ex Fujitsu Laboratories research trainee

Say what?

• Status quo of data • Why Machine Learning? • Intro to Machine Learning • Machine Learning and the JVM • Machine Learning Demo

The state of data

Exponential growth

Data source

Data collection Data storage

Simple analytics

Data processing

• Alerting from real time data • Similarity search

Retail

Healthcare

Insurance/banking

• Recommendations • Store layout • Ad targetting

• Stock price predictions • Anomaly/fraud detection • Automatic investments

https://www.kaggle.com/wiki/DataScienceUseCases

Machine Learning

Def ini t ion

“The field of machine learning is concerned with the question of how to construct computer programs that automatically improve with experience.” — Mitchell, Tom M., “Machine Learning”

Simply speaking…

• Subfield of Artificial Intelligence which… • Tries to find patterns in data using… • Math, statistics, probability, optimisation theory etc. to

create… • Model which can be used to predict values or cluster • Theoretical concept with many implementations

Basic terminology

Observations are objects which are used for learning and evaluation. Anything that can be described using quantitative features.

Observations

{"title":"Emailschema","type":"object","properties":{"age":{"type":"float"},"rooms":{"type":"int"},"size":{"type":"float"},"location":{"type":"string"}}}

Feature is a quantitative trait that (partially) represents an observation.

Feature vector is an n-dimentional vector of features that represents an observation.

Feature extraction vs. feature selection

Feature

{"title":"Emailschema","type":"object","properties":{"age":{"type":"float"},"rooms":{"type":"int"},"size":{"type":"float"},"location":{"type":"string"}}}

[5,3,60.5]

• System is a set of related objects forming a complex whole (e.g. set of all possible distinct observations) • In our case set of all possible houses

System

• Model is the description of a system using mathematical concepts/language. • Result of a machine learning technique • Can be used for predictions/clustering • Online or offline

Supervised Learning

• User needs to know: • the structure of the data • possible outputs

• Sample data has to be labeled for training

Classif ication

• Required: • all possible labels • already labeled samples

• Output: predicted label for new inputs • Examples: • spam classification based on email content • gender classification based on physical features

Regression

• Required: • samples with actual values associated

• Output: predicted values for new inputs • Examples: • price prediction based on historical prices

Unsupervised Learning

• Doesn’t require the user to know what should be the output

• No labelling necessary by the user • Useful for finding structure in data • Examples: • grouping users (clustering)

Cluster ing

• Required: • data, no labelling necessary

• Output: data grouped into clusters • Examples: • grouping users with similar tastes

Types of machine learning

eg. regression, when you want to predict

a real number

eg. clustering, when you want to cluster or have too much data

eg. classification, when you want to assign to a

Machine Learning for (JVM) Developers

Software

Transcript of Machine Learning for (JVM) Developers

Machine Learning Basics for Web Application Developers

The Java Virtual Machine Martin Schöberl. The Java virtual machine2 Overview Review Java/JVM JVM Bytecodes Short bytecode examples Class information Parameter.

Windows Azure Virtual Machine Services for Developers

Machine Learning for Developers - Pop-up Loft Tel Aviv

Info Architettura, JVM Installazione JDK (Windows, …homes.di.unimi.it/frosio/Lessons/AY2008-2009-LabProgr/...2008/10/13 · JVM Java Virtual Machine (JVM) → macchina astratta

Splunking the JVM (Java Virtual Machine)

Machine learning for java developers

Enterprise Development Trends 2016 - Cloud, Container and Microservices Insights from 2100 JVM Developers

The Java Virtual Machine Mike Brunt. What is the JVM? Main JVM Suppliers ColdFusion and the JVM Java J2EE – Java EE Servlet Containers Where.

HELIX: Holistic Optimization for Accelerating Iterative ... · neering. This interoperability allows developers to seamlessly inte-grate existing JVM machine learning libraries [69,

Machine Learning for Developers

Building Skynet: Machine Learning for Software Developers

JVM Memory Model and GC - Meetupfiles.meetup.com/3189882/JVM_MemoryManagement.pdf · 2016-08-22 · The Java Virtual Machine (JVM) is an abstract computing machine. This way, Java

Java and Java Virtual Machine Security - LSD-PL · Java and Java Virtual Machine Security ... During that time we have learned about JVM internals, ... Next we present JVM security

Amazon Machine Learning: Empowering Developers to Build Smart Applications

JVM Memory Model and GC - files.meetup.com · What is JVM? Oracle Confidential – Internal/Restricted/Highly Restricted 4 The Java Virtual Machine (JVM) is an abstract computing

Programming with Java RMI - pk.orgpk.org/rutgers/notes/content/rb-rmi.pdf · 2002-02-27 · resides on a different JVM: – remote machine – same machine, different JVM (different

Reactive Machine Learning On and Beyond the JVM

Code Security Gordon College Stephen Brinton. Virtual Machine Security Building a fence around your code –JVM – Java Virtual Machine Originally developed.

Java Puzzle Ball - Oracle · Machine (JVM) A Java Virtual Machine (JVM) interprets the bytecode, allowing the program to run on any machine with a Java Runtime Environment (JRE) installed.