Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

14
Introducing Apache PredictionIO (incubating) http://predictionio.incubator.apache.org Donald Szeto Tech Lead @ Salesforce First Git Commit @ PredictionIO

Transcript of Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Page 1: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

IntroducingApache PredictionIO (incubating)http://predictionio.incubator.apache.org

Donald Szeto

Tech Lead @ SalesforceFirst Git Commit @ PredictionIO

Page 2: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Agenda

Future {

println(roadmap)

}

?

(Source: benbrandt22, https://redd.it/383edw)

Page 3: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Why PredictionIO?

ML problem 1 Data collection Modeling Serving/scoring

Evaluation

ML problem 2 Data collection Modeling Serving/scoring

Evaluation

Is there a common factor?(Not his actual words :))

Page 4: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

What is PredictionIO?

A machine learning server for developers and ML engineers

PredictionIO API

Engines EnginesEngines

Page 5: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)
Page 6: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Quick Demo

Page 7: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Digging Deeper

What is DASE?Data, Algorithm, Serving, Evaluation

What are engine instances?

What are engine variants?

Page 8: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Engine Instances

Engine(Scala/Java code)

Data

Engine Parameters

Algorithm Hyperparameters

Environment

+ Engine Instance=

Page 9: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Digging Deeper

What is DASE?Data, Algorithm, Serving, Evaluation

What are engine instances?

What are engine variants?

Page 10: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Engine Variants

Engine(Scala/Java code)

Data

Engine Parameters

Algorithm Hyperparameters

Environment

+ Engine Instance=

Data

Engine Parameters

Algorithm Hyperparameters

Environment

Engine Instance

Page 11: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Current Development ( <= 0.10.0 )

Migrating to ASF infrastructure

Merging forksSliding window event data source

Installation fixes

Engine templates and SDKs migration

Docker-based Integration Test Infrastructure

Page 12: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Travis CI Worker

Travis CI Worker

Travis CI Worker

Travis CI Worker

Docker-based Integration Test Infrastructure

Running many test engines in different environments, in parallel

Page 13: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

Future Roadmap ( > 0.10.0 )

Cross-building with Spark 1.x and Spark 2.x

Better native support of Spark ML Pipeline and DataSet

Multi-engine serving

Admin API w/ CLI Refactoring

Testing infrastructure for community engine templates

Further separation-of-concernsAllow engines to use different parts of PredictionIO independently

Page 14: Introducing apache prediction io (incubating) (bay area spark meetup at salesforce)

WE NEED YOUR HELP!!!Please subscribe to [email protected]

For usage questions please subscribe to [email protected]

donald (at) apache.org

Thank you!