Josh Wills, MLconf 2013
-
Upload
sessionsevents -
Category
Technology
-
view
1.097 -
download
0
description
Transcript of Josh Wills, MLconf 2013
1
From The Lab to the Factory Building A Produc8on Machine Learning Infrastructure Josh Wills, Senior Director of Data Science Cloudera
About Me
2
Data Science: Another Defini8on
3
Data Scien8sts Build Data Products.
4
All* Products Become Data Products
5
Iden8fying the BoHlenecks
6
Oryx: Model Building and Serving
• Algorithms • ALS Recommenders • K-‐Means Parallel • RDF
• Batch model building via MapReduce
• Server for real-‐8me scoring and updates
• PMML 4.1 Models
7
Gertrude: Evalua8on via Experiments
• Mul8variate Tes8ng • Define and explore a space of parameters
• Overlapping Experiments • Tang et al. (2010) • Runs mul8ple independent experiments on every request
8
Planning For The Future
9
Josh Wills, Director of Data Science, Cloudera @josh_wills
Thank you!