openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and...
Transcript of openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and...
![Page 1: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/1.jpg)
Jo-fai(Joe)Chow,DataScienceEvangelistH2O.ai
[email protected]|@h2oai
JointheConversation#OpenPOWERSummit
AcceleratingAIDeploymentwithH2ODriverlessAIonIBMPower9
![Page 2: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/2.jpg)
CONFIDENTIAL
H2O.aiOverview
This
Company Founded in Silicon Valley in 2012Funded: $75M Investors: Wells Fargo, NVIDIA, Nexus Ventures, Paxion Ventures
Products • H2O Open Source Machine Learning (14,000 organizations)• H2O Driverless AI – Automatic Machine Learning
Leadership Leader in Gartner MQ Machine Learning and Data Science Platform
Team 120 AI experts (Kaggle Grandmasters, Distributed Computing, Visualization)
Global Mountain View, London, Prague, India
![Page 3: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/3.jpg)
CONFIDENTIALCONFIDENTIAL
AGrowingCustomerBase
This “H2O.ai'sreferencecustomersgaveitthehighestoverallscoreforsales
relationshipandoverallserviceandsupport”- GartnerMQ2018
Financial InsuranceMedia & MarketingTelcosIndustrial Retail HealthcareAdvisory,
Accounting & Government
![Page 4: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/4.jpg)
CONFIDENTIAL
GrowingWorldwideOpenSourceCommunity
14,000CompaniesusingH2O
155,000datascientists 116KMeetupMembers
H2OWorldNYC,London,SF
Thousandsattendingliveandonline
![Page 5: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/5.jpg)
CONFIDENTIAL
H2O.aiisaLeader inthe2018GartnerDataScienceandMachineLearningPlatformsMagicQuadrant• Technologyleaderwithmostcompletenessofvision
• Recognizedforthemindshare,partnernetworkandstatusasaquasi-industrystandardformachinelearningandAI
• H2O.aicustomersgavethehighestoverallscore amongallthevendorsforsalesrelationshipandaccountmanagement,customersupport(onboarding,troubleshooting,etc.)andoverallserviceandsupport
This
GettheGartnerMagicQuadranthere
![Page 6: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/6.jpg)
“ConfidentialandpropertyofH2O.ai.Allrightsreserved”
PartnerEcosystem
StrategicPartners
Cloud ProvidersHW Vendors System Integrators
Value Added Resellers
Data Stores
This
![Page 7: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/7.jpg)
CONFIDENTIALCONFIDENTIAL
H2O.ai Product Suite
In-Memory, Distributed Machine Learning Algorithms
with H2O Flow GUI
H2O AI Open Source Engine Integration with Spark
Lightning Fast machine learning on GPUs
Automatic feature engineering, machine
learning and interpretability
• 100%opensource– ApacheV2licensed• Builtfordatascientists– interfaceusingR,Python
onH2OFlow(interactivenotebookinterface)• EnterpriseSupportsubscriptions
• Enterprisesoftware• Builtfordomainusers,analysts&
datascientists– GUIbasedinterfaceforend-to-enddatascience
• Fullyautomatedmachinelearningfromingesttodeployment
• User licensesonaperseatbasis(annualsubscription)
Open Source
![Page 8: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/8.jpg)
CONFIDENTIALCONFIDENTIAL
Why Driverless AI?
![Page 9: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/9.jpg)
CONFIDENTIAL
Driverless AI: Automates Data Science and ML Workflows
Driverless AI
![Page 10: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/10.jpg)
10H2OTeam
OriginofRPackage`ggplot2`
![Page 11: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/11.jpg)
“ConfidentialandpropertyofH2O.ai.Allrightsreserved”
Automatic VisualizationAutomaticScagnosticsandothervisualizationstogeneratethemostrelevantvisualizationsforeachdataset
![Page 12: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/12.jpg)
12H2OTeam
1st
4th
25th
48th33rd
KaggleGrandmasters(andtheirHighestRank)
13th
About80,000Kagglers
![Page 13: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/13.jpg)
13H2OTeam
1st
4th
25th
48th33rd
13th
181stHopingtogetclosertothematsomepoint…
![Page 14: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/14.jpg)
CONFIDENTIAL
Secret Sauce: 1) Grandmaster Feature Engineering
Numerical/Categorical Interactions, Target Encoding, Clustering, Dimensionality Reduction, Weight of Evidence, etc.
Time-Series: Lags and historical aggregates with causality constraints
![Page 15: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/15.jpg)
CONFIDENTIAL
Secret Sauce: 2) Grandmaster Pipeline Tuning + Validation
19,000 features tested
1,000 models trained
reliable generalization estimates (overfitting avoidance)
Example: Driverless AI BNP Paribas on 3-GPU workstation
evolutionary strategies
DOI:10.1126/science.aaa9375
MTV
1 final optimalscoring pipeline
massively parallel processing(multi-CPU, multi-GPU)
![Page 16: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/16.jpg)
CONFIDENTIAL
https://web.stanford.edu/~hastie/Papers/ESLII.pdf
http://www.deeplearningbook.org
Statistical Learning vs Deep Learning - We Do Both!
Typically better for structured data(CSV, SQL, Transactional)
Typically better for unstructured data(Images, Video, Audio, Text)
GLM/CART/RF/GBM/XGBoostK-Means/PCA/SVD
TensorFlow Deep Learning
![Page 17: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/17.jpg)
“ConfidentialandpropertyofH2O.ai.Allrightsreserved”
• Automatic feature engineering to increase accuracy - AlphaGo for AI
• Automatic Kaggle Grandmaster recipes in a box for solving wide variety of use-cases
• Automatic machine learning to find and tune the right ensemble of models
Accuracy
![Page 18: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/18.jpg)
“ConfidentialandpropertyofH2O.ai.Allrightsreserved”
Interpretability
• Interpretability for debugging, not just for regulators
• Get reason codes and model interpretability in plain english
• K-Lime, LOCO, partial dependence and more
![Page 19: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/19.jpg)
CONFIDENTIAL
Deployment: Auto Generated Pipelines
Driverless AI = AI to do AI
![Page 20: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/20.jpg)
CONFIDENTIALCONFIDENTIAL
BinaryClassification
LiveDemo
https://www.kaggle.com/c/bnp-paribas-cardif-claims-management
![Page 21: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/21.jpg)
CONFIDENTIALCONFIDENTIAL
BinaryClassification
LiveDemo
![Page 22: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/22.jpg)
CONFIDENTIAL
DriverlessAIExperiment– LiveDemo
This
![Page 23: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/23.jpg)
CONFIDENTIAL
Deployment:ScoringPipelineExample
This
valid license
Pipelines generated from Driverless AI experiment
New data (raw features only, no target)
Fast, practical scoring speed in ms(including all feature engineering and scoring steps)
![Page 24: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/24.jpg)
CONFIDENTIAL
PythonAPI:RunningDriverlessAIwithaScript
This
![Page 25: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/25.jpg)
CONFIDENTIALCONFIDENTIAL
DriverlessAIonIBMPower
docs.h2o.ai
![Page 26: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/26.jpg)
CONFIDENTIALCONFIDENTIAL
DriverlessAIonIBMPower
![Page 27: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/27.jpg)
CONFIDENTIALCONFIDENTIAL
DriverlessAIDelivers“ExpertDataScientistinaBox”
• CreatedandsupportedbyworldrenownedAIexperts
• EmpowerscompaniestoaccomplishAIandMLwithasingleplatform
• Performsthefunctionofanexpertdatascientistandaddsmorepowertobothnoviceandexpertteams
• Detailsandhighlightsinsightsandinterpretabilitywitheasytounderstandresultsandvisualizations
21dayfreetrialforDriverlessAI
![Page 28: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/28.jpg)
OurFlagshipCommunityEvent– H2OAIWorldisfinallycomingtoLondon!
28
29th &30th Oct,London
Morereal-worldusecases
+AllH2OKaggleGrandmasters
+Hands-onTraining
![Page 29: openpower summit driverless ai compressed · H2O.ai is a Leaderin the 2018 Gartner Data Science and Machine Learning Platforms Magic Quadrant • Technology leader with most completeness](https://reader033.fdocuments.net/reader033/viewer/2022042220/5ec5e6028570db7987671965/html5/thumbnails/29.jpg)
• MoreInfo,Code,andSlides• bit.ly/h2o_meetups
• Contact• [email protected]• @matlabulous• github.com/woobe
29
Thanks!