Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most...
Transcript of Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most...
![Page 1: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/1.jpg)
AutomatedMachineLearning(AutoML)andPentahoCaio MorenodeSouzaPentahoSeniorConsultant,HitachiVantara
![Page 2: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/2.jpg)
Agenda
WewilldiscusshowAutomatedMachineLearning(AutoML)andPentaho,together,canhelpcustomerssavetimeintheprocessofcreatingamodelanddeployingthismodelintoproduction.
• BusinessCaseforAutomatedMachineLearning(AutoML)andPentaho;
• HighleveloverviewaboutAutomatedMachineLearning(AutoML);
• Demonstrations(Pentaho+AutoML).
![Page 3: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/3.jpg)
ThePerfectModelDoesNotExist
“Allmodelsarewrong,butsomeareuseful.”
– GEORGEBOX,1919-2013
![Page 4: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/4.jpg)
BusinessCaseforAutoMLandPentaho
• Findingthecorrectmachinelearningalgorithmisnotaneasytask.
• YouneedtofindabalancebetweenthetimeyouwouldneedtospendandthetimeyoucanactuallyspendontheMLproblem.
• Tocreateagoodmodelyouwillneedtoknowverywelltheproblem,thevariables(instances),preparethedata,featureengineeringandtestdifferentalgorithms.
• SomedatascientistswillalsosaytoaddalittlebitofMAGICJ.
• Adding,ofcourse,inmostcases,alotofcomputerpower.
![Page 5: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/5.jpg)
MachineLearningHigh-LevelOverview
![Page 6: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/6.jpg)
WhatisAutomatedMachineLearning(AutoML)?
IllustrationbyShyam Sundar Srinivasan
![Page 7: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/7.jpg)
WhatisAutomatedMachineLearning(AutoML)?
“Machinelearningisverysuccessful,butitssuccessescruciallyrelyonhumanmachinelearningexperts,whoselectappropriateMLarchitectures(deeplearningarchitecturesormoretraditionalMLworkflows)andtheirhyperparameters.Asthecomplexityofthesetasksisoftenbeyondnon-experts,therapidgrowthofmachinelearningapplicationshascreatedademandforoff-the-shelfmachinelearningmethodsthatcanbeusedeasilyandwithoutexpertknowledge.WecalltheresultingresearchareathattargetsprogressiveautomationofmachinelearningAutoML.”https://sites.google.com/site/automl2016/
![Page 8: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/8.jpg)
WhyAutomatedMachineLearning(AutoML)?
• Thedemandformachinelearningexpertshasoutpacedthesupply.Toaddressthisgap,therehavebeenbigstridesinthedevelopmentofuser-friendlymachinelearningsoftwarethatcanbeusedbynon-expertsandexperts,alike.
• AutoMLsoftwarecanbeusedforautomatingalargepartofthemachinelearningworkflow,whichincludesautomatictrainingandtuningofmanymodelswithinauser-specifiedtime-limit.
![Page 9: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/9.jpg)
WhatisNOTAutomatedMachineLearning(AutoML)?
• AutoML isnotautomateddatascience;
• AutoML willnotreplaceDataScientist;– Allthemethodsofautomatedmachinelearningaredevelopedtosupportdatascientists,nottoreplacethem.– AutoML istofreedatascientistsfromtheburdenofrepetitiveandtime-consumingtasks(e.g.,machinelearningpipelinedesignandhyperparameteroptimization)sotheycanbetterspendtheirtimeontasksthataremuchmoredifficulttoautomate.
![Page 10: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/10.jpg)
AutoMLTools
• AutoWeka(OpenSource)– http://www.cs.ubc.ca/labs/beta/Projects/autoweka/
• H2o.aiAutoML(OpenSource)– https://www.h2o.ai/
• TPOT(OpenSource)– https://github.com/rhiever/tpot
• AutoSklearn(OpenSource)– https://github.com/automl/auto-sklearn– http://automl.github.io/auto-sklearn/stable/
• machineJS (OpenSource)– https://github.com/ClimbsRocks/machineJS
![Page 11: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/11.jpg)
PDI+AutoML
![Page 12: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/12.jpg)
MachineLearningwithPentahoin4Steps
http://www.pentaho.com/blog/4-steps-machine-learning-pentaho
![Page 13: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/13.jpg)
CRISP-DM
http://www.pentaho.com/blog/4-steps-machine-learning-pentaho
BusinessUnderstanding
DataUnderstanding
DataPreparation
Modeling
Evaluation
Deployment
Data
![Page 14: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/14.jpg)
UseCase:AutoML+Pentaho
• OurusershaveawelldefinedMLproblemandtheinitialversionofthedataset(trainandtest).
• Unfortunately,theyhaven’tcreatedaMLmodelyet.
• Also,theyhavenoideahowtocreateit.• AndtheywantustohelpthemtocreateitassoonaspossibleusingonlyOpenSourcetools.
![Page 15: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/15.jpg)
TheJourney
• Ifyouembarkinthisjourney,youcanstickinthisproblemforever…
…oryoucanfindquickwaystodoitinaspecifiedtime.
• CustomerscanthenspendenoughtimelatertoimprovetheircurrentModel.
• Thenextstepswillbe:– Hireadatascientistorateamofdatascientists;– Hireadomainexpertinthatproblem.
![Page 16: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/16.jpg)
OurGoal
• Inthisspecificscenario,ourgoalwillbetohelpthemtostarttheprocessofcreatingadummymodelusingAutoML.
![Page 17: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/17.jpg)
CreateYourFirstMLModel
1. Definetheproblem;
2. Analyzeandpreparethedata;
3. Selectalgorithms(startsimple);
4. Runandevaluatethealgorithms;
5. Improvetheresultswithfocusedexperiments;
6. Finalizeresultswithfinetuning.
![Page 18: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/18.jpg)
SampleDataset
• Moredataisbetter,butmoredatameansmorecomplexity.
• Moredatameansmoretimethatyouwillhavetospendinyourproblem.
• Whynotcreateasampledataset?!– Create1to20datasetstotestyourproblemandcreateyourmodels;
![Page 19: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/19.jpg)
DemoAutoML+Pentaho
• ThispresentationaimstodemotheprocessofhowAutoML opensourcetoolsandPentaho,together,canhelpcustomerssavetimeintheprocessofcreatingamodelanddeployingthismodelintoproduction.
![Page 20: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/20.jpg)
ThePowerofPDI
• PDI(PentahoDataIntegration)willhelpdatascientistanddataengineerswithdataonboarding,datapreparation,datablending,modelorchestration(modelandpredict),savingandvisualizingthedata.
![Page 21: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/21.jpg)
DataOnboarding,DataPreparationandDataBlending
• BelowwecanseeaDataPreparationProcessusingPDI(PentahoDataIntegration);• MLdatasetoutput:ARFFFile(WekaFile),CSV(Python,RandApacheSparkMLlib)andHadoopOutputtosavethetxtfiletotheDataLake;
![Page 22: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/22.jpg)
PredictingNewValuesUsingYourModel
![Page 23: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/23.jpg)
Demonstration
![Page 24: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/24.jpg)
DemoAgenda
Whatwewillcoverinthedemo:
• DataPreparationwithPDI;• ModelcreationusingAutoML Tool;
• ModelDeploymentwithPDI;
![Page 25: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/25.jpg)
PentahoDataIntegration+H2OAutoML
![Page 26: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/26.jpg)
Summary
Whatwecoveredtoday:
• BusinessCaseforAutomatedMachineLearning(AutoML)andPentaho;
• HighleveloverviewaboutAutomatedMachineLearning(AutoML);
• Demonstrations(Pentaho+AutoML).
![Page 27: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/27.jpg)
NextSteps
Wanttolearnmore?
• TalktomeduringPentahoWorld2017orsendmeane-mailcaio.moreno@HitachiVantara.com;
• Meet-the-Experts:– https://www.pentahoworld.com/meet-the-experts
![Page 28: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/28.jpg)
![Page 29: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/29.jpg)
Appendices
![Page 30: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/30.jpg)
TopPredictionAlgorithms
• AccordingtoDataiku,thetoppredictionalgorithmsaretheonesexplainedintheimageontherightside.
• Thisimagealsoexplains(resumes)theadvantagesanddisadvantagesofeachalgorithm.
Source:https://blog.dataiku.com/machine-learning-explained-algorithms-are-your-friend
![Page 31: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/31.jpg)
Algorithms
REXERanalyticsdatasciencesurvey*givesusagoodideaaboutwhichalgorithmshavebeenusedovertheyears.
*SpecialthankstoMarkHall(Pentaho)forsharingthisdocumentwithme.Documentavailableat:http://www.rexeranalytics.com/data-science-survey.html
![Page 32: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/32.jpg)
CoreAlgorithms
Source: http://www.rexeranalytics.com/files/Rexer_Data_Science_Survey_Highlights_Apr-2016.pdf
![Page 33: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/33.jpg)
Tools
• Thehugeamountoftoolsincreasesthecomplexity.
Source: http://www.rexeranalytics.com/files/Rexer_Data_Science_Survey_Highlights_Apr-2016.pdf
![Page 34: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/34.jpg)
AutoWeka
• AutoWeka– providesautomaticselectionofmodelsandhyperparametersfor WEKA.– http://www.cs.ubc.ca/labs/beta/Projects/autoweka/
• OpendatasetsforAutoWeka– http://www.cs.ubc.ca/labs/beta/Projects/autoweka/datasets/
![Page 35: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/35.jpg)
AutoSklearn
• AutoWekainspiredtheauthorsofAutoSklearn;
• AutoSklearn– auto-sklearnisanautomatedmachinelearningtoolkitandadrop-inreplacementforascikit-learnestimator.– https://github.com/automl/auto-sklearn– http://automl.github.io/auto-sklearn/stable/
![Page 36: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/36.jpg)
TypesofMLProblemswith(AutoML)
• ThetypesofMachineLearningproblemsthatwecansolveusingAutoWekaandAutoSklearn areClassification,RegressionandClustering:– ClassificationandRegressionarealreadysupportedinAuto-sklearn&Auto-WEKA.– Forclustering,youcanuseaslongasyouhaveanobjectivefunctiontooptimize.
![Page 37: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/37.jpg)
AutomatedbyTPOT
• TPOTwillautomatethemosttediouspartofmachinelearningbyintelligentlyexploringthousandsofpossiblepipelinestofindthebestoneforyourdata.
https://github.com/rhiever/tpot
![Page 38: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/38.jpg)
AutoMLToolsInstallation
![Page 39: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/39.jpg)
InstallingAutoWeka
• ToinstallAutoWeka,gotoWekaPackageManager>SearchforAuto-WEKAandclickthe“Install”button.
![Page 40: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/40.jpg)
InstallingTPOT
• CommandtoinstallTPOT– $pipinstalltpot
• Learnmore:– http://rhiever.github.io/tpot/installing/
![Page 41: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/41.jpg)
InstallingAutoSklearnonUbuntu
• Usethedocumentationbelowtohelpyou:– http://automl.github.io/auto-sklearn/stable/
• Runthiscommandonubuntuterminal:– $condainstallgccswig– $curlhttps://raw.githubusercontent.com/automl/auto-sklearn/master/requirements.txt|xargs-n1-L1pipinstall– $sudoapt-getinstallbuild-essentialswig– $pipinstall–Uauto-sklearn
![Page 42: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/42.jpg)
ErrorAutoSklearnonUbuntu
• ErrorreportedonJune,14th 2017.Solutionsentonthesameday.
• ChecktheGitHublinkbelowtofindthesolution:https://github.com/automl/auto-sklearn/issues/308
![Page 43: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/43.jpg)
InstallingH20.ai
• ToinstallH20.aiAutoMLvisitthewebsites:– https://blog.h2o.ai/2017/06/automatic-machine-learning/– https://www.h2o.ai/
![Page 44: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/44.jpg)
AutoMLDemonstration
![Page 45: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/45.jpg)
UsingAutoWeka
• timeLimit=Youcandefinethetimeinminutesthat youwantAutoWekatousetorunandfindthebestoption.– Moretime=betterresults
![Page 46: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/46.jpg)
UsingAutoWeka
• YoucanrunAutoWekafromtheWekaExplorerUserInterface
![Page 47: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/47.jpg)
UsingAutoWeka
• Forbetterperformance,trygivingAuto-WEKAmoretime
![Page 48: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/48.jpg)
UsingAutoWeka
• AutoWekaoutputresults
![Page 49: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/49.jpg)
TestingAutoSklearn
• OpenSpyderandtestthecodebelow:
Sourcecode:http://automl.github.io/auto-sklearn/stable/
![Page 50: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/50.jpg)
TestingAutoSklearn withIrisDataset
![Page 51: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/51.jpg)
TestingH2o.aiAutoML
TotestH2oAutoMLisnecessarytoinstalltheversion3.11.0.3888orsuperior.http://h2o-release.s3.amazonaws.com/h2o/rel-vapnik/1/index.html
https://github.com/caiomsouza/machine-learning-orchestration/blob/master/AutoML/src/r/h2o-automl/H20_AutoML_Example.R
aml<- h2o.automl(x=x,y=y,training_frame=train,leaderboard_frame=test,max_runtime_secs=30)
#ViewtheAutoMLLeaderboardlb<- aml@leaderboardlb
![Page 52: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/52.jpg)
DemoAutoML(AutoWeka)+Pentaho
• UsingAutoWekafromtheWekaUserInterfacewecreatedafirst“dummy”modelin15minutes.
![Page 53: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/53.jpg)
• AutoWekawilloutputthebestmodelcreatedinthetimespecified,thismodelcanthenbeusedtopredictnewvalues.
AutoWekaoutput
![Page 54: Automated Machine Learning (AutoML) and Pentaho - Presentation · •Adding, of course, in most cases, a lot of computer power. Machine Learning High-Level Overview. What is Automated](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec5e5ff8570db798767195c/html5/thumbnails/54.jpg)
NoFreeLunchTheorem
https://ti.arc.nasa.gov/m/profile/dhw/papers/78.pdf
http://www.no-free-lunch.org/
http://philosophy.wisc.edu/forster/papers/Krakow.pdf