Towards Autonomic e-Science Ecosystems

47
Towards autonomic e‐science ecosystems Cécile Germain‐Renaud Laboratoire de Recherche en Informa<que Université Paris‐Sud ‐ CNRS ‐ INRIA

description

 

Transcript of Towards Autonomic e-Science Ecosystems

Page 1: Towards Autonomic e-Science Ecosystems

Towardsautonomice‐scienceecosystems

CécileGermain‐Renaud

LaboratoiredeRechercheenInforma<queUniversitéParis‐Sud‐CNRS‐INRIA

Page 2: Towards Autonomic e-Science Ecosystems

Outline

☀ Computa<onalecosystems

☂ TheClouds☂ Challenges☀ Autonomics

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 3: Towards Autonomic e-Science Ecosystems

Therequirementsofe‐science

“Cyberinfrastructureintegrateshardwareforcompu6ng,dataandnetworks,digitally‐enabledsensors,observatoriesandexperimentalfacili6es,andaninteroperablesuiteofso=wareandmiddlewareservicesandtools…”

NSF’sCyberinfrastructureVisionfor21stCenturyDiscovery

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 4: Towards Autonomic e-Science Ecosystems

Anolddream

«Acomputa6onalgridisahardwareandso=wareinfrastructurethatprovidesdependable,consistent,pervasive,andinexpensiveaccesstohighcomputa6onalcapabili6es.»I.Foster,C.Kesselman,TheGrid,1998

UCLApressreleaseonthecrea<onofARPANET,1969

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 5: Towards Autonomic e-Science Ecosystems

Gridsareareality

•  Severallargedeploymentsinrou<neproduc<on•  UKNa<onalGridService(NGS)•  EuropeanGridInfrastructure(EGI‐EGEE)•  TeraGrid•  OpenScienceGrid(OSG)•  DEISA•  …

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 6: Towards Autonomic e-Science Ecosystems

TheEGEE/EGIgrid

LHCisthe•  Largest(26km),•  Fastest(14TeV)•  Coldest(1.9K)•  Emp<est(10−13

atm)machine.

EGEE/EGIisthe•  Largest(40KCPUs),•  Mostdistributed(250

sites),•  Mostused(300K

jobs/day)Computersystem

AtlasCollabora<on(oneinfour)

•  3000scien<sts•  38countries

•  174universi<esandlabs

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 7: Towards Autonomic e-Science Ecosystems

Cyberinfrastructure => Cyber-Ecosystems

Source: M. Parashar eSI visitor Seminar /www.nesc.ac.uk/action/esi/

Page 8: Towards Autonomic e-Science Ecosystems

Cyberinfrastructure => Cyber-Ecosystems

21st century Science and Engineering: New Paradigms & Practices

•  Fundamentally data-driven/data intensive

•  Fundamentally collaborative

Source: M. Parashar eSI visitor Seminar /www.nesc.ac.uk/action/esi/

Page 9: Towards Autonomic e-Science Ecosystems

UnprecedentedOpportuni<es

Forscienceandengineering

•  Knowledge‐based,informa<on/data‐driven,context/content‐awarecomputa<onallyintensive,pervasive,..

•  Holis<capplica<ons:integrateon‐demandcomputa<ons,experiments,observa<ons,data,…

•  Tomanage,control,predict,adapt,op<mize,…

•  Newparadigmsandprac<cesforexis<nggoalsornewthinking

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 10: Towards Autonomic e-Science Ecosystems

e‐scienceecosystems

•  AmajorrequirementisPervasive:On‐demand,integrated,transparent

•  Con<nuity,notrevolu<on–Wemustlearnfromtheexperience

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 11: Towards Autonomic e-Science Ecosystems

ExperiencewiththeEGEE/EGIgrid

EGEECPUusage

0.10%

1.00%

10.00%

100.00%

AA CC ES F HEP INF LS MV OTH UNK

Y0(%)

Y1(%)

Y2(%)

Source:ReportonU<liza<onofEGEEsupportservicesandinfrastructure,May2010

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 12: Towards Autonomic e-Science Ecosystems

e‐scienceecosystems

•  AmajorrequirementisPervasive:On‐demand,integrated,transparent

•  Con<nuity,notrevolu<on–Wemustlearnfromtheexperience

•  Organizedscien<ficcommuni<esarecommimedtoglobalizedhomogeneoussystems.Individualizedscienceisnot(yet?).Heterogeneoushigh‐levelsystemsares<llinthedesignstate.

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 13: Towards Autonomic e-Science Ecosystems

Outline

•  Computa<onalecosystems

•  TheClouds•  Challenges•  Autonomics

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 14: Towards Autonomic e-Science Ecosystems

Amorepervasivetechnology

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 15: Towards Autonomic e-Science Ecosystems

Source:WilliamVambenepe'sKeynoteatCloudConnect2010hmp://stage.vambenepe.com/archives/1355

Page 16: Towards Autonomic e-Science Ecosystems

SaaS:SopwareasaService

Howtodeliver/consume/managesuchservices

•  Cloudprovidesincreasedinfrastructureflexibility,excellentbutnotthebomleneck

•  Applica<onoruser‐orientedflexibility•  Controlandorchestra<onoftheholis<capplica<onsacrossspecialized

andheterogeneouscomponents,whetherlocal,inagridorinacloud•  Agilityasthecapacitytoreconfigure,reorganizetheinternalprocesses

«TheboComlineisthatanydis6nc6onbetweenSaaSandPOWA(PlainOldWebApplica6ons)isatworstarbitraryandatbestconcernedwiththebusinessrela6onshipbetweentheproviderandtheconsumerratherthantechnicalaspectsoftheapplica6on.»Samesource

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 17: Towards Autonomic e-Science Ecosystems

TheGridexperience

«Gridaredefinedbycoordinatedresourcesharingandproblemsolvingindynamic,mul6‐ins6tu6onalvirtualorganiza6ons.Thesharingisnecessarily,highlycontrolled,withresourceprovidersandconsumersdefiningclearlyandcarefullyjustwhatisshared,whoisallowedtoshare,andthecondi6onsunderwhichsharingoccurs»IanFoster,2000

«Acomputa6onalgridisahardwareandso=wareinfrastructurethatprovidesdependable,consistent,pervasive,andinexpensiveaccesstohighcomputa6onalcapabili6es.»I.Foster,C.Kesselman,TheGrid,1998

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 18: Towards Autonomic e-Science Ecosystems

Consumers

Differentusersandrequirementsacrossandwithinthecollobara<ons30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 19: Towards Autonomic e-Science Ecosystems

Providers

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 20: Towards Autonomic e-Science Ecosystems

WhataboutGPUs?

•  Anewdigitaldivide,HPCandpersonalcomputersembarkingintoGPUs,businessande‐scienceintoclouds?

•  GridsmightbeamenabletoGPUs,virtualizedGPUsisanascentresearcharea/technology

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 21: Towards Autonomic e-Science Ecosystems

Themessage

•  DEFINITELYNOT“Cloudisabuzzword”•  Atechnology,notasilverbullet•  Bothe‐scienceandbusinessrequire

•  Efficientintegra<onoflargedatasetswithcompu<ng

•  Pervasiveness•  e‐sciencehasspecificrequirements

•  Organizedsharing:dataandfunding–technicalandpoli<calissues

•  Performance:notalways,butastrongculturalbias/feature.

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 22: Towards Autonomic e-Science Ecosystems

Outline

•  Computa<onalecosystems

•  TheClouds•  Challenges•  Autonomics

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 23: Towards Autonomic e-Science Ecosystems

Thecomplexitycrisis

source:IDC2008,retrievedfromhmp://www.vmware.com/files/pdf/Virtualiza<on‐applica<on‐based‐cost‐model‐WP‐EN.pdf

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 24: Towards Autonomic e-Science Ecosystems

Thecomplexitycrisisinac<on

Source:hmp://www.teach‐ict.com/news/news_stories/news_computer_failures.htm

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 25: Towards Autonomic e-Science Ecosystems

Implemen<ngPervasivenessandSharing

Mul<‐scalefeedbacks30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 26: Towards Autonomic e-Science Ecosystems

Implemen<ngPervasivenessandSharing

Mul<‐scalefeedbacks30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 27: Towards Autonomic e-Science Ecosystems

Configuringthemiddleware

Source:JamesCasey’stalkatEGEE’0930/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 28: Towards Autonomic e-Science Ecosystems

Runningthemiddleware

gLitepredic<onerrorforqueuing<me30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 29: Towards Autonomic e-Science Ecosystems

Usersbehavior

Users/filegroups/hostswithAVIZGraphDice

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 30: Towards Autonomic e-Science Ecosystems

Usersbehavior

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 31: Towards Autonomic e-Science Ecosystems

ComplexityANDuncertainty

•  Asadistributedsystem•  Componentsandcommunica<onscomeandgo

•  Fordynamic(P2P),butformanagedsystemsaswell

•  CAP(Brewer’s)theorem:atmosttwooftheConsistency,Availability,Par<<ontolerancecanbeguaranteed

•  Asadynamic(al)system•  En<<eschangebehaviorasaneffectofunexpectedfeedbacks,

emergentbehavior•  Organizedself‐cri<cality,minoritygames,...

•  Lackofcompleteandcommonknowledge–Informa<onuncertainty•  Monitoringisdistributedtoo•  Resolu<onandcalibra<on•  Seman<csandontologies

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 32: Towards Autonomic e-Science Ecosystems

ComplexityANDuncertainty

Forapplica<onstoo

•  Opportunis<cbehaviors•  Space‐<me,accuracy,andmoregenerallyobjec<veadap<vity

•  Context‐awarenessasrequiredbyaCAP‐proneenvironement

•  Dynamicandcomplexcouplingandinterac<ons•  mul<‐physics,mul<‐model,mul<‐resolu<on,…

•  Trustindataandsopware•  NotonlyforP2Psystems

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 33: Towards Autonomic e-Science Ecosystems

ChallengesSummary

•  Currentlevelsofscale,complexityanddynamismmakeitinfeasibleforhumanstoeffec<velymanageandcontrolsystemsandapplica<ons

•  Compu<ngecosystems,withtheirverylargenumbersofhardwareandsopwarecomponentsinterac<ngwithverylargedata,arecomplexsystemsthatarecurrentlyverydifficulttoprogram

•  Compu<ngecosystemsaredifficulttomanagebecauseoftheheterogeneityofworkflows,datasetsandopera<ngenvironment.

•  Theabilityofanapplica<ontoself‐adaptbyincorpora<ngdynamicinputsalongitsexecu<onneedstobeformulatedthroughageneralandprincipledprogrammingmodel

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 34: Towards Autonomic e-Science Ecosystems

Outline

•  Computa<onalecosystems

•  TheClouds•  Challenges•  Autonomics

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 35: Towards Autonomic e-Science Ecosystems

WhatisAutonomicCompu<ng?

“Compu6ngsystemsthatmanagethemselvesinaccordancewithhigh‐levelobjec6vesfromhumans”KephartandChess,AVisionofAutonomicCompu<ng,IEEEComputer,2003

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 36: Towards Autonomic e-Science Ecosystems

Milestones•  IBMVisionandManifesto2001

•  J.O.KephartandD.M.Chess.Thevisionofautonomiccompu<ng.IEEEComputer,36(1),2003

•  IEEEInterna<onalConferenceonAutonomicCompu<ngseriessince2004

•  IEEETaskForceonAutonomousandAutonomicSystems2006

•  ECMLPKDD2006Tutorial/Workshop:AutonomicCompu<ng:ANewChallengeforMachineLearning,I.RishandG.Tesauro

•  ACMTransac<onsonAutonomousandAdap<veSystems(TAAS),2006

•  AutonomicCompu6ng:Concepts,InfrastructureandApplica6onsM.ParasharandS.Hariri(Ed.),CRCPress,2006

•  TheNSFCenterforAutonomicCompu<ng,2008

•  Interna<onalJournalofAutonomicCompu<ng(IJAC),IntersciencePublishers,2009

•  Panelatthe1stGMACworkshop:TheconvergenceofGrids,CloudsandAutonomics,2009

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 37: Towards Autonomic e-Science Ecosystems

Self‐management

•  Self‐ConfiguraDonAutomatedconfigura<onofcomponents,systemsaccordingtohigh‐levelpolicies;restofsystemadjustsseamlessly.

•  Self‐HealingAutomateddetec<on,diagnosis,andrepairoflocalizedsopware/hardwareproblems.

•  Self‐OpDmizaDonAutoma<candcon<nualadap<vetuningofhundredsofparameters(databaseparams,serverparams,…)affec<ngperformance&efficiency

•  Self‐ProtecDonAutomateddefenseagainstmaliciousamacksorcascadingfailures;useearlywarningtoan<cipateandpreventsystem‐widefailures.

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 38: Towards Autonomic e-Science Ecosystems

TheAutonomicNervousSystem

•  Themostsophis<catedexampleofautonomicbehavior.

•  Regulatesandmaintainshomeostasis:maintainsstructureandfunc<onsbymeansofamul<plicityofdynamicequilibriumsthatarerigorouslycontrolledbyinterdependentregula<onmechanisms.

30/01/11 ASSYSTmee<ng:OpeningtheCloud

•  Notallparametershavethesameurgency,essen<alparametersaremonitoredmoreclosely.

Page 39: Towards Autonomic e-Science Ecosystems

Ashby’sUltrastableSystem

Source: “Autonomic Computing: An Overview, ” M. Parashar, and S. Hariri, UPP 2004, Mont Saint-Michel, France, Editors: J.-P. Banâtre et al. LNCS, Springer Verlag, Vol. 3566, pp. 247 – 259, 2005.

Acontroltheoryvision

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 40: Towards Autonomic e-Science Ecosystems

And/orSelf‐awareness

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 41: Towards Autonomic e-Science Ecosystems

TheMAPE‐Kloop

ManagedElement

ES

Monitor

Analyze

Execute

Plan

Knowledge

AutonomicManagerES

Environmentsensors

Networkinstrumenta<on

Userscontext

Applica<onrequirements

High‐dimensional,high‐volume‘raw’data

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 42: Towards Autonomic e-Science Ecosystems

TheMAPE‐Kloop

ManagedElement

ES

Monitor

Analyze

Execute

Plan

Knowledge

AutonomicManagerES

State‐SpaceandDataAbstracDon

Streaming:

On‐linedatamining,clustering,..

Dimensionalityreduc<on

Ac<velearning

Ontologicalinference

High‐dimensional,high‐volume‘raw’data

Compressed,‘informa<ve’data30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 43: Towards Autonomic e-Science Ecosystems

TheMAPE‐Kloop

ManagedElement

ES

Monitor

Analyze

Execute

Plan

Knowledge

AutonomicManagerES

LearnpredicDvemodels

Classifica<on,regression,<meseries,MCMC

Decision‐making

Explora<onvsExploita<onGametheory,Riskanalysis

ReinforcementLearning,bandits

Compressed,‘informa<ve’data

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 44: Towards Autonomic e-Science Ecosystems

TheMAPE‐Kloop

ManagedElement

ES

Monitor

Analyze

Execute

Plan

Knowledge

AutonomicManagerES

Knowledge–basedegontologies,a‐priorimodels,intelligentini<alisa<on

Or

Tabula‐rasaKnowledge

‐Avoidsknowledge‐intensivemodelbuilding

Criteria

‐IndepedentKnowledgeandlearning

‐Theore<calguaranteesofimprovement30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 45: Towards Autonomic e-Science Ecosystems

Technicalissues:exampleforRL

NeedenhancementtoVanillaReinforcementLearning•  Observa<onuncertainty•  Historicaldependenciesmayexist:MDPmightnotbeanexactmodel

•  Convergencenotguaranteed•  Lackofsta<onarity,•  Con<nuousstate‐ac<onspacerequiresapproxima<ons•  Localvsgloballearning,becauseofcurseofdimensionality

•  Explora<onpenal<esmightbeexcessiveAnindepthexplora<onoftheseissues:GeraldTesauroetal.OntheUseofHybridReinforcementLearningforAutonomicResourceAlloca<on.ClusterCompu<ng,10(3):287‐99,2007.

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 46: Towards Autonomic e-Science Ecosystems

Transversalissues

•  Limits•  Biologicalself‐*(awareness,healing…)may/willul<matelyfail,plusunforeseentreats

•  Overheads•  Designing,programming,execu<ng,provisioning

•  Valida<on•  Extremeevents:revisittradi<onalcriteriaegRMSE

•  Benchmarkingunderuncertainty•  Availabilityofreferencedatasets

www.grid‐observatory.org

CGermain‐Renaudetal.TheGridObservatory,toappearIEEE/ACMCCGRID'11

30/01/11 ASSYSTmee<ng:OpeningtheCloud

Page 47: Towards Autonomic e-Science Ecosystems

30/01/11 ASSYSTmee<ng:OpeningtheCloud