From Damaris to CALCioM - Inria · ICS 2011 M. Dorier, G. Antoniu, F. Cappello, M. Snir, L. Orf....

FromDamaristoCALCioMMi/ga/ngI/OInterferenceinHPCSystems

Ma#hieuDorier–ENSRennes,IRISA,InriaRennesKerDataprojectteamJointworkwithRobRoss,DriesKimpe,GabrielAntoniu,ShadiIbrahim

WithintheData@Exascaleassociatedteam

10thworkshopoftheJointLabforPetascaleCompuLngUrbana-Champaign,November2013

Outline

•  Damaris:AUer3yearsofcollaboraLon…•  CALCioM:Towardscross-applicaLoncoordinaLon

DamarisaUer3years…Originated from the “Shared Buffering System”designed in 2010 during aninternship at NCSA, Damaris proposes to dedicate cores in mulLcore SMPnodestodatamanagement,i.e.storage,insituanalysisandvisualizaLon.

Overview Implementa/on

•  Version0.7.3available•  h#p://damaris.gforge.inria.fr

•  Version1.0forsummer2014•  15095linesofcode•  APIforC,C++andFortran

simulaLons•  EasyconfiguraLonwithXML•  InsituvisualizaLonwithVisIt•  PythonandC++plugins

DamarisaUer3years…PeopleInvolved Evaluatedon

BlueWaters,Intrepid,Kraken,Jaguar,Grid’5000,BluePrint,Surveyor

Ma#hieuDorier,GabrielAntoniu,LokmanRahmani,RobertoSisneros,DaveSemeraro,BobWilhelmson,

RobRoss,TomPeterka,DriesKimpe,MarcSnir,FranckCappello,LeighOrf

Publica/onsM. Dorier, advised by G. Antoniu. Damaris - Using Dedicated I/OCoresforScalablePost-petascaleHPCSimulaLons.ICS2011M.Dorier,G.Antoniu,F.Cappello,M.Snir,L.Orf.Damaris:HowtoEfficientlyLeverageMulLcoreParallelismtoAchieveScalable,Ji#er-freeI/O.inProc.ofIEEECLUSTER2012.M.Dorier,advisedbyG.Antoniu.EfficientI/OusingDedicatedCoresinLarge-ScaleHPCSimulaLons.PhDforumofIPDPS2013M. Dorier, R. Sisneros, T. Peterka, G. Antoniu, D. Semeraro.Damaris/Viz, a Nonintrusive, Adaptable and User-Friendly In SituVisualizaLonFramework.inProc.ofIEEELDAV2013

EvaluatedwithCM1,Nek5000,OLAM

Mi/ga/ngI/OInterference

inHPCSystems

IntroducLontocross-applicaLoninterference

Interference: Performance degradaLon observed by anapplicaLon incontenLonwithotherapplicaLons for theaccesstoasharedresource. •  HowoUendoesI/Ointerferenceoccur?• WhatistheeffectofI/Ointerference?•  HowdowequanLfyandvisualizeit?•  HowtomiLgateit?

HowoUendoesI/Ointerferenceoccur?

HowoUendoesinterferenceoccur?“Intrepidhasareallyweirdworkloadcomparedtomostothersystems,becauseofthelargenumberoflargejobs.”

NarayanDesai(ANL)

HowoUendointerferenceoccur?IamanapplicaLon,IstartwriLng,whatistheprobabilitythatatleastoneotherapplicaLonisalsoaccessingthefilesystem?

P(another is doing I/O) =1− P(X = n)(1−E(µ))n=0

WhereXisthenumberofrunningapplicaLon(randomvariable),μistheI/OLmev.s.computaLonLmeraLoofapplicaLons(r.v.),AssumingindependencebetweenXandμ.

OnIntrepid:AssumingE(μ)=5%,P(anotherisdoingI/O)=64%

WhatistheeffectofI/Ointerference?

IORrunningon336cores,wriLngevery10secondsina35-serverPVFSfilesystem

onGrid’5000

Asecondinstanceisstartedon336othercores,wriLngthesameamountofdata

every7seconds

I/Ointerferencehasalargeimpactoncachingmechanisms 11

HowdowequanLfyandvisualizeI/Ointerference?

Interferencefactor•  Theuserisinterestedinthefactorbywhichinterference

increasestheI/O3me:

•  ConsideringnapplicaLons,wecould(forexample)wanttominimizethesumofaccessLmes:

•  Thesemetricscanbeadaptedtoanything(EnergyconsumpLon,CPUcycles,etc.):fcanbegeneralizedasametricsformachine-wideefficiency.

IX =TX

TX (alone)>1

f = TXX∈app∑

Delta-graphAppA’saccess

AppB’saccessdt

ResultsonSurveyor(2x2048cores),eachcorewrites8MBconLguously.Thegraphrepresentsthepointofviewofoneofthe2applicaLons.

I/OLmewhentheapplicaLonisalone

PerformancedegradaLonduetointerferences

BadluckforsmallapplicaLons

ExperimentonGrid’5000,AppBon24cores,AppAon744,wriLng8MBperprocess

SmallestAppobservesanupto14xdecreaseofperformance!Biggestonedoesnotevenseeit!

HowtomiLgateI/Ointerference?TheCALCioMapproach

TheCALCioMarchitectureAppA

CALCioMI/Olibrary

MPII/O

I/Olibrary

MPII/O

Read/WriteRead/WriteFileSystem

CoordinaLon

Cross-Applica/onLayerforCoordinatedI/OManagement

CALCioM’sAPI

CALCioM_Init(MPI_Comm c)CALCioM_Prepare(MPI_Comm c, MPI_Info i)CALCioM_Ask()CALCioM_Check(int* status)CALCioM_Wait()CALCioM_Release()CALCioM_Complete()CALCioM_Finalize()

PossiblecoordinaLonstrategies

AppBdt

“Firstcomefirstserved”(FCFS)SerializaLon

InterrupLon

HowtochooseacoordinaLonstrategyQ:GivenapplicaLonAwithexpectedaccessLmeTAandapplicaLonBwithexpectedaccessLmeTB,starLngdtLmeunitsaUerapplicaLonA’saccess,

ShouldAbeinterruptedinfavorofB?OrshouldBwaitforAtoterminateitsaccess?

Example:ifneitherAnorBhavesomethingelsetodo,opLmizingglobalperformance,i.e.minimizinganinterferenceeffectgivenby

f = TATA(alone)

TB(alone)

TellsusthatBshouldinterruptAifandonlyif

dt <TA(alone)

2 −TB(alone)2

TA(alone)

f = TA +TB

dt < TA(alone) −TB(alone)

IntegraLoninMpich•  MPI_InitandMPI_Finalize overwri#en in

libcalciom.a•  MPI_File_open(“myfile”)

Ø MPI_File_open(“calciom:myfile”)•  MPI_File_open(“pvfs2:myfile”)

Ø MPI_File_open(“calciom:pvfs2:myfile”)•  ConnecLonbetweenapplicaLons:couldbedonethrough

MPI_Comm_connect/accept(ideallywouldbenefitfromMPI_Comm_iconnect/iaccept)+interacLonwiththejobscheduler

ExperimentalevaluaLon

2x2048coresonSurveyor•  AppA:4files,4MBperfileperprocess,conLguouslayout•  AppB:1file,4MBperfileperprocess,conLguouslayout

f = TA +TB dt < TA(alone) −TB(alone)

AppB(smallI/Oload) AppA(bigI/Oload)

ExampleofapplicaLon

AppBarrivesfirst,AppAisserializeda[erB

ExampleofapplicaLon

AppBarrivesduringthewriteofthe3firstfilesofAppA,Condi/onindicatesthatAshouldbeinterrupted.

Thelevelofinterrup/onproducesdifferentpa^erns.

ExampleofapplicaLon

AppBarrivesduringthelastwriteofAppA.Condi/ondictatesthatBisserializeda[erA.

Synthesis

CALCioMmanagestoimprovethecomputa/onalefficiencyofthesetofapplica/onsbyavoidinginterference,andthus

improvestheefficiencyoftheen/remachine. 27

Conclusion

Conclusion•  Interferencebetweenapplica/onimpactssystemefficiency•  CALCioM:•  CommunicaLonlayerbetweenindependentapplicaLons•  Cross-applicaLoncoordinaLonthroughexchangeofknowledgeonI/Opa#erns

•  Severalpoliciesimplemented:FCFS,interrupLon

reallifeinterference

Thankyou!Ques/ons?

From Damaris to CALCioM - Inria · ICS 2011 M. Dorier, G. Antoniu, F. Cappello, M. Snir, L. Orf....

Documents

Transcript of From Damaris to CALCioM - Inria · ICS 2011 M. Dorier, G. Antoniu, F. Cappello, M. Snir, L. Orf....

DFT Technologies for High- Quality Low-Cost Manufacturing Tests Yuval Snir JTAG 2006 Yuval Snir JTAG 2006.

Portofolio - Damaris Ciupei

Thesis Damaris

DORIER-APPRILL E., AGOSSOU N., TAFURI C. (2013), « Porto ... · DORIER-APPRILL E., AGOSSOU N., TAFURI C. (2013), « Porto-Novo dans l aire métropolitaine littorale du Sud-Bénin,

By: Damaris Ayodele Williams Billups.

Jean-Luc DORIER - UNIGE

Microsoft Research Faculty Summit 2008. Marc Snir.

TCC Damaris

Damaris f edld_5352_week04_assignment1

CLARA DAMARIS 1 (2).docx

Damaris&Felipe libro de boda

Keynote snir sc

Esctp snir

Artigo Damaris, Banco de Leite

FELIZ CUMPLEAÑOS DAMARIS

damaris chica

Booking Matrimonios (Damaris Sedini 2013)

Nydia Damaris López

Damaris estrella

Keynote snir spaa