A Deep Learning tool for fast detector simulation · A Deep Learning tool for fast detector...
Transcript of A Deep Learning tool for fast detector simulation · A Deep Learning tool for fast detector...
![Page 1: A Deep Learning tool for fast detector simulation · A Deep Learning tool for fast detector simulation 1GangneungWonjuU. & CERN, 2CERN, 3Caltech ... (geant4) Intel Xeon Platinum 8180](https://reader035.fdocuments.net/reader035/viewer/2022062600/5b3926b97f8b9a5a178e1ecb/html5/thumbnails/1.jpg)
FedericoCarminati2,Gulrukh Khattak2,SofiaVallecorsa1,Jean-Roch Vlimant3
ADeepLearningtoolforfastdetectorsimulation
1GangneungWonju U.&CERN,2CERN,[email protected]
Simulation of particle transport through matter is fundamental for understanding the physics of High Energy Physics(HEP) experiments, as the ones at the Large Hadron Collider (LHC) at CERN. Currently, most of LHC worldwidedistributed CPU budget – in the range of half a million CPU-years equivalent – is dedicated to simulation. A fasterapproach is to treat Monte Carlo simulation as a black-box and replace it by a deep learning algorithm trained ondifferent particle quantities. Our project intends to test several DL techniques to achieve a speedup of at least x100with respect to Monte Carlo techniques.
Generative models, such as Generative Adversarial Networks (GAN) are particularly suited to replace Monte Carlo: theygenerate realistic samples modelling complicated probability distributions. They allow multi-modal output, they can dointerpolation and they are robust againstmissing data.
We can use Monte Carlo simulation to train GANs to reproduce realistic detector output
• Motivation
GenerativeAdversarialNetworksforfastsimulation
Oneofthefirst3DGANimplementations!
DistributedtrainingforHPC
Training:3DGANarenot“out-of-the-box”networks• Complextrainingprocess
FirstphysicsresultslookverypromisingPerformdetailedvalidationagainststandardMonteCarlocomparinghighlevelquantities(energyshowershapes)anddetailedcalorimeterresponse(singlecellresponse)Agreementisremarkable (afewpercent)!Discriminatorauxiliaryenergyregressiontaskhas5%accuracyoverthewholeenergyrange
Startwiththemosttimeconsumingsimulations:highgranularitycalorimeters(CLICdetectorstudies)*.Singleparticlesdepositenergyinanarrayofcalorimetercellsandgeneratea“energyshower”,interpretedasa3Dimage.
TheideaAchievem
entsDevelopm
ent
200Computingcentersin20countries:>600kcores
@CERN(20%WLCG):65kprocessorcores;30PB
GenerativeAdversarialNetworkssimultaneouslytraintwomodels:aGeneratorG andaDiscriminatorD´G reproducesthedatadistributionstartingfromrandom noise´DestimatestheprobabilitythatasamplecamefromthetrainingdataratherthanG
´TrainingprocedureforGistomaximizetheprobabilityofDmakingamistake
References
Dataisessentiallya3D
image
πshowerinCLICcalorimetersimulatedwithMonteCarloapproach25
2525
Generator
Discriminator
´ Goodfellow etal.2014´ ConditionalGAN,arXiv:1411.1744´ AuxiliaryClassifierGAN,arXiv:1610.0958
Image from:https://medium.com/@devnag/generative-adversarial-networks-gans-in-50-lines-of-code-pytorch-e81b79659e3f
*http://cds.cern.ch/record/2254048#
ComputingresourcesAlltestsrunwithInteloptimised Tensorflow 1.4.1.+keras 2.1.2
Systems:[email protected](28physicalcores)NVIDIAGeForceGTX1080
Timetocreateanelectronshower
Method Machine Time/Shower (msec)
FullSimulation(geant4) IntelXeonPlatinum8180 17000
3dGAN(batchsize128) IntelXeonPlatinum8180 7
3dGAN(batchsize 128) GeForceGTX1080 0.04
3dGAN(batchsize 128)
Intel [email protected](MacBookPro) 66
Timetotrainfor30epochs
Method Machine Trainingtime(days)
3dGAN(batchsize 128)
Intel XeonPlatinum8180
(Inteloptimised TF)30
3dGAN(batchsize 128) GeForceGTX1080 1
Inference: Usingatrainedmodelisveryfast!• OrdersofmagnitudefasterthanstandardMonteCarlo
• TestinferenceonFPGAandintegratedaccelerators
ImplementdataparallelismandstudyscalingonclustersModifympi basedlibrary(mpi-learn(#))toparallelise adversarialtrainingprocessPreliminaryscalingmeasuredatCSCSSwissNationalSuperComputingCenter
Picturefrom:https://github.com/duanders/mpi_learn #https://github.com/duanders/mpi_learn
Howgenericournetworkcanbe?• Ourbaselineisanexampleofnextgeneration
calorimeterdetector• Extendtoothercalorimeters• Exploreoptimalnetworktopologyaccordingto
theproblemtosolve• Hyper-parametersscansandmeta-
optimization• Fasttraining
Generalisation
Longitudinalshowershapefor400GeVelectron
PrototypeoftheSemiDigital HadronicCalorimeterduringtestsattheCERNSPSfacility
GeneratorandDiscriminatorarebasedon3Dconvolutions.Theshower shape dependsonparticletypeandenergysoweconditiontrainingonparticleenergyAuxiliaryenergyregressiontaskfordiscriminator
ElasticAverageSGD
Calorimeterenergyresponse:GANpredictionstaysstablethrough20nodes!
MonteCarlo1GPUGAN5GPUGAN10GPUGAN20GPU
Partofthisworkwasconductedat"iBanks",theAIGPUclusteratCaltech.WeacknowledgeNVIDIA,SuperMicro andtheKavli Foundationfortheirsupportof“iBanks”
ThisworkispartiallyfundedbyIntelParallelComputingCenterprogram
Calorimeterresponseasafunctionofprimaryelectronenergy
3DGANforcalorimetersimulation
3DGANadversarialtrainingprocess Start
GenerateFakeImageBatch
TrainDiscriminatoronrealimages
TrainDiscriminatoronfakeimages
DiscriminatorLossisaverageofbothlosses
Traingeneratortwice
Epoch<num_epochs
Longitudinalshowershapefor100GeVelectron
Discriminatorenergyregressionaccuracyasafunctionofprimaryelectronenergy
Transverseshowershapefor100-500GeVpions
(Ep-E d
iscr)/E p
JHEP11(2017)047