Overview of EGEE and gLite-1.ppt - Indico · 2018. 11. 16. · Enabling Grids for E-sciencE NA3...
Transcript of Overview of EGEE and gLite-1.ppt - Indico · 2018. 11. 16. · Enabling Grids for E-sciencE NA3...
-
Enabling Grids for E sciencEEnabling Grids for E-sciencE
Overview of the EGEE project and the gLite middlewareand the gLite middleware
www eu egee org
EGEE-III INFSO-RI-222667
www.eu-egee.org
-
Enabling Grids for E-sciencE
Outline
• What is EGEE?What is EGEE?– The project– The infrastructure
• gLite middlewareg
• EGEE applicationspp
• Sources of further informationSources of further information
EGEE-III INFSO-RI-222667 2
-
Enabling Grids for E-sciencE
Defining the Grid
• A Grid is the combination• A Grid is the combination of networked resources and the corresponding middleware, which provides services for the userprovides services for the user.
EGEE-III INFSO-RI-222667 3Providing a Production Grid Infrastructure for Collaborative Science 3
-
Enabling Grids for E-sciencE
The EGEE Project• Aim of EGEE:
“to establish a seamless European Grid infrastructure for the support of the European Research Area (ERA)”
• EGEE– 1 April 2004 – 31 March 2006– 71 partners in 27 countries, federated in regional Grids71 partners in 27 countries, federated in regional Grids
• EGEE-II– 1 April 2006 – 30 April 2008
E d d ti– Expanded consortium
• EGEE-III– 1 May 2008 – 30 April 2010y p– Transition to sustainable model
EGEE-III INFSO-RI-222667 4Providing a Production Grid Infrastructure for Collaborative Science 4
-
Enabling Grids for E-sciencE
Defining the Grid
• A Grid is the combination• A Grid is the combination of networked resources and the corresponding middleware, which provides services for the userprovides services for the user.
EGEE-III INFSO-RI-222667 5Providing a Production Grid Infrastructure for Collaborative Science 5
-
Enabling Grids for E-sciencE
EGEE working with related infrastructure projects
GINEGEE-III INFSO-RI-222667 6Providing a Production Grid Infrastructure for Collaborative Science 6
GIN
-
Enabling Grids for E-sciencE
What is happening now?
Real Time Monitor – Java tool– Displays jobs
running (submitted through RBs)g )
– Shows jobs moving around
ld iworld map in real time, along with changes in status
EGEE-III INFSO-RI-222667 7
http://gridportal.hep.ph.ic.ac.uk/rtm/ (snapshot 16 January 2007)
-
Enabling Grids for E-sciencE
EGEE Infrastructures
• Production service– Scaling up the infrastructure with resource centres around the globe– Stable, well-supported infrastructure, running only well-tested and
reliable middleware
• Pre-production service– Run in parallel with the production service (restricted nr of sites)– First deployment of new versions of the gLite middleware – Test-bed for applications and other external functionality
• T-Infrastructure (Training&Education)– Complete suite of Grid elements 20 sites on 3 continentsp
and application (Testbed, CA, VO, monitoring, support, …)
– Everyone can register and use
EGEE-III INFSO-RI-222667 8Providing a Production Grid Infrastructure for Collaborative Science 8
Everyone can register and use GILDA for training and testing
-
Enabling Grids for E-sciencE
NA3 activity: User training and induction
• Expand portfolio of training materials & coursesT i id i f EGEE
NA12%
NA25%
NA38%
NA419%
SA22%
SA39%
JRA15%
• Train a wide variety of EGEE users (internal/external)
• Develop effective mechanisms SA149%
for training end-users of the EGEE infrastructure
• Collaborate in cross-activity
NA51%29 Active
partners ~ 29 FTEsyinitiatives
– ICEAGE Project Digital Library– http://library.iceage-eu.org/
~ 29 FTEs89 Individuals6 Federations
y g g– Videos, MP3 talks on grid computing
• http://www.egee.nesc.ac.uk/– Training eventsg– Training material repository
• http://egee.lib.ed.ac.uk/– EGEE Digital Library
EGEE-III INFSO-RI-222667 9
EGEE Digital Library– Repository of training materials
-
Enabling Grids for E-sciencE
NA4 Activity: Application identification and support
• Application Identification and Support (NA4)– 25 countries, 40 partners, 280+ participants, 1000s of users
S t th l d di EGEE it• Support the large and diverse EGEE user community:– Promote dialog: Users’ Forums & EGEE Conferences– Technical Aid: Porting support, procedural issues– Liaison: Software and operational requirements
• Main activities:– 5 application clusers: HEP Life sciences Astronomy & astrophysics5 application clusers: HEP, Life sciences, Astronomy & astrophysics, Earth science, Computational chemistry, Fusion, Grid observatory– Support:
Application porting supportApplication porting supportwww.lpds.sztaki.hu/gasuc
VO supportDirect user supportDirect user support
www.ggus.orgRegional support
• http://egeena4 lal in2p3 fr
EGEE-III INFSO-RI-222667 10
http://egeena4.lal.in2p3.fr
-
Enabling Grids for E-sciencE
EGEE Infrastructure
Country participating
in EGEE
600007000080000
No. Cores
0100002000030000400005000060000
300No. Sites
0
Apr
-04
Jul-0
4
Oct
-04
Jan-
05A
pr-0
5
Jul-0
5
Oct
-05
Jan-
06
Apr
-06
Jul-0
6
Oct
-06
Jan-
07
Apr
-07
Jul-0
7
Oct
-07
Jan-
08
Apr
-08
> 200 sites in 40 countries38 000 CPU
50100150200250300 ~ 38 000 CPUs
~ 5 PB storage98k jobs/day> 200 Virtual Organizations
EGEE-III INFSO-RI-222667 11Providing a Production Grid Infrastructure for Collaborative Science 11
050
Apr
-04
Jul-0
4
Oct
-04
Jan-
05
Apr
-05
Jul-0
5
Oct
-05
Jan-
06
Apr
-06
Jul-0
6
Oct
-06
Jan-
07
Apr
-07
Jul-0
7
Oct
-07
Jan-
08
Apr
-08
> 200 Virtual Organizations⇨The world’s largest multi-disciplinary Grid
-
Enabling Grids for E-sciencE
Resource management: structure
• Operations Coordination Centre (OCC)
– management oversight of– management, oversight of all operational and support activities• Regional Operations
Centres (ROC)Centres (ROC)– providing the core of the
support infrastructure, each supporting a number of resource centres within its region
– Grid Operator on Duty • Resource centres– providing resourcesproviding resources
(computing, storage, network, etc.);• Grid User Support
(GGUS)(GGUS)– At FZK, coordination and
management of user support, single point of contact for users
EGEE-III INFSO-RI-222667 12
-
Enabling Grids for E-sciencE
VO concept
• gLite middleware runs on each shared resource to provideresource to provide– Data services– Computation
services
INTERNET
services – Security service
• Resources and users form Virtual organisations: basis for collaboration
• Distributed services (both people and middleware) enable
EGEE-III INFSO-RI-222667 13
)the grid
-
Enabling Grids for E-sciencE
Defining the Grid
• A Grid is the combination• A Grid is the combination of networked resources and the corresponding middleware, which provides services for the userprovides services for the user.
EGEE-III INFSO-RI-222667 14Providing a Production Grid Infrastructure for Collaborative Science 14
-
Enabling Grids for E-sciencE
Grid middleware
• The Grid relies on advanced software, called middleware, which interfaces between resources and the applications
Th G id iddl• The Grid middleware:– Basic services
Secure and effective access Secu e a d e ect e accessto resources
– High level servicesOptimal use of resourcesOptimal use of resourcesAuthentication to the different sites that are usedJob execution & monitoringJob execution & monitoring of progressProblem recoveryT f f lt b k t th
EGEE-III INFSO-RI-222667 15
Transfer of results back to the user
-
Enabling Grids for E-sciencE
Grid Middleware
• When using a PC or workstation you
• When using a Grid you– Login with digitalworkstation you
– Login with a username and password (“Authentication”)
Login with digital credentials – single sign-on (“Authentication”)
( Authentication )– Use rights given to you
(“Authorisation”)
– Use rights given you (“Authorisation”)
– Run jobs– Run jobs– Manage files: create
them, read/write, list
– Run jobs– Manage files: create
them, read/write, list them, read/write, list directories
• Components are li k d b b
directories• Services are linked by
th I t tlinked by a bus• Operating system • One admin domain
the Internet• Middleware
M d i d iEGEE-III INFSO-RI-222667 16
• One admin. domain • Many admin. domains
-
Enabling Grids for E-sciencE
EGEE Middleware: gLite
• gLite 3.0, gLite 3.1⇨ Merger of LCG 2 7 and GLite 1 5⇨ Merger of LCG 2.7 and GLite 1.5
Exploit experience and existing components from– Exploit experience and existing components from VDT (Condor, Globus), EDG/LCG, and others
– Develop a lightweight stack of generic middlewareDevelop a lightweight stack of generic middlewareuseful to EGEE applications (HEP and Biomedics are pilot applications). p pp )
Should eventually deploy dynamically (e.g. as a globus job)Pluggable components – cater for different implementations
– Focus is on providing a stable and usable infrastructure
EGEE-III INFSO-RI-222667 17
-
Enabling Grids for E-sciencE
Basic gLite use case:Job submissionInformation System
Submit job(executable + small inputs) Resource Broker
User Interface
( p )
query
Resource Broker
create
Retrieve status & (small) output files
Submit job publishquery
createproxy
(small) output files
Submit jobRetrieve
output
pstate
File and Replica CatalogJob
status Logging
Computing Element Storage Element
Site X
J bInput file(s)
Register file
VO Management Service
process
Job status
p ( )
Output file(s)
EGEE-III INFSO-RI-222667 18
(DB of VO users)Logging and bookkeeping
Ou pu e(s)
-
Enabling Grids for E-sciencE
Main components
User Interface (UI)User Interface (UI): The place where users logon to the Grid
Resource Broker (RB) (Workload Management System (WMS)Resource Broker (RB) (Workload Management System (WMS):Matches the user requirements with the available resources on the Grid
Information SystemInformation System: Characteristics and status of CE and SE
File and replica catalogFile and replica catalog: Location of grid files and grid file replicas
L i d B kk i (LB)L i d B kk i (LB) L i f ti f j bLogging and Bookkeeping (LB)Logging and Bookkeeping (LB): Log information of jobs
Computing Element (CE)Computing Element (CE): A batch queue on a site’s computers wherethe user’s job is executed
EGEE-III INFSO-RI-222667 19
Storage Element (SE)Storage Element (SE): provides (large-scale) storage for files
-
Enabling Grids for E-sciencE
Main components
User Interface (UI)User Interface (UI): The place where users logon to the Grid
Resource Broker (RB) (Workload Management System (WMS)Resource Broker (RB) (Workload Management System (WMS):Matches the user requirements with the available resources on the Grid
Information SystemInformation System: Characteristics and status of CE and SEAll built upon authorisationFile and replica catalogFile and replica catalog: Location of grid files and grid file replicas
L i d B kk i (LB)L i d B kk i (LB) L i f ti f j b
authorisation, authentication,
Logging and Bookkeeping (LB)Logging and Bookkeeping (LB): Log information of jobssecurityComputing Element (CE)Computing Element (CE): A batch queue on a site’s computers where
the user’s job is executed
EGEE-III INFSO-RI-222667 20
Storage Element (SE)Storage Element (SE): provides (large-scale) storage for files
-
Enabling Grids for E-sciencE
Who provides the resources?!
ServiceService Provider NoteUser interfaceUser interface User / institute / VO Computer with client SWp
Resource Broker Resource Broker (WMS)(WMS)
VOs - EGEE does not fund RBs
Information SystemInformation System Grid operations -EGEE funded effort
File and replica catalogFile and replica catalog VOs - EGEE does p gp gnot fund catalogs
Logging and Logging and BookkeepingBookkeeping
VOs - EGEE does not fund LB serversBookkeepingBookkeeping not fund LB servers
Computing Element Computing Element (CE)(CE)
VOs - EGEE does not fund CEs
VOs provide resources to match average need
Storage Element Storage Element (SE)(SE)
VOs - EGEE does not fund SEs
VOs provide resources to match average need
External services User / institute / VO To extend the capabilities of
EGEE-III INFSO-RI-222667 21
External services User / institute / VO To extend the capabilities of the core infrastructure
-
Enabling Grids for E-sciencE
Empowering VOs Where computer science meets the application communities!
– Recommended External Software Packages for Egee CommuniTies Application g C
– Current RESPECT tools:GridWayP-GRADE PortalGANGA
Applicationtoolkits
Application
GRelCI2glogin
– http://egeena4.lal.in2p3.fr/ “Grid software” menuHigher level gLite services
toolkitsCommand line & APIs
Basic gLite services:
Higher-level gLite services (WMS,…) Production infrastructure contains
these servicesBasic gLite services:CE, SE, info, security – Basic services: Must be complete and robust; Should not assume the
use of Higher-Level Grid Services– High level services: help the users
building their computing infrastructure but should not be
EGEE-III INFSO-RI-222667 22
mandatory
-
Enabling Grids for E-sciencE
Defining the Grid
• A Grid is the combination• A Grid is the combination of networked resources and the corresponding middleware, which provides services for the userprovides services for the user.
EGEE-III INFSO-RI-222667 23Providing a Production Grid Infrastructure for Collaborative Science 23
-
Enabling Grids for E-sciencE
EGEE Applications
• >270 VOs from several scientific domains
Astronomy & Astrophysics– Astronomy & Astrophysics– Civil Protection– Computational Chemistry– Comp. Fluid Dynamics– Computer Science/Tools– Condensed Matter Physicsy– Earth Sciences– Fusion
High Energy Physics– High Energy Physics– Life Sciences
• Further applications d l tiunder evaluation
Applications have moved from
EGEE-III INFSO-RI-222667 24Providing a Production Grid Infrastructure for Collaborative Science 24
Applications have moved from testing to routine and daily usage
~80-95% efficiency
-
Enabling Grids for E-sciencE
Application families
• Simulation– Large number of similar, independent jobs – parameter study
• Bulk Processing– Widely-distributed input data, Sophisticated data management
Workflow• Workflow– Complex dependencies between individual tasks
• Legacy Applicationsg y pp– Licenses: control access to software on the grid– No recompilation ⇒ no direct use of grid APIs
P ll l J b• Parallel Jobs– Many CPUs needed simultaneously, Use of MPI libraries– Limited support in gLite: MPI configuration is not uniformLimited support in gLite: MPI configuration is not uniform
• Responsive Apps.– Short response time
EGEE-III INFSO-RI-222667 25
– No real support in gLite Interactive Grid FP6 project
-
Enabling Grids for E-sciencE
Further information, references
• EGEE– http://www.eu-egee.org/p g g
• gLite middleware– http://www.glite.org
• gLite manuals, documentation– http://glite.web.cern.ch/glite/documentation/
(gLite user guide)• Recommended External Software Packages for Egee
CommuniTies (RESPECT)CommuniTies (RESPECT)– http://egeena4.lal.in2p3.fr/
• Description of work of EGEE-IIIhttps://edms cern ch/document/886385/4
EGEE-III INFSO-RI-222667 26
– https://edms.cern.ch/document/886385/4
-
Enabling Grids for E-sciencE
Summary
• EGEE is running the largest multi-VO grid in the world!– Creating the “grid layer” in e-Infrastructure for research, public
service and industry• Key concepts for EGEE• Key concepts for EGEE
– Sustainability – planning for the long-term– Production qualityProduction quality – User support
• EGEE’s middleware: gLite. Current version 3.1g– Basic middleware services– High level middleware services
• External software to foster uptake of technology
EGEE-III INFSO-RI-222667 27