EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE The gLite middleware distribution OSG...

17
EGEE-II INFSO-RI- 031688 Enabling Grids for E-sciencE www.eu-egee.org www.glite.org The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23 August 2006

Transcript of EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE The gLite middleware distribution OSG...

Page 1: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

EGEE-II INFSO-RI-031688

Enabling Grids for E-sciencE

www.eu-egee.orgwww.glite.org

The gLite middleware distribution

OSG Consortium Meeting

Seattle, 21-23 August 2006

Page 2: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 2

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Outline

• Background and approach adopted

• Architecture

• Software process

• Status

• Summary

Page 3: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 3

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Background & Approach

• gLite– Exploit experience and existing

components from VDT (Condor, Globus), EDG/LCG, AliEn, and others

– Develop a lightweight stack of generic middleware useful to EGEE applications (HEP and Biomedics are pilot applications). Pluggable components – cater for different implementations Follow SOA approach, WS-I compliant where possible

– Focus is on re-engineering and hardening

– Business friendly open source license Plan to switch to Apache-2

Page 4: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 4

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Service Oriented Architecture

• gLite follows a Service Oriented Architecture – Facilitate interoperability among Grid

services – Allow easier compliance with

upcoming standards– The services work together in a

concerted way but can also be deployed and used independently, allowing their exploitation in different contexts

• Services communicate through the exchange of messages– Slowly moving to WS-* interfaces– Still missing a real standard. Many

WS-* specifications– Activity inside GGF-GIN

GIN

Page 5: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 5

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Middleware structure

• Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware

• Higher-Level Grid Services are supposed to help the users building their computing infrastructure but should not be mandatory

• Foundation Grid Middleware will be deployed on the EGEE infrastructure– Must be complete and robust– Should allow interoperation

with other major grid infrastructures

– Should not assume the use of Higher-Level Grid Services

Foundation Grid Middleware

Security model and infrastructure

Computing (CE) and Storage Elements (SE)

Accounting

Information and Monitoring

Higher-Level Grid Services

Workload Management

Replica Management

Visualization

Workflow

Grid Economies

...

Applications

Page 6: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 6

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Workload ManagementData Management

SecurityInformation & Monitoring

Access

gLite Grid Middleware Services

API

ComputingElement

WorkloadManagement

MetadataCatalog

StorageElement

DataMovement

File & ReplicaCatalog

Authorization

Authentication

Information &Monitoring

Application

MonitoringAuditing

JobProvenance

PackageManager

CLI

Accounting

Site Proxy

Overview paper http://doc.cern.ch//archive/electronic/egee/tr/egee-tr-2006-001.pdf

Page 7: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 7

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Grid foundation: Accounting

• Resource usage by VO, group or single user– Resource metering:

sensors running on resources to determine usage

– Pricing policies: associate a cost to resource usage if enabled allowed market-

based resource brokering

– privacy: access to accounting data granted only to authorized people (user, provider, VO manager)

• Basic functionality in APEL, full functionality in DGAS

Page 8: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 8

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Grid foundation: Computing Element

The CE software• accepts batch jobs (and job control

requests) through a gatekeeper LCG-CE (GT2 GRAM + GSI-enabled

Condor) gLite-CE (GSI-enabled Condor-C) CREAM (WS-I based interface)

• performs the necessary AAA operations and map to a local user through LCAS/LCMAPS and the GRAM or

glexec

• passes the job to a layer that interacts with the local resource manager BLAH

• monitors the status of the jobs and reports it to the client Condor CEMon (in CREAM)

Web service interface to the CE info

Client

LRMSWN

MonitoringComputing

Element

AAA andlocal mapping

JobController

Gri

dS

ite

Page 9: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 9

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Grid foundation: Storage Element

• Site File Name (SFN): identifies a Storage Element and the logical name of the file inside it

• Physical File Name (PFN): argument of file open• Storage Resource Manager (SRM)

– hides the storage system implementation (disk or active tape)– checks the access rights to the storage system and the files– translates SFNs to PFNs– disk-based: DPM, dCache; tape-based: Castor, dCache

• File I/O: posix-like access from local nodes or the gridGFAL

Page 10: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 10

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

High Level Services: EDS

• Encrypted Data Storage– encrypt and decrypt data on-the-fly– Key-store: Hydra

N instances: at least M (<N) need to be available for decryption

• fault tolerance and security

• Demonstrated with the SRM-DICOM demo at EGEE Pisa conference (Oct’05)

Will be GFAL

Will be LFC

Will be DPM

(now d-Cache)

Page 11: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 11

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

High Level Services: Workload Manag.

• Resource brokering, workflow management, I/O data managementWeb Service interface: WMProxy– Task Queue: keep non matched jobs– Information SuperMarket: optimized cache of information system– Match Maker: assigns jobs to resources according to user

requirements– Job submission & monitoring

Condor-GICE (to CREAM)

– External interactions: Information System Data Catalogs Logging&Bookkeeping Policy Management

system (G-PBox)

Page 12: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 12

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

High Level Services: Job Information

• Logging and Bookkeeping service– Tracks jobs during their lifetime (in terms of events)

• Job Provenance stores long term job information– Supports job rerun

Page 13: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 13

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

SA3 Testing & Certification

Functional Tests

Testbed Deployment

gLite Software Process

JRA1 Development

SoftwareError Fixing

SA3 Integration

Deployment Packages

Integration Tests

Installation Guide, Release Notes, etc

SA1 Pre-Production

Scalability Tests

Pre-Production Deployment

Fail

Fail

Fail

Pass

Pass

Pass

SA1 Production Infrastructure

Release

Problem

Serious problem

Directives

Page 14: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 14

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

gLite Software Process• Technical Coordination Group (TCG)

– gathers & prioritizes user requirementsfrom HEP, biomed, (industry), sites

– gLite development is client-driven!• Software from EGEE-JRA1 and other projects

– JRA1 preview testbed (currently being set up) early exposure to users of “uncertified” components

• SA3 Integration Team– Ensures components are deployable and work– Deployment Modules implemented high-level gLite node types

(WMS, CE, R-GMA Server, VOMS Server, FTS, etc)

– Build system now spun off into the ETICS project (Jan 2006)• SA3 Certification Team

– Merge of the JRA1 testing and SA1 certification teams– Dedicated testbed; test release candidates and patches– Develop test suites

• SA1 Pre-Production System– Scale tests by users

Page 15: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 15

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

gLite status

• Convergence of LCG 2.7.0 and gLite 1.5.0 in spring 2006– continuity on the production infrastructure

ensured usability by experiments– new features from gLite 1.5.0

• Current activities– Improve usability, efficiency and performance– Migration to VDT 1.3.11 (GT4 pre-WS)– Support for Scientific Linux 4 and 64-bit

Support for other platforms will follow– New data management components for Biomed

applications on the production infrastructure– Certify new components (CREAM, Job

Provenance, GPBox, ...)• Interoperation with other projects and

adherence to standards• Open source (Apache) license

LCG-2

prototyping

prototyping

product

20042004

20052005 product

gLite

20062006 gLite 3.0

Page 16: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 16

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

Summary

• gLite 3 is an important milestone in EGEE program– New components from gLite 1.5 being deployed for the first time

on the Production Infrastructure Address requirements in terms of functionality and scalability Components deployed for the first time need extensive testing!

– New organization in EGEE II more controlled software process development is client driven (TCG)

• Development is continuing to provide increased robustness, usability and functionality

• Collaboration with other projects for interoperability and definition/adoption of international standards

Page 17: EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE   The gLite middleware distribution OSG Consortium Meeting Seattle, 21-23.

OSG Consortium Meeting - Seattle - 21-23 August 2006 17

Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688

www.glite.org