Download - Storage and Data Movement at FNAL D. Petravick CHEP 2003.

Transcript
Page 1: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

Storage and Data Movement at FNAL

D. PetravickCHEP 2003

Page 2: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 2

FNAL Overall Goals

• Provide competent one copy permanent store to all the Lab’s experiments.

• Provide scalable and performant data flows– Tape and Disk– Local area and Wide area.

• Provide standard interfaces allowing for interoperation with other sites.

• Collaborate! (DESY > 5 yrs) GLOBUS, LBL, JLAB, CERN

Page 3: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 3

STK, ADIC libraries; STK, IBM/LTO tape drives

dCache: Buffering and Caching

Enstore: (staging files from tape)

PNFS Name Space

SRM (weak, kerberized,grid) FTP

AnywhereClents

dCap

“local” Clients

encp NFS (not data)

Local Clients

Page 4: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 4

Overview

• Central Storage Systems capacious and scalable enough to be a hub of a data intensive system.– Linux as a hardware platform.– Permanent and temporary semantics.

• Competent local and GRID interfaces.ENCP, dCapFTP(s) and SRM (Storage Resource Manager)investigating GLUE schema and monitoriing

Page 5: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 5

Page 6: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 6

Page 7: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 7

Simplified structure of Dcache

Read poolRead pool

Write poolWrite pool

User

Grid FTP SRM Kerberos FTPFTPDCCPDCAP APIDcachedoors

13March 19, 2003

Page 8: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 8

6/11/2002 D. Petravick -- THIC -- NCAR 18

Bonus Disk -- dFarm

• Exploit the excess disk + competent networks we are currently blessed with.

• Provide redundant, temporary storage, robust against failure of any farm node

Page 9: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 9

RunII Work Plan

• SAM– Grid FTP integration with SAM.– DCAP integration– SRM integration with SAM.

• SRM integration with legacy CDF AC++

• Restore rates for CDF experiment.• Provide hardware for D0 Wide area

transfers.

Page 10: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 10

US CMS Work Plan

• Use dCache as a vehicle for Grid File access locally.

• Evaluate Storage elements for Tier II centers (dCache, DRM, NeST EDG wp 5 , Dfarm).

• Follow, help formulate LCG requirements.• Detailed work to meet CMS Data Challenge

on CMS deadlines.• Monitor and Improve Network

– Pinger IEPM (co-project)– FAST and other h/p TCP stacks. (co-project)

Page 11: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 11

Page 12: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 12

Grid Side

Storage Element

Storage Element

Worker Side

Job-Aware, Replica-Aware MdwPermanent Store Replica Stores

FTP Inter-face

SRMInter-face

DirectFile Access Interface

Monitoring Interface

Framework + job

Page 13: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 13

Lattice Gauge Work Plan

• Support the Fermilab Lattice Computation Facility.

• Use GRID FTP, SRM to integrate with – JLAB (cluster) , BNL (QCDOC) facilities.

• Understand relationship of storage systems to QIO (community I/O package)

• Investigate utility of caching data at FNAL.• Investigate symmetries with earth science

type system.

Page 14: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 14

Other Work Elements

• Work on Grid FTP V1.1 specification.– Include work on a few scaling issues for

“seas of linux box” type systems.

• Work on SRM protocol, futures.– V2.1 (co – pi @ FNAL)

• Look at “Object Based Storage” in conjunction with US CMS.– Proposed investigation is root integration.

• Understand the requirements of analysis systems.

Page 15: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 15

Other work elements…

• Support Grid protocols for dFarm.– Make it a “storage element”– Another innovative package.

• Experimenting with Grid Authentication.• Prepare for special routing on the WAN

– E.g. lambda networking.

• Support the experimental community at FNAL.– Auger in GridFTP production.– Minos in Kerberized, Weak FTP.

Page 16: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 16

Page 17: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 17

Advanced Monitoring

• Animations were Crucial for debugging The RUN II LAN based data systems.

• Looking for analogous tools for grid based works

• Follow items such as GLUE.

Page 18: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 18

Advanced Network Integration

• FNAL proposes to have two paths off site.

• Path to Starlight will allow R&D on advanced network concepts (e.g. lambda network)

• Central data movement systems will be an early user.

Page 19: Storage and Data Movement at FNAL D. Petravick CHEP 2003.

3/25/2003 DLP -- CHEP 2003 19

Summary

• Very substantial and successful support of FNAL Program.

• Very proactive work on fabric side.

• Data in Movement are interesting.

• Goal :Routine – performant flows– on LAN and WAN– with interoperation