Ogf27 Ligo

14
Open Grid Forum 27 Banff, Canada Requirements for Future Cyber-Infrastructure Workshop Tuesday, October 13th, 2009 LIGO-G0900945

Transcript of Ogf27 Ligo

Page 1: Ogf27 Ligo

Open Grid Forum 27

Banff, Canada

Requirements for Future Cyber-Infrastructure Workshop

Tuesday, October 13th, 2009

LIGO-G0900945

Page 2: Ogf27 Ligo

Brief description of the project.

Where are the project participants from?

How is the e-infrastructure provided?

What level of interoperability is required: data, compute, service?

Page 3: Ogf27 Ligo

Scientific Mission: Directly detect the oscillations of space-time predicted by Einstein’s General Theory of Relativity

Two Observatory Sites: Hanford, WA & Livingston, LA L-shaped arms 4 kilometers long on a side.

Largest high vacuum system on the Earth’s surface.

Able to measure length changes on one one-thousandth the size of a proton over the 4 km distance

Instrumentation records roughly one and a quarter Terabytes per day

Funded by the National Science Foundation

Page 4: Ogf27 Ligo

Compact Binary (Black Holes, Neutron Stars) Coalescence Periodic (Pulsar) Sources

Unmodeled Bursts (Supernovae) Gravitational Stochastic (Big Bang) Background

Page 5: Ogf27 Ligo

LHO LLO

VIRGO GEO600

TAMA300 AIGO

Page 6: Ogf27 Ligo

The LIGO Scientific Collaboration (LSC) is a dynamic group of approximately 700 scientists from approximately 70 institutions worldwide who have joined together in the search for gravitational waves from the the most violent events in the universe.

The LIGO Data Grid (LDG) is a collection of computing resources owned and managed by the LSC Provides computing infrastructure necessary for analysis

of LIGO data in the search for gravitational waves.

Several sites are (will be) part of the Open Science Grid.

Page 7: Ogf27 Ligo

LIGO Hanford

LIGO Caltech

LIGO Livingston

LIGO MIT

Syracuse

UW Milwaukee

Cardiff

AEI Hannover

AEI Berlin

The LIGO Scientific Collaboration operates 9 clusters, with a total of approximately 17,000 cores in total.Condor is deployed on all LDG sites.

Page 8: Ogf27 Ligo

Architecturally, the LIGO Data Grid relies on a fundamental design choice: Replication & Cataloging of Science Data to all centers as a

service Users are not entangled by issues of getting data to their

jobs/workflows

Satellite tools are needed to provide integrity and interoperability Source Code Version Control and Repositories Data Analysis Tool Distributions compatible with user

platforms and skills Web servers, Wikis, Document Databases, Data Analysis

Databases, Audio/Video Conferencing, …

Page 9: Ogf27 Ligo

The LIGO Data Grid is scaled to provide full coverage of the Compact Binary Coalescence (CBC) analysis This doesn’t have head room for large new analysis

models As a baseline scope, this also supports Burst, Stochastic

and a limited (targeted) form of the Periodic analyses

LIGO also incorporates additional cyber-infrastructures to assure flexibility and new thinking BOINC Applications

Periodic searches could utilize every computer on Earth!

Open Science Grid Many common services & software with the LIGO Data Grid

Page 10: Ogf27 Ligo

Reaches out to the the masses with personal computers and idle compute cycles looking for an application

Einstein@Home is a World Year of Physics 2005 and an International Year of Astronomy 2009 project supported by the American Physical Society (APS) and by a number of international organizations.

Analysis is dominated by large FFTs.

Half a million registered participants.

Open Science Grid ranked #2 contributorthrough adaption of BOING to submissionmechanism onto grids.

Page 11: Ogf27 Ligo

LIGO is a key stakeholder in the OSG

OSG is used opportunistically by LIGO to gain additional computing resources for two of LIGO’sanalyses:

LIGO Applications

Opens Science Grid

Einstein@Home on the OSG Compact Binary Coalescence Workflows (60,000 DAG nodes)

Page 12: Ogf27 Ligo

LIGO’s Compact Binary Coalescence Analysis relies on in-house toolsto generate workflows with complex interdependencies between jobs and data. A typical analysis run involves tens to hundreds of thousands of DAG nodes requiring roughly a week to run each workflow.

Page 13: Ogf27 Ligo

The enormous variety of interfaces, services, tools making up the cyber-infrastructure used by the LIGO Scientific Collaboration introduces a challenging authentication/authorization problem Dozens of credentials are need by each participant to be

able to conduct daily science mission oriented tasks

LIGO is actively pursuing a single Auth(2) solution under the purview of the “Auth-Project” Still under active development Facing many challenges from existing infrastructure

policies, requiring adoption of redundant or home grown technologies to support user community

Page 14: Ogf27 Ligo

Rules that are easy to master.

Ability to distribute objects/apps transparently around the cyber-infrastructure.

Evolution when objects warrant.

Sufficiently sophisticated to support complex goals.

Have some fun on the way!