Computing and Storage Resources at the San Diego Supercomputer Center

48
SAN DIEGO SUPERCOMPUTER CENTER NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE Computing and Storage Resources at the San Diego Supercomputer Center Natasha Balac, Ph.D. UC Associates Program August 3, 2004

description

Computing and Storage Resources at the San Diego Supercomputer Center. Natasha Balac, Ph.D. UC Associates Program August 3, 2004. What is SDSC?. Founded in 1985 One of three NSF-funded supercomputer centers - PowerPoint PPT Presentation

Transcript of Computing and Storage Resources at the San Diego Supercomputer Center

Page 1: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Computing and Storage Resources at the San Diego Supercomputer Center

Natasha Balac, Ph.D.UC Associates Program

August 3, 2004

Page 2: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

What is SDSC?

• Founded in 1985 • One of three NSF-funded

supercomputer centers• Provides resources to national

academic and non-profit community above and beyond what an individual university can provide

• Peer-review proposal system, no cost to academics/non-profits

• But we are much more

Page 3: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

• Employs nearly 400 researchers, staff and students

• Leading edge site for NSF’s National Partnership for Advanced Computational Infrastructure (NPACI)

• One of 5 sites of NSF’s TeraGrid/ETF project

• Home of many associated activities including • Protein Data Bank• Alliance for Cell Signaling• Cooperative Association for Internet Data Analysis (CAIDA)• High Performance Wireless Research and Education Network (HPWREN)• Geosciences Network (GEON)• Joint Center for Structural Genomics• Protein Kinase Resource, etc.

SDSC

Page 4: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

SDSC’s Mission

• To develop and use technology to advance science

• We do this through our use of• Hardware • Software• Expertise / Personnel

in computation, data management and visualization

Page 5: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

A Range of Hardware Resources

• 14TFlops of aggregate compute power• Nearing 100 Tflops across all NSF centers

• 5.7TB aggregate memory• 500TB SAN file systems• 6 Petabytes of tape archive• Thousands of active users• Part of U.S. TeraGrid initiative• 40Gb/s backbone connects the center to other

supercomputer centers• Wide range of HPC applications

Page 6: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

New Powerful IBM System

Peak speed Processors Memory Disk Latency/Bandwidth

DataStar ~10 Tflops 11 32-way176 8-way 1.7 and 1.5 GHz Power4

~4.2 TB 540 TB5-10 GB/s

<10 s2 GB/s

Page 7: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

20 TB

230 TB FCS SAN500 TB FCS SAN

256 2p Madison667 2p Madison

Myrinet

128 2p Madison256 2p Madison

Myrinet

NCSA

Caltech

SDSC PSC

100 TB DataWulf

TeraGrid Compute Resources

32 Pentium452 2p Madison20 2p Madison

Myrinet

1.1 TF Power4Federation

CHILA

96 GeForce4 Graphics Pipes

96 Pentium4 64 2p Madison

Myrinet

20p Vis75 TB Storage

750 4pAlpha EV68

Quadrics

4 32p EV7

Marvel

16 2p (ER)MadisonQuadrics

4 Lambdas

ANL

Page 8: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

TeraGrid Application Targets

• Usage exemplars• “traditional” supercomputing made simpler

• remote access to data archives and computers

• distributed data archive access and correlation• remote rendering and visualization• remote sensor and instrument coupling

Page 9: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

SDSC’s Mission

• To develop and use technology to advance science

• We do this through our use of• Hardware • Software• Expertise / Personnel

in computation, data management and visualization

Page 10: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

• www.npaci.edu/Applications• Applications in variety of research areas:

• Biomolecular Structure• Molecular Mechanics/Dynamics• Quantum Chemistry• Eng. Structural Analysis• Finite Element Methods• Fluid Dynamics• Numerical Libraries• Linear Algebra• Differential Equations• Graphics/Scientific Visualization• Grid Computing• Data Mining and Analysis

Production Software in many research areas

               

Page 11: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Variety of scientific software installed and maintained

• BLAST• CLUSTALW• Biology WorkBench• CNS• NAMD• Amber

• NWChem• CHARMM• Parallel MOPAC• GAMESS• Gaussian• DataCutter

•We’re happy to install your favorite package

Page 12: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

SDSC’s Mission

• To develop and use technology to advance science

• We do this through our use of• Hardware • Software• Expertise / Personnel

in computation, data management and visualization

Page 13: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Expertise integrating technology and applications

• 400 personnel• Experts in wide range of fields

• Deploying the largest supercomputers and networks• Building clusters• Designing storage area networks• Portal design• Bioinformatics• Web Services• Cross-disciplinary expertise

• Ability to map applications onto hardware efficiently• Understanding of hardware and science

Page 14: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

A Range of Personnel Resources

Expertise • parallelizing/optimizing code• portal-based access• grid computing• data mining• web services• vizualization

• Peer-review process• www.paci.org

• Multi-year awards possible• Database/Data collection hosting/persistent archiving

Page 15: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Visualization Services

Page 16: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Visualization Serviceshttp://vis.sdsc.edu/

• Scalable Visualization Toolkit and Applications• Visualization Service Grid

• Cancer Center Visualizations• SAC Visualizations• NPACI Visualization Software

• Scientific imaging and animation production• Customized visualization solutions• Gaming Grid for Research and Education• Visualization Training

• OpenDX workshops• Maya workshops and short courses

Page 17: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Visualization Services Grid

Workstation Alley

VGrid Gallery SRB Archive Grid Farm

VGrid Portal

SVT Grid Services

Page 18: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

UCSD Cancer Center Visualizations

James R. Feramisco -Digital Imaging Resource Leader

Page 19: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Monuments and Dust:The Culture of Victorian London

Page 20: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Page 21: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Environmental Visualizations

Page 22: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Volume Visualization of the Orion NebulaThe San Diego Supercomputer Center and

The American Museum of Natural History Hayden Planetarium

Hubble Space Telescope images of the Orion Nebula and the HST-10 proplyd.

Page 23: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Astronomical Visualization

Visualization ofAn Emission Nebulae from 3 Terabytes of Simulation DataCredits:American Museum of Natural HistoryDave Nadeau SDSCErik Engquist SDSC

Page 24: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Center for Visualization Prototypes

Contact [email protected] for details orgo to http://cvp.sdsc.edu

Bringing physical prototyping technology into the visualization mainstream

Page 25: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

DAKS – Data and Knowledge Systems

• DAKS creates data and knowledge cyber infrastructure for scalable, end-to-end knowledge discovery pipelines in data-intensive computing

• Integrated enabling technologies include:• data gathering and data grid tools• web services• massive storage• large-scale databases• data mining• knowledge integration• publishing in digital libraries• long-term preservation in persistent archives

Page 26: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Major Projects• GEON: GEOsciences Network – Integration of multi-

disciplinary Earth Science databases• SEEK: Science Environment for Ecological Knowledge –

Modeling frameworks, semantic integration, workflow systems for environmental modeling

• National Archives and Records Administration (NARA) – Persistent archives and electronic records management

• GeoGRID – Research on integrating geospatial information from multiple, heterogeneous sources, including studying the metadata necessary to describe the geospatial content & services, as well as accuracy-aware query processing techniques

• Keck Graduate Institute – Twin framework to analyze, model & design robust, complex networks using biological & computational principals

Page 27: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Major Projects

• Grid Benchmarks – Defining metrics to measure performance of Grid applications and architectures and to rate their functionality and efficiency

• I2T: Information Integration Testbed – Set of tools and technologies that are being developed to provide a testbed for information integration

• GriPhyN: Grid Physics Network – Develop & build production-scale data grids

• SciDAC/Scientific Data Management – Five-year initiative to develop scientific computing infrastructure for terascale computers to advance research programs in basic energy sciences, biological & environmental research, fusion energy sciences, and high-energy and nuclear physics

Page 28: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Major Projects

• Southern California Earthquake Data Center (SCEDC) – Primary archive of earthquake data for southern California

• Bridges – Integrated framework for health monitoring of highway bridges & civil infrastructure

• INGREIN – Integrated, Geo-Referenced Environmental Information Network

• National Virtual Observatory – Federation of over 100 terabytes of astronomical data from more than 50 collections

• NPACI Neuroscience – Infrastructure to support the study of brain structure

• SRB: Storage Resource Broker – Middleware providing a uniform API to access heterogeneous distributed storage resources

Page 29: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Major Projects—DAKS Involvement• TeraGrid – Multi-year effort to build and deploy the world's

largest, most comprehensive distributed infrastructure for open scientific research

• BIRN: Biomedical Informatics Research Network – Standardizing imaging protocols, developing database schemas around their data, defining processing pipelines for upload & analysis of data, and assembling large imaging caches

• OptIPuter – Cyber “infostructure” to support data-intensive scientific research and collaboration

• LTER: Long-Term Ecological Research Network – Investigation of long-term ecological phenomena in U.S.

• WIISARD: Wireless Internet Information System for Medical Response in Disasters – Sophisticated wireless technology to coordinate and enhance care of mass casualties in a terrorist attack or natural disaster

Page 30: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Major Projects—DAKS Involvement

• BorderSafe – Infrastructure for sharing and evaluation of information between local law enforcement and the Department of Homeland Security

• Gene Regulatory Networks – integrating data from disparate sources to create an interaction graph of gene and protein regulation. Performing graph queries to reveal interesting unkown interactions

• Cell-Centered Database – On-line resource for high resolution cell-centered data

• ROADNet: Real-time Observatories, Applications, and Data management Network – Integrated information management system & wireless networks to deliver seismic, oceanographic, ecological, hydrological, and physical data to end users in real-time

Page 31: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

BIRN

NIH NCRR award to …• Enhance access to 4T – 8T MRI & other

imaging modalities for neuroscience studies• Enhance telecommunications & telemedicine

efforts for GCRC sites, co-located with NCRR imaging centers

• Develop bioinformatics tools & data fusion for PET, CAT, EEG, MRI

• Extend to all NCRR Resource sites & expand model to other areas

Partners• National Institutes of Health (NIH)• National Science Foundation (NSF)• UCSD• SDSC

Page 32: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

BorderSafe

Framework for Automated Sharing and Analysis• Service Oriented Architecture for integrating new and legacy data and analytical resources.• Explore and implement strong auditing using evolving

Web Service security standards

Crime Analysis Tool Evaluation• Evaluation of current law enforcement analysis and data mining tools

• Exploration of intelligent data analysis tool extensions Policy-based Information Sharing Research• Leverage policy-based resource discovery and sharing research done in Grid Services and apply them to Homeland Security Domain

Inter-organization Law Enforcement Community

Building • Discovering and addressing data integration and analysis needs by working directly with domain experts

Enabling Intelligent, Policy-based Information Sharing Technology

SAN DIEGO SUPER- COMPUTER CENTER,

Page 33: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

GEON

Finding Answers to Vital QuestionsEstimating flood and landslide potential, groundwater problems, volcanic activity, and soil quality—all with the best data available Accelerating scienceThrough the GEON Portal, researchers will be able to discover relationships of the type that led to plate tectonics in days instead of years Democratizing Grid Technologies

Building cyberinfrastructure for a wide range of users, from scientists and educators to policymakers and engineers

Building Reusable CyberinfrastuctureA model for the Earth Sciences and beyond

Enabling Integrated Views of the Earth System by

Page 34: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

3D Earthquake Modeling in GEON

Page 35: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Advanced Database Projects

Research & system development• Infrastructure for data mining, data

warehousing, and query processing• Grid data services to make data available to

researchers via traditional methods and API's which allow simple storage and retrieval of data regardless of type, size, and physical

AfCS PDBBIRN JCSG PKREOL ToL GenBankBlast

FED_SDSCVendor API SQL

Federation

Grid Data Service

DB

2OracleDB2

FlatFile(s)

MSSQL

MySQL BLASTXMLFile(s)

Source Specific Ontology

Master OntologyDB2

OracleDB2Flat

File(s)MSSQL

MySQL BLASTXMLFile(s)

Federation

Federation Ontology

Federation issues• Performance, reliability, authentication• Data location• Network connections between users & data• Need for replication• Google-like tool to search grid data services

for data & access content

Page 36: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Advanced Query Processing

Modeling and integrating biological pathways

Volume algebra for Neuroscience Research & system development• Data Modeling services for scientific applications –

volumetric data, multimedia, …• Ontology construction and searching for large-scale

problems – query evaluation techniques• Techniques for management of graph-structured

information – representation and query language development, browsing tools

• Data Warehousing for Interaction Networks – graph views over relational data

• Information Integration techniques over multiple data models (spatial, relational, graph…) with Ontologies

• Simulation of process networks using Hybrid Functional Petri Nets

Co-browsing ontologies, atlases, data

Page 37: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Distributed Data Management

• Data collecting• Sensor systems, object ring

buffers and portals

• Data organization• Collections, manage data context

• Data sharing• Data grids, manage heterogeneity

• Data publication• Digital libraries, support discovery

• Data preservation• Persistent archives, manage

technology evolution

• Data analysis• Processing pipelines, manage

knowledge extraction

Storage Resource BrokerCollections at SDSC

Digsky 42,786.00 6,076,982 69 NSF/ITRNPACI 8,822.00 2,995,432 377 NSF/PACIHayden 7,835.00 60,001 168 AMNHSLAC 2,108.00 294,149 43 NSF/NPACILDAS/SALK 824.00 13,016 66 NSFTeraGrid 10,603.00 433,938 2,229 NSFBIRN 389.00 1,084,749 167 NIH (NCRR)Digital LibraryDigEmbryo 720.00 45,365 23 NLMHyperLter 215.00 5,097 28 NSF/NPACIPortal 1,244.00 34,094 352 NSF/NPACIAfCS 107.00 21,295 21 NIHNSDL/SIO Exp 603.00 87,191 26 NSF/NSDLTRA 92.00 2,387 26 NSF/NPACISCEC 12,274.00 1,721,241 43 NSF/ITRUCSDLib 1,085.00 138,421 29 UCSDPersistent ArchiveNARA/Collection 67.00 82,031 56 NARANSDL/CI 465.00 2,948,903 114 NSF/NSDLTOTAL 90 TB 16 million 3837

Project Size Files Users Agency

Page 38: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Knowledge and Information Discovery Research & system development• Data mining and machine learning • Services-based knowledge discovery

infrastructure• Analysis of complex data – real-time streams,

sensor networks, remote sensing imagery, microarray data, large text collections

• Decision support systems – environmental monitoring, law enforcement, forensics, and homeland security

• Support for knowledge discovery projects at SDSC and beyond

In Environment (TeraBridge, LTER Network, PRAGMA)

In Homeland Defense (Border Safe)

sensor

(( ))

WS

SAN DIEGO SUPER- COMPUTER CENTER,

Page 39: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Strategic Applications Collaborations

• SDSC staff paired with domain scientists for 3-12 month projects

• Past successes include• Biomedical imaging (U Michigan)• Brain mapping/computational anatomy (Johns Hopkins)• Computational modeling of the cochlea (U Michigan)• Molecular dynamics in large biomolecular systems (UCSF, TSRI)• SEQUEST (U Washington)• SCWRL (Fox Chase Cancer Center)• Protein Structure prediction (UCSD)• Protein fold recognition and classification (UCSD)

Page 40: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Academic Associate Program for UC Campuses

• Resources dedicated to UC campuses to support research endeavors

• Program provides University of California (UC) researchers access to the vast array of computational resources at the San Diego Supercomputer Center (SDSC)

• Any qualified UC researcher can request supercomputing time, free of charge, through the AAP administrator for his or her respective campus

Page 41: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Academic Associate Program for UC Campuses

• High-Performance Computing• Storage• Access to specialized Software, Databases

and Archives• Technical Support• Training/Documentation• Early access to new SDSC systems

Page 42: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Academic Associate Program for UC Campuses

Computing

• DataStar: 10 teraflops IBM Power4-based system with total memory of 4.2 terabytes

• TeraGrid: 4.3 teraflops IA-64 system

Storage • Petabyte-scale archival storage system• SAN disk array with a total capacity of more than 500 terabytes• http://datacentral.sdsc.edu/

Page 43: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Academic Associate Program for UC Campuses

Access to specialized software/Databases/Archives• A variety of powerful software applications covering a range of

disciplines including – Medicine, bioscience, physics, astronomy, chemistry, etc.

• Large-scale data libraries, such as the – Protein Data Bank (PDB)– National Virtual Observatory (NVO)– 2-Micron All Sky Survey (2MASS) – User-friendly software for accessing large data collections

• NPACKage – mature middleware and applications for grid computing, communication and archiving

• A variety of data analysis, mining and visualization tools

Page 44: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Academic Associate Program for UC Campuses

Technical support• From SDSC Scientific Computing Services, including consulting

for parallel programming, optimization, porting, etc.

Training• Quarterly workshops on parallel computing• 1-2 day on-site workshops • Priority seating at SDSC workshops• Web-based training• Special week-long summer institute

– Focused work on participant’s projects

– Student expenses paid

– Data-intensive and grid applications focus this year

Early access to new SDSC systems (Data Star)

Page 45: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Academic Associate Program for UC Campuses

• How to apply?

• Any UC researcher can request supercomputing time online

http://www.sdsc.edu/aap.html

• Campus representatives are available at each UC campus to help researchers with any questions or problems they may have regarding the Academic Associates program

Page 46: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

Some collaboration ideas

• Development of new algorithms• Optimization and parallelization of code• Development of portal interfaces to applications and

other resources• Assistance writing successful proposals for SDSC

resources• Assistance with cluster setup and maintenance• Hardware acquisitions, benchmarking and

performance analysis• Data services: databases, data collections and data

mining

Page 47: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

And still more

• Visualization assistance• Automation of data collection from wet labs

• expertise in sensor/instrument data collection, CAL IT2

• Hosting of visiting scientists and students• Joint proposals

• NSF’s ITR programs• TeraGrid participation• NIH initiatives

Page 48: Computing and Storage Resources at the San Diego Supercomputer Center

SAN DIEGO SUPERCOMPUTER CENTER

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE

We’re interested in working together

• Q&A• [email protected]• http://www.sdsc.edu• UC Academic Associates program

• http://www.sdsc.edu/aap.html