parampadma-1

37
PARAM Padma SuperComputer Vishal Bakshi Aditya Polumetla

Transcript of parampadma-1

Page 1: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 1/37

Page 2: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 2/37

Topics Introduction Specifications

Machine Architecture Microker nel Architecture PARAMNet II System Software

Applications on PARAM

Page 3: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 3/37

PARAM

PARAM ± PARAllel Machine Developed by Center for Development of Advanced

Computing (C-DAC) India

PARAM Padma is the latest in the line of PARAMseries after PARAM 8000, PARAM 9000 andPARAM 10000

Introduced in 2003 Its a next generation high performance scalable

computing cluster  PARAM Padma's theoretical peak performance of 

complete configuration is 1.13 TFlops

Page 4: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 4/37

PARAM Padma

Page 5: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 5/37

PARAM Padma Configuration

Page 6: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 6/37

Specifications

Compute Nodes:

� 248 Power 4 1 Ghz processors

� 54 4-way SMP and 1 32-way SMP (SymmetricMultiProcessing) configuration

� OS ± AIX / LINUX 

AIX (Advanced Interactive eXecutive) is a

IBM's UNIX operating system� Aggregate Memory of 0.5 TeraBytes

Page 7: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 7/37

Specifications continued

Network:

� PARAMNet II (primary)

� GigaByte Ethernet (backup)

File Servers:

� 24 UltraSparc-III 900MHz processors

� 6 4-way SMP configuration

� OS ± Solaris

� File System ± QFS (Quantitative File System)

Page 8: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 8/37

Specifications continued

External Storage:

Storage array 5 TeraBytes� Tape Library 12 TeraBytes

System Software:

� C -DAC' s HP CC Suite of System Software

� It provides a high flexible software environment 

for the cluster 

Page 9: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 9/37

PARAM Padma System Layout

Page 10: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 10/37

PARAM Machine Ar chitectur e

Page 11: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 11/37

PARAM Machine Ar chitectur econtinued

� Cluster Personality

- Data Link Provider Interface (DLPI) driver 

allows mapping of common network protocols tothe underlying network MPP (Massively Parallel Processor) Personality- Architecture allows microkernel to be loaded on

compute nodes- implements custom designed Concurrent RuntimeEnvironment (COR E)

- Standard message passing interface

1. Parallel Virtual Machine (PVM)2. Messa e Passin Interface MPI

Page 12: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 12/37

PARAS Micr oker nel 

Ar chitectur e� Message for high MPP

� Supports multiple tasks with a paged virtual

memory space

� Multiple threads of execution within each task 

� Message based interprocess communication

� Communication between tasks by message passing

� Location independent interprocess communication

supported by port abstraction

Page 13: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 13/37

PARAS Micr oker nel 

Ar chitectur e continued

� Services:

- An executive which schedules priority based

threads

- Location transparent interprocess communication

mechanism

- Simple virtual memory model

- Low kernel hardware supervisor 

Page 14: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 14/37

PARAS Microkernel ± Block 

Diagram

Page 15: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 15/37

Page 16: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 16/37

Micr oker nel Abstractions

� Five basic abstractions supported

- Tasks

- Threads

- Ports

- Messages

- Regions

Page 17: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 17/37

Operating Envir onment

Page 18: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 18/37

Operating Envir onment continued

� Components of Operating Environment

- Partition Manager: global resource manager and

management of that partition's resources- Process Server: Spawning tasks on a node and

provides remote system call interface

- MicrokernelF

ile System Server: to serve UNIX filesystem calls

- Name Server: provides port naming services to user 

application

Page 19: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 19/37

PARAMNet II

� Developed by C-DAC

� System Area Network (SAN) switch to

interconnect the nodes of the system

� Provides a low latency and high bandwidth

interconnect

� data rate @ 2.5 Gigabits/sec

� message latency of 10 µ sec

� switch latency of 0.5 µ sec

Page 20: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 20/37

PARAMNet II continued

Major components are -

1. 8 / 16-port SAN switch

2. Network Interface Card (NIC) with CCP-III(C-DAC Co-Processor) communicationcoprocessor 

3. Application Programming Interface (C-VIPL)

C-DAC's Virtual Interface Provider Library

Page 21: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 21/37

PARAMNet II ± SAN Switch

� Non-blocking crossbar based architecture

� 2.5 Gbps full duplex bandwidth per port

� Distributed schedulers allow individual routing

tables per port, allowing for any network topology

� More than 8 / 16 ports supported using multi-level

switching

� Ethernet based management interface allows for 

remote access and control of multiple switches

Page 22: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 22/37

PARAMNet II ± SAN Switch 

continued

�Virtual channel based routing with 1 KB buffering

per port each at input and output

� Interval routing scheme based on 32-bit header 

(16-bit routing information)

� Group adaptive routing based on LRU algorithm to

ensure uniform bandwidth distribution in a group

� Point-to-point flow control with pause and resume

token

Page 23: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 23/37

PARAMNet II - NIC

� Provides interface to SANSW8 (8 port ) andSANSW16 (16 port) PARAMNet II switch

� Supports for connection oriented and

connectionless protocols

� Can be configured for other protocols

� Capable of performing I/O from paged virtual

memory

Page 24: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 24/37

PARAMNet II ± NIC continued

�Support for up to 1024 connections and up to 1024

completion queues

Support for different page sizes

� Packetization and reassembly done in hardware

� Error detection and recovery done in hardware

� Latency of 10 µ sec

Page 25: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 25/37

PARAMNet II - C-VIPL

Page 26: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 26/37

PARAMNet II ± C-VIPL continued

� C-VIPL is an application programming interface

for PARAMNet II

Major feature ± Thread safe implementation

� Supports AIX, Linux, Solaris and Windows OS

Page 27: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 27/37

System Level Block diagram for 

PARAMNet II

Page 28: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 28/37

System Level Block diagram for 

PARAMNet II continued

� PARAMNet II network comprises of N hostsconnected in non-blocking fat tree topology

� For more than 8 / 16 hosts multiple SANSW8 /

SANSW16 are required

� As switch latency is low multiple levels of switches

have less latency

Page 29: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 29/37

PARAM Padma and PARAMNet II

� PARAM Padma has 12 PARAMNet II switches connected

to two-level configuration to form 64 node CLOS network 

CLOS Network: multi-stage network topology thatprovides alternate paths between input and output making

it possible to eliminate the blocking that occurs in other 

networks

� Latency associated with packet routing are small for a non

-blocking topology

Page 30: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 30/37

HPCC Sof twar e

HPCC software on the PARAM Padma provides a

high performance flexible software environment

Page 31: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 31/37

HPCC Sof twar e continued

� HPCC Software consists of 

-program development tools

- system management tools

- software engineering tools

� Provides low overhead for communication

� Optimized Message Passing Interface (MPI)

Parallel file system with MPI-IO interface to

enable applications to scale on large cluster 

Page 32: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 32/37

HPCC Sof twar e continued

� C-FPS is a client-server and user level parallel file

system that addresses high I/O throughput

requirements� The Fortran programming environment consists of 

- Fortran 90 compiler,

- parallelizing compiler for Fortran 77/90 for SMP

- integrated development environment and Fortran

77 to Fortran 90 converter 

Page 33: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 33/37

HPCC Sof twar e continued

�The DIViA debugging environment comprises of a

coherent set of tools that help programmers in

debugging parallel programs� PARMON the system management tool, enables

the administrator to monitor activities and resource

utilization of various cluster components

� RMS resource management tool enables users to

optimally schedule jobs across the cluster nodes

Page 34: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 34/37

Storage Ar chitectur e

� Storage is provided by Storage array and Tape

library

Storage Area Network 

� PARAM Padma has a network centric storage

architecture

� Storage capacity of 5 TB scalable to 22 TB

� Use of Fibre Channel Arbitration Loop (FC-AL)

technology to interconnect storage subsystems

Page 35: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 35/37

Applications on PARAM Padma

� Bioinformatics

- stimulations of large bio molecules

- analysis of large databases for studying functions

of genes

� Computational Structural Mechanics

- fracture analysis

- stress analysis

- visualization for structural and fluid mechanics

- study of thin walled structures

Page 36: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 36/37

Applications on PARAM Padmacontt

� Seismic Data Processing

- for oil and natural gas exploration

Computational Chemistry- calculation of electronic structure and properties of 

molecules

- molecular dynamics simulation� Computational Fluid Dynamics

- simulation of external and internal flows

- simulation of hypersonic flows

Page 37: parampadma-1

8/7/2019 parampadma-1

http://slidepdf.com/reader/full/parampadma-1 37/37

Refer ences

1. http://cdac.in/html/parampma.asp

2.A Microkernel Based Operating System for PARAM 9000,

Mohan Ram, N et al, CDAC 1995 

3. PARAM Padma ± A Teraflops Computing System And 

High Performance Computing in India, Purohit, S.K., CDAC,

2003

4.Current State and Future Trends in High PerformanceComputing & Communications (HPCC) Research in India,

Sinha, P.K. et al, 10 th IEEE Internaltional Workshop on

FTDCS 2004.

5.Overview of recent supercomputers, Aad J. van der