ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu...

28
Copyright 2010 FUJITSU LIMITED ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010 Takashi Toyoshima Next Generation Technical Computing Unit Fujitsu Limited

Transcript of ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu...

Page 1: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Copyright 2010 FUJITSU LIMITED

ICC: An Interconnect Controllerfor the Tofu Interconnect Architecture

August 24, 2010

Takashi ToyoshimaNext Generation Technical Computing Unit

Fujitsu Limited

Page 2: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Background Requirements for Supercomputing Systems Low latency

• Communication latency limits the scalability of applications High bandwidth

• Increasing calculation FLOPS requires higher network bandwidth be balanced with FLOPS

RAS – Reliability, Availability and Serviceability• The risk of hardware faults in large systems increases along with the

increased number of nodes

Copyright 2010 FUJITSU LIMITED1

Page 3: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

6D Mesh/Torus Interconnect Architecture* Scalability Fault-tolerance

LSI Features Ten network links Four communication engines

Fujitsu’s New Interconnect Architecture

Copyright 2010 FUJITSU LIMITED

3D Torus

CPUTofu UnitLinks x4

3D TorusLinks x6

Router EngineComm.EngineComm.EngineComm.EngineComm.Engine

ICC

Tofu Unit

(*) “Tofu: A 6D Mesh/Torus Interconnect for Exascale Computers”, IEEE Computer, vol.42, no.11,Yuichiro Ajima, Shinji Sumimoto, Toshiyuki Shimizu

2

Page 4: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Copyright 2010 FUJITSU LIMITED

Implementation

• Implementation• Features

• Overview• Interface features for latency and throughput• Network features for network utilization

• Conclusion

3

Page 5: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Specifications Fujitsu’s 65nm CMOS Technology Die size

• 18.2mm×18.1mm Transistors

• 48M gates for logic• 12M-bit SRAM cells

I/O• 5GB/s Ports×16

• 6.25Gb/s×8 links / portMisc.

• ASIC design flow• 312.5MHz/625.0MHz

Copyright 2010 FUJITSU LIMITED4

Page 6: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Node Domain CPU bus bridge

• 20GB/s in each direction Communication engines×4

• 5GB/s in each direction• Barrier engine (Comm.#0 only)

PCIe 2.0 root complex×2• Isolated power domain

Router Domain Crossbar

• 14 ports 5GB/s in each direction Link ports×10

• 5GB/s in each direction

Floor Plan

Copyright 2010 FUJITSU LIMITED

Node Domain

Router Domain

5

Page 7: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Fault Domain Isolation Router continues to work on node faults

Error Protection Radiation-hardened FFs ECC protection

• RAM/Data path Parity error detection

• Control path CRC protection

• Data link/Transaction

RAS Features

Copyright 2010 FUJITSU LIMITED

Node Domain

Router Domain

6

Page 8: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Copyright 2010 FUJITSU LIMITED

FeaturesOverview

• Implementation• Features

• Overview• Interface features for latency and throughput• Network features for network utilization

• Conclusion

7

Page 9: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

ICC Features

Copyright 2010 FUJITSU LIMITED

Latency Throughput RAS

SystemMany NeighborsHop Reduction3D Torus View

Many NeighborsTrunking

Detour PathSubnet Partitioning

NetworkInterface

RDMA- Quick Start- Piggyback- Strong OrderStream OffloadBarrier Engine

GAP ControlMulti-Interfaces

- User Thread x2- Kernel Thread

Radiation-hardened FFECCParityCRC

RouterEngine

Cut-throughGrant PredictionStraight Bypath

Straight BypathNew VC Scheduling

Node Error IsolationRadiation-hardened FFECCParityCRC

: Unique Features Today’s topics are highlighted in red8

Page 10: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Copyright 2010 FUJITSU LIMITED

FeaturesInterface features

• Implementation• Features

• Overview• Interface features for latency and throughput• Network features for network utilization

• Conclusion

9

Page 11: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

RDMA: Remote Direct Memory Access Features

Low Latency and High Throughput Command supply throughput and latency Out-of-ordered I/O memory bus

Copyright 2010 FUJITSU LIMITED

Operation READ (Get) / WRITE (Put)Length ~16MB

MTU 256B~1920BVirtual Address Support (64K set)

ICCCPU

Mem.Network

ICC CPU

Mem.

10

Page 12: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

RDMA Optimizations Sender Techniques Direct descriptor

• Quick command supply Piggyback

• Command embedded communication payload• Short message sending without any DMA

Receiver Techniques Out-of-ordered I/O memory bus

• High throughput bus transaction Strong ordered store

• In order completion of DMA transactions for buffer polling

Copyright 2010 FUJITSU LIMITED

Throughput LatencyPIO ✓Good

DMA ✓GoodCommand Supply Performance

11

Page 13: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

ICCCPU

Mem.

ICCCPU

Mem.

Direct Descriptor Feature Normal Command Supply DMA fetching produces high throughput command supply

Direct Descriptor and DMA Command Supply DMA latency hiding by Block Store with first two commands

Copyright 2010 FUJITSU LIMITED

Network

ICC CPU

Mem.

Network

ICC CPU

Mem.

Cmd 1Cmd 2Cmd 3Cmd 4

Cmd 5Cmd 6Cmd 7Cmd 8

Cmd 3Cmd 4Cmd 5Cmd 6

Cmd 1Cmd 2

12

Page 14: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Normal Payload Supply User messages (payload) should be fetched by DMA

Piggyback Payload Supply User messages (payload) are embedded in commands

ICCCPU

Mem.

ICCCPU

Mem.

Piggyback Feature

Copyright 2010 FUJITSU LIMITED

Network

ICC CPU

Mem.

Network

ICC CPU

Mem.(4) Packet

(2) PacketPayloadCmd 1

13

Page 15: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

ICC CPU

Mem.

Completion Notification Polling ICC notifies after the completion of O-o-O DMA Stores

Buffer Polling with Strong Ordered IndicationMemory controller guarantees specified DMA ordering

ICCCPU

Mem.

ICCCPU

Mem.

Out-of-Ordered I/O Memory Bus

Copyright 2010 FUJITSU LIMITED

Network

ICC CPU

Mem.

Network(1) Packets

Mem.(1) Packets

14

Page 16: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Hardware Measured Results Piggyback achieves low latency in short message Strong ordered packet makes buffer polling possible

RDMA Performance

Copyright 2010 FUJITSU LIMITED

0.00

0.50

1.00

1.50

2.00

2.50

3.00

3.50

8 32 128 256 512 768 1024 1280 1792 1920

Rel

ativ

e La

tenc

y

Message Size [B]

Buffer Polling Notification Polling

Piggyback (dashed line)

48% cut

Strong Ordered DMA

0.37 pointin avg.

15

Page 17: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Copyright 2010 FUJITSU LIMITED

FeaturesNetwork Features

• Implementation• Features

• Overview• Interface features for latency and throughput• Network features for network utilization

• Conclusion

16

Page 18: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Network Utilization ProblemsGlobal Unfairness of Throughput Arbitrations with local fairness cause global unfairness

Non-uniform Application Traffic in Time and Space Bandwidth of idle links needs to be used effectively

Copyright 2010 FUJITSU LIMITED

ICCNode #0

ICCNode #1

ICCNode #2

ICC ICC ICC

ICC

ICC

ICCIdle

100% 50% 25%

Phase A Phase B

Idle

Bottleneck17

Page 19: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

#0 #1 #2100% 0% 0%

#0 #1 #250% 50% 0%

#0 #1 #225% 25% 50%

Global Unfairness of Throughput Local Fairness of arbitration cause global unfairness Arbiters on every junction treat all incoming traffic fairly

Copyright 2010 FUJITSU LIMITED

ICCNode #0

ICCNode #1

ICCNode #2

#0 #1 #233% 33% 33%

ICCNode #2

Ideal Throughput

halved halved

18

Page 20: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Software Specify the Inter-Packet GAP Parameter

Communication engine works to control injection rate• Insert temporal gaps between transmitting packets• Interval can be specified by the user

Injection Rate Control

Copyright 2010 FUJITSU LIMITED

GAP GAPTransport

LayerPackets

TofuDescriptorCommand

Packet Packet Packet

RDMA Message

#0 #1 #233% 0% 0%

#0 #1 #233% 33% 0%

#0 #1 #233% 33% 33%

ICCNode #0

ICCNode #1

ICCNode #2

GAP GAP GAP

19

Page 21: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Hardware Measured Results Software can specify fine grained GAP parameters: 0-255 GAP works to control throughput effectively

Throughput Performance

Copyright 2010 FUJITSU LIMITED20

Page 22: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Non-uniform Application Traffic Non-uniform Traffic in Space

Non-uniform Traffic in Time

Copyright 2010 FUJITSU LIMITED

ICC ICC

ICC

ICC

ICC

ICC

Idle

IdleBusy

Busy

ICC

ICC

ICC

ICCBottleneck

Idle

Phase A Phase B21

Page 23: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Trunking Independent Idle Paths Nodes have four neighborhoods in Tofu Unit

• Independent links and 3D-Torus networks Each node has four communication engines

• Up to×4 throughput

TofuUnit

TofuUnit

Independent

3D Torus Networks

Trunking Communication

Copyright 2010 FUJITSU LIMITED22

Page 24: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Trunking Performance Results Communication engines achieve good performance Trunking mechanisms scale up to four engines

Copyright 2010 FUJITSU LIMITED

0.0 1.0 2.0 3.0 4.0 5.0

Thro

ughp

ut [G

B/s

]

MTU [B]

Communication Engine

WRITE READ

0.0

5.0

10.0

15.0

20.0

x1 x2 x3 x4

Thro

ughp

ut [G

B/s

]

1-4 Engines Trunking

23

Page 25: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Copyright 2010 FUJITSU LIMITED

Conclusion

• Implementation• Features

• Overview• Interface features for latency and throughput• Network features for network utilization

• Conclusion

24

Page 26: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Concluding Remarks Tofu: A 6D mesh/torus interconnect architecture Interconnect for Fujitsu’s Peta/Exascale computing systems Low latency, High bandwidth and RAS

Features High-throughput and low-latency RDMA

• Direct Descriptor and Piggyback• Out of Ordered I/O Memory Bus

Network features for network utilization• Network injection rate control• Trunking up to four times throughput

Copyright 2010 FUJITSU LIMITED25

Page 27: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Thank You for ListeningThanks to…

Next Generation Technical Computing UnitAiichiro Inoue, Yuji Oinaga

Tofu Architecture TeamToshiyuki Shimizu, Yuichiro Ajima, Tomohiro Inoue, Shinya Hiramoto

ICC Design TeamTakeo Asakawa, Akira Asato, Takumi Maruyama,

Koichiro Takayama, Koichi Yoshimi, Osamu Moriyama,Masao Yoshikawa, Shinichi Iwasaki, Takekazu Tabata,

Yoshiro Ikeda, Yuzo Takagi,Yoshihito Matsushita, Toshihiko Kodama, Satoshi Nakagawa,

Masato Inokai, Shigekatsu Sagi, Ikuto Hosokawa,Yaroku Sugiyama, Takahide Yoshikawa

Copyright 2010 FUJITSU LIMITED26

Page 28: ICC: An Interconnect Controller - Fujitsu · ICC: An Interconnect Controller for the Tofu Interconnect Architecture August 24, 2010. Takashi Toyoshima. Next Generation Technical Computing

Copyright 2010 FUJITSU LIMITED27