Montpellier, France, April 10, 2008 - - TU Kaiserslautern · [email protected] 24 April 2008...

28
[email protected] 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 1 Reiner Hartenstein, TU Kaiserslautern, Germany http://hartenstein.de Reconfigurable Computing to solve the CS Education Dilemma (keynote address) Reiner Hartenstein TU Kaiserslautern Montpellier, France, April 10, 2008 RC education 2008 © 2008, [email protected] http://hartenstein.de TU Kaiserslautern 2 Demonstrating the intensive Impact (FPGA-based speed up and power saving) DES breaking [T. Elghazawi et al.: IEEE COMPUTER, Febr. 2008] Platform (compared to Beowulf cluster) Speed-up factor Power saving factor SDC-6 6757 856 Cray XD-1 12162 608 SGI Atix 4700 with RC 100 RASC 28514 3439 encryption on von Neumann getting unaffordable

Transcript of Montpellier, France, April 10, 2008 - - TU Kaiserslautern · [email protected] 24 April 2008...

Page 1: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 1

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

Reconfigurable Computing to solve the CS Education Dilemma

(keynote address)

Reiner Hartenstein

TU Kaiserslautern

Montpellier, France, April 10, 2008

RC education 2008

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

2

Demonstrating the intensive Impact

(FPGA-based speed up and power saving)

DES breaking [T. Elghazawi et al.: IEEE COMPUTER, Febr. 2008]

Platform (compared to Beowulf cluster)

Speed-up factor

Power saving factor

SDC-6 6757 856

Cray XD-1 12162 608

SGI Atix 4700 with RC 100 RASC 28514 3439

encryption on von Neumann getting unaffordable

Page 2: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 2

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

3

The clash of Paradigms (a simplified view)

Computation in time (procedural-only von Neumann mind set)

Computing in space (structural mind set)

Reconfigurable Computing model: supersystolic mind set -

data-stream-based

… you can teach programming to a hardware guy

… you can teach programming to a hardware guy

you can’t teach hardware to a programmer, although …

you can’t teach hardware to a programmer, although …

This is the fault of our CS curriculaThis is the fault of our CS curricula

traditional CS model : Software-only mindset -instruction-stream-based

sequential mindset parallel pipelining mindset

procedural structural

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

4

Tear down this Wall !

… to achieve a far-ranging paradigm shift, away from the monopoly-like dominance of the instruction-stream-based vN mind set,

… to clear away the wreckage of CS foundations having survived the mainfrage age, and, having collapsedunder the weight of its own absurdity.

over to a dichotomic dual paradigm basic model including the Reconfigurable Computing mind set.

Page 3: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 3

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

5

>> Outline <<

• Inefficient Microprocessors

• The von Neumann Syndrome

• The twin paradigm approach is needed

• The CS education dilemma

• Conclusions

http://www.uni-kl.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

6

Microprocessor Industry (1)

We are o.k. ? You are o.k. ?

go multi-threading? - transactional memory? – MPI? … ?

“… we have a tool package for manycore programmers…”

Leadership in the Future

We are witnessing a radical change of the entire industry

away from single-processor chips to new generations ofmulti-processor chips called manycores,

“intel has thrown a hail mary pass …”

[Dave Patterson]

Page 4: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 4

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

7

Climate Change in Computing

Wawrzynek missing

(my birthday)

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

8

What Climate change ?

Now intel goes toward 1024 chickens

intel„s strong oxen under Andy Grove and Craig Barret: up to 5 GHz clock speed [til 2005]

Page 5: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 5

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

9

Microprocessor Industry (2)

Gordon Moore indicating performance? (corrected twice)

Stop at >100 Watts per core

More performance by more transistors? -> imagination! -> phantasy!

-> la raison

La raison, c'est l'intelligence en exercice; … [Victor Hugo]

… seamless improvement of performance …… from generation to generation?

Leadership in the past

-> raison?

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

10

Rapid Decline of Computational Density

[BWRC, UC Berkeley, 2004]

1990 1995 2000 2005

200

100

0

50

150

75

25

125

175

SP

EC

fp2

00

0/M

Hz/B

illio

n T

ran

sis

tors

HP

alph

a: d

own

by

100

x in

6 y

rsIB

M:

dow

n by

20

x in

6 y

rs

stolen from Bob Colwell

memory wall, caches, ...

[Wawrzynek; Sep 8, 2005, GSRC Symposium (GSRC‟05) ]

architecture overhead

Page 6: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 6

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

11

Victor Hugo

La raison, c'est l'intelligence en exercice; …

…l'imagination c'est l'intelligence en érection. [Victor Hugo]

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

12

Justin Rattner‟s DAC keynote

Rattner predicts the impact of the industry's move toward the ubiquitous use of wireless communications and the challenges that this very high level of integration will have on design automation, from architecture to manufacturing.

… and, how design techniques will evolve, by necessity, to satisfy the demands of reconfigurable hardware and software programmability.

He will address how the combination of new radio architectures and mostly digital implementations will drive a new generation of design tools, and …

“EDA for Digital, Programmable, Multi-Radios.”

the demand elsewhere (outside intel) ???

Page 7: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 7

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

13

The biggest problem in CS history

Manycore requires a fundamental change in programming:drastically more complex than uniprocessor programming.

This requires to understand new computational principles,algorithms, and programming tools.

We need new directions in R&D

The Manycore Programing Crisis

Reconfigurable Computing is an inevitable ingredient of the crisis’s solution

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

14

>> Outline <<

• Inefficient Microprocessors

• The von Neumann Syndrome

• The twin paradigm approach is needed

• The CS education dilemma

• Conclusions

http://www.uni-kl.de

Page 8: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 8

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

15

A List of Massive Overhead Phenomena

overheadvon Neumann

machine

instruction fetch instruction stream

state address computation instruction stream

data address computation instruction stream

data meet PU + other overh. instruction stream

i/o to/from off-chip RAM instruction stream

Dijkstra 1968: The Goto considered harmful

R.H. & Koch 1975: The universal Bus considered harmful

Backus, 1978: Can programming be liberated from the von Neumann style?Arvind et al., 1983: A critique of Multiprocessing the von Neumann Style

all this overhead piling up to code sizes of astronomic dimensions

von Neumann Syndrome[C.V. “RAM” Ramamoorthy]

von Neumann Syndrome[C.V. “RAM” Ramamoorthy]

CPUCPU single CPU

von Neumann

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

16

Data meeting the Processing Unit (PU)

by Software

inefficiently

routing the data by memory-cycle-hungry instruction streams thru shared memory

the pianist is the „PU“

This is just one of many von Neumann overhead phenomena

Moving data

data

Page 9: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 9

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

17

manycore von Neumann: arrays

of massive overhead phenomena

overheadvon Neumann

machine

instruction fetch instruction stream

state address computation instruction stream

data address computation instruction stream

data meet PU + other overh. instruction stream

i / o to / from off-chip RAM instruction stream

proportionate to the number of processors

CPUCPU single CPU

von Neumann

17

Inter PU communication instruction stream

message passing overhead instruction stream

transactional memory overh. instruction stream

multithreading overhead etc. instruction stream

disproportionate to the number of processors

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

or, 16 core SoCexample: 16 CPUs running in parallel

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

18

Code sizes of astronomic dimensions

Fast on-chip memory cannot store such huge instruction code sizes

… no way around von Neumann bottlenecks

Page 10: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 10

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

19

Reconfigurable Computing

overheadvon Neumann

machinehardwired

anti machinereconfigurableanti machine

instruction fetch instruction stream none*

state address computation instruction stream none*

data address computation instruction stream none*

data meet PU + other overh. instruction stream none*

i / o to / from off-chip RAM instruction stream none*

Inter PU communication instruction stream none*

message passing overhead instruction stream none*

transactional memory overh. instruction stream none*

multithreading overhead etc. instruction stream none*

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPA: reconfigurable datapath arrayrDPA: reconfigurable datapath array

(coa

rse-

grai

ned

rec.

)(c

oars

e-gr

aine

d re

c.)

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

20

von Neumann overhead: just one example

overheadvon Neumann

machine

instruction fetch instruction stream

state address computation instruction stream

data address computation instruction stream

data meet PU + other overh. instruction stream

i / o to / from off-chip RAM instruction stream

CPUCPU single CPU

reconfigurable address generator (GAG): ~20x speed-uprDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

PISA DRC accelerator [ICCAD 1984]

(entire project: 15000x speed-up)

Page 11: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 11

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

21

Declining Programmer Productivity

At particular HPC application domains

massive parallelism requires 10 – 30

professionalists in multi-disciplinary

multi-insitutional teams for 5 - 10 years

[Douglass Post, DoD HPCMP, panelist at SC07]

The Law of More: programmer

productivity declines disproportionately

with increasing parallelism

Software done: machine obsolete

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

22

Dead Supercomputer Society

•ACRI •Alliant •American Supercomputer

•Ametek •Applied Dynamics •Astronautics •BBN •CDC•Convex•Cray Computer •Cray Research •Culler-Harris •Culler Scientific •Cydrome •Dana/Ardent/ Stellar/Stardent

•DAPP •Denelcor •Elexsi •ETA Systems •Evans and Sutherland•Computer•Floating Point Systems •Galaxy YH-1 •Goodyear Aerospace MPP •Gould NPL •Guiltech •ICL •Intel Scientific Computers •International Parallel Machines

•Kendall Square Research •Key Computer Laboratories

a wave of bankruptcies, consolidations, and, changing business objectives

•MasPar•Meiko •Multiflow •Myrias •Numerix •Prisma •Tera •Thinking Machines •Saxpy •Scientific Computer•Systems (SCS) •Soviet Supercomputers •Supertek •Supercomputer Systems •Suprenum •Vitesse Electronics

[Gordon Bell]

Page 12: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 12

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

23

>> Outline <<

• Inefficient Microprocessors

• The von Neumann Syndrome

• We need the twin paradigm approach

• The CS education dilemma

• Conclusions

http://www.uni-kl.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

24

The von-Neumann-only model

algorithms variable

resources fixed

instruction-stream-based computing:

µprocessorµprocessor

Software

instruction-stream

programmable

algorithms variable

Page 13: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 13

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

25

algorithms variable

resources fixed

instruction-stream-based computing:

algorithms variable

resources variable

data-stream-based reconfigurable computing:

Software

instruction-stream

acceleratorsacceleratorsµprocessorµprocessor

Configware

resources variable

programmable

algorithms variable

Nick Tredennick‟s view

algorithms variable

Flowware

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

26

algorithms variable

resources fixed

instruction-stream-based computing:

algorithms variable

resources variable

data-stream-based reconfigurable computing:

instruction-stream

resources variable

algorithms variable

Twin paradigm manycore model

algorithms variable

Configware

Flowware

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

Software

Page 14: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 14

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

27

algorithms variable

resources fixed

instruction-stream-based computing:

algorithms variable

resources variable

data-stream-based reconfigurable computing:

instruction-stream

resources variable

algorithms variable

Software/Configware Co-Compilation

algorithms variable

Configware

Flowware

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

Software

software/configwareco-compiler

C language source

Academic Academic demodemoimplementationimplementation ofof such a such a compilercompiler in in thethe midmid„ 90ies„ 90ies

Academic Academic demodemoimplementationimplementation ofof such a such a compilercompiler in in thethe midmid„ 90ies„ 90ies

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

28

The time has come …

Twin paradigm CS education is inevitable

All, even early, undergrad courses should include

reconfigurability and both kinds of parallelism

Page 15: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 15

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

29

Terminology clean-up

Software: for scheduling instruction streams

Flowware: for scheduling data streams

Configware: for configuring morphware

Programming sources:

vonNeumann

non-vonNeumann

Morphware: „soft hardware“, reconfigurable platforms

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

30

>> Outline <<

• Inefficient Microprocessors

• The von Neumann Syndrome

• We need the twin paradigm approach

• The anti machine

• The CS education dilemma

• Conclusionshttp://www.uni-kl.de

Page 16: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 16

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

31

The 2nd

paradigm: ready to go

the urgently need

datastream-based

machine paradigm:

pepared almost

30 years ago …

the urgently need

datastream-based

machine paradigm:

pepared almost

30 years ago …

xxx

xxx

xxx

|

||

x xx

x

xx

x x

x

- -

-xxxx

xx

xxx

---

---

---

---

xxx

xxx

xxx

|

|

|

|

|

|

|

|

|

|

|

| data streams

… with systolic arrays… with systolic arrays

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

32

DPA

xxx

xxx

xxx

|

||

x x

x

x

x

x

x x

x

- -

-

input data streams

xx

x

x

x

x

xx

x

--

-

-

-

-

-

-

-

-

-

-

xxx

xxx

xxx

|

|

|

|

|

|

|

|

|

|

|

|output data streams

time

port #

time

time

port #time

port #

Flowware defines: ... which data item at which time at which port

Data streams

(flowware)

(pipe network)

systolic array research: throughout the 80ies:

Mathematicians‘ hobby

Page 17: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 17

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

33

Data streams

The clearest and common elementary definition !

datastream ≠ dataflow

… be careful: a lot more definitions are around, derived from local perspectives

(the old “dataflow machine” is indeterministic! )

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

34

Early Systolic Synthesis

Mathematicians like to wax rhapsodic about the elegance, beauty and depth of their proofs.

John Horgan: „The Endof Science Revisited“

Mathematicians liked thebeauty and elegance of Systolic Arrays.

Due to a lack of depth in reasoning, their efforts yielded poor synthesis algorithms.

R. H.

Page 18: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 18

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

35

Algebraic Synthesis Method

algebraic tunnel view:

array[history] mapping

uniform only

shape resources

systolic [1979]

algebraiclinear only

uniform only

mathematicians‟ paradigm trap

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

36

array[history] mapping

uniform only suitable for what

applicationsshape resources

systolic [1979]

algebraiclinear only

uniform only

strictly regular data dependencies

only

Algebraic synthesis methodes: usable

only for a small minority of applications!

Algebraic: what applications?

Page 19: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 19

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

37

Pipe Networks

array[history] mapping

uniform only Suitable for what

applicationsshape resources

systolic [1979]

algebraiclinear only

uniform only

strictly regular data dependencies

only

rDPA: super-systolic

[1995]

simulated annealing no restrictions at all

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

38

Who generates the Data Streams?

xxx

xxx

xxx

|

||

x x

x

x

x

x

x x

x

- -

-

xx

x

x

x

x

xx

x

--

-

-

-

-

-

-

-

-

-

-

xxx

xxx

xxx

|

|

|

|

|

|

|

|

|

|

|

|

Without

a sequencer

it is not

a machine !

Page 20: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 20

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

39

Mathematics: Winner of the

“Not My Job” Award

„it„s not our job“

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

40

But there are Asymmetries

von Neumann paradigm Antimachine paradigm

Software EngineeringSoftware Engineering

Configware EngineeringConfigware Engineering

ASM

CPU

location of the

sequencer

Page 21: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 21

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

41

The counterpart of the

von Neumann machine

xxx

xxx

xxx

|

||

x x

x

x

x

x

x x

x

- -

-

xx

x

x

x

x

xx

x

--

-

-

-

-

-

-

-

-

-

-

xxx

xxx

xxx

|

|

|

|

|

|

|

|

|

|

|

|

(r)DPA

ASM

ASM

ASM

ASM

ASM

ASM

AS

M

AS

M

AS

M

AS

M

AS

M

AS

M

datacounter

GAG RAM

ASM: Auto-Sequencing

Memory

data counters instead of a program counter

data counters: located at memory(not at data path)

coarse-grained

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

42

Data meeting the Processing Unit

byConfigware

placement of the execution locality ...

… pipe network generated by configware compilation

Page 22: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 22

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

43

Toward hetero manycore chips

Each standard core should have a mode bit to select:•Running in CPU mode, or,•Running in DPU mode*

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

also CPU clusters routable

CPUCPU

DPUDPU

*) program sequencer switched off

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPUDPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPUDPUDPU

DPUDPUDPUDPUDPUDPUDPUDPU

•4 (or, 6) nearest neighbor connect

additional backplane interconnect fabrics

for legacy software and supervisory tasks

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

44

The twin paradigm approach

CPUCPU

CPUCPU

CPUCPU

CPUCPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

rDPUrDPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

heterogenousmanycore system

Software/ Configware

Co-compilation

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

DPUDPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

CPUCPU

Page 23: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 23

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

45

>> Outline <<

• Inefficient Microprocessors

• The von Neumann Syndrome

• We need the twin paradigm approach

• The CS education dilemma

• Conclusions

http://www.uni-kl.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

46

The CS education dilemma

On the way between hardware mind set and software mind set we find a major roadblock.

Removing this brick wall is very simple by using very old straight-forward wisdom.

This is due to a reductionist tunnel view, a paradigm trap, partly driven by arrogance.

Educators and curriculum task forces caused this brick wall by decades of ignorance.

Page 24: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 24

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

47

Software to Configware Migration

SNN filter on supersystolic array

rDPU not used used for routing only operator and routing port location markerLegend: backbus connect

rout thru only

not usedbackbus connect

example: brick wall in the brain

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

48

Brick wall in the Brain

After this talk* a VIP jumps up from the floor: „But you can„t implement decisions!“

This statement has been highly embarrassing since it came from a top level IT R&D manager, of a large electronics/computer industry group

*) RAW workshop, late 90ies at Orlando, Florida

We immediately see the brick wall in his brain

Page 25: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 25

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

49

„But you can„t implement decisions!“

S = R + (if C then A else B endif);

=1

+

ABR C

section of a very large pipe network:

C. G. Bell et al: IEEE Trans-C21/5, May 1972

W. A. Clark: 1967 SJCC, AFIPS Conf. Proc.

decision box turns

into a multiplexer*

Software to

Configware

Migration:

“That’s so simple! why did it take 30

years to find out?”

in the year 1971**:

**) the hardware description

languages community

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

50

>> Outline <<

• Inefficient Microprocessors

• The von Neumann Syndrome

• We need the twin paradigm approach

• The CS education dilemma

• Conclusions

http://www.uni-kl.de

Page 26: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 26

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

51

[Kornecki (& R.H.)]

CS curricula must represent today’s computing reality by focusing on parallel, reconfigurable, and distributed computing, not only introducing the loop in CS I

We need to change the Computer Architecture courses for introducing parallel, reconfigurable, and distributed computing in early undergraduate CS courses without involving complicated problems and in-depth analysis.

What matters most is that, from the start, we expect students to consider the world of computing as filled with hetero multiprocessor machines.

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

52

We need a dual paradigm approach

structural procedural

instruction-stream-based

conducted by

a program counter

data-stream-

based

conducted

by data counters

CPUCPU

progra

m

counter

DPUrDPUrDPU

DPUDPU(hardwired)(hardwired)

ASMASMdata

counter

Page 27: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 27

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

53

RC and parallelism

Nothing is more powerful than an idea whose time has come.

Rien n'est plus puissant qu'une idée dont le temps est venu.

Victor Hugo1802-1885

The time has come for parallelism and RC to be introduced in undergraduate CS and ECE courses

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

54

thank you for your patience

Page 28: Montpellier, France, April 10, 2008 - - TU Kaiserslautern · reiner@hartenstein.de 24 April 2008 The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008,

[email protected]

24 April 2008

The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 28

Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de

© 2008, [email protected] http://hartenstein.de

TU Kaiserslautern

55

END