Montpellier, France, April 10, 2008 - - TU Kaiserslautern · [email protected] 24 April 2008...
Transcript of Montpellier, France, April 10, 2008 - - TU Kaiserslautern · [email protected] 24 April 2008...
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 1
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
Reconfigurable Computing to solve the CS Education Dilemma
(keynote address)
Reiner Hartenstein
TU Kaiserslautern
Montpellier, France, April 10, 2008
RC education 2008
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
2
Demonstrating the intensive Impact
(FPGA-based speed up and power saving)
DES breaking [T. Elghazawi et al.: IEEE COMPUTER, Febr. 2008]
Platform (compared to Beowulf cluster)
Speed-up factor
Power saving factor
SDC-6 6757 856
Cray XD-1 12162 608
SGI Atix 4700 with RC 100 RASC 28514 3439
encryption on von Neumann getting unaffordable
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 2
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
3
The clash of Paradigms (a simplified view)
Computation in time (procedural-only von Neumann mind set)
Computing in space (structural mind set)
Reconfigurable Computing model: supersystolic mind set -
data-stream-based
… you can teach programming to a hardware guy
… you can teach programming to a hardware guy
you can’t teach hardware to a programmer, although …
you can’t teach hardware to a programmer, although …
This is the fault of our CS curriculaThis is the fault of our CS curricula
traditional CS model : Software-only mindset -instruction-stream-based
sequential mindset parallel pipelining mindset
procedural structural
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
4
Tear down this Wall !
… to achieve a far-ranging paradigm shift, away from the monopoly-like dominance of the instruction-stream-based vN mind set,
… to clear away the wreckage of CS foundations having survived the mainfrage age, and, having collapsedunder the weight of its own absurdity.
over to a dichotomic dual paradigm basic model including the Reconfigurable Computing mind set.
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 3
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
5
>> Outline <<
• Inefficient Microprocessors
• The von Neumann Syndrome
• The twin paradigm approach is needed
• The CS education dilemma
• Conclusions
http://www.uni-kl.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
6
Microprocessor Industry (1)
We are o.k. ? You are o.k. ?
go multi-threading? - transactional memory? – MPI? … ?
“… we have a tool package for manycore programmers…”
Leadership in the Future
We are witnessing a radical change of the entire industry
away from single-processor chips to new generations ofmulti-processor chips called manycores,
“intel has thrown a hail mary pass …”
[Dave Patterson]
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 4
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
7
Climate Change in Computing
Wawrzynek missing
(my birthday)
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
8
What Climate change ?
Now intel goes toward 1024 chickens
intel„s strong oxen under Andy Grove and Craig Barret: up to 5 GHz clock speed [til 2005]
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 5
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
9
Microprocessor Industry (2)
Gordon Moore indicating performance? (corrected twice)
Stop at >100 Watts per core
More performance by more transistors? -> imagination! -> phantasy!
-> la raison
La raison, c'est l'intelligence en exercice; … [Victor Hugo]
… seamless improvement of performance …… from generation to generation?
Leadership in the past
-> raison?
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
10
Rapid Decline of Computational Density
[BWRC, UC Berkeley, 2004]
1990 1995 2000 2005
200
100
0
50
150
75
25
125
175
SP
EC
fp2
00
0/M
Hz/B
illio
n T
ran
sis
tors
HP
alph
a: d
own
by
100
x in
6 y
rsIB
M:
dow
n by
20
x in
6 y
rs
stolen from Bob Colwell
memory wall, caches, ...
[Wawrzynek; Sep 8, 2005, GSRC Symposium (GSRC‟05) ]
architecture overhead
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 6
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
11
Victor Hugo
La raison, c'est l'intelligence en exercice; …
…l'imagination c'est l'intelligence en érection. [Victor Hugo]
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
12
Justin Rattner‟s DAC keynote
Rattner predicts the impact of the industry's move toward the ubiquitous use of wireless communications and the challenges that this very high level of integration will have on design automation, from architecture to manufacturing.
… and, how design techniques will evolve, by necessity, to satisfy the demands of reconfigurable hardware and software programmability.
He will address how the combination of new radio architectures and mostly digital implementations will drive a new generation of design tools, and …
“EDA for Digital, Programmable, Multi-Radios.”
the demand elsewhere (outside intel) ???
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 7
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
13
The biggest problem in CS history
Manycore requires a fundamental change in programming:drastically more complex than uniprocessor programming.
This requires to understand new computational principles,algorithms, and programming tools.
We need new directions in R&D
The Manycore Programing Crisis
Reconfigurable Computing is an inevitable ingredient of the crisis’s solution
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
14
>> Outline <<
• Inefficient Microprocessors
• The von Neumann Syndrome
• The twin paradigm approach is needed
• The CS education dilemma
• Conclusions
http://www.uni-kl.de
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 8
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
15
A List of Massive Overhead Phenomena
overheadvon Neumann
machine
instruction fetch instruction stream
state address computation instruction stream
data address computation instruction stream
data meet PU + other overh. instruction stream
i/o to/from off-chip RAM instruction stream
Dijkstra 1968: The Goto considered harmful
R.H. & Koch 1975: The universal Bus considered harmful
Backus, 1978: Can programming be liberated from the von Neumann style?Arvind et al., 1983: A critique of Multiprocessing the von Neumann Style
all this overhead piling up to code sizes of astronomic dimensions
von Neumann Syndrome[C.V. “RAM” Ramamoorthy]
von Neumann Syndrome[C.V. “RAM” Ramamoorthy]
CPUCPU single CPU
von Neumann
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
16
Data meeting the Processing Unit (PU)
by Software
inefficiently
routing the data by memory-cycle-hungry instruction streams thru shared memory
the pianist is the „PU“
This is just one of many von Neumann overhead phenomena
Moving data
data
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 9
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
17
manycore von Neumann: arrays
of massive overhead phenomena
overheadvon Neumann
machine
instruction fetch instruction stream
state address computation instruction stream
data address computation instruction stream
data meet PU + other overh. instruction stream
i / o to / from off-chip RAM instruction stream
proportionate to the number of processors
CPUCPU single CPU
von Neumann
17
Inter PU communication instruction stream
message passing overhead instruction stream
transactional memory overh. instruction stream
multithreading overhead etc. instruction stream
disproportionate to the number of processors
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
or, 16 core SoCexample: 16 CPUs running in parallel
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
18
Code sizes of astronomic dimensions
Fast on-chip memory cannot store such huge instruction code sizes
… no way around von Neumann bottlenecks
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 10
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
19
Reconfigurable Computing
overheadvon Neumann
machinehardwired
anti machinereconfigurableanti machine
instruction fetch instruction stream none*
state address computation instruction stream none*
data address computation instruction stream none*
data meet PU + other overh. instruction stream none*
i / o to / from off-chip RAM instruction stream none*
Inter PU communication instruction stream none*
message passing overhead instruction stream none*
transactional memory overh. instruction stream none*
multithreading overhead etc. instruction stream none*
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPA: reconfigurable datapath arrayrDPA: reconfigurable datapath array
(coa
rse-
grai
ned
rec.
)(c
oars
e-gr
aine
d re
c.)
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
20
von Neumann overhead: just one example
overheadvon Neumann
machine
instruction fetch instruction stream
state address computation instruction stream
data address computation instruction stream
data meet PU + other overh. instruction stream
i / o to / from off-chip RAM instruction stream
CPUCPU single CPU
reconfigurable address generator (GAG): ~20x speed-uprDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
PISA DRC accelerator [ICCAD 1984]
(entire project: 15000x speed-up)
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 11
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
21
Declining Programmer Productivity
At particular HPC application domains
massive parallelism requires 10 – 30
professionalists in multi-disciplinary
multi-insitutional teams for 5 - 10 years
[Douglass Post, DoD HPCMP, panelist at SC07]
The Law of More: programmer
productivity declines disproportionately
with increasing parallelism
Software done: machine obsolete
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
22
Dead Supercomputer Society
•ACRI •Alliant •American Supercomputer
•Ametek •Applied Dynamics •Astronautics •BBN •CDC•Convex•Cray Computer •Cray Research •Culler-Harris •Culler Scientific •Cydrome •Dana/Ardent/ Stellar/Stardent
•DAPP •Denelcor •Elexsi •ETA Systems •Evans and Sutherland•Computer•Floating Point Systems •Galaxy YH-1 •Goodyear Aerospace MPP •Gould NPL •Guiltech •ICL •Intel Scientific Computers •International Parallel Machines
•Kendall Square Research •Key Computer Laboratories
a wave of bankruptcies, consolidations, and, changing business objectives
•MasPar•Meiko •Multiflow •Myrias •Numerix •Prisma •Tera •Thinking Machines •Saxpy •Scientific Computer•Systems (SCS) •Soviet Supercomputers •Supertek •Supercomputer Systems •Suprenum •Vitesse Electronics
[Gordon Bell]
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 12
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
23
>> Outline <<
• Inefficient Microprocessors
• The von Neumann Syndrome
• We need the twin paradigm approach
• The CS education dilemma
• Conclusions
http://www.uni-kl.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
24
The von-Neumann-only model
algorithms variable
resources fixed
instruction-stream-based computing:
µprocessorµprocessor
Software
instruction-stream
programmable
algorithms variable
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 13
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
25
algorithms variable
resources fixed
instruction-stream-based computing:
algorithms variable
resources variable
data-stream-based reconfigurable computing:
Software
instruction-stream
acceleratorsacceleratorsµprocessorµprocessor
Configware
resources variable
programmable
algorithms variable
Nick Tredennick‟s view
algorithms variable
Flowware
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
26
algorithms variable
resources fixed
instruction-stream-based computing:
algorithms variable
resources variable
data-stream-based reconfigurable computing:
instruction-stream
resources variable
algorithms variable
Twin paradigm manycore model
algorithms variable
Configware
Flowware
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
Software
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 14
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
27
algorithms variable
resources fixed
instruction-stream-based computing:
algorithms variable
resources variable
data-stream-based reconfigurable computing:
instruction-stream
resources variable
algorithms variable
Software/Configware Co-Compilation
algorithms variable
Configware
Flowware
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
Software
software/configwareco-compiler
C language source
Academic Academic demodemoimplementationimplementation ofof such a such a compilercompiler in in thethe midmid„ 90ies„ 90ies
Academic Academic demodemoimplementationimplementation ofof such a such a compilercompiler in in thethe midmid„ 90ies„ 90ies
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
28
The time has come …
Twin paradigm CS education is inevitable
All, even early, undergrad courses should include
reconfigurability and both kinds of parallelism
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 15
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
29
Terminology clean-up
Software: for scheduling instruction streams
Flowware: for scheduling data streams
Configware: for configuring morphware
Programming sources:
vonNeumann
non-vonNeumann
Morphware: „soft hardware“, reconfigurable platforms
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
30
>> Outline <<
• Inefficient Microprocessors
• The von Neumann Syndrome
• We need the twin paradigm approach
• The anti machine
• The CS education dilemma
• Conclusionshttp://www.uni-kl.de
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 16
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
31
The 2nd
paradigm: ready to go
the urgently need
datastream-based
machine paradigm:
pepared almost
30 years ago …
the urgently need
datastream-based
machine paradigm:
pepared almost
30 years ago …
xxx
xxx
xxx
|
||
x xx
x
xx
x x
x
- -
-xxxx
xx
xxx
---
---
---
---
xxx
xxx
xxx
|
|
|
|
|
|
|
|
|
|
|
| data streams
… with systolic arrays… with systolic arrays
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
32
DPA
xxx
xxx
xxx
|
||
x x
x
x
x
x
x x
x
- -
-
input data streams
xx
x
x
x
x
xx
x
--
-
-
-
-
-
-
-
-
-
-
xxx
xxx
xxx
|
|
|
|
|
|
|
|
|
|
|
|output data streams
time
port #
time
time
port #time
port #
Flowware defines: ... which data item at which time at which port
Data streams
(flowware)
(pipe network)
systolic array research: throughout the 80ies:
Mathematicians‘ hobby
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 17
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
33
Data streams
The clearest and common elementary definition !
datastream ≠ dataflow
… be careful: a lot more definitions are around, derived from local perspectives
(the old “dataflow machine” is indeterministic! )
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
34
Early Systolic Synthesis
Mathematicians like to wax rhapsodic about the elegance, beauty and depth of their proofs.
John Horgan: „The Endof Science Revisited“
Mathematicians liked thebeauty and elegance of Systolic Arrays.
Due to a lack of depth in reasoning, their efforts yielded poor synthesis algorithms.
R. H.
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 18
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
35
Algebraic Synthesis Method
algebraic tunnel view:
array[history] mapping
uniform only
shape resources
systolic [1979]
algebraiclinear only
uniform only
mathematicians‟ paradigm trap
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
36
array[history] mapping
uniform only suitable for what
applicationsshape resources
systolic [1979]
algebraiclinear only
uniform only
strictly regular data dependencies
only
Algebraic synthesis methodes: usable
only for a small minority of applications!
Algebraic: what applications?
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 19
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
37
Pipe Networks
array[history] mapping
uniform only Suitable for what
applicationsshape resources
systolic [1979]
algebraiclinear only
uniform only
strictly regular data dependencies
only
rDPA: super-systolic
[1995]
simulated annealing no restrictions at all
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
38
Who generates the Data Streams?
xxx
xxx
xxx
|
||
x x
x
x
x
x
x x
x
- -
-
xx
x
x
x
x
xx
x
--
-
-
-
-
-
-
-
-
-
-
xxx
xxx
xxx
|
|
|
|
|
|
|
|
|
|
|
|
Without
a sequencer
it is not
a machine !
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 20
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
39
Mathematics: Winner of the
“Not My Job” Award
„it„s not our job“
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
40
But there are Asymmetries
von Neumann paradigm Antimachine paradigm
Software EngineeringSoftware Engineering
Configware EngineeringConfigware Engineering
ASM
CPU
location of the
sequencer
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 21
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
41
The counterpart of the
von Neumann machine
xxx
xxx
xxx
|
||
x x
x
x
x
x
x x
x
- -
-
xx
x
x
x
x
xx
x
--
-
-
-
-
-
-
-
-
-
-
xxx
xxx
xxx
|
|
|
|
|
|
|
|
|
|
|
|
(r)DPA
ASM
ASM
ASM
ASM
ASM
ASM
AS
M
AS
M
AS
M
AS
M
AS
M
AS
M
datacounter
GAG RAM
ASM: Auto-Sequencing
Memory
data counters instead of a program counter
data counters: located at memory(not at data path)
coarse-grained
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
42
Data meeting the Processing Unit
byConfigware
placement of the execution locality ...
… pipe network generated by configware compilation
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 22
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
43
Toward hetero manycore chips
Each standard core should have a mode bit to select:•Running in CPU mode, or,•Running in DPU mode*
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
also CPU clusters routable
CPUCPU
DPUDPU
*) program sequencer switched off
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPUDPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPUDPUDPU
DPUDPUDPUDPUDPUDPUDPUDPU
•4 (or, 6) nearest neighbor connect
additional backplane interconnect fabrics
for legacy software and supervisory tasks
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
44
The twin paradigm approach
CPUCPU
CPUCPU
CPUCPU
CPUCPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
rDPUrDPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
heterogenousmanycore system
Software/ Configware
Co-compilation
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
DPUDPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
CPUCPU
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 23
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
45
>> Outline <<
• Inefficient Microprocessors
• The von Neumann Syndrome
• We need the twin paradigm approach
• The CS education dilemma
• Conclusions
http://www.uni-kl.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
46
The CS education dilemma
On the way between hardware mind set and software mind set we find a major roadblock.
Removing this brick wall is very simple by using very old straight-forward wisdom.
This is due to a reductionist tunnel view, a paradigm trap, partly driven by arrogance.
Educators and curriculum task forces caused this brick wall by decades of ignorance.
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 24
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
47
Software to Configware Migration
SNN filter on supersystolic array
rDPU not used used for routing only operator and routing port location markerLegend: backbus connect
rout thru only
not usedbackbus connect
example: brick wall in the brain
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
48
Brick wall in the Brain
After this talk* a VIP jumps up from the floor: „But you can„t implement decisions!“
This statement has been highly embarrassing since it came from a top level IT R&D manager, of a large electronics/computer industry group
*) RAW workshop, late 90ies at Orlando, Florida
We immediately see the brick wall in his brain
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 25
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
49
„But you can„t implement decisions!“
S = R + (if C then A else B endif);
=1
+
ABR C
section of a very large pipe network:
C. G. Bell et al: IEEE Trans-C21/5, May 1972
W. A. Clark: 1967 SJCC, AFIPS Conf. Proc.
decision box turns
into a multiplexer*
Software to
Configware
Migration:
“That’s so simple! why did it take 30
years to find out?”
in the year 1971**:
**) the hardware description
languages community
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
50
>> Outline <<
• Inefficient Microprocessors
• The von Neumann Syndrome
• We need the twin paradigm approach
• The CS education dilemma
• Conclusions
http://www.uni-kl.de
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 26
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
51
[Kornecki (& R.H.)]
CS curricula must represent today’s computing reality by focusing on parallel, reconfigurable, and distributed computing, not only introducing the loop in CS I
We need to change the Computer Architecture courses for introducing parallel, reconfigurable, and distributed computing in early undergraduate CS courses without involving complicated problems and in-depth analysis.
What matters most is that, from the start, we expect students to consider the world of computing as filled with hetero multiprocessor machines.
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
52
We need a dual paradigm approach
structural procedural
instruction-stream-based
conducted by
a program counter
data-stream-
based
conducted
by data counters
CPUCPU
progra
m
counter
DPUrDPUrDPU
DPUDPU(hardwired)(hardwired)
ASMASMdata
counter
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 27
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
53
RC and parallelism
Nothing is more powerful than an idea whose time has come.
Rien n'est plus puissant qu'une idée dont le temps est venu.
Victor Hugo1802-1885
The time has come for parallelism and RC to be introduced in undergraduate CS and ECE courses
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
54
thank you for your patience
24 April 2008
The 3rd Workshop on Reconfigurable Computing Education (RC education), April 10, 2008, Montpellier, France 28
Reiner Hartenstein, TU Kaiserslautern, Germanyhttp://hartenstein.de
© 2008, [email protected] http://hartenstein.de
TU Kaiserslautern
55
END