UIC Thesis Montone

52
Time-driven reconfiguration-aware floorplacer BY Alessio Montone [email protected] Thesis committee: S. Dutt (chair), A. Khokhar, M.D. Santambrogio UIC Thesis Defense: 05/08/2008

Transcript of UIC Thesis Montone

Page 1: UIC Thesis Montone

Time-driven reconfiguration-aware

floorplacer

BY

Alessio Montone

[email protected]

Thesis committee:

S. Dutt (chair), A. Khokhar, M.D. Santambrogio

UIC Thesis Defense: 05/08/2008

Page 2: UIC Thesis Montone

2

Rationale and InnovationRationale and Innovation

Problem statementGiven a reconfigurable architecture, find an on-chip position for each functional unit

Innovative contribution: taking into accountTarget Device HeterogeneityTarget Device reconfiguation capabilities Inter-FU Communication

Page 3: UIC Thesis Montone

3

AimsAims

Considering the area assignment problem tailored for reconfigurable architectures, providea formalization of the problem, andan approach (in 3 algorithms) for solving

Page 4: UIC Thesis Montone

4

OutlineOutline

IntroductionFloorplacementThe Proposed ApproachExperimental ResultsComparison with the state of the artConclusions and Future WorksQuestions

Page 5: UIC Thesis Montone

INTRODUCTIONINTRODUCTION

5

Page 6: UIC Thesis Montone

Reconfigurable Architectures - IReconfigurable Architectures - I

On FPGAsReconfigurable DevicesHeterogeneousReconfiguration Limits

Different types of Reconfigurable Architectures:

TotalPartial (Static)Partial (Dynamic)

6

Page 7: UIC Thesis Montone

Reconfigurable Architectures - IReconfigurable Architectures - I

Total

7

Page 8: UIC Thesis Montone

Reconfigurable Architectures - IReconfigurable Architectures - I

Total

Partial (Static)

8

Page 9: UIC Thesis Montone

Reconfigurable Architectures - IIReconfigurable Architectures - II

Partial Dynamic

9

Page 10: UIC Thesis Montone

Area Assignment ProblemArea Assignment Problem

Let consider a Reconfigurable ArchitectureGiven a scheduled task graph (TG) of the application

Node: Reconfigurable Functional Unit (RFU) [*], A netlist obtained after post synthesis and technology mapping (i.e., before placement and routing)

Aim: find an area assignment for each RFU

10

[*] K. Bazargan, R. Kastner, M.S.: 3-d floorplanning: Simulated annealing and greedy placement methods for reconfigurable computing systems. IEEE Rapid Systems Prototyping (1999)

Page 11: UIC Thesis Montone

Related Works - IRelated Works - I

[*] introduced the concept of 3D floorplanning for reconfigurable systems

SA in order to solve HW/SW codesign problemFor each task choose between

HW implementationSW implementation

LimitsNo device limits consideredNo communicationinfrastructure

11

[*] K. Bazargan, R. Kastner, M.S.: 3-d floorplanning: Simulated annealing and greedy placement methods for reconfigurable computing systems. IEEE Rapid Systems Prototyping (1999)

Page 12: UIC Thesis Montone

Related Works - IIRelated Works - II

[*] is the state of art in 3D floorplanningSimulated Annealing over Transitive Closure GraphTakes into account device reconfiguration limits

LimitsNo heterogeneity consideredHigh overhead communicationinfrastructure solution [**]

12

[*] Ping-Hung Yuh, Chia-Lin Yang, Yao-Wen Chang, Hsin-Lung Chen: Temporal Floorplanning Using 3D-subTCG, Design Automation Conference, 2004

[**] S. P. Fekete, E. Kohler, and J. Teich: Optimal FPGA Module Placement with Temporal Precedence Constraints, Proc. DATE, 2001.

Page 13: UIC Thesis Montone

FLOORPLACEMENTFLOORPLACEMENT

13

Page 14: UIC Thesis Montone

Floorplanning vs. PlacementFloorplanning vs. PlacementCharacteristic Floorplanning Placement

# items <100 >10.000

Items (for FPGAs) IP-Core Slice, CLB

Aim Find a position for each item

obj. function depends

mainly on Area mainly on Wirelength

Constraints Items can be positioned everywhere

There is a set of possible positions

14

PlacementFloor plan

Page 15: UIC Thesis Montone

Floorplacement - IFloorplacement - I

Hierarchical Approach (Floorplanning + Partitioning)

15

S. N. Adya, I. L. Markov, Fixed-outline Floorplanning: Enabling Hierarchical Design, IEEE Transaction on VLSI System, 2002

S. N. Adya, S. Chaturvedi, J. A. Roy, David A. Papa, I. L. Markov: Unification of Partitioning, Placement and Floorplanning , IEEE Intl. Conf. on CAD, 2004

Page 16: UIC Thesis Montone

Floorplacement - IIFloorplacement - II

Reconfigurable Functional Unit (RFU)A netlist obtained after post synthesis and technology mapping (i.e., before placement and routing)

Reconfigurable Region (RR)

16

Page 17: UIC Thesis Montone

Floorplacement - IIIFloorplacement - III

Resource Aware (i.e., not all positions are feasible)

Device heterogeneity

Device Reconfigurationcapabilities

17

Page 18: UIC Thesis Montone

THE PROPOSED THE PROPOSED APPROACHAPPROACH

18

Page 19: UIC Thesis Montone

Proposed Problem DefinitionProposed Problem Definition

19

AimDefine RRsFor each task find

find a RRA position inside RR

Objective FunctionMin. Fragmentation

ConstraintsCommunication issuesDevice limits

Page 20: UIC Thesis Montone

Target Devices: Xilinx Virtex 4 - 5Target architecture based on EAPR design flow

Target Architecture and DevicesTarget Architecture and Devices

20

Page 21: UIC Thesis Montone

InputInput

21

Page 22: UIC Thesis Montone

Proposed Approach: overviewProposed Approach: overview

22

Page 23: UIC Thesis Montone

11stst Algorithm: Partitioning into Algorithm: Partitioning into RRRR

Aim: identify the RRs and associate each RFU to one RRHow: partitioning the TG minimizing resource requirement variance of the RRs (moving and swapping nodes)

23

Resource of type t required by RFU n, at static photo p

Page 24: UIC Thesis Montone

22ndnd Algorithm:TFiRR - I Algorithm:TFiRR - I

24

Temporal Floorplacement inside RR (TFiRR)Aim: for each RR find a set of feasible width-height pairsHow: floorplacing RFUs inside corresponding RR Assumption: RFUs’ height = height of the RR they belong to

Pseudo Code:

Page 25: UIC Thesis Montone

22ndnd Algorithm:TFiRR - II Algorithm:TFiRR - II

25

Let consider an iteration:

Page 26: UIC Thesis Montone

22ndnd Algorithm:TFiRR - III Algorithm:TFiRR - III

26

Let consider an iteration:

Page 27: UIC Thesis Montone

22ndnd Algorithm:TFiRR - IV Algorithm:TFiRR - IV

27

Let consider an iteration:

Page 28: UIC Thesis Montone

22ndnd Algorithm:TFiRR - V Algorithm:TFiRR - V

28

Let consider an iteration:

Page 29: UIC Thesis Montone

22ndnd Algorithm:TFiRR - VI Algorithm:TFiRR - VI

29

Page 30: UIC Thesis Montone

22ndnd Algorithm: TFiRR – Example Algorithm: TFiRR – Example

30

Page 31: UIC Thesis Montone

33rdrd Algorithm: RR floorplacement - I Algorithm: RR floorplacement - I

Simulated AnnealingObjective Function

Data Structure4 Constraint Lists (one per row)

31

Page 32: UIC Thesis Montone

33rdrd Algorithm: RR floorplacement - II Algorithm: RR floorplacement - II

Simulated Annealing: movesSwap two RRsMove one RRsSpan over one more rowUn-Span over one less row

After each move packing is performed(i.e., the floorplacement is compressed)

32

Page 33: UIC Thesis Montone

33rdrd Algorithm: RR floorplacement – Example - I Algorithm: RR floorplacement – Example - I

33

Page 34: UIC Thesis Montone

33rdrd Algorithm: RR floorplacement – Example - II Algorithm: RR floorplacement – Example - II

34

Page 35: UIC Thesis Montone

ImplementationImplementation

Three simulated annealers written in C++ STL

35

Page 36: UIC Thesis Montone

Output ExamplesOutput Examples

TFiRR

RR Floorplacement

36

Page 37: UIC Thesis Montone

EXPERIMENTAL RESULTSEXPERIMENTAL RESULTS

37

Page 38: UIC Thesis Montone

Identification of the number of RRs Identification of the number of RRs

38

Page 39: UIC Thesis Montone

Partitioning’s impact on TFiRRPartitioning’s impact on TFiRR

TFiRR on Partitioned

TG

TFiRR on TG

Execution Time 125ms 114ms 4m54s

Width (normalized)

1.00 1.19 1.04

39

Increasing the number of RFUs decreases the possibility to pick up the right one

Partitioning is a precondition of the 3rd algorithm in order to better exploit FPGA’s area (2D Floorplacement)

Page 40: UIC Thesis Montone

Tests performed directly floorplacing RFUs Execution time about 100 ms (100K iterations)

Floorplacement – Success Floorplacement – Success RateRate

40

Page 41: UIC Thesis Montone

Floorplacement – Aspect RatioFloorplacement – Aspect Ratio

41

Tests performed directly floorplacing RFUs

Page 42: UIC Thesis Montone

COMPARISON WITH THE COMPARISON WITH THE STATE OF THE ARTSTATE OF THE ART

42

Page 43: UIC Thesis Montone

State of the artState of the art

43

Authors Comm. Infrastructure

ResourceAware

Reconfiguration Aware

Device LimitsAware

Bazargan et al. No No Yes No

Yuh et al. Limited, w/ High Overhead

No Yes Yes

Singhal et al. No No Yes No

Feng et al. No Yes No No

Page 44: UIC Thesis Montone

NotesNotes

The comparsion is performed with respect to the description given by Yuh et al in [*]

Yuh’s approach does not support– Multiple Resources– Existence of a Static side

In order perform the comparison– the case study has been chosen in order to avoid

multiple resource limitation– Yuh’s approach has been extended to support a

static side

44

[*] Ping-Hung Yuh, Chia-Lin Yang, Yao-Wen Chang, Hsin-Lung Chen: Temporal Floorplanning Using 3D-subTCG, Design Automation Conference, 2004

Page 45: UIC Thesis Montone

The Case StudyThe Case Study

A Reconfigurable Architecture (for Biomedical Purpose) on XC5VLX30T 1. Collecting data from sensor2. Elaborating them3. Sending to a host computer thorough the net

45

Page 46: UIC Thesis Montone

The Proposed SolutionThe Proposed Solution

It is a reconfigurable architecture with 2 static photos

46

Page 47: UIC Thesis Montone

Area assignments comparisonArea assignments comparison

47

Proposedsolution

Yuh’s solution

Page 48: UIC Thesis Montone

Communication performances comparisonCommunication performances comparison

Considering 100 M samples, 32 bit each, at 75 MHz.

The entire data are transferred by– The Proposed Approach in 2.6 seconds– Yuh’s approach in 416.0 seconds

48

Page 49: UIC Thesis Montone

Conclusions - IConclusions - I

An algorithm for the identification of area constraint for reconfigurable architectures has been introduced

Novelties: taking into account– Target device heterogeneity– Target device reconfiguration capabilities– Communication issues

49

Page 50: UIC Thesis Montone

Conclusions - IIConclusions - II

Results have been published– A. Montone, M.D. Santambrogio, D. Sciuto,

A Design Workflow for the Identification of Area Constraints in Dynamic Reconfigurable Systems, IEEE International Symposium on Electronic Design, Test and Applications (DELTA), 2008

– A. Montone, M.D. Santambrogio, Area Constraint Evaluation for FPGAs, The Syndicated Q1-2008, A technical newsletter for FPGA, ASIC Verification and DSP Designers, Synplicity Incorporation

Under revision:– A Reconfiguration-aware Floorplacer for FPGAs,

IEEE Field Programmable Logic (FPL), 2008

50

Page 51: UIC Thesis Montone

Future WorksFuture Works

Take into consideration IOBs and inter modules communications

Partitioning considering clock regions

51

Page 52: UIC Thesis Montone

QuestionsQuestions

52