A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

17
A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    218
  • download

    2

Transcript of A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Page 1: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

A Massively Parallel Architecture for Bioinformatics

Presented by Md Jamiul Jahid

Page 2: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Introduction

• Bioinformatics algorithms are demanding in scientific computing

• In general most of the bioinformatics algorithms are fairly simple

• Dealing with huge amount of data• The size of DNA sequence database doubles

every year

Page 3: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Introduction

• A typical DNA contains 3.4 billion base pairs• Maximum algorithms use only simple

operations with input data like – Arithmetic operation– String matching– String comparison

Page 4: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Introduction

• Standard CPUs are designed for providing a good instruction mix for almost all commonly used algorithm

• For a target class of algorithm they are not effective

• Results– High runtime– Energy– Money

Page 5: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Contribution

• Present a massively parallel architecture • Using low cost FPGA(Field Programmable Gate

Array)• They called it COPACOBANA 5000– Meaning Cost-Optimized Parallel Code Braker ANd

Analyzer

Page 6: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

COPACOBANA 1000• This machine is for cryptanalysis: fast code

breaking• 120 low cost FPGAs• 20 subunits• Each has Xilinx Spartan -3 XC3S1000 FPGAs

Page 7: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

COPACOBANA 1000

• Assumptions– Programs are

parallelizable– Demand of data

transfer is low– All node needed

very little local memory which can be served from on-chip RAM of FPGAs

Page 8: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

COPACOBANA 5000

• Bus Concepts– Point to point connection two neighboring FPGA-

cards– Point to point connection contain 8 pairs of wire– Each 250MHz, total 2Gbit/s

Page 9: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.
Page 10: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

COPACOBANA 5000

• Controller– Root entity of control is running on a remote host

computer– Connected to COPACOBANA5000 by LAN– Two scenario• Data on remote host• Data on COPACOBANA5000

Page 11: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

COPACOBANA 5000

• FPGA-Card– Xilinx Spartan-3 5000 is used– Contains 8 FPGAs– All FPGAs are globally clocked

Page 12: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Performance Estimation

• Between– PC– COPACOBANA1000– COPACOBANA5000

Page 13: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Performance Estimation

Page 14: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Conclusion

• In this paper a new hardware for running bioinformatics algorithm is proposed

• The hardware are– Cheap– Low power consumption– Efficient

Page 15: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Questions

?

Page 16: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Thank You

Page 17: A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid.

Reference• Gerd Pfeiffer, Stefan Baumgart, Jan Schröder, and Manfred Schimmler,

A Massively Parallel Architecture for Bioinformatics, 9th International Conference on Computational Science (ICCS 2009).