Generic Programming for High Performance Numerical Linear...

Generic Programming for High PerformanceNumerical Linear Algebra

Jeremy Siek and Andrew Lumsdaine

Department of Computer Science and Engineering

University of Notre Dame

Overview

1. Introduction & Motivation

2. Generic Programming

3. Generic Algorithms for Linear Algebra

4. The MTL Algorithms

5. The MTL Components

6. High Performance

7. Conclusion

Introduction & Motivation� Scientific software can benefit from software engineering

methodologies

– Development

– Maintenance

� Perpetual interest in using C++ (e.g.) in scientific

computing

� Common perception: Abstraction is the enemy of

performance

Introduction & Motivation� C++ can be used effectively in scientific computing (with

concomitant software engineering benefits)

� Generic programming has some particular benefits in this

domain

� Follow the theme of STL

� Keep high-performance always in mind

Combinatorial Explosion� Four precision types

� Several dense storage types

� A multitude of sparse storage types

� Row and column oriented matrices

� Scaling and striding

Combinatorial Explosion� Unnecessary artifact of certain programming languages

� Algorithm expression also includes data type information

� Not necessary in certain other languages (most notably,

Generic Programming� Algorithms can be expressed independently of data

storage formats

� Define standard interfaces for data storage components

� iterators are the interface between containers and

algorithms

� E.g., The Standard Template Library

� High performance linear algebra is amenable to generic

programming

Iterator Bridge Between Algorithms and

Containers

ContainerAlgorithm Iterator

Example of Generic Programmingtemplate <class InIter, class T>

T accumulate(InIter first, InIter last, T init)

while (first != last)

init = init + *first++;

return init;

// how it is used:

vector<double> x(10,1.0);

double sum = accumulate(x.begin(),x.end(),0.0);

Generic Algorithms for Linear Algebra� Extend the generic style of programming to domain of

linear algebra

� A matrix can be abstractly thought of as a container of

containers

� Use iterators and 2-dimensional iterators to traverse the

matrix

� A large class of matrix types can be implemented with

this interface.

The MTL Generic Algorithms� Encompasses BLAS functionality

� A single algorithm typically used for all matrix and

numeric types

� Index-less algorithms

� Sparse and dense algorithms unified

� Transpose, scaling, and striding handled by adapters, not

the algorithm

Index-less Algorithms� Iterate from begin() to end() of a vector.

� Iterate from begin rows() to end rows() (or

columns) of a 2-D container.

� This side-steps traditional annoyances such as the

difference between Fortran (from 1) and C (from 0)

indexing.

Unifying Sparse and Dense� Iterators hides difference in traversal

� index() method hides difference in indexing

� An example from a matrix-vector multiply.

for(j = i->begin(); not_at(j, i->end()); ++j)

tmp += *j * x[j.index()];

A Generic Matrix-Vector Multiplytemplate <class Matrix,class IterX,class IterY>

void matvec::mult(Matrix A, IterX x, IterY y) {

typename Matrix::row_2Diterator i;

typename Matrix::RowVector::iterator j;

for (i = A.begin_rows();

not_at(i, A.end_rows()); ++i) {

typename Matrix::PR tmp = y[i.index()];

for(j=i->begin(); not_at(j,i->end()); ++j)

tmp += *j * x[j.index()];

y[i.index()] = tmp;

Transpose, Scaling, and Striding� Matrix and vector adapters

� An adapter wraps up an object an modifies its behavior

// y <- A' * alpha x

matvec::mult(trans(A),

scale(x, alpha),

stride(y, incy));

The MTL Components� Iterators

� 1-D Containers

� 2-D Containers

� Orientation Adapter: Row and Column

� Shape Adapter: Banded, Triangle, Symmetric

triangle<row<array2D<dense1D<double>>>,lower>

The MTL Iterators� For sparse vectors

� For dense vectors

� Strided iterator adapter

� Scaled iterator adapter

� Block iterator

The MTL 1-D Containers� dense1D similar to STL's vector class

� sparse1D index-value pairs

� compressed1D separate index and value arrays

� scaled1D adapter class

� strided adapter class

The MTL 2-D Containers� array2D composes 1-D containers into a matrix

� dense2D contiguous dense matrix

� compressed2D contiguous sparse matrix

� scaled2D adapter class

The MTL Orientation Classes� Row Orientation

– Maps row to major

– Maps column to minor

� Column Orientation

– Maps column to major

– Maps row to minor

� All 2-D methods and typedefs are mapped.

The MTL Shape Adapters� banded , triangle

� symmetric , hermitian

High Performance with C++� Explicit unrolling and blocking in C++ using the BLAIS.

� Use a good optimizing compiler to remove layers of

abstraction: lightweight object optimization and inlining.

� Follow a set of coding guidelines to ensure the above

optimizations can be made, and double check the

intermediate C code.

� Don't interfere with backend compiler unrolling and

scheduling.

Dense Matrix-Matrix Performance

(UltraSPARC 170E)

Matrix Size

MTLSun Perf LibATLASFortran BLASTNT

Dense Matrix-Matrix Performance

(RS6000 590)

Matrix Size

MTLESSL

Dense Matrix-Vector Performance

(UltraSPARC 170E)

0 50 100 150 200 250 300 350 4000

MTLFortran BLASSun Perf LibTNT

Sparse Matrix-Vector Performance

(UltraSPARC 170E)

Average non zeroes per row

MTLSPARSKITNISTTNT

Conclusion� High performance linear algebra

� Comprehensive (sparse, dense, etc.) and orthogonal

� Only 25,000 lines of code

� 150,000 lines for Fortran BLAS

Generic Programming for High Performance Numerical Linear...

Documents

Transcript of Generic Programming for High Performance Numerical Linear...

Numerical Simulation of Aircraft Ditching of a Generic ... · Numerical Simulation of Aircraft Ditching of a Generic Transport Aircraft: Implementation of an Aerodynamic Model ...

Numerical Heat Transfer, Part A: Applicationsresearch.me.mtu.edu/pubs/Yang- Modeling IC Engine Conjugate Heat...Numerical Heat Transfer, Part A: Applications ... Modeling IC Engine

Z2Pack: Numerical implementation of hybrid Wannier centers ...dhv/pubs/local_copy/as_z2p.pdfPHYSICAL REVIEW B 95, 075146 (2017) Z2Pack: Numerical implementation of hybrid Wannier centers

ARTICLES - University of Washingtonfaculty.washington.edu/aragon/pubs/annealing-pt2.pdfTHE GENERIC ANNEALING ALGORITHM Both local optimization and simulated annealing require that

WEAAD Activity and Marketing Toolkit 2020 · WEAAD Brandmark. Generic . Generic Generic Generic Generic Generic Generic Generic Generic Generic Generic Generic Generic Generic Generic

Numerical Generic Programming with C++ Templatesusers.ices.utexas.edu/~roystgnr/dualnumber.pdf · Numerical Generic Programming with C++ Templates Roy H. Stogner The University of

Numerical Methods - Box2DNumerical solution •Sacriﬁce accuracy •Time stepping •Generic simulation 28 We can simulate systems of diﬀerential equations using numerical methods.

Numerical Study on GRB-Jet Formation in Collapsarsslac.stanford.edu/pubs/slacpubs/12000/slac-pub-12065.pdf · Numerical Study on GRB-Jet Formation in Collapsars ... eﬀects are most

A Numerical Study of Inversion-Layer Breakup and the ...faculty.ce.berkeley.edu/chow/pubs/Colette_Chow_Street_JAM2003.pdf · numerical investigation of the inﬂuence of topography

Numerical modelling of the seismic behaviour of a 7-story ...nees.ucsd.edu/projects/2006-seven-story/pubs/2009... · reinforced concrete wall building. This paper deals with the numerical

Numerical methods for the stochastic Landau-Lifshitz ...algarcia.org/Pubs/Bell07.pdf · Numerical methods for the stochastic Landau-Lifshitz Navier-Stokes equations John B. Bell,1

3745-300-08 Generic numerical standards. - Ohio EPA …epa.ohio.gov/Portals/30/rules/2012/Rule 3745-300-08.pdf · 2013-05-14 · 3745-300-08 Generic numerical standards. [Comment:

Trefftz-DG Approximation for the Elasto-AcousticsTable 1:Generic properties of the most widely used numerical methods2. Numerical Complex High-order Explicit semi- Conservation Elliptic

Numerical simulation of aircraft ditching of a generic ... · PDF fileNumerical Simulation of Aircraft Ditching of a Generic Transport Aircraft: Contribution to ... particularly using

Developing an Excel VBA Application to Set a Generic … · Developing an Excel VBA Application to Set a Generic SD Model in Equilibrium & ... Next, two numerical methods are presented,

OpenDA, a generic toolbox for data assimilation in ... · OpenDA, a generic toolbox for data assimilation in numerical modelling Martin Verlaan1,2, Nils van Velzen3,2, Stef Hummel1,

Euler: A System for Numerical Optimization of Programssc40/pubs/cav12-tool.pdf · Euler: A System for Numerical Optimization of Programs? SwaratChaudhuri1 andArmandoSolar-Lezama2

Design and numerical simulation of a DNA electrophoretic stretching deviceweb.mit.edu/doylegroup/pubs/LOC_JuMin_07.pdf · Design and numerical simulation of a DNA electrophoretic

Numerical Drag Reduction Studies on Generic Truck

A numerical study of auto-ignition in turbulent lifted ﬂames … › pubs › Gordon_MPG_CTM_07.pdf · 2012-12-21 · While direct numerical simulations are proving to be extremely