TAU Performance System tau.uoregon

21
TAU Performance System http://tau.uoregon.edu [email protected]

description

TAU Performance System http://tau.uoregon.edu. [email protected]. TAU Parallel Performance System. http://tau.uoregon.edu/ Multi-level performance instrumentation Multi-language automatic source instrumentation Flexible and configurable performance measurement - PowerPoint PPT Presentation

Transcript of TAU Performance System tau.uoregon

Page 1: TAU Performance System tau.uoregon

TAU Performance Systemhttp://tau.uoregon.edu

[email protected]

Page 2: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 2

TAU Parallel Performance System

http://tau.uoregon.edu/ Multi-level performance instrumentation

Multi-language automatic source instrumentation Flexible and configurable performance measurement Widely-ported parallel performance profiling system

Computer system architectures and operating systems Different programming languages and compilers

Support for multiple parallel programming paradigms Multi-threading, message passing, mixed-mode, hybrid

Page 3: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 3

TAU Port to IBM BG/P Supports automatic instrumentation at:

Source level (PDT, tau_instrumentor; KOJAK, opari) MPI

Flexible and configurable performance measurement Support for profiling and tracing Support for PAPI counters on BG/P Uses bgxlC_r, bgxlc_r, bgxlf90_r as compilers

To configure TAU: ./installtau -arch=bgp -mpi -pdt=<dir> -pdt_c++=xlC -papi=<dir> ./tau_validate --html --build bgp >& results.html

Parallel Profile Analysis: Paraprof profile browser PerfDMF profile database Perfexplorer cross-experiment data analysis toolkit

Page 4: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 4

Using TAU on IBM BGP (surveyor.alcf.anl.gov) Choose measurement configuration

% ls /soft/apps/tau/tau_latest/bgp/lib/Makefile.*Makefile.tau-mpi-pdtMakefile.tau-mpi-pdt-traceMakefile.tau-callpath-mpi-pdtMakefile.tau-callpath-mpi-compensate-pdtMakefile.tau-depthlimit-mpi-pdtMakefile.tau-mpi-compensate-pdtMakefile.tau-multiplecounters-mpi-papi-pdtMakefile.tau-multiplecounters-mpi-papi-pdt-traceMakefile.tau-multiplecounters-papi-pdtMakefile.tau-multiplecounters-pthread-papi-pdtMakefile.tau-pdtMakefile.tau-phase-multiplecounters-mpi-compensate-papi-pdtMakefile.tau-phase-multiplecounters-mpi-papi-pdtMakefile.tau-pthread-pdt …

% setenv TAU_MAKEFILE /soft/apps/tau/tau-2.17/bgp/lib/Makefile.tau-mpi-pdt

% set path=(/soft/apps/tau/tau-2.17/ppc64/bin $path) # Front-end binaries

Replace mpixlf90_r with tau_f90.sh and compile your application Use tau_cxx.sh and tau_cc.sh for C++ and C compilers respectively

Page 5: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 5

Using TAU on IBM BGP (surveyor.alcf.anl.gov) Choose measurement configuration

% ls /soft/apps/tau/tau_latest/bgp/lib/Makefile.*Makefile.tau-mpi-pdtMakefile.tau-mpi-pdt-traceMakefile.tau-callpath-mpi-pdtMakefile.tau-callpath-mpi-compensate-pdtMakefile.tau-depthlimit-mpi-pdtMakefile.tau-mpi-compensate-pdtMakefile.tau-multiplecounters-mpi-papi-pdtMakefile.tau-multiplecounters-mpi-papi-pdt-traceMakefile.tau-multiplecounters-papi-pdtMakefile.tau-multiplecounters-pthread-papi-pdtMakefile.tau-pdtMakefile.tau-phase-multiplecounters-mpi-compensate-papi-pdtMakefile.tau-phase-multiplecounters-mpi-papi-pdtMakefile.tau-pthread-pdt …

% setenv TAU_MAKEFILE /soft/apps/tau/tau-2.17/bgp/lib/Makefile.tau-mpi-pdt% set path=(/soft/apps/tau/tau-2.17/ppc64/bin $path) # Front-end binaries

Replace mpixlf90_r with tau_f90.sh and compile your application Use tau_cxx.sh and tau_cc.sh for C++ and C compilers respectively

Visualize performance data with paraprof, pprof, vampir, jumpshot

Page 6: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 6

TAU’s ParaProf 3D Profile Browser: Matmult

Page 7: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 7

Profiling FLASH3 on IBM BG/P

Page 8: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 8

Sedov 2D Auto

• Initial test run did not include a load balanced problem• Small problem: too little work for 1024 processor• Proof of concept to validate porting of tools

Page 9: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 9

PerfExplorer: Cross Experiment Analysis

Page 10: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 10

TAU PerfExplorer: Runtime Breakdown

MPI_Barrier

IO_OUTPUT

Page 11: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 11

Relative Efficiency

Page 12: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 12

Relative Speedup for One Event

Page 13: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 13

TAU’s PerfExplorer: IBM BG/P

Page 14: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 14

TAU Portal

TAU portal supports the FLASH regression testing Allows groups to share profiling data in a secure way Allows users to launch TAU performance tools (paraprof,

perfexplorer) Nightly regression testcases uploaded to the database

automatically SVN checkout each night TAU: http://www.cs.uoregon.edu/research/tau TAU Portal: http://tau.nic.uoregon.edu

Page 15: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 15

Portal: Nightly Performance Regression Testing

Page 16: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 16

TAU Portal: Launch ParaProf/PerfExplorer

Page 17: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 17

PerfExplorer: Regression Testing

Page 18: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 18

PerfExplorer: Limiting Events (> 3% ), Oct 2007

Page 19: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 19

PerfExplorer: Exclusive Time for Events (2007)

Page 20: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 20

ParaProf: 3D Visualization

Page 21: TAU Performance System tau.uoregon

TAU Performance SystemIBM Blue Gene Consortium 21

Support Acknowledgements

Department of Energy (DOE)

Office of Science LLNL, LANL, ASC Argonne National Laboratory University of Chicago

Department of Defense NSF