ReplayEngine 2.0 and TotalView 8.9.1

Rogue Wave Software has simultaneously released ReplayEngine 2.0 and TotalView 8.9.1. This suite of products simplifies debugging and memory analysis, especially for applications that are data-intensive, multi-threaded, or distributed across a network or cluster. With this release, ReplayEngine supports native Infiniband on both Mellanox and QLogic hardware, opening up its use on large-scale HPC clusters; and TotalView expands its Cuda support to include SDKs 3.1 and 3.2, with Cuda 4.0 support in progress.

ReplayEngine 2.0, the reverse debugging add-on to TotalView, now supports debugging applications that make use of the high speed of Infiniband networks, making ReplayEngine available to developers working with large clusters. ReplayEngine supports native transport mechanisms with Mellanox and QLogic Infiniband hardware on MVAPICH 1.2, MVAPICH2 1.5 and 1.6, OpenMPI 1.4.2, and Intel MPI 4.0.

‘Now, developers creating complex parallel applications for deployment on high performance clusters have the option of using reverse debugging on these systems,’ said Chris Gottbrath, principal product manager at Rogue Wave Software. ‘With ReplayEngine developers can allow their program to run until the point of failure, then step backward through the program execution, making hard-to-reproduce bugs easier to find.’

TotalView 8.9.1 provides developers with the ability to troubleshoot apps using versions 3.1 and 3.2 of the Cuda Toolkit. This release features support for Cuda function calls on the stack (in addition to inline), host pinned memory regions, and Cuda contexts. It handles exceptions in Cuda code, displays variables in GPU hardware registers, and has added CLI (command line interface) commands for Cuda functionality. In addition to CUDA support, TotalView 8.9.1 includes expanded platform support and enhancements to the multi-dimensional array display, parallel backtrace features and TVScript.


For functionality and security for externalised research, software providers have turned to the cloud, writes Sophia Ktori


Robert Roe investigates the growth in cloud technology which is being driven by scientific, engineering and HPC workflows through application specific hardware


Robert Roe learns that the NASA advanced supercomputing division (NAS) is optimising energy efficiency and water usage to maximise the facility’s potential to deliver computing services to its user community


Robert Roe investigates the use of technologies in HPC that could help shape the design of future supercomputers