NEWS
Tags: 

Sandia Labs supercomputer the fastest Arm-based TOP500 system

Astra, the world’s fastest Arm-based supercomputer according to the TOP500 list, has achieved a speed of 1.529 petaflops, placing it 203rd on a ranking of top computers announced at SC18conference in Dallas.

The supercomputer is also ranked 36th on the High-Performance Conjugate Gradients benchmark, co-developed by Sandia and the University of Tennessee Knoxville, with a performance of 66.942 teraflops. (One thousand teraflops equals 1 petaflop.)

The conjugate benchmark uses computational and data access patterns that more closely match the simulation codes used by the National Nuclear Security Administration.

'These preliminary results demonstrate that Arm-based processors are competitive for high-performance computing. They also position Astra as the world leader in this architecture category,' said Sandia computer architect James Laros, Astra project lead. 'We expect to improve on these benchmark results and demonstrate the applicability of this architecture for NNSA’s mission codes at supercomputer scale.'

Astra is one of the first supercomputers to use processors based on Arm technology. The machine’s success means the supercomputing industry may have found a new potential supplier of supercomputer processors, since Arm designs are available for licensing.

Arm processors previously had been used exclusively for low-power mobile computers, including cell phones and tablets. A single Astra node is roughly one hundred times faster than a modern Arm-based cell phone, and Astra has 2,592 nodes.

Less than a month after hardware delivery and system installation, Astra reached its first goal of running programs concurrently on thousands of nodes.

The next steps include transferring mission codes to Astra from existing architectures used to support the NNSA mission. While this step can be challenging for Astra’s new architecture and compilers, the real effort will likely involve a continuous cycle of performance analysis, optimization and scalability studies, which evaluate performance on larger and larger node counts to achieve the best possible performance on this architecture.

'We expect that the additional memory bandwidth provided by this node architecture will lead to additional performance on our mission codes, which are traditionally memory bandwidth limited,' said Laros. 'We ultimately need to answer the question: is this architecture viable to support our mission needs?'

The Astra supercomputer is itself the first deployment of Sandia’s larger Vanguard program. Vanguard is tasked to evaluate the viability of emerging high-performance computing technologies in support of the NNSA’s mission to maintain and enhance the safety, security and effectiveness of the U.S. nuclear stockpile.

Astra was built and integrated by Hewlett Packard Enterprises, and is comprised of 5,184 Cavium ThunderX2 central processing units, each with 28 processing cores based on the Arm V8 64-bit core architecture. 'While being the fastest in the world is not the goal of Astra or the Vanguard program in general,' said Laros, 'Astra is indeed the fastest Arm-based supercomputer today.'

Twitter icon
Google icon
Del.icio.us icon
Digg icon
LinkedIn icon
Reddit icon
e-mail icon
Feature

Robert Roe reports on developments in AI that are helping to shape the future of high performance computing technology at the International Supercomputing Conference

Feature

James Reinders is a parallel programming and HPC expert with more than 27 years’ experience working for Intel until his retirement in 2017. In this article Reinders gives his take on the use of roofline estimation as a tool for code optimisation in HPC

Feature

Sophia Ktori concludes her two-part series exploring the use of laboratory informatics software in regulated industries.

Feature

As storage technology adapts to changing HPC workloads, Robert Roe looks at the technologies that could help to enhance performance and accessibility of
storage in HPC

Feature

By using simulation software, road bike manufacturers can deliver higher performance products in less time and at a lower cost than previously achievable, as Keely Portway discovers