GRC and Nvidia develop immersion-cooled supercomputer
GRC (Green Revolution Cooling), a provider of single-phase immersion cooling for data centres, has announced its joint project with Nvidia to help power a GPU-intensive computing subsystem for the Texas Advanced Computing Center’s (TACC) Frontera Supercomputer, the world’s largest academic supercomputer.
'GRC is proud of its long history with TACC and we’re delighted to have been able to collaborate once again with Nvidia to help power the next generation of academic research,' said Peter Poulin, CEO of GRC. 'We look forward to strengthening our partnerships with both Nvidia and TACC to continue to provide support for this important academic endeavour.'
First announced in 2018 and built earlier this year, the new supercomputer will enable the nation’s academic researchers to make important discoveries in all fields of science, from astrophysics to zoology, and further establishes The University of Texas at Austin’s leadership in advanced computing. The GRC and Nvidia subsystem is optimised for single-precision performance for TACCs multi-user environment.
'GRC and Nvidia have developed a system that maximizes performance and efficiency,' said Dan Stanzione, executive director, TACC at the University of Texas-Austin. 'GRC’s immersion cooling will enable us to operate in a high-intensity computing environment without having to worry about performance, reliability or thermal issues.'
GRC worked closely with TACC and Nvidia to design and build custom servers that leverage the cooling capacity of GRC’s ICEraQ solution, enabling TACC to maximise GPU density. Nvidia’s enterprise-class Tensor Core GPUs provide the computing power behind the system’s accelerated computing nodes. GRC’s ICEraQ system was chosen to support the GPU-intensive workloads due to its simplicity, virtually limitless cooling capacity, and energy efficiency.
Given the high density of the Nvidia GPUs, motherboard and other components from Supermicro, and Mellanox InfiniBand cards for high-speed networking, using air-based cooling would have been complex and expensive. All available liquid-cooling options were considered and GRC’s liquid-immersion cooling was chosen due to its ability to increase GPU density without having to plumb water lines to every individual heat source (GPUs and CPUs) within each server. By immersing complete servers in GRC’s ElectroSafe dielectric coolant, GRC’s ICEraQ delivers a consistent thermal environment for every server component, while dramatically reducing the required cooling energy.
'We’re excited to collaborate with GRC and TACC to support the fifth fastest supercomputer in the world,' said Fred Allman, Director of Worldwide Supercomputing at Nvidia. 'Nvidia and GRC have a long history of partnering, starting with the TSUBAME KFC system which was No. 1 on the Green500 ranking in 2013 and 2014.'
TACC first partnered with GRC in 2009 and validated the cost-effectiveness of its liquid-immersion cooling solution in both energy and operations. During the initial engagement, TACC found that it only took about 300 watts to cool 10 kilowatts of power, yielding a 1.03 PUE even in the middle of a Texas summer.