fbpx
Techitup Middle East
AIB2B Technology

HPE to Build Exascale, Discovery and AI Lux Systems for Oak Ridge National Lab

HPE has been selected to build two new systems for the U.S. Department of Energy’s Oak Ridge National Laboratory: Discovery, the next-generation exascale supercomputer following ORNL’s Frontier, and “Lux,” a new AI cluster that will support advanced AI and machine learning on a cloud-like platform.

Discovery will be based on the new HPE Cray Supercomputing GX5000, HPE’s next-generation supercomputing platform for leadership class systems that leverages a unified AI and high performance computing (HPC) architecture to streamline operations site-wide and across distributed clusters. It will be augmented by a new DAOS-based HPE Cray Supercomputing Storage Systems K3000, a storage option for HPE Cray Supercomputing GX5000. Discovery will deliver new capabilities for AI, HPC and quantum computing and is expected to increase select application productivity tenfold, enabling scientists to accelerate breakthroughs in areas such as precision medicine, cancer research, nuclear energy and aerospace.

“When we built Frontier for Oak Ridge National Laboratory and ushered in exascale, we achieved the pinnacle in supercomputing history and a triumph for the U.S.,” said Antonio Neri, President and CEO at HPE. “We are proud to build on that leadership innovation and strong public-private partnership with the U.S. Department of Energy, ORNL and AMD, to build Discovery and Lux, accelerating the next era of scientific discovery and AI innovation.”

Lux will be a dedicated AI system based on the direct liquid-cooled HPE ProLiant Compute XD685 and feature AMD Instinct MI355X GPUs, AMD EPYC CPUs and AMD Pensando networking. Designed to bolster access to AI resources, Lux will provide researchers across the U.S. with cloud-like access to a sovereign AI factory specifically resourced for training and inference.

Discovery will elevate the exascale computing capabilities first developed for the HPE-built Frontier supercomputer at ORNL. As a result, Discovery will unlock new scientific horizons in various scientific fields while advancing the lab’s mission of innovation and security.

“We are excited for Discovery and Lux to expand the science that researchers are able to do at Oak Ridge,” said Bronson Messer, Director of Science for the Oak Ridge Leadership Computing Facility. “Discovery will set the stage for a new level of converged HPC, AI and quantum computing capabilities, providing additional insight in connection with other systems, while Lux greatly expands researcher access to dedicated AI resources. As a result, we expect both systems will contribute to a paradigm shift in our productivity, reaching unparalleled gains in various, critical areas of scientific research and leadership.”

“For more than a decade, AMD and HPE have partnered to push the limits of high-performance computing, delivering solutions that enable discoveries and change the world,” said Dr. Lisa Su, chair and CEO, AMD. “Together with Oak Ridge National Laboratory, we are advancing the next generation of AI systems with Discovery and Lux—empowering researchers to accelerate innovation and strengthen America’s leadership in science and technology.”

Inside Discovery: the next-generation exascale supercomputer

Discovery’s scientific advancements will stem from utilizing the HPE Cray Supercomputing GX5000 unveiled today. Building upon 50 years of supercomputing innovation dating back to the Cray-1 announced in 1975, HPE has designed its next-generation infrastructure for supercomputing in the converged AI and HPC era.

The HPE Cray Supercomputing GX5000 is purpose-built for exascale and features state-of-the-art end-to-end capabilities across CPUs, GPUs, accelerators, networking, software, storage and liquid cooling. By leveraging the new architecture, Discovery will deliver:

  • Greater performance with optimized space – The new platform is purpose-built to scale to exascale performance with greater density compared to the previous version, using 25 percent less data center space per rack.
  • High performance interconnect with HPE Slingshot – The next generation HPE Slingshot provides Discovery a modern, high-performance interconnect to deliver high-bandwidth and low-latency for HPC, machine learning and analytics applications.
  • Industry-first HPC DAOS storage performance – Augmented by the new HPE Cray Supercomputing Storage Systems K3000, Discovery will have 300 percent more input/output operations per second (IOPS) per storage rack compared to Frontier to enable AI applications to run with higher productivity. As the industry’s first factory-built storage system with embedded Distributed Asynchronous Object Storage (DAOS) open source software, the HPE Cray Supercomputing Storage Systems K3000 is a cost-effective, all-flash storage system that complements the Lustre file system-based HPE Cray Supercomputing Storage Systems E2000, which will also be featured in Discovery.
  • Next-generation, liquid-cooled and accelerated compute – Discovery will feature next-generation AMD EPYC processors, codenamed “Venice,” with AMD Instinct MI430X GPUs, which offer advanced performance and accuracy for modeling, simulation and AI projects. Leveraging HPE’s 50 years of liquid cooling innovation, Discovery’s compute infrastructure will be fully liquid-cooled to optimize energy efficiency and cost-effectiveness in supercomputing environments.

HPE delivers end-to-end solutions and services to customers with best-in-class AI and HPC expertise. As an integral partner, HPE supercomputing services help enhance outcomes through a fully unified management approach of an organization’s infrastructure and applications with a key focus on core business needs and continuous innovation.

Related posts

Sophos: Majority of Retail Organizations Unable to Halt Attacks in Progress

Editor

Fortinet Announces Industry’s First Wi-Fi 7–Enabled Secure Networking Solution 

Editor

Pure Storage Unveils New GenAI Pod, Accelerates AI Innovation

Editor