Lenovo, Intel and LRZ announce the phase two of the LRZ’s SuperMUC-NG supercomputer

Lenovo Infrastructure Solutions Group, Intel and the Leibniz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences and Humanities, announce Phase Two of the LRZ’s SuperMUC-NG supercomputer. The system will deliver high performance integrated solutions to the LRZ user community, utilizing artificial intelligence to implement advanced simulations, modelling, and data analysis that will accelerate research to help solve humanity’s greatest challenges.

With funding from the Free State of Bavaria and the German Federal Ministries of Education and Research, Phase Two sees the expansion of SuperMUC-NG, part of the Gauss Centre for Supercomputing (GCS), so as to continue as one of the fastest, most energy-efficient supercomputers in the world.

AI enabling Integrated Solutions

Since SuperMUC-NG Phase One was launched, practitioners have used the supercomputer not only for traditional simulation and modelling, but also to automate image and pattern recognition in planet observations, climate data from satellites, medical visuals and health records, and data demographics. Given the successful utilisation of SuperMUC-NG in these projects, the demand for high performance data analytics, machine learning and fast memory performance has further increased.

To meet these demands and ensure researchers are supported, SuperMUC-NG will now be enhanced with next-generation Intel Xeon Scalable processors (codenamed Sapphire Rapids) and Intel’s upcoming HPC GPUs based on the Xe HPC architecture, codenamed “Ponte Vecchio”.

Phase Two will also use distributed asynchronous object storage (DAOS), leveraging 3rd Gen Intel Xeon Scalable processors (codename “Ice Lake”) integrated into Lenovo’s ThinkSystem SR630 V2 platform. DAOS provides 1 petabyte of data storage, and will enable fast throughput of large data volumes, while the system architecture can deliver highly scalable, compute and data-intensive workloads and artificial intelligence applications. Overall, the SuperMUC-NG Phase Two compute nodes will deliver four times higher performance per Watt (High Performance Linpack) than Phase One.

Ensuring a sustainable approach

The enhancements made in Phase Two will ensure SuperMUC-NG is now capable of performing additional tasks in a way that’s as energy-efficient as possible. The key to this is the integration of 240 Intel compute nodes into Lenovo’s ThinkSystem SD650 leveraging Neptune warm water cooling and connected to the DAOS storage system via a high-speed network. Lenovo’s innovative Neptune direct water-cooling technology removes approximately 90% of the heat from the compute system, reducing overall energy consumption, significantly increasing overall efficiency and ultimately allowing the processors to perform at their peak.

In addition, the components for SuperMUC-NG Phase Two will be manufactured within Europe, in Lenovo’s new dedicated manufacturing facility in Hungary, to help further improve the eco-footprint of the project’s supply chain.

Phase Two kick off

LRZ will receive the DAOS storage system in the last quarter of 2021, and the compute system will follow in the 2nd quarter of 2022. The LRZ team are preparing their user community for Phase Two’s enhancements by offering support and consultations for adapting and optimizing codes and AI algorithms, and giving researchers access to GPU systems specialized in AI applications. The LRZ training program also offers a wide variety of machine and deep learning courses, educating users in how they can adapt existing algorithms or develop and train their own.

SuperMUC-NG is currently listed 15th in the Top 500 supercomputers in the world.