NEMO Power Maintenance

NEMO Power Maintenance

Published: 22 Feb 2022 by HPC Team Freiburg

Inspection of the data center power supply.

Date : 03.03.2022 6:00 - 18:00 CET

The complete NEMO cluster will be shut down. No computing is possible during the maintenance, so submit all jobs in time. Note that only jobs whose walltime fits into the time window until maintenance can start. Jobs that cannot start will only be processed afterwards. However, in the event of problems, the queue may have to be emptied. During maintenance the operating system of the cluster is updated.

Update 03.03.2022:

The maintenance of the uninterruptible power supply was successfully completed. In addition, firmware updates were applied to all Ethernet switches. On the NEMO worker nodes, the OS was updated to the latest version (security update March, CentOS 7.9, Rev. 17).

NEMO Power Maintenance

Latest Posts

New GPU Nodes with NVIDIA H200 Available

Two new GPU nodes with 8× NVIDIA H200, 192 cores, 1.5 TB RAM, and 3.8 TB NVMe are now available in NEMO2 for testing via the “h200” partition. No software modules are installed yet.

NEMO2 Production Mode

NEMO2 has officially launched, transitioning from a testing phase to full production with expanded hardware, including AMD Instinct MI300A and Nvidia L40A nodes. NEMO1 is being phased out, with limited resources available until May 31st. Users are encouraged to transition to NEMO2 and consult the wiki for details.

NEMO2 Conda

NEMO2 uses Miniforge for conda environments, offering a streamlined setup with conda-forge as the default repository. The Miniforge module auto-initializes conda, simplifying environment activation without modifying shell profiles.