
HPCC users, The Slurm update has completed and the queues have been resumed. For those of you who already have an active session on the cluster, you should do the following: module reload slurm If you encounter any problems using the queuing system, let me know. Also, dgx2-1 is back online, and I’ll try to have dgx2-2 and dgx2-5 back online by tomorrow morning. Rob From: Yelle, Robert Brian <robert.yelle@oregonstate.edu> Date: Wednesday, May 12, 2021 at 3:37 PM To: cluster-users@engr.orst.edu <cluster-users@engr.orst.edu> Subject: CoE HPC News, May 12 Edition: Urgent updates HPCC users, Three of the DGX2 systems (dgx2-1, dgx2-2 and dgx2-5) are currently offline for urgent firmware updates. They should be available again later this week. Also, the Slurm scheduler will be offline briefly tomorrow morning (May 13) between 7 and 9am for an urgent update. The cluster will remain online and submit nodes will still be accessible, but Slurm commands issued may fail and no new jobs will be started during this time. Currently running jobs should not be impacted by this update. Rob Yelle HPC Manager