Cluster users,
A couple of the KEC chillers have been fixed, and cooling has been restored to the datacenter. The cluster is up but currently at limited capacity, but should be back to full capacity by tomorrow. If you
have any questions or problems getting on, let me know.
Rob Yelle
From:
Yelle, Robert Brian <robert.yelle@oregonstate.edu>
Date: Sunday, May 15, 2022 at 10:36 AM
To: cluster-users@engr.orst.edu <cluster-users@engr.orst.edu>
Cc: 'staff@engr.orst.edu' <staff@engr.orst.edu>
Subject: CoE HPC Cluster down
HPCC users,
The HPC cluster has been brought down due to a failure in the KEC cooling system. Facilities is working on the problem. The cluster will not be brought back up until the problem has been resolved and cooling
restored to the datacenter. It is not yet known when that will happen, but I’ve been informed that tomorrow is the soonest that the cluster can come back up. I will keep you all updated and let you know when the cluster is back online. You may also check
the cluster status in the web site below:
https://it.engineering.oregonstate.edu/hpc/hpc-cluster-status-and-news
Rob Yelle
HPC Manager