CoE HPC News, June 29 2023: AC maintenance

HPCC users, Maintenance was performed on the KEC chillers earlier today. The KEC server room temperature is still quite warm since the maintenance, but is expected to cool down slowly over time. The cluster resources are currently offline until the server room temperature returns to normal. Jobs currently running in the cluster will continue, but in the event that the temperature rises, I may need to perform an emergency shutdown of parts or most of the cluster, which may result in terminating those running jobs. I will post updates of the cluster status in the link below: https://it.engineering.oregonstate.edu/hpc/hpc-cluster-status-and-news<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fit.engineering.oregonstate.edu%2Fhpc%2Fhpc-cluster-status-and-news&data=05%7C01%7Ccluster-users%40engr.orst.edu%7C389823e9b8df4a2e0b0708db78ff707e%7Cce6d05e13c5e4d6287a84c4a2713c113%7C0%7C0%7C638236810718518726%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=9ghrVbmsTXujVDSRi477F3pEW6ijG1e%2FIcNzLAUa2po%3D&reserved=0> Rob Yelle HPC Manager
participants (1)
-
Yelle, Robert Brian