HPCC users,

 

The cluster is back online, but at limited capacity.  Additional resources will become available throughout the week as maintenance progresses.

 

 

Submit node ssh warnings

 

Please note that the server host keys have change on two of the submit nodes (submit-a and submit-b), so new ssh connections may result in a security warning like "Remote Host Identification has changed" or "Host Key verification failed".   It is okay to continue the connection.  If you are using MacOS or Linux and are have trouble connecting, try the following:

ssh-keygen -R submit-a.hpc.engr.oregonstate.edu

ssh-keygen -R submit-b.hpc.engr.oregonstate.edu

ssh-keygen -R submit.hpc.engr.oregonstate.edu

then try to connect via ssh again.

 

New Cuda versions

 

Cuda versions 11.5 and 11.6 have been installed, and Nvidia drivers have been upgraded on most GPU nodes to support these latest Cuda versions.  However, drivers for Cuda 11.5+ have not yet been released for the DGX systems.  The default Cuda module has been changed to 11.4 to reflect the maximum Cuda version supported by the DGX systems.

 

For more cluster news and status updates, check out the link below:

 

https://it.engineering.oregonstate.edu/hpc/hpc-cluster-status-and-news

 

Cheers,

 

Rob Yelle

HPC Manager