Cluster users,
Survey
If you haven’t had a chance yet, please take a few minutes to complete the survey below, your feedback is important.
https://oregonstate.qualtrics.com/jfe/form/SV_290Wnkkv7IFqSW2
Cluster status
Most of the cluster has been upgraded and is back online. The following hosts are still offline until further notice:
submit-a (please use submit-b or submit-c instead)
dgx2-[3,4]
dgxs-[1-3]
cn-h-[5-8]
The HPC portal is back to a “mostly working” state, but some interactive applications do not work well on the portal at present. The portal is still being worked on. In the meantime, if your desired application
does not work well, try using the Advanced COEHPC Desktop and launch the application from there.
Submit node hostkeys
When you access the upgraded nodes, you may be met with the following message:
"host key for submit-b.hpc.eng.oregonstate.edu has changed and you have requested strict checking. Host key verification failed."
Or something similar. To address this, please remove your old host keys as follows, e.g.:
ssh-keygen -R submit-b.hpc.engr.oregonstate.edu
Repeat this for all submit nodes, and for any other HPC hosts that you need ssh access for. After that, try connecting again and accept the new host keys and you should be set.
New HPC storage
All HPC share data has been migrated to our new DDN storage appliance, still located on /nfs/hpc/share, and all upgraded nodes are now using this storage.
This directory is available on the HPC cluster only and is not visible on the Flips (access.engr.oregonstate.edu)! I will mount the old share on /mnt/share on the submit nodes for a week or so to give people a chance to check the current HPC shares
against the old one, and to copy files or directories that may be missing.
If anyone has any questions or problems, let me know. For up-to-date status on the cluster, check out the link below:
https://it.engineering.oregonstate.edu/hpc/hpc-cluster-status-and-news
Rob Yelle
HPC Manager