Stampede2 Maintenance 9 April, 2019

Posted by David Littrell on Apr 3, 2019 5:48:49 PM

Updated on Apr 9, 2019 9:49:33 PM

Stampede2 is back in production.

Updated on Apr 9, 2019 7:17:03 PM

The Stampede2 maintenance has been extended as we continue to resolve remaining hardware issues on a storage server in our cluster. We will announce via user news once service is restored and Stampede2 is back into production.

Original Posting

Stampede2 will be unavailable from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 9 April 2019. We need to perform hardware repairs on a storage server after encountering problems today.

If you submit a job before the maintenance, and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable" ("Required Node Not Available"). The showq utility will list the job as "BLOCKED" and report its status as "WaitNod" ("Waiting for Nodes"). Note that the hours leading up to the maintenance are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be short.