Frontera /scratch1 filesystem Thursday 30 April 2020

Posted by Sergio Leal on Apr 30, 2020 9:00:20 AM

Updated on Apr 30, 2020 12:57:53 PM

The offline storage target for the /scratch1 filesystem has been restored and activated on all logins and compute nodes. The Frontera queues have been reopened and the system back in normal operation again.

Original Posting

Frontera's /scratch1 filesystem currently has one of its Lustre storage targets offline due to errors on it and we are working with the vendor to restore it, but it might take some time.   For now, the compute queues have been closed and the target has been deactivated on the login nodes to prevent hangs when trying to use the filesystem, but users will get errors if they attempt to access files residing on the offline target.  If the repair will take more than a few hours, we will deactivate the target on the compute nodes and re-open the queues, however, jobs will fail if they try to access a file on that target.  New files can be created on the other storage targets for /scratch1 without any errors.  

 An update to this announcement will be provided once we have more details and/or time estimates for the repair.