Lonestar5 User News

Emergency Network Maintenance Wednesday June 9th 2021

Posted by Mark Brueschke on Jun 8, 2021 3:55:50 PM

Emergency network maintenance will be carried out between Midnight and 2:00 AM (CDT). Users may experience issues during this window. Users may experience issues during this period connecting into and out of TACC systems, especially login nodes. Jobs scheduled to run within clusters should be...

Emergency network maintenance will be carried out between Midnight and 2:00 AM (CDT). Users may experience issues during this window.

Users may experience issues during this period connecting into and out of TACC systems, especially login nodes. Jobs scheduled to run within clusters should be unaffected.

Lonestar5 Status

Posted by Mark Brueschke on May 3, 2021 11:15:39 AM

Lonestar5 has hit a substantial failure mode that has affected the core infrastructure. Admins will continue to work to restore service.  The queues will remain unavailable until we are able to restore the compute nodes.  At this time, there is no ETA on the availability of LS5 for compute nodes.  ...

Updated on May 14, 2021 3:45:11 PM

Users of Lonestar5 are now encouraged to migrate ALL data. Users can move data from /work to /work2 and should also back up any important data from /home and /scratch


Users will not be able to run jobs at this time.

Original Posting

Lonestar5 has hit a substantial failure mode that has affected the core infrastructure. Admins will continue to work to restore service.  The queues will remain unavailable until we are able to restore the compute nodes.  At this time, there is no ETA on the availability of LS5 for compute nodes.
 
In the interim, we've restored filesystem access and have opened the login nodes to allow users to move data between filesystems and utilize Stockyard (/work2 during the migration period) to shift data between TACC resources. We will also shift the timing of the Stockyard migration dates to give users extra time to migrate due to the outage of LS5. Please be cognizant of the fact that /scratch does not have a quota but /work2 does.

Please submit any questions you may have via the TACC User Portal.

https://portal.tacc.utexas.edu/tacc-consulting

WORK File System Migration and Upgrade

Posted by Tim Cockerill on Apr 8, 2021 11:59:41 AM

ACTION REQUIRED We are pleased to announce the availability of a new file system (/work2) on new upgraded hardware. Users must migrate any current data they wish to keep from /work to /work2.  Users should complete the migration before 17 May 2021 when /work2 becomes the only work file system...

Updated on May 10, 2021 11:36:45 AM

The deadline dates have been extended again, thus /work will become read-only as of 18May2021. All other dates have been extended as well, and have been updated in the detailed timeline in the /work Migration and Transition Guide.

Updated on Apr 20, 2021 10:13:31 AM

The deadline dates have been extended by 2 weeks, thus /work will become read-only as of 4May2021. All other dates have been extended as well, and have been updated in the detailed timeline in the /work Migration and Transition Guide.


Original Posting

ACTION REQUIRED

We are pleased to announce the availability of a new file system (/work2) on new upgraded hardware. Users must migrate any current data they wish to keep from /work to /work2.  Users should complete the migration before 17 May 2021 when /work2 becomes the only work file system mounted on compute resources. As of 19 April 2021, /work will become read-only, and thus all file writes will need to go to /work2. Note that /work is NOT backed up, so you MUST migrate your data to /work2 if you wish to keep it. A more detailed timeline and technical details on how to most efficiently accomplish migration can be found in the  /work Migration and Transition Guide.

Prior to migrating data, users are encouraged to review their data to identify that which can be deleted. The Ranch Archive Facility is also available for data that no longer needs to be in /work but must be retained long term.

Lonestar5 Degraded, Tuesday 27 April 2021

Posted by Mark Brueschke on Apr 27, 2021 2:15:44 PM

Lonestar5 is down due to unscheduled network event. Administrators are working on restoring services as quickly as possible.

Please submit any questions you may have via the TACC User Portal.
https://portal.tacc.utexas.edu/tacc-consulting

Updated on Apr 28, 2021 9:32:32 AM

Lonestar5 still remains offline due to previous issues from yesterday. Administrators are working with vendors to help resolve this.

Please submit any questions you may have via the TACC User Portal.

Original Posting

Lonestar5 is down due to unscheduled network event. Administrators are working on restoring services as quickly as possible.

Please submit any questions you may have via the TACC User Portal.
https://portal.tacc.utexas.edu/tacc-consulting

Lonestar5 Degraded, Thursday 4 March 2021 18:57 CST / CDT

Posted by Alex Ferrier on Mar 4, 2021 7:03:38 PM

Lonestar5 has had a hardware failure that's impacting some core functionality. Administrators are working on restoring services as quickly as possible.


Please submit any questions you may have via the TACC User Portal.

Updated on Mar 4, 2021 9:09:23 PM

LS5 is now back in production and queues have been reopened.

Updated on Mar 4, 2021 7:06:57 PM

Lonestar5 is now in unscheduled maintenance. Administrators are working to restore the system as soon as possible.

Original Posting

Lonestar5 has had a hardware failure that's impacting some core functionality. Administrators are working on restoring services as quickly as possible.


Please submit any questions you may have via the TACC User Portal.

Lonestar5 Maintenance Tuesday 23 February 2021

Posted by Matthew Edeker on Feb 8, 2021 10:17:31 AM

Lonestar5 will not be available from 8:30 AM to 5:00 PM (CST) on Tuesday, 23 February 2021. Hardware maintenance will be performed during this time.


Please submit any questions you may have via the TACC User Portal. 


Updated on Feb 23, 2021 4:49:33 PM

 LS5 is back in production and the queues are open.

Original Posting

Lonestar5 will not be available from 8:30 AM to 5:00 PM (CST) on Tuesday, 23 February 2021. Hardware maintenance will be performed during this time.


Please submit any questions you may have via the TACC User Portal. 


Queues down on all Large Scale Systems Monday 15 February 2021

Posted by Matthew Edeker on Feb 15, 2021 11:00:35 AM

Due to the recent state of emergency in the City of Austin due to extreme weather and the impact it is having on the power in Austin, TACC is closing all queues on all Large Scale Systems. This includes Frontera, Stampde2, Lonestar5, Longhorn, and all other TACC resources. Queues will re-open as...

Updated on Feb 19, 2021 3:32:48 PM

All TACC large scale production systems have resumed full operations after powering down and load shedding to help with the Texas power grid emergency.

Original Posting

Due to the recent state of emergency in the City of Austin due to extreme weather and the impact it is having on the power in Austin, TACC is closing all queues on all Large Scale Systems. This includes Frontera, Stampde2, Lonestar5, Longhorn, and all other TACC resources. Queues will re-open as soon as possible when the emergency subsides.

Please submit any questions you may have via the TACC User Portal. 

Stockyard Status Wednesday 27 January 2021

Posted by Matthew Edeker on Jan 27, 2021 11:03:51 AM

We're noticing performance issues with Stockyard and have closed queues on Frontera, Lonestar5, Stampede2, and Maverick2 while attempting to recover. Running jobs not using Stockyard (/work) will not be impacted.  Please submit any questions you may have via the TACC User Portal....

Updated on Jan 27, 2021 11:55:27 AM

Stockyard recovery process has been completed and queues have been opened. 

Original Posting

We're noticing performance issues with Stockyard and have closed queues on Frontera, Lonestar5, Stampede2, and Maverick2 while attempting to recover. Running jobs not using Stockyard (/work) will not be impacted. 


Please submit any questions you may have via the TACC User Portal.

Matlab unavailable on all TACC clusters (Frontera/Stampede2/Lonestar5/Maverick2) on Thurs 28 January 2021 due to license maintenance.

Posted by Alex Ferrier on Jan 26, 2021 6:02:52 PM

Matlab may be unavailable on all TACC clusters (Frontera/Stampede2/Lonestar5/Maverick2) on  on Thursday, January 28 from 5:00 PM - 6:00 PM CST.  The license service may be intermittently unavailable during this maintenance as licenses are updated for the 2021 calendar year. Please submit any...

Matlab may be unavailable on all TACC clusters (Frontera/Stampede2/Lonestar5/Maverick2) on  on Thursday, January 28 from 5:00 PM - 6:00 PM CST.  The license service may be intermittently unavailable during this maintenance as licenses are updated for the 2021 calendar year.


Please submit any questions you may have via the TACC User Portal.

https://portal.tacc.utexas.edu/tacc-consulting

Emergency Network Maintenance Friday, 22 January 2021

Posted by Mark Brueschke on Jan 22, 2021 1:43:58 PM

Emergency firewall maintenance is being carried out, the process should be brief with all services restored quickly.


Please submit any questions you may have via the TACC User Portal.
https://portal.tacc.utexas.edu/tacc-consulting

Updated on Jan 22, 2021 2:53:54 PM

Firewall maintenance is complete as of 2:50 PM.

Original Posting

Emergency firewall maintenance is being carried out, the process should be brief with all services restored quickly.


Please submit any questions you may have via the TACC User Portal.
https://portal.tacc.utexas.edu/tacc-consulting

Upcoming TACC Training Short Courses - February and March 2021

Posted by Jason Allison on Jan 21, 2021 9:21:44 AM

We are pleased to announce the following training courses being held via webcast during the months of February and March 2021:  - Introduction to Linux - February 10th, 2021, 1pm - 4pm CT  - C++ for C Programmers - February 18th, 2021, 9am - 3pm CT  - MPL Object-Oriented Interface to MPI - February...

We are pleased to announce the following training courses being held via webcast during the months of February and March 2021:

 - Introduction to Linux - February 10th, 2021, 1pm - 4pm CT
 - C++ for C Programmers - February 18th, 2021, 9am - 3pm CT
 - MPL Object-Oriented Interface to MPI - February 25th, 2021, 9am - 12pm CT
 - Introduction to Machine Learning at TACC - March 5th, 2021, 9am - 3pm CT
 - Introduction to Deep Learning at TACC - March 12th, 2021, 9am - 3pm CT

Registration for these events closes at 12pm CT the day prior. To register and for more information please visit:

Please email jasona@tacc.utexas.edu if you have any questions.

TACC $WORK File System (Stockyard) is Offline - Thurs 14 Jan 2021

Posted by Alex Ferrier on Jan 14, 2021 9:13:50 PM

Stockyard ($work) is has experienced a failure and is unavailable right now. Admins are working to resolve the issue.


Please submit any questions you may have via the TACC User Portal.

https://portal.tacc.utexas.edu/tacc-consulting


Updated on Jan 14, 2021 9:57:45 PM

The issue with the Stockyard filesystem has been resolved and queues have been opened.


Please submit any questions you may have via the TACC User Portal.

Original Posting

Stockyard ($work) is has experienced a failure and is unavailable right now. Admins are working to resolve the issue.


Please submit any questions you may have via the TACC User Portal.

https://portal.tacc.utexas.edu/tacc-consulting


Lonestar5 Status Thursday 7 January 2021

Posted by Matthew Edeker on Jan 7, 2021 1:29:24 PM

Lonestar5 is currently experiencing a scheduler issue and admins are working to resolve the issue.

Updated on Jan 7, 2021 4:10:17 PM

LS5 is back in production and queues have be re-opened.

Please submit any questions you may have via the TACC User Portal.

Original Posting

Lonestar5 is currently experiencing a scheduler issue and admins are working to resolve the issue.

TACC Outage December 31 2020

Posted by Alex Ferrier on Dec 31, 2020 9:58:40 PM

TACC's datacenter has had a power event that's impacting multiple core systems. Queues on all of the Large Scale Systems will remain down while assessing the impact and recovering. Admins are working to restore services as soon as possible. Please submit any questions you may have via the TACC User...

Updated on Jan 1, 2021 6:24:06 PM

The Lonestar5 issue related to the power event has been resolved and queues have been opened. Please contact us via a ticket if you see any further problems on the system. Thank you, TACC LSS team

Updated on Jan 1, 2021 12:14:25 PM

Frontera, Stampede2 and other services are back on line as of 12 noon (CST) on Friday, January 1 2021. Lonestar5 remains offline at this time as we work to restore it.

Original Posting

TACC's datacenter has had a power event that's impacting multiple core systems. Queues on all of the Large Scale Systems will remain down while assessing the impact and recovering. Admins are working to restore services as soon as possible.

Please submit any questions you may have via the TACC User Portal.
https://portal.tacc.utexas.edu/tacc-consulting

-TACC Team

Network Performance Issue

Posted by Garland Whiteside on Dec 22, 2020 1:55:41 AM

TACC staff are investigating a network performance issue which is affecting login process to TACC systems. Staff are working to resolve the problem.

Updated on Dec 22, 2020 2:22:41 AM

TACC Staff has concluded the work on the network performance issue.  The issue has been resolved. 


Thanks, 
TACC Staff.

Original Posting

TACC staff are investigating a network performance issue which is affecting login process to TACC systems. Staff are working to resolve the problem.