User News

Lonestar5 Status 25 April 2018

Posted by Sergio Leal on Apr 25, 2018 11:58:56 AM

LoneStar5 has taken an unexpected power-down due to temperature and water problems.  Administrators are working on resolving the problem.

Updated on Apr 25, 2018 1:36:20 PM

LoneStar5 queues have been reopened. 

Original Posting

LoneStar5 has taken an unexpected power-down due to temperature and water problems.  Administrators are working on resolving the problem.

LoneStar 5 Maintenance 8 May 2018

Posted by Mitchell Collins-Bailey on Apr 23, 2018 10:40:49 PM

LoneStar5 will not be available from 8 a.m. to 17:00 p.m. (CT) on Tuesday, 8 May 2018. Scheduler maintenance will be performed during this time

LoneStar5 will not be available from 8 a.m. to 17:00 p.m. (CT) on Tuesday, 8 May 2018. Scheduler maintenance will be performed during this time

Wrangler File System

Posted by Mitchell Collins-Bailey on Apr 23, 2018 7:04:14 PM

Wrangler has filesystem issues with /data and administrators are working on it. It'll be resolved as soon as possible.

Updated on Apr 23, 2018 8:45:02 PM

/data is returned on Wrangler and therefore the queues are re-opened. Most client transactions should've resumed after the filesystem returned so most should only have paused. If jobs have failed due to this issue please submit a ticket

Original Posting

Wrangler has filesystem issues with /data and administrators are working on it. It'll be resolved as soon as possible.

Stampede2 Log in issues 16 april 2018

Posted by Mitchell Collins-Bailey on Apr 16, 2018 10:09:17 PM

Stampede 2 is currently experiencing log in issues.  This is currently being addressed.  This should not impact any jobs currently being run or in queue.  Notifications will be sent when solution is found.

Updated on Apr 17, 2018 2:56:13 AM

Stampede2 home filesystem is back in full production.

Original Posting

Stampede 2 is currently experiencing log in issues.  This is currently being addressed.  This should not impact any jobs currently being run or in queue.  Notifications will be sent when solution is found.

Stampede2 Extended Maintenance 23 April 2018

Posted by Matthew Edeker on Apr 10, 2018 4:18:40 PM

Stampede2 will be unavailable for a five-day period from Monday, 23 April 2018 at 8 am CDT to Friday, 27 April 2018 at 7:30 pm CDT. During this extended downtime, TACC staff will conduct system maintenance, apply file system updates, and run tests that exercise the entire system. Please submit any...

Stampede2 will be unavailable for a five-day period from Monday, 23 April 2018 at 8 am CDT to Friday, 27 April 2018 at 7:30 pm CDT. During this extended downtime, TACC staff will conduct system maintenance, apply file system updates, and run tests that exercise the entire system.

Please submit any questions you have via the TACC User Portal.

Thank you.

Lonestar Maintenance, 3 April 2018

Posted by David Littrell on Mar 22, 2018 9:35:16 AM

LoneStar5 will not be available from 8 a.m. to 17:00 p.m. (CT) on Tuesday, 3 April 2018. System maintenance installing Cray released patches and field notices will be performed during this time.

Updated on Apr 3, 2018 6:26:12 PM

LS5 has been returned to production and the queues are open. We apologize for the delay.

Updated on Apr 3, 2018 5:27:44 PM

LS5 maintenance needs to be extended. LS5 will be returned to production as soon as possible.

Original Posting

LoneStar5 will not be available from 8 a.m. to 17:00 p.m. (CT) on Tuesday, 3 April 2018. System maintenance installing Cray released patches and field notices will be performed during this time.

Stampede2 Status 31 March, 2018

Posted by Matthew Edeker on Mar 31, 2018 9:39:51 AM

Stampede2 is currently inaccessible. TACC staff are working to resolve the issue as quickly as possible.

Updated on Mar 31, 2018 10:25:27 AM

Stampede2 is back in full production.

Original Posting

Stampede2 is currently inaccessible. TACC staff are working to resolve the issue as quickly as possible.

Corral Issue 30 March 2018

Posted by Mitchell Collins-Bailey on Mar 30, 2018 6:58:13 PM

There is a known issue with some users trying to make files on Corral.  "no space left of device" error.  We apologize for the inconvenience and are working to correct the problem.


-TACC Ops

Updated on Mar 30, 2018 7:22:58 PM

Corral issues pertaining to this announcement should be resolved.  We apologize for any inconveniences.  


-TACC Ops

Original Posting

There is a known issue with some users trying to make files on Corral.  "no space left of device" error.  We apologize for the inconvenience and are working to correct the problem.


-TACC Ops

Wrangler Maintenance 13 March 2018

Posted by Mitchell Collins-Bailey on Mar 9, 2018 5:07:28 PM

Wrangler will be taking an emergency maintenance on Tuesday the 13th to recover from a hardware failure that requires filesystems to be unmounted to resolve. We apologize for the short notice.

Updated on Mar 27, 2018 4:41:16 PM

After extensive hardware issues and then filesystem recovery attempts Wrangler now has /gpfs/flash available cluster-wide. We've ended up with a number of files in the lost+found location after the fsck repair programs finally completed cleanly. /gpfs/flash is the scratch filesystem in the Wrangler cluster so we expect files to be replicated elsewhere but if there were any files that are of very high value that are missing there's a chance that they can be restored. Please submit a consulting ticket and the Wrangler admins will determine if they can be restored. Please note that the lost+found recovery location is the combination of all files in no particular order or structure and requires substantial sorting to track down individual entries.

Updated on Mar 13, 2018 5:03:14 PM

Wrangler will be returning to production shortly however we are not able to restore the gpfs/flash filesystem at this point. This is a hardware related issue that is currently being worked on. Please note that jobs that we can identify as definitely using gpfs/flash have been suspended and will be automatically resumed when the filesystem is available.

Original Posting

Wrangler will be taking an emergency maintenance on Tuesday the 13th to recover from a hardware failure that requires filesystems to be unmounted to resolve. We apologize for the short notice.

LoneStar5 Status, 25 March 2018

Posted by Matthew Edeker on Mar 25, 2018 8:31:32 AM

Early this morning LS5 had a failure on the service nodes that serve out critical filesystems. Administrators are working on the problem now and will report when full functionality is restored.

Updated on Mar 25, 2018 10:22:11 AM

The filesystem issues on LoneStar5 have been resolved but we are still seeing the consequences on a few nodes. These will be recovered in due course and the queues have been reopened.

Original Posting

Early this morning LS5 had a failure on the service nodes that serve out critical filesystems. Administrators are working on the problem now and will report when full functionality is restored.

Training: OpenMP Training Events - April 12th and 13th, 2018

Posted by Jason Allison on Mar 22, 2018 12:19:22 PM

We are pleased to announce the following OpenMP training events are being offered to both in-person and webcast participants April 12th and 13th, 2018. The courses include hands-on exercises on TACC systems. Local participants are strongly encouraged to attend in person. Instructors will be...

We are pleased to announce the following OpenMP training events are being offered to both in-person and webcast participants April 12th and 13th, 2018. The courses include hands-on exercises on TACC systems. Local participants are strongly encouraged to attend in person. Instructors will be available after class to consult on individual projects with in person participants.

4/12/18 9am-12:30pm CT - Introduction to OpenMP
This course will introduce participants to the OpenMP threading model, and describe the basic constructs necessary to parallelize loops on multi-core architectures. Topics include the fork/join threading model, using OpenMP directives, and loop parallelization. The fundamentals of hybrid computing (MPI & OpenMP) will be explained and illustrated.

4/13/18 9am-12:30pm CT - Advanced OpenMP
This course will provide an introduction to OpenMP optimization techniques for multi-core and vectorized architectures. Topics will include OpenMP SIMD directives, configuring OpenMP thread affinity, tasking, and task dependences.

To register and for more information please visit: https://learn.tacc.utexas.edu/

If you have any questions please contact me at jasona@tacc.utexas.edu

Stampede2 Maintenance 20 March 2018

Posted by Matthew Edeker on Mar 12, 2018 11:14:37 AM

Stampede2 will be unavailable from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 20 March 2018. System maintenance will be performed during this time.   If you submit a job before the maintenance, and the time you request exceeds the time remaining until the maintenance begins, your job will run when the...

Updated on Mar 21, 2018 1:59:37 AM

Stampede2 is back in production after the system maintenance.  


Thanks,
TACC Administration

Updated on Mar 20, 2018 7:35:10 PM

Today's filesystem work is taking longer than expected and the Stampede2 maintenance will be extended.

Original Posting

Stampede2 will be unavailable from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 20 March 2018. System maintenance will be performed during this time.

 

If you submit a job before the maintenance, and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable" ("Required Node Not Available"). The showq utility will list the job as "BLOCKED" and report its status as "WaitNod" ("Waiting for Nodes"). Note that the hours leading up to the maintenance are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be short.

Corral Status 16 March, 2018

Posted by Jacob Getz on Mar 16, 2018 12:24:25 PM

Corral has had an unexpected disk error that caused the file system to unmount causing a brief interruption shortly before noon (local time). The file system is back up now, but users can report any errors they encounter to the TACC User Portal at the URL below. https://portal.tacc.utexas.edu/...

Corral has had an unexpected disk error that caused the file system to unmount causing a brief interruption shortly before noon (local time). The file system is back up now, but users can report any errors they encounter to the TACC User Portal at the URL below.

https://portal.tacc.utexas.edu/

Thank you,
TACC Team

Stampede2 Status, March 8, 2018

Posted by David Littrell on Mar 8, 2018 3:22:00 PM

One of the /home1 Lustre targets is currently unavailable.  TACC Staff are working to resolve the issue as quickly as possible. Please stay tuned to TACC User News for further updates.

Updated on Mar 8, 2018 4:09:48 PM

/home1 is now back in full production.

Original Posting

One of the /home1 Lustre targets is currently unavailable.  TACC Staff are working to resolve the issue as quickly as possible. Please stay tuned to TACC User News for further updates.

Wrangler Maintenance 6 March 2018

Posted by Matthew Edeker on Feb 19, 2018 9:05:34 AM

Wrangler will not be available from 7 a.m. to 5:00 p.m. (CT) on Tuesday, 6 March 2018. System maintenance will be performed during this time.


-TACC Team

Updated on Mar 6, 2018 7:59:45 PM

System updates to Wrangler have been completed and jobs can now be submitted. 

Updated on Mar 6, 2018 4:47:21 PM

System updates for Wrangler have run longer than expected. At this point, there is no ETA for availability. 

Original Posting

Wrangler will not be available from 7 a.m. to 5:00 p.m. (CT) on Tuesday, 6 March 2018. System maintenance will be performed during this time.


-TACC Team

TACC Maintenance 11 March, 2018

Posted by Mitchell Collins-Bailey on Feb 26, 2018 5:32:29 PM

Access to all TACC systems will be unavailable from 9:00 AM CDT  until 2:00 PM CDT on March 11, 2018 to allow for upgrades to the TACC core network hardware. Jobs will continue to run, but users will have no access to TACC services and systems until the upgrade is complete.

Access to all TACC systems will be unavailable from 9:00 AM CDT  until 2:00 PM CDT on March 11, 2018 to allow for upgrades to the TACC core network hardware. Jobs will continue to run, but users will have no access to TACC services and systems until the upgrade is complete.

TACC Maintenance 11 March, 2018

Posted by Mitchell Collins-Bailey on Feb 26, 2018 5:28:56 PM

Access to all TACC systems will be unavailable from 9:00 AM CDT  until 2:00 PM CDT on March 11, 2018 to allow for upgrades to the TACC core network hardware. Jobs will continue to run, but users will have no access to TACC services and systems until the upgrade is complete.

Access to all TACC systems will be unavailable from 9:00 AM CDT  until 2:00 PM CDT on March 11, 2018 to allow for upgrades to the TACC core network hardware. Jobs will continue to run, but users will have no access to TACC services and systems until the upgrade is complete.

Stampede2 will be unavailable from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 20 February 2018

Posted by Mitchell Collins-Bailey on Feb 12, 2018 4:29:35 PM

Stampede2 will be unavailable from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 20 February 2018. System maintenance will be performed during this time.   If you submit a job before the maintenance, and the time you request exceeds the time remaining until the maintenance begins, your job will run when the...

Stampede2 will be unavailable from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 20 February 2018. System maintenance will be performed during this time.

 

If you submit a job before the maintenance, and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable" ("Required Node Not Available"). The showq utility will list the job as "BLOCKED" and report its status as "WaitNod" ("Waiting for Nodes"). Note that the hours leading up to the maintenance are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be short.

Wrangler Maintenance 30 January 2018

Posted by Jacob Getz on Jan 12, 2018 1:28:23 PM

Wrangler will be undergoing system maintenance for patching/updates and unavailable for users on 1/30/18 from 0800-1700 (CST).

Updated on Jan 30, 2018 12:41:29 PM

Wrangler's maintenance is complete and it is back in production.

Original Posting

Wrangler will be undergoing system maintenance for patching/updates and unavailable for users on 1/30/18 from 0800-1700 (CST).

February 2018 TACC Training Events

Posted by Jason Allison on Jan 24, 2018 2:56:51 PM

We are pleased to announce the following training events are being offered to both in-person and webcast participants for February 2018. Local participants are strongly encouraged to attend in person. 2/5/18 8:30am-12:30pm CT - C++ for C programmers 2/15/18 8am-12pm CT - Introduction to Manycore...

We are pleased to announce the following training events are being offered to both in-person and webcast participants for February 2018. Local participants are strongly encouraged to attend in person.

2/5/18 8:30am-12:30pm CT - C++ for C programmers
2/15/18 8am-12pm CT - Introduction to Manycore Programming
2/16/18 8am-12pm CT - Advanced Manycore Programming

To register and for more information please visit: https://learn.tacc.utexas.edu/


If you have any questions please contact our training staff at training@tacc.utexas.edu

Stampede 2 Maintenance 16 January 2018

Posted by Jacob Getz on Jan 9, 2018 11:24:40 AM

Stampede2 will be unavailable from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 16 January 2018. System maintenance will be performed during this time.   If you submit a job before the maintenance, and the time you request exceeds the time remaining until the maintenance begins, your job will run when the...

Updated on Jan 23, 2018 10:32:44 PM

The Stampede2 maintenance is complete and the system is back in production. During the maintenance we upgraded the login nodes. This should have little to no adverse impact, but you may need to take minor actions to account for the new login nodes: e.g. reschedule cron jobs or update known hosts on your client.

Updated on Jan 23, 2018 7:27:31 PM

Stampede2 maintenance has been extended.  We will provide further updates as they become available.

Updated on Jan 22, 2018 11:04:59 AM

As a reminder, Stampede2 will undergo scheduled system maintenance on Tuesday, 23 Jan 2018 between 8:00AM and 7:30PM CST. This is the maintenance originally scheduled for 16 Jan 2018 that we delayed due to inclement weather. The system will be unavailable during this window.

 

Planned activities include upgrading the login nodes. This should have little to no adverse impact, but after the maintenance you may need to take minor actions to account for the new login nodes: e.g. rescheduling cron jobs or updating known hosts on your client.

 

If you submit a job before the maintenance, and your job cannot finish before the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable" ("Required Node Not Available"). The showq utility will list the job as "BLOCKED" and report its status as "WaitNod" ("Waiting for Nodes"). Note that the hours leading up to the maintenance are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be short.

Updated on Jan 15, 2018 9:51:38 PM

The University of Texas will be closed due to inclement weather tomorrow, January 16, 2018. For this reason, the Stampede2 maintenance scheduled for this date will be rescheduled for Tuesday, January 23, 2018 from 8am to 7:30pm (CST).

Original Posting

Stampede2 will be unavailable from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 16 January 2018. System maintenance will be performed during this time.

 

If you submit a job before the maintenance, and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable" ("Required Node Not Available"). The showq utility will list the job as "BLOCKED" and report its status as "WaitNod" ("Waiting for Nodes"). Note that the hours leading up to the maintenance are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be short.

Ranch Status 18 December 2017

Posted by Jacob Getz on Jan 17, 2018 4:11:44 PM

At 07:00 January 18th 2018 the Ranch environment will be taken down for 4 hours in order to address a hardware failure.

Appropriate notice will be provided when Ranch returns to production.


-TACC Team

Updated on Jan 18, 2018 9:25:11 AM

Ranch is back in Production as of 09:15 1/18/2018.


-TACC Team

Original Posting

At 07:00 January 18th 2018 the Ranch environment will be taken down for 4 hours in order to address a hardware failure.

Appropriate notice will be provided when Ranch returns to production.


-TACC Team

Lonestar5 Maintenance 23 January 2018

Posted by Jacob Getz on Jan 8, 2018 11:19:21 AM

LoneStar5 will not be available from 8 a.m. to 5:00 p.m. (CT) on Tuesday, 23 January 2018. System maintenance will be performed during this time.

-TACC Team

LoneStar5 will not be available from 8 a.m. to 5:00 p.m. (CT) on Tuesday, 23 January 2018. System maintenance will be performed during this time.

-TACC Team

TACC Winter Break Schedule

Posted by Chris Hempel on Dec 21, 2017 6:42:54 AM

TACC personnel will observe the University of Texas at Austin winter break from 5 p.m. (CST) on Thursday, 21 December 2017, and will resume normal business hours on Tuesday, 2 January 2018. A staff member will be on site to monitor the status of all TACC resources. TACC support staff will monitor...

TACC personnel will observe the University of Texas at Austin winter break from 5 p.m. (CST) on Thursday, 21 December 2017, and will resume normal business hours on Tuesday, 2 January 2018. A staff member will be on site to monitor the status of all TACC resources. TACC support staff will monitor the consulting system throughout the break and address critical system issues. The staff will address other issues beginning Tuesday, 2 January 2018.


Please submit any questions you may have via the TACC Consulting System.
https://portal.tacc.utexas.edu/tacc-consulting

Stampede2 Status, December 19, 2017

Posted by Jacob Getz on Dec 11, 2017 2:55:43 PM

Stampede2 will be unavailable 19 Dec 2017 between 8:00AM and 7:30PM CST for maintenance.


-TACC Team

Updated on Dec 19, 2017 7:29:33 PM

Stampede2 is now back in production. Thank you.

Updated on Dec 18, 2017 2:58:04 PM

Reminder: Stampede2 will be unavailable December 19th 2017 between 8:00AM and 7:30PM CST for maintenance.

At the end of this maintenance, the new Skylake (SKX) nodes will enter production service alongside the existing Knights Landing (KNL) nodes. SKX jobs that run after the maintenance will incur normal accounting charges. See the Stampede2 User Guide  for more information. (https://portal.tacc.utexas.edu/user-guides/stampede2)

If you submit a job before the maintenance, and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable" ("Required Node Not Available"). The showq utility will list the job as "BLOCKED" and report its status as "WaitNod" ("Waiting for Nodes"). Note that the hours leading up to the maintenance are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be short.

Original Posting

Stampede2 will be unavailable 19 Dec 2017 between 8:00AM and 7:30PM CST for maintenance.


-TACC Team

Wrangler Status 18 December 2017

Posted by Sergio Leal on Dec 18, 2017 12:15:37 PM

Wrangler system is currently unavailable due to a software error.  Administrators are currently working on resolving the issue.  There is no current ETA and we will provide further updates as they become available.  Thanks.

Updated on Dec 18, 2017 6:34:25 PM

Wrangler has been returned to production at this time. Thank you.

Original Posting

Wrangler system is currently unavailable due to a software error.  Administrators are currently working on resolving the issue.  There is no current ETA and we will provide further updates as they become available.  Thanks.

Training: Introduction to OpenMP using the Interactive Parallelization Tool (IPT)

Posted by Jason Allison on Dec 1, 2017 11:58:48 AM

December 14th, 2017 9am-1pm CT Texas Advanced Computing Center ACB 1.104 J.J. Pickle Research Campus 10100 Burnet Rd. Austin, TX 78758 OpenMP is one of the most popular paradigms to exploit the now ubiquitous manycore and multi-core processors. In this beginner-level training session, we will...

December 14th, 2017 9am-1pm CT
Texas Advanced Computing Center
ACB 1.104
J.J. Pickle Research Campus
10100 Burnet Rd. Austin, TX 78758

OpenMP is one of the most popular paradigms to exploit the now ubiquitous manycore and multi-core processors. In this beginner-level training session, we will provide an overview of the basic concepts of OpenMP. We will introduce the trainees to the Interactive Parallelization Tool (IPT) that is designed for parallelizing serial C/C++ programs semi-automatically. The participants in the training session will be introduced to OpenMP and will learn to use IPT for parallelizing their C/C++ applications.

Prerequisites: Experience working in a Linux environment, and familiarity with C/C++/Fortran or any other programming language.

We are offering the training to both in-person and webcast participants. Local participants are strongly encouraged to attend in person.

To attend the training in person, please contact me via email at jasona@tacc.utexas.edu.

To attend via webcast, please enroll for the training at:
https://learn.tacc.utexas.edu/mod/chat/view.php?id=30

You will need to sign in with your TACC User Portal account and password to enroll.

Maverick: New queues to support long gpu runs

Posted by Chris Hempel on Nov 27, 2017 4:34:48 PM

Two new queues have been configured on Maverick to accommodate GPU jobs that require more runtime than allowed in the gpu queue. These two queues are configured as follows: gpu-long - up to 72 hours runtime, one node per job (i.e. sbatch -N 1 and/or -n 20 or less) - maximum of 8 jobs allowed in...

Updated on Dec 1, 2017 8:59:34 AM

The runtime limit on the gpu queue has been increased to 24 hours.

Original Posting

Two new queues have been configured on Maverick to accommodate GPU jobs that require more runtime than allowed in the gpu queue. These two queues are configured as follows:

gpu-long
- up to 72 hours runtime, one node per job (i.e. sbatch -N 1 and/or -n 20 or less)
- maximum of 8 jobs allowed in queue per user

gpu-verylong
- up to 120 hours runtime, one node per job
- maximum of 3 jobs allowed in queue per user

These queues are available immediately for use and do not require special permission to access. The gpu queue remains with a 12-hour runtime limit.

Please submit any questions you have via the TACC Consulting System.

https://portal.tacc.utexas.edu/tacc-consulting

Ranch status 11/14/2017

Posted by David Littrell on Nov 14, 2017 10:53:26 AM

At 10:30 November 14th the Ranch environment will be taken down for 2 hours in order to address a hardware error. Notice will be provided when Ranch returns to production.

Updated on Nov 25, 2017 9:52:34 AM

The Ranch environment is up and available as of 09:09 CST 25 of November, 2017

Updated on Nov 16, 2017 11:31:26 AM

Administrators continue to work with the vendor to resolve a filesystem issue on Ranch and an update to user news will be posted once the problem has been resolved.


Updated on Nov 14, 2017 1:08:57 PM

The Ranch emergency downtime is being extended.
ETA back into production is still being determined at this time.
Appropriate notice will be provided when Ranch returns to production.

-TACC Team

Original Posting

At 10:30 November 14th the Ranch environment will be taken down for 2 hours in order to address a hardware error. Notice will be provided when Ranch returns to production.

Lonestar5 Maintenance 21 November 2017

Posted by Matthew Edeker on Nov 6, 2017 11:05:12 AM

LoneStar5 will not be available from 8 a.m. to 17:00 p.m. (CT) on Tuesday, 21 November 2017. Maintenance on the Slurm scheduler and Cray Development Toolkit will be performed during this time.



Updated on Nov 21, 2017 11:23:02 PM

LoneStar5 has been returned to production and the queues are open. Any queued slurm jobs will need to be resubmitted. Thank you for your patience.

Updated on Nov 21, 2017 6:01:37 PM

The Lonestar5 maintenance needs to be extended. At the moment we don't have a scheduled time to return to production.

Original Posting

LoneStar5 will not be available from 8 a.m. to 17:00 p.m. (CT) on Tuesday, 21 November 2017. Maintenance on the Slurm scheduler and Cray Development Toolkit will be performed during this time.



Maverick Maintenance 21 November 2017

Posted by Matthew Edeker on Nov 6, 2017 11:26:03 AM

Maverick will not be available from 8 a.m. to 17:00 p.m. (CT) on Tuesday, 21 November 2017. Maintenance on the Slurm scheduler will be performed during this time.

Updated on Nov 21, 2017 3:02:42 PM

Maverick is back in production. 

Original Posting

Maverick will not be available from 8 a.m. to 17:00 p.m. (CT) on Tuesday, 21 November 2017. Maintenance on the Slurm scheduler will be performed during this time.

Stampede 2 status, November 28, 2017

Posted by David Littrell on Nov 15, 2017 12:09:26 PM

Stampede2 will be unavailable 28 Nov 2017 between 8:00AM and 7:30PM CST for maintenance.   If you submit a job and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable"...

Stampede2 will be unavailable 28 Nov 2017 between 8:00AM and 7:30PM CST for maintenance.
 
If you submit a job and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable" ("Required Node Not Available"). The showq utility will list the job as "BLOCKED" and report its status as "WaitNod" ("Waiting for Nodes"). Note that the hours leading up to the maintenance are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be short.

Ranch Maintenance 21 November 2017

Posted by David Littrell on Nov 9, 2017 9:50:00 AM

At 08:00 on Tuesday, November 21st, the Ranch environment will be brought down for normal system maintenance. Due to the 1.3 billion files currently in Ranch, this maintenance activity should take between 24 and 36 hours. We expect to bring Ranch back into production no sooner than 20:00 Wednesday,...

At 08:00 on Tuesday, November 21st, the Ranch environment will be brought down for normal system maintenance. Due to the 1.3 billion files currently in Ranch, this maintenance activity should take between 24 and 36 hours. We expect to bring Ranch back into production no sooner than 20:00 Wednesday, November 22nd.

Users should take note that it is possible that this downtime could extend overnight into the Thanksgiving holiday. Appropriate notice will be given should this maintenance event run longer than its expected 36 hours.