User News

TACC Training – Parallel Computing on Stampede

Posted by Jason Allison on Feb 27, 2015 12:27:09 PM

TACC Training - Parallel Computing on Stampede - March 12 and 13, 2015 8:30 a.m. to 5 p.m. CT, both days ?J.J. Pickle Research Campus? ROC 1.900 ?10100 Burnet Rd  ?Austin, TX The Stampede supercomputer at the Texas Advanced Computing Center went into production in January 2013 and is the first...

TACC Training - Parallel Computing on Stampede - March 12 and 13, 2015

8:30 a.m. to 5 p.m. CT, both days
?J.J. Pickle Research Campus?
ROC 1.900
?10100 Burnet Rd 
?Austin, TX

The Stampede supercomputer at the Texas Advanced Computing Center went into production in January 2013 and is the first system to deploy at scale the Intel Xeon Phi CoProcessor.  Stampede provides nearly 10 petaflops of peak performance, and is the new flagship system of the US National Science Foundation's XSEDE Cyberinfrastructure.  Stampede provides more than 100,000 cores and 2PF of Intel Xeon E5 "Sandy Bridge" processors, and an additional 7+ PF of Intel Xeon Phi CoProcessors.
 
In this tutorial, we will introduce the Stampede architecture, and cover how to achieve performance using both the conventional processors as well as the coprocessors.

Topics will include:
Stampede architecture overview?
The Stampede user environment, including the batch system, compiler environment, application modules, etc.?
MPI and OpenMP parallel programming?
Hands-on exercises with Stampede.?
Basic optimization and vector tuning on Stampede  for Sandy Bridge and Xeon Phi Coprocessors (MICs)?
Hybrid computing?
Intel Xeon Phi Coprocessor (MIC) overview?
Programming models for Sandy Bridge - MIC computing: native, symmetric and offload.?

You are welcome to attend this course in-person or via webcast.  Registration closes on March 6, 2015 at 5:00pm CST. 

For more information and to access the link to register for this course please visit the the following:

https://portal.tacc.utexas.edu/-/parallel-computing-on-stampede-march-12-and-13-2015

TACC System Maintenance - February 15, 2015

Posted by Jason Allison on Feb 13, 2015 9:26:42 AM

A system maintenance will be performed starting on 8:00am CST on Sunday and should be completed by 10:00am CST.  This may cause intermittent downtime for the following services: TACC Website, TACC Portal, TACC Consulting, XSEDE Portal, Rodeo, iPlant, and TAIR.  TACC HPC resources should not be...

Updated on Feb 15, 2015 8:51:21 AM

Maintenance has been completed. 

Original Posting

A system maintenance will be performed starting on 8:00am CST on Sunday and should be completed by 10:00am CST. 


This may cause intermittent downtime for the following services: TACC Website, TACC Portal, TACC Consulting, XSEDE Portal, Rodeo, iPlant, and TAIR. 

TACC HPC resources should not be impacted. Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Network Maintenance 10 February 2015

Posted by Jesse Snead on Feb 6, 2015 2:37:01 PM

Network access to TACC will be intermittent from 6 a.m. to 7 a.m. (CT) on Tuesday, 10 February 2015. An upgrade will be performed during this time. During the network maintenance, jobs may continue to run.  Please submit any questions that you may have via the TACC Consulting System....

Network access to TACC will be intermittent from 6 a.m. to 7 a.m. (CT) on Tuesday, 10 February 2015. An upgrade will be performed during this time. During the network maintenance, jobs may continue to run. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Ranch IP Change and Maintenance 10 February 2015

Posted by Jesse Snead on Feb 5, 2015 4:50:31 PM

Ranch will not be available from 9 a.m. to 2:00 p.m. (CT) on Tuesday, 10 February 2014. System maintenance will be performed during this time.  Please note that the IP addresses for login1, login2, gridftp1 and gridftp2 will change: login1.ranch.tacc.utexas.edu 129.114.62.97 ...

Ranch will not be available from 9 a.m. to 2:00 p.m. (CT) on Tuesday, 10 February 2014. System maintenance will be performed during this time. 


Please note that the IP addresses for login1, login2, gridftp1 and gridftp2 will change:

login1.ranch.tacc.utexas.edu 129.114.62.97 
login2.ranch.tacc.utexas.edu 129.114.62.98 
gridftp1.ranch.tacc.utexas.edu 129.114.62.107 
gridftp2.ranch.tacc.utexas.edu 129.114.62.108 

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Stampede Maintenance 10 February 2015

Posted by Jesse Snead on Feb 3, 2015 2:17:34 PM

Stampede will not be available from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 10 February 2015. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Stampede will not be available from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 10 February 2015. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Emergency outage of login and gridftp services at 5:00PM today for Stampede, Lonestar, Maverick, Rustler

Posted by Sergio Leal on Jan 28, 2015 3:30:53 PM

Due to the recently announced vulnerability in glibc, all TACC Linux production systems have been updated with the security fix released today. However, all production systems will need to have their login and gridftp servers rebooted at 5:00PM Central time today to ensure that all affected...

Due to the recently announced vulnerability in glibc, all TACC Linux production systems have been updated with the security fix released today. However, all production systems will need to have their login and gridftp servers rebooted at 5:00PM Central time today to ensure that all affected services have been restarted. Running and queued jobs will not be impacted.

More information on the vulnerability is available here, https://access.redhat.com/articles/1332213.

TACC Training – Data Analysis with R in HPC

Posted by Jason Allison on Jan 28, 2015 2:22:52 PM

February 20, 2015 8:00am - 5:00pm CST Texas Advanced Computing Center ROC 1.900 J.J. Pickle Research Campus 10100 Burnet Rd. Austin, TX 78758   You are welcome to attend this course in-person or via webcast.  Registration closes on February 13, 2015 at 5:00pm CST.    This workshop will introduce...

February 20, 2015
8:00am - 5:00pm CST

Texas Advanced Computing Center
ROC 1.900
J.J. Pickle Research Campus
10100 Burnet Rd.
Austin, TX 78758
 
You are welcome to attend this course in-person or via webcast.  Registration closes on February 13, 2015 at 5:00pm CST.
  
This workshop will introduce participants to data intensive computing on Stampede. Prior experience with R is necessary in order to benefit from the material provided in the afternoon.

Agenda:

1. Introduction to TACC resources for data analysis (lecture)

2. R GUI/Rstudio and getting started with R on Stampede (lecture/lab)

3. Basic R features and examples. (lecture/demos)

Lunch Break.  

4. Running R in Parallel (lecture/lab)

5. Hands on with R on Stampede (lab)

6. Knowing your R job, profiling and tuning (lecture/lab)

To register for this course please visit the the following link on the TACC Training Course Calendar.
https://portal.tacc.utexas.edu/training/register-for-data-analysis-2.20.2015

TACC Training – Programming the Xeon Phi

Posted by Jason Allison on Jan 26, 2015 1:18:30 PM

February 6, 2015 8:00am - 5:00pm CDT Texas Advanced Computing Center ROC 1.900 J.J. Pickle Research Campus 10100 Burnet Rd. Austin, TX 78758   You are welcome to attend this course in-person or via webcast.  Registration closes on February 2, 2015 at 5:00pm CST.   Experienced C/C++ and Fortran...

February 6, 2015
8:00am - 5:00pm CDT

Texas Advanced Computing Center
ROC 1.900
J.J. Pickle Research Campus
10100 Burnet Rd.
Austin, TX 78758
 
You are welcome to attend this course in-person or via webcast.  Registration closes on February 2, 2015 at 5:00pm CST.  

Experienced C/C++ and Fortran programmers will be introduced to techniques essential for utilizing the Intel Xeon Phi co-processor efficiently. Multiple lectures and hands-on exercises will be used to acquaint attendees with the Xeon Phi platform and to explore the different execution modes as well as parallelization and optimization through example testing and reports. All exercises will be executed on the Stampede system at the Texas Advanced Computing Center (TACC). Stampede features more than 2PF of performance using 100,000 Intel Xeon E5 cores and an additional 7+ PF of performance from more than 6,400 Xeon Phi.

We have a limited number of laptops available for labs, first come, first served.  If you choose to use your own laptop, please make sure that you have an SSH client. 

To register for this course please visit the the following link on the TACC Training Course Calendar.

https://portal.tacc.utexas.edu/training/register-for-xeon-phi-2-6-15

Stampede Scratch File System Outage

Posted by Garland Whiteside on Jan 18, 2015 7:46:45 AM

One of the storage targets on the /scratch filesystem for Stampede encountered an error at 5:45AM and had to be taken offline to be checked.   Users trying to access files on the scratch-OST0094 will encounter an error similar to "Cannot send after transport endpoint shutdown".    TACC Staff are...

Updated on Jan 22, 2015 10:35:29 AM

TACC staff has completed recovery operations on the section of /scratch that went offline this past Sunday. We have recovered more than 99.9% of the files and have restored all recovered files as of 8:00AM on 1/22/2015. We encourage users to check the status of their files in $SCRATCH as few hundred files could have been affected by this filesystem problem.
If you receive the message "Cannot allocate memory" when trying to access a file, this means that a portion of the file was on the offline portion of /scratch and could not be recovered. You should remove any such file using "rm -f filename".
To generate a file containing information on all of your files that were offline, execute the command:
lfs getstripe -r -O scratch-OST0094_UUID $SCRATCH | sed '/^$/d' > ost_info.out
To extract the list of files that might have been affected, execute:
grep scratch ost_info.out
 
and list those files with ls -l to see if any generate a “Cannot allocate memory” message.  Users may also consult this page for instructions from a previous outage, but replace OST0010 with OST0094: https://portal.tacc.utexas.edu/user-news/stampede-scratch-file-system-outage

Original Posting

One of the storage targets on the /scratch filesystem for Stampede encountered an error at 5:45AM and had to be taken offline to be checked.   Users trying to access files on the scratch-OST0094 will encounter an error similar to "Cannot send after transport endpoint shutdown".    TACC Staff are working on the storage target and will post an update to user news when more information is available.

Rodeo Outage

Posted by Jason Allison on Jan 20, 2015 11:17:33 AM

Since Friday 1/16/2015, Rodeo has been experiencing intermittent network connectivity issues causing access issues for some users.  Administrators are aware of the issue and are working to resolve it. Please submit any questions that you may have via the TACC Consulting System....

Updated on Jan 21, 2015 1:19:30 PM

Network issues with Rodeo have been resolved. The system is back in production. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Original Posting

Since Friday 1/16/2015, Rodeo has been experiencing intermittent network connectivity issues causing access issues for some users.  Administrators are aware of the issue and are working to resolve it.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Ranch maintenance 1/13/2015

Posted by William Littrell on Jan 6, 2015 11:34:13 PM

Ranch will be down from 9 am to 5 pm due to equipment relocation. 

Updated on Jan 14, 2015 6:36:07 PM

Hardware issues with Ranch have been resolved. Ranch is now in production. 

Updated on Jan 13, 2015 7:37:23 PM

Ranch maintenance has been extended until 5 pm CST 1/14/2015. 

Original Posting

Ranch will be down from 9 am to 5 pm due to equipment relocation. 

Stampede Scratch Filesystem Outage

Posted by Jason Allison on Jan 14, 2015 2:23:52 PM

The /scratch filesystem was partially unavailable from 1:24 to 2:00 (CST) today and users may have encountered errors while trying to access files in the filesystem during that time. Please submit any questions that you may have via the TACC Consulting System....

The /scratch filesystem was partially unavailable from 1:24 to 2:00 (CST) today and users may have encountered errors while trying to access files in the filesystem during that time.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Stampede Scratch Filesystem Outage

Posted by David Cooper on Jan 9, 2015 5:05:27 PM

The /scratch filesystem was partially unavailable from 4:00PM until 4:30PM (CST) today and users may have encountered errors while trying to access files in the filesystem during that time. Please submit any questions that you may have via the TACC Consulting System....

The /scratch filesystem was partially unavailable from 4:00PM until 4:30PM (CST) today and users may have encountered errors while trying to access files in the filesystem during that time.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Maverick Maintenance 13 January 2015

Posted by Jesse Snead on Jan 7, 2015 1:41:55 PM

Maverick will not be available from 9 a.m. to 5 p.m. (CT) on Tuesday, 13 January 2015. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Maverick will not be available from 9 a.m. to 5 p.m. (CT) on Tuesday, 13 January 2015. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Stampede Maintenance 13 January 2015

Posted by Jesse Snead on Jan 7, 2015 1:38:18 PM

Stampede will not be available from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 13 January 2015. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Stampede will not be available from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 13 January 2015. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

TACC Training – HPC Python 1.23.2015 9am-12pm

Posted by Jason Allison on Jan 5, 2015 9:28:53 AM

TACC Training – HPC Python January 23, 2015 9:00am - 12:00pm CDT Texas Advanced Computing Center ROC 1.900 J.J. Pickle Research Campus 10100 Burnet Rd. Austin, TX 78758   You are welcome to attend this course in-person or via webcast.  Registration closes on January 19, 2015 at 5:00pm CDT. This...

TACC Training – HPC Python
January 23, 2015
9:00am - 12:00pm CDT
Texas Advanced Computing Center
ROC 1.900
J.J. Pickle Research Campus
10100 Burnet Rd.
Austin, TX 78758
 

You are welcome to attend this course in-person or via webcast.  Registration closes on January 19, 2015 at 5:00pm CDT.

This class provides intermediate users with an overview of intermediate and advanced techniques for using Python on HPC environments. The lecture will emphasize well known approaches for improving the performance of their Python codes. The lecture will include an overview of when Python can be used on HPC, an overview of numpy, matplotlib, SciPy and Cython, as well as an introduction to the most common functionality of mpi4py.
 
Previous knowledge of Python and MPI is required since the course will not go into specific details of Python syntax. Some knowledge of MPI is required as the lecture will not go into specific details regarding MPI. Some familiarity with C/C++ is also recommended for Cython.

We have a limited number of laptops available for labs, first come, first served.  If you choose to use your own laptop, please make sure that you have an SSH client. Windows users will also need to have an X client installed (X-win32, Xming, Cygwin/X).

The labs will be available for remote users.  However, we will not be able to assist remote users with problems during the lab.  

To register for this course please visit the the following link on the TACC Training Course Calendar.

https://portal.tacc.utexas.edu/training/register-for-hpc-python-1-23-2015


Thank you,

Jason Allison 
TACC User Services

Lonestar compute nodes down due to emergency

Posted by David Cooper on Dec 30, 2014 9:04:35 AM

There is currently a campus wide chilled water leak which will affect Lonestar. The water system has been shut off for emergency repairs, so we had to power down all of the compute nodes on the system.   The system will be returned to service once the chilled water has been restored. Please direct...

Updated on Dec 30, 2014 3:08:18 PM

As of 3PM, chilled water has been restored to the building housing Lonestar, so the system will be back in production momentarily.  



Original Posting

There is currently a campus wide chilled water leak which will affect Lonestar. The water system has been shut off for emergency repairs, so we had to power down all of the compute nodes on the system.   The system will be returned to service once the chilled water has been restored.

Please direct any questions to the TACC consulting ticket system. 

Stampede Scratch File System Outage

Posted by Garland Whiteside on Dec 19, 2014 1:25:57 AM

The /scratch filesystem has encountered an error with one of the storage targets and it has been deactivated for now while the error is investigated.   Users may encounter an I/O error if a file is accessed that resides on the offline storage target.    The TACC administrators are working on the...

Updated on Dec 24, 2014 6:28:50 PM

TACC staff has completed recovery operations on scratch-OST0049, the Object Storage Target (OST) in the /scratch file system that has been offline since last Friday. We have recovered more than 99.9% of the files that were on this OST, and have restored all recovered files as of noon on 12/24/2014. We encourage users to check the status of their files in $SCRATCH as several hundred files could have been impacted by this filesystem problem..

Managers of community, portal, or gateway accounts may find that they have issues with ownership and permissions on some of their files in $SCRATCH.

If you receive the message "Cannot allocate memory" when trying to access a file, this means that a portion of the file was on the offline OST and could not be recovered. You should remove any such file using "rm -f filename".

To generate a file containing information on all of your files that were located on the offline OST, execute the command:

lfs getstripe -r -O scratch-OST0049_UUID $SCRATCH | sed '/^$/d' > ost_info.out

To extract from this file a list of affected files, execute:

grep scratch ost_info.out

Original Posting

The /scratch filesystem has encountered an error with one of the storage targets and it has been deactivated for now while the error is investigated.   Users may encounter an I/O error if a file is accessed that resides on the offline storage target.   

The TACC administrators are working on the issue and the TACC team will post updates as they become available.

Lonestar Maintenance 6 January 2015

Posted by Jesse Snead on Dec 23, 2014 8:15:35 AM

Lonestar will not be available from 8 a.m. to 5:00 p.m. (CT) on Tuesday, 6 January 2015. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Lonestar will not be available from 8 a.m. to 5:00 p.m. (CT) on Tuesday, 6 January 2015. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Ranch Maintenance 30 November 2014

Posted by Jesse Snead on Dec 23, 2014 8:12:25 AM

Ranch will not be available from 10 a.m. to 2:00 p.m. (CT) on Tuesday, 30 December 2014. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Ranch will not be available from 10 a.m. to 2:00 p.m. (CT) on Tuesday, 30 December 2014. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

TACC Winter Break Schedule

Posted by Jesse Snead on Dec 18, 2014 9:56:30 AM

TACC personnel will observe the University of Texas at Austin winter break from 5 p.m. (CT) on Friday, 19 December 2014, and will resume normal business hours on Monday, 5 January 2015. A staff member will be on site to monitor the status of all TACC resources in an effort to ensure continuous...

TACC personnel will observe the University of Texas at Austin winter break from 5 p.m. (CT) on Friday, 19 December 2014, and will resume normal business hours on Monday, 5 January 2015.


A staff member will be on site to monitor the status of all TACC resources in an effort to ensure continuous availability. 

TACC support staff will monitor the consulting system throughout the holiday period and answer critical user questions. Non-critical user questions will be addressed beginning Monday, 5 January 2014.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Ranch Status 15 December 2014

Posted by Jesse Snead on Dec 15, 2014 11:16:16 AM

Ranch has encountered an error with one of the filesystems that requires a reboot of the file server to clear. This problem may have caused "hangs" to logins since yesterday evening for users that were assigned to the filesystem. Users may not be able to access files on the system while the file...

Updated on Dec 15, 2014 8:06:27 PM

The login node has been put offline to continue a filesystem check.  Further update tomorrow morning

Updated on Dec 15, 2014 5:28:07 PM

The filesystem error on Ranch was resolved earlier today and TACC Staff have been monitoring the filesystem this afternoon.  The filesystem problem has not recurred so users should not experience hangs on login now.  


Please submit a ticket if you do encounter any problems with accessing Ranch.

Original Posting

Ranch has encountered an error with one of the filesystems that requires a reboot of the file server to clear. This problem may have caused "hangs" to logins since yesterday evening for users that were assigned to the filesystem. Users may not be able to access files on the system while the file server is being rebooted, an update to this announcement will be made once normal operation has resumed.


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Corral Maintenance 16 December 2014

Posted by Jesse Snead on Dec 15, 2014 11:05:54 AM

Corral will not be available from 10 a.m. to 12:00 p.m. (CT) on Tuesday, 16 December 2014. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Corral will not be available from 10 a.m. to 12:00 p.m. (CT) on Tuesday, 16 December 2014. System maintenance will be performed during this time. 


Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Corral Group Quotas 6 January 2015

Posted by Jesse Snead on Dec 9, 2014 3:10:23 PM

On Tuesday, January 6, 2015 default project group quotas of 1TB will go into effect for all data on Corral. Although quotas are currently assigned for all allocated projects on Corral, in the past, no quotas have been assigned for un-allocated projects and no usage reporting is done for project...

On Tuesday, January 6, 2015 default project group quotas of 1TB will go into effect for all data on Corral. Although quotas are currently assigned for all allocated projects on Corral, in the past, no quotas have been assigned for un-allocated projects and no usage reporting is done for project groups without Corral allocations. This means that, under certain circumstances, users may have significant amounts of data on Corral not currently subject to reporting and quota mechanisms. For users whose data is all owned by the correct project group, this change will have no effect. However, users with significant amounts of data stored under a project group without a Corral allocation may experience problems after the quota implementation. 


Users can check project group IDs by logging into the TACC user portal ( https://portal.tacc.utexas.edu/projects-and-allocations ), clicking on “Projects and Allocations” under the Allocations tab, then clicking on any project detail button. At the top of the resulting page will be a line labeled “Unix Group”. The value listed there should match the group ID you see when you list the contents of your Corral directories with “ls -l” or in the file details of any graphical file transfer tool such as Cyberduck. You may also check your Corral allocation usage in the TACC user portal and see if the value there matches your expectations based on the amount of data you have stored.

If you have significant amounts of data with an incorrect project group and need help correcting this or if you have any questions regarding this announcement, please contact TACC support via the TACC User Portal. 

https://portal.tacc.utexas.edu/consulting

Corral File Systems

Posted by Sergio Leal on Nov 27, 2014 1:21:19 PM

The corral filesystems is currently offline to resolve a networking hardware issue.  TACC Staff are currently working to resolve this and will provide further updates through user news. The /corral-repl and /corral-tacc filesystems have been unmounted from the Lonestar, Stampede and Maverick login...

The corral filesystems is currently offline to resolve a networking hardware issue.  TACC Staff are currently working to resolve this and will provide further updates through user news.


The /corral-repl and /corral-tacc filesystems have been unmounted from the Lonestar, Stampede and Maverick login nodes

Stampede/Stockyard $WORK File System

Posted by Sergio Leal on Nov 21, 2014 10:16:55 PM

Due to a hardware error on an OSS server, Stockyard $WORK file system suffered a partial (affecting processes accessing particular subset of OST’s) interruption in service between 8:12PM - 8:59PM.  Due to a potentially severe file system corruption on a single OST, the target (gsfs-OST00cd) is...

Due to a hardware error on an OSS server, Stockyard $WORK file system suffered a partial (affecting processes accessing particular subset of OST’s) interruption in service between 8:12PM - 8:59PM.  Due to a potentially severe file system corruption on a single OST, the target (gsfs-OST00cd) is currently off-line and will not be back in service until repair is done.

This could take days and we may disable the target on clients to prevent hang.

TACC Webinar - Introduction to Wrangler 12.4.2014 9:00am-10:30am

Posted by Jason Allison on Nov 19, 2014 3:39:44 PM

TACC Webinar - Introduction to Wrangler December 4, 2014 9:00am - 10:30am In this webinar, we will touch on the hardware components that make up Wrangler, and how they will be allocated to users. We will explain how the Hadoop and Database features will be supported on the system, as well as cover...

TACC Webinar - Introduction to Wrangler
December 4, 2014
9:00am - 10:30am

In this webinar, we will touch on the hardware components that make up Wrangler, and how they will be allocated to users. We will explain how the Hadoop and Database features will be supported on the system, as well as cover the tools available for more traditional workflows that can run on current HPC systems, but which may run significantly better in the Wrangler environment. Finally we will talk about the ways Wrangler will help users manage their data collections.

Significant time will be allotted for Q&A.  Please click the following link to register.

https://portal.tacc.utexas.edu/training/register-for-introduction-to-wranger-dec-4


Thank you,

Jason Allison
TACC User Services

TACC Training – Fortran 90/95 Programming 12.3.2014 1pm-5pm

Posted by Jason Allison on Nov 19, 2014 12:51:13 PM

Fortran 90/95 Programming December 3, 2014 1:00pm - 5:00pm   You are welcome to attend this course in-person on UT Austin's Main Campus in building POB, room 2.402.  Registration closes on December 1, 2014. Fortran programming skills are essential for researchers in areas of High Performance...

Fortran 90/95 Programming
December 3, 2014
1:00pm - 5:00pm
 
You are welcome to attend this course in-person on UT Austin's Main Campus in building POB, room 2.402.  Registration closes on December 1, 2014.

Fortran programming skills are essential for researchers in areas of High Performance Computing. These skills can be directly applied to resources at the Texas Advanced Computing Center and in any Unix environment. The class is intended for the intermediate user wishing to gain more experience in Fortran 90/95 programming. Basic knowledge of Fortran or another mid/high-level programming language is a prerequisite.

Please see the following link for more information:

https://portal.tacc.utexas.edu/-/fortran-90-95-programming


Thank you,

Jason Allison
TACC User Services

Stampede /SCRATCH Filesystem Status

Posted by David Cooper on Nov 15, 2014 1:50:26 PM

The /scratch filesystem was unavailable from 1:05PM until 1:35PM (CST) today and users may have encountered errors while trying to access files in the filesystem during that time.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/


The /scratch filesystem was unavailable from 1:05PM until 1:35PM (CST) today and users may have encountered errors while trying to access files in the filesystem during that time.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/


Ranch Unavailable

Posted by Garland Whiteside on Nov 7, 2014 2:57:47 AM

Ranch is currently unavailable.  The TACC administrators are working on the issue.


  

Updated on Nov 15, 2014 5:53:28 AM

Ranch is back in production.

Original Posting

Ranch is currently unavailable.  The TACC administrators are working on the issue.


  

New Xeon Phi Applications on Stampede

Posted by Jesse Snead on Nov 8, 2014 12:02:16 AM

TACC is pleased to announce the availability of additional Xeon Phi-enabled applications and libraries on Stampede. The Stampede User Guide has been updated with a list of these applications and libraries, descriptions of which Stampede modules to load, and links to documentation regarding their...

TACC is pleased to announce the availability of additional Xeon Phi-enabled applications and libraries on Stampede. The Stampede User Guide has been updated with a list of these applications and libraries, descriptions of which Stampede modules to load, and links to documentation regarding their use.


https://portal.tacc.utexas.edu/user-guides/stampede#software
   
  • Computational Chemistry
    • LAMMPS: molecular dynamics
      • module name: lammps/09Oct14
      • documentation: http://lammps.sandia.gov and module help lammps/09Oct14
      • execution model: offload; reported 1.8-5x speedups with 12 core host
    • NAMD: molecular dynamics
      • module name: namd/2014_07_31-mic
      • documentation: module help namd/2014_07_31-mic
      • execution model: offload; reported 2x speedups
  • Weather forecasting
    • WRF: weather research and forecasting
      • module name: wrf/3.6
      • documentation: http://www2.mmm.ucar.edu/wrf/users and module help wrf/3.6
      • execution model: symmetric mode (host and MIC-side MPI)
  • Productivity Languages and LIbraries with Automatic Offload to MKL for Dense Linear Algebra
    • MATLAB: high-level numerical computing language and environment
      • module name: matlab
      • documentation: https://portal.tacc.utexas.edu/software/MATLAB
      • other comments: “Bring Your Own License” usage model
    • Python: interactive object-oriented language and tools
      • module name: python/2.7.6
      • documentation: https://portal.tacc.utexas.edu/tutorials/automatic-offload
    • R: high-level statistical computing language
      • module name: Rstats/3.0.3
      • documentation: https://portal.tacc.utexas.edu/tutorials/automatic-offload
  • Other Libraries and Tools
    • NetCDF3: file i/o
      • module name: netcdf/3.6.3
      • documentation: http://unidata.ucar.edu and module help netcdf/3.6.3
      • execution model: host and MIC-side libraries available
    • NetCDF4: file i/o
      • module name: netcdf/4.3.2
      • documentation: http://unidata.ucar.edu and module help netcdf/4.3.2
      • execution model: host and MIC-side libraries available
    • HDF5: file i/o
      • module name: hdf5/1.8.13
      • documentation: http://hdfgroup.org and module help hdf5/1.8.13
      • execution model: host and MIC-side libraries available
    • NetCDF4: parallel i/o
      • module name: parallel-netcf/4.3.2
      • documentation: http://unidata.ucar.edu and module help parallel-netcf/4.3.2
      • execution model: host and MIC-side libraries available
    • HDF5: parallel i/o
      • module name: phdf5/1.8.13
      • documentation: http://hdfgroup.org and module help phdf5/1.8.13
      • execution model: host and MIC-side libraries available
    • DDT: debugger
      • module name: ddt
      • documentation: https://portal.tacc.utexas.edu/software/ddt and Allinea and Intel online resources
    • VTune: profiling
      • module name: vtune
      • documentation: http://software.intel.com/en-us/intel-vtune-amplifier-xe
    • PAPI: profiling
      • module name: papi/5.3.0
      • documentation: module help papi/5.3.0

We anticipate additional Xeon Phi-enabled community codes to become generally available over the next several weeks, including QCD and electronic structure codes. If you are interested in other applications, please note that “developer” versions are often available several months before the general public release.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Lonestar /scratch filesystem outage

Posted by David Cooper on Nov 6, 2014 12:03:43 AM

There is a filesystem outage on Lonestar that has resulted in /scratch being down from around 23:00 (CST). Administrators are aware of the issue and are working to resolve it.

There is a filesystem outage on Lonestar that has resulted in /scratch being down from around 23:00 (CST). Administrators are aware of the issue and are working to resolve it.

TACC Training – Optimization and Debugging 9:00 am to 12:00 pm CT

Posted by Jason Allison on Nov 4, 2014 3:29:12 PM

TACC Training – Optimization and Debugging November 13th, 2014  9:00 am to 12:00 pm CT  Texas Advanced Computing Center  ROC 1.900 J.J. Pickle Research Campus  10100 Burnet Rd. Austin, TX 78758  The course covers profiling and debugging technologies available at TACC for performance optimization...

TACC Training – Optimization and Debugging

November 13th, 2014 

9:00 am to 12:00 pm CT 

Texas Advanced Computing Center 

ROC 1.900 J.J. Pickle Research Campus 

10100 Burnet Rd. Austin, TX 78758 

The course covers profiling and debugging technologies available at TACC for performance optimization and debugging of serial and parallel codes. The course contains instructions for beginners and advanced users, along with a lab with demos and examples.

You are welcome to attend this training class in-person or via webcast. Registration closes at 5:00 pm. CT, November 11, 2014.

To register for this course you will need an active TACC User Account.  

Please visit the the following link to get more information on the course and to access the registration page:

https://portal.tacc.utexas.edu/-/optimization-and-debugging

Stampede Status 31October 2014

Posted by Sergio Leal on Oct 31, 2014 5:55:43 PM

A portion of Stampede was affected by a power issue when a power distribution unit failed.  This impacted $HOME filesystem briefly and $WORK is currently unavailable.  TACC staff are working to get $WORK back available as soon as possible.


--TACC Operations

Updated on Oct 31, 2014 9:09:03 PM

As of 7:30CST today, the filesystems have been brought back online and the queues open again.

Original Posting

A portion of Stampede was affected by a power issue when a power distribution unit failed.  This impacted $HOME filesystem briefly and $WORK is currently unavailable.  TACC staff are working to get $WORK back available as soon as possible.


--TACC Operations

Stampede Scratch File System Outage

Posted by Garland Whiteside on Oct 9, 2014 12:16:55 AM

One of the storage targets for the /scratch filesystem is currently offline and undergoing a filesystem check after experiencing an error earlier this evening.  Users may encounter hangs or errors trying to access files or directories that reside on the storage target.   To prevent jobs from...

Updated on Oct 29, 2014 12:50:08 PM

TACC staff has completed recovery operations on scratch-OST0010, the Object Storage Target (OST) in the /scratch file system that has been offline for the past few weeks. We have recovered about 90% of the files that were on this OST, and have restored all recovered files as of the system maintenance on October 28, 2014. We encourage users to check the status of their files in $SCRATCH. 


Managers of community, portal, or gateway accounts may find that they have issues with ownership and permissions on some of their files in $SCRATCH. 

If you receive the message "Cannot allocate memory" when trying to access a file, this means that a portion of the file was on the offline OST and could not be recovered. You should remove any such file using "rm -f filename". 

To determine which of your files were located on the offline OST, consult this page for instructions: https://portal.tacc.utexas.edu/user-news/stampede-scratch-file-system-outage

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Updated on Oct 14, 2014 11:05:46 AM

One storage target for the Stampede /scratch filesystem remains offline.   During this period, /scratch continues in full production, but a small fraction of files (<1%) that existed on /scratch at the time of the outage may be unavailable.   Efforts continue to recover these files, but the scan an check process is taking much longer than anticipated.

In the interim, for users with files on the /scratch filesystem, if you encounter an error reporting “Cannot send after transport endpoint shutdown”, then this file likely resides on the offline storage target.   Any attempt to access a file in this state will result in an error.   Users can confirm the file’s location using the command “lfs getstripe <filename>”, and comparing if the obdidx is 16, which corresponds to the offline target.   Here is an example:

stampede(10)$ cat restart.h5

cat: restart.h5: Cannot send after transport endpoint shutdown

 

stampede(11)$ lfs getstripe restart.h5

restart.h5

lmm_stripe_count:   2

lmm_stripe_size:    1048576

lmm_pattern:        1

lmm_layout_gen:     0

lmm_stripe_offset:  16


obdidx           objid           objid           group

    16        55335676      0x34c5afc                0

   146        54838324      0x344c434                0

If the file needs to be removed, then the –f option will be required to force removal of the file, e.g. rm –f restart.h5.   An error will occur if the –f option is not used.

We will continue to run the scan process on the offline storage target and hopefully recover the data on the offline storage target.  An update to this user news will be sent once more information is available.    As a reminder, the /home and /work filesystems are unaffected.

Updated on Oct 9, 2014 1:32:25 PM

One storage target on the Stampede /scratch filesystem remains offline while filesystem checks on it continue to run. The storage target has been deactivated on all login and compute nodes to prevent hangs when files on that target are accessed. The queues are being opened back up to allow jobs to run again, however, any job that tries to access a file that resides on the offline storage target will result in an I/O error. 


Further updates will be sent once more information is available.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Original Posting

One of the storage targets for the /scratch filesystem is currently

offline and undergoing a filesystem check after experiencing an error

earlier this evening.  Users may encounter hangs or errors trying to

access files or directories that reside on the storage target.   To

prevent jobs from failing, the production queues have been closed,

however the development queues are still available for use.   An update

to this announcement will be posted once the scan and checks of the

object storage target device have completed.

TACC Webinar - Introduction to Wrangler

Posted by Jason Allison on Oct 24, 2014 3:28:30 PM

November 11, 2014  2:00 - 3:30 pm CDT   Texas Advanced Computing Center ROC 1.900  J.J. Pickle Research Campus   10100 Burnet Rd.  Austin, TX 78758  In this webinar, we will touch on the hardware components that make up Wrangler, and how they will be allocated to users. We will explain how the...

November 11, 2014 

2:00 - 3:30 pm CDT 
 Texas Advanced Computing Center ROC 1.900 
J.J. Pickle Research Campus 
 10100 Burnet Rd. 
Austin, TX 78758 

In this webinar, we will touch on the hardware components that make up Wrangler, and how they will be allocated to users. We will explain how the Hadoop and Database features will be supported on the system, as well as cover the tools available for more traditional workflows that can run on current HPC systems, but which may run significantly better in the Wrangler environment. Finally we will talk about the ways Wrangler will help users manage their data collections. 

Please register for the webinar here: 
https://www.tacc.utexas.edu/user-services/training/wrangler-intro

Stampede User Environment Notice

Posted by Jason Allison on Sep 23, 2014 2:08:03 PM

On Tuesday, October 28, 2014, TACC will be ending support for the "~/.profile_user" bash/csh environment configuration script on Stampede to provide improved support for the system. IF you are using "~/.profile_user" to source your user environment on Stampede, we recommend you run the following...

Updated on Oct 23, 2014 2:13:33 PM

As a reminder, on October 28, 2014; TACC will be ending support for ~/.profile_user on Stampede. 


In addition to this upcoming change, TACC will be ending support for ~/.cshrc_user and ~/.login_user on December 9, 2014.

If you use these files in your environment on Stampede, see the following for resolution information: /usr/local/startup_scripts/transition.txt 

Please submit any questions you may have via the consulting section of the TACC User Portal. 

https://portal.tacc.utexas.edu/group/tup/tacc-consulting      

Original Posting

On Tuesday, October 28, 2014, TACC will be ending support for the "~/.profile_user" bash/csh environment configuration script on Stampede to provide improved support for the system.

IF you are using "~/.profile_user" to source your user environment on Stampede, we recommend you run the following command:

$ /usr/local/startup_scripts/install_default_scripts

This safely moves your current "~/.profile", "~/.profile_user" and "~/.bashrc" files into a new subdirectory, "~/.orig" and installs the newest dot files into your home directory.  You may then merge the contents of "~.orig/.profile_user" into the newly created "~/.bashrc" file.   Your new ~/.bashrc contains comments and instructions that will help you accomplish this transition.

Please submit any questions you may have via the consulting section of the TACC User Portal.

https://portal.tacc.utexas.edu/group/tup/tacc-consulting 

Stampede Job Scheduler Notification

Posted by Jason Allison on Oct 17, 2014 3:42:07 PM

On Tuesday, October 21, from 9am CDT to approximately 6pm, Stampede will be running some full-system jobs. As the scheduler begins to drain the system ahead of the time reserved for these runs, the reported system load will be lower than normal.  Short jobs will receive higher priority if they are...

On Tuesday, October 21, from 9am CDT to approximately 6pm, Stampede will be running some full-system jobs. As the scheduler begins to drain the system ahead of the time reserved for these runs, the reported system load will be lower than normal. 


Short jobs will receive higher priority if they are able to complete in the time remaining before the full-system runs; longer jobs will remain in the queue until the full-system jobs are complete. The reservation does not affect the largemem queue or access to the login nodes. 

Please submit any questions you may have via the consulting section of the TACC User Portal. https://portal.tacc.utexas.edu/group/tup/tacc-consulting

TACC Training – Python HPC

Posted by Jason Allison on Oct 14, 2014 12:44:59 PM

October 30th, 2014  9:00 am to 12:00 pm CDT  Texas Advanced Computing Center  ROC 1.900 J.J. Pickle Research Campus  10100 Burnet Rd. Austin, TX 78758  You are welcome to attend this training class in-person or via webcast. Registration closes at 5 p.m. CDT, October 21, 2014. This class provides...

October 30th, 2014 

9:00 am to 12:00 pm CDT 

Texas Advanced Computing Center 

ROC 1.900 J.J. Pickle Research Campus 

10100 Burnet Rd. Austin, TX 78758 

You are welcome to attend this training class in-person or via webcast. Registration closes at 5 p.m. CDT, October 21, 2014.

This class provides intermediate users with and overview of intermediate and advanced techniques for using Python on HPC environments. The lecture will emphasize well known approaches for improving the performance of their Python codes. The lecture will include an overview of when Python can be used on HPC, an overview of numpy, matplotlib, SciPy and Cython, as well as an introduction to the most common functionality of mpi4py.

Previous knowledge of Python is required since the course will not go into specific details of Python syntax. Some knowledge of MPI is required as the lecture will not go into specific details regarding MPI. Some familiarity with C/C++ is also recommended for Cython.

We have a limited number of laptops available for labs, first come, first served. If you choose to use your own laptop, please make sure that you have an SSH client. 

The labs will be available for remote users. However, we will not be able to assist remote users with problems during the lab. 

To register for this course please visit the the following link:

https://www.tacc.utexas.edu/user-services/training/hpc-python

Network Outage

Posted by Garland Whiteside on Sep 30, 2014 7:20:51 AM

There was a TACC network outage between 6am and 7am.  All resources are back on-line.

There was a TACC network outage between 6am and 7am.  All resources are back on-line.

Stampede and Maverick $WORK Filesystem Status

Posted by Jason Allison on Sep 26, 2014 4:04:33 PM

Users may not have been able to access the /work filesystem on Stampede between 2:30PM and 3:30PM (CDT) today.

Administrators have restored access at this time.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

Users may not have been able to access the /work filesystem on Stampede between 2:30PM and 3:30PM (CDT) today.

Administrators have restored access at this time.

Please submit any questions that you may have via the TACC Consulting System. http://portal.tacc.utexas.edu/consulting/

TACC Training – Linux/Unix Basics for HPC

Posted by Jason Allison on Sep 25, 2014 10:14:18 AM

October 9th, 2014  9:00 am to 12:00 pm CDT  Texas Advanced Computing Center  ROC 1.900 J.J. Pickle Research Campus  10100 Burnet Rd. Austin, TX 78758  You are welcome to attend this training class in-person or via webcast. Registration closes at 5 p.m. CDT, October 2, 2014. This foundational class...

October 9th, 2014 
9:00 am to 12:00 pm CDT 
Texas Advanced Computing Center 
ROC 1.900 J.J. Pickle Research Campus 
10100 Burnet Rd. Austin, TX 78758 

You are welcome to attend this training class in-person or via webcast. Registration closes at 5 p.m. CDT, October 2, 2014.

This foundational class provides beginner and intermediate users with basic Linux/Unix command line environment experience. The lecture will emphasize common strategies used for interacting with clusters and HPC resources. A student guide with suggested supplemental reading and a Linux/Unix command line environment practice lab, as well as a Command Reference Guide (with interactive system and batch commands) are included. No prerequisite. 

We have a limited number of laptops available for labs, first come, first served. If you choose to use your own laptop, please make sure that you have an SSH client. 

The labs will be available for remote users. However, we will not be able to assist remote users with problems during the lab. 

To register for this course please visit the the following link:
https://www.tacc.utexas.edu/user-services/training/linux-unix-basics-for-hpc

Scheduled Maintenance 9/20/14

Posted by Sergio Leal on Sep 20, 2014 12:32:07 PM

As of 12:10PM today, network upgrade has been completed.  Access to TACC resources and services have been restored.


--TACC Consulting

As of 12:10PM today, network upgrade has been completed.  Access to TACC resources and services have been restored.


--TACC Consulting

TACC Network Scheduled Maintenance 9.20.2014

Posted by Jason Allison on Sep 8, 2014 11:16:59 AM

Access to all TACC resources and services will be intermittently unavailable between 9am - 5pm (CT) on Saturday, Sept 20th. TACC staff will be performing an upgrade to the networking infrastructure. During this time, jobs will continue to run and the XSEDE/TACC websites and user portals will be...

Access to all TACC resources and services will be intermittently unavailable between 9am - 5pm (CT) on Saturday, Sept 20th. TACC staff will be performing an upgrade to the networking infrastructure. During this time, jobs will continue to run and the XSEDE/TACC websites and user portals will be fully operational during this time.

Please submit any questions you may have via the consulting section of the TACC User Portal.
https://portal.tacc.utexas.edu/group/tup/tacc-consulting