User News

Stampede and Maverick Status 4/18

Posted by Jesse Snead on Apr 18, 2014 9:59:48 AM

TACC staff have resolved an issue with the Stockyard Lustre Parallel file system that began around 5:00am Friday 18 April 2014.  Users may have experienced intermittent slow performance and I/O errors from the /work file system on Maverick and Stampede during that time frame. Please submit any...

TACC staff have resolved an issue with the Stockyard Lustre Parallel file system that began around 5:00am Friday 18 April 2014.  Users may have experienced intermittent slow performance and I/O errors from the /work file system on Maverick and Stampede during that time frame.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/

TACC Training - Parallel Computing on Stampede

Posted by Bob Garza on Apr 17, 2014 2:54:37 PM

TACC Training - Parallel Computing on Stampede May 1-2, 2014 (Thursday and Friday) 8:30 a.m. to 5 p.m. CT ROC 1.900 J.J. Pickle Research Campus 10100 Burnet Rd. Austin, TX 78758 You are welcome to attend this training class in-person or via webcast. Registration closes at 5 p.m. CDT, April 25, 2014...


TACC Training - Parallel Computing on Stampede

May 1-2, 2014 (Thursday and Friday)
8:30 a.m. to 5 p.m. CT
ROC 1.900

J.J. Pickle Research Campus
10100 Burnet Rd.
Austin, TX 78758

You are welcome to attend this training class in-person or via webcast.

Registration closes at 5 p.m. CDT, April 25, 2014

Registration

The Stampede supercomputer at the Texas Advanced Computing Center went into production in January 2013 and is the first system to deploy at scale the Intel Xeon Phi CoProcessor.  Stampede provides nearly 10 petaflops of peak performance, and is the new flagship system of the US National Science Foundation's XSEDE Cyberinfrastructure.  Stampede provides more than 100,000 cores and 2PF of Intel Xeon E5 'Sandy Bridge' processors, and an additional 7+ PF of Intel Xeon Phi CoProcessors.

In this tutorial, we will introduce the Stampede architecture, and cover how to achieve performance using both the conventional processors as well as the coprocessors.

We have a limited number of laptops available for labs, first come, first served.  If you choose to use your own laptop, please make sure that you have an SSH client.

Topics will include:

  • Stampede architecture overview
  • The Stampede user environment, including the batch system, compiler environment, application modules, etc.
  • MPI and OpenMP parallel programming
  • Hands-on exercises with Stampede.
  • Basic optimization and vector tuning on Stampede  for Sandy Bridge and Xeon Phi Coprocessors (MICs)
  • Hybrid computing
  • Intel Xeon Phi Coprocessor (MIC) overview
  • Programming models for Sandy Bridge - MIC computing: native, symmetric and offload.

The labs will be available for remote users.  However, we will not be able to assist remote users with problems during the lab.  Remote users are invited to submit their questions via email.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/




Intel® Xeon Phi™ Coprocessor Developer Training Event

Posted by Bob Garza on Apr 17, 2014 2:26:24 PM

Intel® Xeon Phi™ Coprocessor Developer Training Event When: Tuesday, April 22, 2014, 8:30 AM - 4:00 PM Where: Texas Advanced Computing Center, J.J. Pickle Research Campus, ROC Building 196, 10100 Burnet Road Austin, TX 78758 This one-day training will provide software developers the foundation...

Intel® Xeon Phi™ Coprocessor Developer Training Event

When: Tuesday, April 22, 2014, 8:30 AM - 4:00 PM
Where: Texas Advanced Computing Center, J.J. Pickle Research Campus, ROC Building 196, 10100 Burnet Road Austin, TX 78758


This one-day training will provide software developers the foundation needed for modernizing their code to take advantage of parallel architectures found in both the Intel® Xeon® processor and the Intel® Xeon Phi™ coprocessor.
The session will cover:

  • An overview of parallel programming frameworks and optimization guidelines for multi-core CPUs (Intel® Xeon®) and many-core coprocessors (Intel® Xeon Phi™):
  • Discussions about three layers of parallelism: SIMD, Threads, Cluster environment
  • Tips for quick porting/development of HPC software applications
  • Real-life examples of code and optimization techniques
  • Hardware solution and corresponding software implementations, APIs, and framework

Click here to RSVP and learn more.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/

Ranch System Maintenance 4/22

Posted by Bob Garza on Apr 16, 2014 5:19:56 PM


Ranch will not be available from 9 a.m. to 3 p.m. (CT) on Tuesday, 22 April 2014.

System maintenance will be performed during this time.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/


Ranch will not be available from 9 a.m. to 3 p.m. (CT) on Tuesday, 22 April 2014.

System maintenance will be performed during this time.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/

TACC User Portal Account Password Reset

Posted by Bob Garza on Apr 16, 2014 4:47:07 PM

Recently there have been a number of high profile IT security related incidents that have been widely reported in the media, culminating with the 'Heartbleed' vulnerability discovered on April 7, 2014.  If exploited, this vulnerability may result in secure internet communications being compromised....


Recently there have been a number of high profile IT security related incidents that have been widely reported in the media, culminating with the 'Heartbleed' vulnerability discovered on April 7, 2014.  If exploited, this vulnerability may result in secure internet communications being compromised.

The TACC security and operations teams quickly responded to assess the possible effects of Heartbleed, as with all new systems. Any TACC systems that were found to be vulnerable have been patched and new certificates have been issued.

In order to eliminate the potential for exposure of any TACC authentication credentials, users who may have been susceptible to this vulnerability as determined by the TACC security team will be contacted and instructed to reset their TACC User Portal passwords as a precautionary measure.  If an account password has not been reset by April 30, 2014, the account will be deactivated until the account password reset process is completed.

You may also submit any questions you may have via the TACC User Portal: https://portal.tacc.utexas.edu/consulting.

TACC Training: Introduction to Maverick 4/25

Posted by Bob Garza on Apr 16, 2014 10:07:21 AM

TACC Training: Introduction to Maverick 4/25 April 25, 2014 (Friday) 8:30 am - 5:00pm (CT) ROC 1.900 You may attend this class in-person or via webcast. In this one-day class, users will receive instructions on the use of remote visualization software to visualize data sets generated on the new...


TACC Training: Introduction to Maverick 4/25

April 25, 2014 (Friday)
8:30 am - 5:00pm (CT)
ROC 1.900

You may attend this class in-person or via webcast.

In this one-day class, users will receive instructions on the use of remote visualization software to visualize data sets generated on the new system Maverick. A review of the scientific visualization process will precede an overview of the visualization software available to Maverick users, including the parallel visualization software VisIt and Paraview. In addition users will be introduced to Python and R for data analysis. Labs will provide students with the opportunity to prepare data sets to be visualized using these applications.

Agenda

08:30 - 09:30 Introduction to Scientific Visualization
09:30 - 10:00 Introduction to Maverick
10:00 - 10:15 Break
10:15 - 11:00 Introduction to Python
11:00 - 12:00 Introduction to R
12:00 - 13:00 Lunch
13:00 - 14:00 Lab - ParaView
14:00 - 15:00 Lab - VisIt
15:00 - 16:00 Parallel Vis
16:00 - 17:00 Lab - Remote & Collaborative Visualization

Registration: https://www.tacc.utexas.edu/user-services/training

Please submit any questions you may have via the TACC Consulting System.
https://portal.tacc.utexas.edu/consulting

Lonestar System Maintenance 4/22

Posted by Bob Garza on Apr 16, 2014 9:58:18 AM


Lonestar will not be available from 8 a.m. to 5:30 p.m. (CT) on Tuesday, 22 April 2014.

System maintenance will be performed during this time.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/


Lonestar will not be available from 8 a.m. to 5:30 p.m. (CT) on Tuesday, 22 April 2014.

System maintenance will be performed during this time.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/

2014 TACC Summer Supercomputing Institute

Posted by Bob Garza on Apr 11, 2014 5:22:57 PM

2014 TACC Summer Supercomputing Institute Monday, June 16 - Friday, June 20, 2014 Apply Now Audience This week-long workshop is appropriate for all levels of researchers, faculty, staff, and graduate students, from new users of advanced computing technologies, to those who have research projects...


2014 TACC Summer Supercomputing Institute

Monday, June 16 - Friday, June 20, 2014

Apply Now

Audience

This week-long workshop is appropriate for all levels of researchers, faculty, staff, and graduate students, from new users of advanced computing technologies, to those who have research projects requiring powerful computing, visualization, storage, or software. We encourage participation from Minority Serving Institutions, Hispanic Serving Institutions, and Historically Black Colleges and Universities.

  • Researchers across disciplines: Mathematics, Engineering, Physics, Astronomy, Astrophysics, Cosmology, Geology & Geophysics, Computer Sciences, Biosciences, Nanosciences, Data Analytics
  • Graduate and undergraduate students
  • Current TACC & XSEDE users
  • Industrial affiliates

Overview

The Institute will provide researchers with an intensive introduction to using TACC's computing resources. Senior TACC staff will deliver presentations and lead interactive lab sessions focused on using TACC's advanced computing resources and technology.

  • Stampede: Dell PowerEdge C8220 Cluster with Intel Xeon Phi coprocessors
  • Lonestar: Dell Linux Cluster
  • Maverick: HP/NVIDIA Interactive Visualization and Data Analytics System
  • Ranch: Petascale archival facility

On January 7, 2013 TACC deployed a new compute cluster, Stampede. Funded by the National Science Foundation, this new cluster provides the community with access to 2 PFlops of Intel based microprocessor power and 8 PFlops of Intel MIC (Many Integrated Core) architecture technology. During the Institute students will receive a description of the system and TACC staff will present a session on how to use the new MIC architecture.

Lectures and Labs: Senior TACC staff will deliver presentations and lead interactive laboratory sessions:

  • Obtaining access to TACC resources and services
  • Reviewing the hardware and software available on TACC resources
  • Developing parallel programs with OpenMP and MPI
  • Using visualization and data analysis software and systems
  • GPGPU programming
  • Using the Intel Xeon Phi coprocessor (MIC)

Applications Seminars: Leading computational researchers will discuss their work, including examples of how they are utilizing TACC's resources.

Consulting: During the Institute, TACC staff will be available to assist participants in applying the techniques and technologies covered in the Institute to their own applications.

Applying to Attend the Institute

Applications to attend the Institute must be submitted by Friday, May 2, 2014. Applicants will receive notification of the status of their application by Friday, May 9, 2014.

Apply Now

Fee for Accepted Applicants

A fee of $150 is due by Friday, May 30, 2014, for accepted applicants. The fee covers lunch, snacks, and course materials.

Refund Policy

Accepted applicants needing to cancel must do so by Friday, June 6, 2014, in order to receive a refund.

Accommodations

Institute attendees traveling to the training are responsible for arranging and paying for their own travel, daily expenses, and hotel accommodations. Accepted applicants will receive a list of suggested hotels.

For more information about the TACC Summer Supercomputing Institute, please contact John Lockman, TACC Training Coordinator: jlockman@tacc.utexas.edu or 512-471-4097.



Stampede System Maintenance 4/15

Posted by Bob Garza on Apr 7, 2014 5:51:01 PM


Stampede will not be available from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 15 April 2014.

System maintenance will be performed during this time.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/

Updated on Apr 16, 2014 7:04:15 AM

The Stampede maintenance period has ended and the system is back in full production.

Updated on Apr 14, 2014 4:44:19 PM

The Stampede system maintenance period for 4/15/2014 will be extended until noon CDT on Wednesday, 4/16/2014, in order to facilitate a number of full-system scale test runs. Following the completion of maintenance activities the login nodes will be reopened for user access, data transfer, compilation, and job submission (jobs will queue but not run). Once the test runs are complete, the system will be returned  to full production and queued jobs will be allowed to run.

Original Posting


Stampede will not be available from 8 a.m. to 7:30 p.m. (CT) on Tuesday, 15 April 2014.

System maintenance will be performed during this time.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/

Corral System Maintenance 4/8

Posted by Bob Garza on Apr 3, 2014 1:08:38 PM

The Corral iRODS service only, will be down from 9 A.M. - 2 P.M. (CT) on Tuesday, 8 April 2014, to perform software upgrades. Normal Corral file system access, web services, and databases will all be unaffected by this maintenance. Please submit any questions that you may have via the TACC...


The Corral iRODS service only, will be down from 9 A.M. - 2 P.M. (CT) on Tuesday, 8 April 2014, to perform software upgrades.

Normal Corral file system access, web services, and databases will all be unaffected by this maintenance.

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/

Ranch Status 3/28

Posted by Bob Garza on Mar 28, 2014 11:26:37 AM

There will be a reboot of the Ranch master node at 11:30 a.m. (CT) on Friday, 28 March 2014, to investigate some software issues. This posting will be updated when Ranch is back in full production. Please submit any questions you may have via the TACC Consulting System....

Updated on Mar 28, 2014 1:12:13 PM


Ranch has returned to full production.

Original Posting


There will be a reboot of the Ranch master node at 11:30 a.m. (CT) on Friday, 28 March 2014, to investigate some software issues.

This posting will be updated when Ranch is back in full production.

Please submit any questions you may have via the TACC Consulting System.
https://portal.tacc.utexas.edu/consulting



Step Up to the MIC with R, Python, and MATLAB: Intel Xeon Phi Automatic Offload on Stampede

Posted by Bob Garza on Mar 21, 2014 6:21:38 PM

Those whose computational science depends on R, Python, or MATLAB may be pleased to know that three popular software packages on Stampede support Automatic Offloading (AO) to the Intel Xeon Phi Many Integrated Core (MIC) coprocessors. These packages offload (distribute work between host and MIC) by...


Those whose computational science depends on R, Python, or MATLAB may be pleased to know that three popular software packages on Stampede support Automatic Offloading (AO) to the Intel Xeon Phi Many Integrated Core (MIC) coprocessors. These packages offload (distribute work between host and MIC) by calling the Intel Math Kernel Library (MKL) to accomplish AO-enabled, computationally intensive functions like matrix-matrix multiplications.
 
Our newest Rstats module, built with Intel 14 (the intel/14.0.1.106 module) and mvapich2/2.2b, includes version 3.0.3 of the popular R statistical software. This module supports automatic offload, scalable distributed computing with RMPI, and a number of other new tools and packages. The new python/2.7.6 module, also built with Intel 14, bundles dozens of computational packages, including NumPy and SciPy, all of which are AO ready. Finally, Stampede's bring-your-own-license MATLAB module is easy to configure for MKL-support and AO.
 
If you are calling AO-enabled MKL functions from R or Python, you need only set the environment variable MKL_MIC_ENABLE to the value 1, then launch your code: MKL will decide if your computations are demanding enough to involve the MIC in the computation. Other environment variables give you greater control over the calculation (e.g. setting threads on all devices, or specifying the division of work between host and coprocessors). To enable AO for MATLAB, set BLAS_VERSION to the value $TACC_MKL_LIB/libmkl_rt.so in addition to setting MKL_MIC_ENABLE. In all cases, you can monitor your offloads by setting OFFLOAD_REPORT to the value 2.
 
See http://software.intel.com/en-us/articles/intel-mkl-on-the-intel-xeon-phi-coprocessors  for more information.

A list of currently AO-enabled functions and size thresholds is at http://software.intel.com/en-us/articles/intel-mkl-automatic-offload-enabled-functions-for-intel-xeon-phi-coprocessors.

If you need some help, feel free to submit a ticket through the TACC User Portal at http://portal.tacc.utexas.edu/consulting/

Maverick: New Visualization and Data Analytics Cluster

Posted by Chris Hempel on Mar 10, 2014 1:31:40 PM

The Texas Advanced Computing Center at The University of Texas at Austin is pleased to introduce Maverick, an HP/NVIDIA interactive, remote visualization and data analytics cluster, to the US national open science community.  Maverick combines capabilities for interactive advanced visualization and...

The Texas Advanced Computing Center at The University of Texas at Austin is pleased to introduce Maverick, an HP/NVIDIA interactive, remote visualization and data analytics cluster, to the US national open science community.  Maverick combines capabilities for interactive advanced visualization and large-scale data analytics.  Maverick replaces the TACC Longhorn visualization cluster (https://www.tacc.utexas.edu/resources/visualization).

Maverick is configured with 132 of the new NVIDIA Tesla K40 GPUs.  Each node contains 1 / 4 TB memory.  All nodes are connected to the new DataDirect Networks based 20PB shared work directory file system, Stockyard, with a Mellanox FDR InfiniBand interconnect.  Maverick's software stack includes TACC-developed remote visualization software;  visualization software such as Paraview and VisIT; and Data Analytics software including MATLAB, IDL and Parallel R.

Normal batch queues will enable users to run simulations up to 4 hours for interactive jobs and 12 hours for GPGPU. Jobs requiring run times and more cores than allowed by the normal queues will be run in a special queue after the approval of TACC staff. 

Longhorn users may start migrating their data to Maverick immediately.  Please see the Longhorn to Maverick migration instructions here: https://www.tacc.utexas.edu/user-services/user-guides/longhorn-to-maverick-migration.

Please refer to the Maverick User Guide for more details on the system configuration,
capabilities, and information regarding efficient use of the system.

https://www.tacc.utexas.edu/user-services/user-guides/maverick-user-guide

Principal investigators may apply for allocation on Maverick through the TACC User Portal.

https://portal.tacc.utexas.edu/allocations

Please submit any questions via the TACC User Portal.

https://portal.tacc.utexas.edu/consulting

Corral is Unavailable

Posted by Garland Whiteside on Mar 9, 2014 10:26:23 PM

Corral remains offline.  The TACC administrators will update its status tomorrow.

Updated on Mar 12, 2014 1:44:09 PM



The Corral file systems are now available again on the production resources at TACC.

TACC staff are still working to get the data collections back up and available.


Original Posting

Corral remains offline.  The TACC administrators will update its status tomorrow.

Ranch Outage

Posted by Garland Whiteside on Mar 8, 2014 8:31:30 PM

A power outage has resulted in Ranch services being unavailable.  The Ranch administrators are currently working through the issue.  Ranch services will be restored as soon as possible.

Updated on Mar 9, 2014 11:51:57 AM

As of 11:20AM CST today, Ranch is back online from the power outage yesterday.

Original Posting

A power outage has resulted in Ranch services being unavailable.  The Ranch administrators are currently working through the issue.  Ranch services will be restored as soon as possible.

Lonestar Outage

Posted by Garland Whiteside on Mar 8, 2014 7:53:38 PM

A power outage has resulted in Lonestar services being unavailable.  The Lonestar administrators are currently working through the issue.  Lonestar services will be restored as soon as possible.


Updated on Mar 9, 2014 10:23:43 PM

TACC administrators have brought Lonestar on-line and services are available to users. 

Original Posting

A power outage has resulted in Lonestar services being unavailable.  The Lonestar administrators are currently working through the issue.  Lonestar services will be restored as soon as possible.


Stampede Outage

Posted by Garland Whiteside on Mar 8, 2014 6:08:35 PM

A  brief power interruption due to a thunderstorm has resulted in most of the Stampede compute nodes powering off.  However, remote access through logins is still available.  TACC will close the Stampede queues until after the administrators power up and retest all of the compute nodes. Corral was...

Updated on Mar 8, 2014 11:38:58 PM

The TACC administrators have reopened the Stampede queues.  All Stampede services should be available.  Users should submit a ticket to TACC if they encounter any problems.

 Corral remains unavailable while further testing and verification are being completed.

 

Original Posting

A  brief power interruption due to a thunderstorm has resulted in most of the Stampede compute nodes powering off.  However, remote access through logins is still available.  TACC will close the Stampede queues until after the administrators power up and retest all of the compute nodes.

Corral was also impacted by the thunderstorm.  Corral services are also temporarily suspended until further review by the administrators.

Lonestar and Ranch Outages

Posted by Garland Whiteside on Mar 2, 2014 11:39:30 PM

Lonestar and Ranch are experiencing an outage due to a loss of power at the chilling station.  Both will be unavailable pending further notice.


Updated on Mar 4, 2014 9:08:07 PM

Lonestar returned to production at 7:30 p.m. CT on Tuesday, March 4.

Original Posting

Lonestar and Ranch are experiencing an outage due to a loss of power at the chilling station.  Both will be unavailable pending further notice.


TACC User Questionnaire

Posted by Bob Garza on Jan 14, 2014 4:02:07 PM

Dear Colleague,   In an effort to continually improve TACC's resources and services for our users, please take this short questionnaire so that we may understand your perception of TACC's mission and brand. https://www.surveymonkey.com/s/TACCuserbranding2014   The results are anonymous and for...


Dear Colleague,
 
In an effort to continually improve TACC's resources and services for our users, please take this short questionnaire so that we may understand your perception of TACC's mission and brand.

https://www.surveymonkey.com/s/TACCuserbranding2014
 
The results are anonymous and for internal use only. The information gathered will be used as qualitative research in TACC's effort to improve and define TACC's overall brand as a national advanced computing center.
 
The deadline for responses is Wednesday, January 22. The survey should take about 10 minutes.
 
Thank you for participating. We appreciate your feedback.

Stampede File System

Posted by Garland Whiteside on Dec 21, 2013 6:42:46 PM

Stampede is currently experiencing a problem with the 'Scratch' file system and is unavailable until the issue is resolved.  Further updates will be forthcoming.

Thanks,

TACC


Updated on Dec 22, 2013 9:51:12 AM

The problem with the /scratch filesystem was resolved at about 02:45, 12/22/13 and normal queue operation resumed at that time.   Users may have experienced I/O errors when trying to access files that were on the affected /scratch storage server from about 2:00PM yesterday until the filesystem check and recovery on the server completed at 2:45 this morning.

Thanks,

TACC

Original Posting

Stampede is currently experiencing a problem with the 'Scratch' file system and is unavailable until the issue is resolved.  Further updates will be forthcoming.

Thanks,

TACC


Stampede access issues this morning have been resolved

Posted by Jason Allison on Nov 15, 2013 7:39:49 AM

Stampede users may have had trouble accessing files from /scratch from approximately 4:00 until 6:30 this morning.  This issue has been resolved.  

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/

Stampede users may have had trouble accessing files from /scratch from approximately 4:00 until 6:30 this morning.  This issue has been resolved.  

Please submit any questions that you may have via the TACC Consulting System.
http://portal.tacc.utexas.edu/consulting/