NCCS Message of the Day


Please note that this message is updated during the day as needed. To be sure the message is up to the minute, please use the reload button on your web browser.


 
NCCS User Services Group
E-mail: support@nccs.nasa.gov
Phone/voice-mail: (301) 286-9120
Web-site: https://www.nccs.nasa.gov

===== GENERAL INFORMATION (as of 17:55 Thu Jul 10, 2014)
============================================================================
Discover Users: When creating a ticket to report a Discover job crashing, if
possible please include the job ID.  This will greatly enhance our ability
to diagnose the problem.  Thank you.
----------------------------------------------------------------------------
The MATLAB licenses on discover are in high demand.  If you are not
currently actively using MATLAB on the dali nodes, please exit your session
so that other users may use the local licenses.

============================================================================
===== SCHEDULED SYSTEM UNAVAILABILITY (as of 17:55 Thu Jul 10, 2014)
============================================================================
Date  Day Duration  System               Reason (Status)
----- --- --------- -------------------- -----------------------------------
There is currently no scheduled system unavailability.
============================================================================

============================================================================
===== UNSCHEDULED SYSTEM UNAVAILABILITY (as of 17:55 Thu Jul 10, 2014)
============================================================================
Date  Day Duration  System               Reason (Status)      
----- --- --------- -------------------- -----------------------------------
07/08 TUE 2100-                          Chilled Water Outage
07/09         -1140 Jibb                 Returned to Service.
----- --- --------- -------------------- -----------------------------------
07/08 TUE 2100-                          Chilled Water Outage
07/09         -1220 DataPortal           Returned to Service.
----- --- --------- -------------------- -----------------------------------
07/08 TUE 2100-                          Chilled Water Outage (See Note A.)
07/09         -1440 Discover             Returned to Service.
----- --- --------- -------------------- -----------------------------------
07/08 TUE 2100-                          Chilled Water Outage (See Note A.)
07/09         --TBD Archive/Dirac            

Note A:
-------
Discover:
Numerous Discover nodes remain offline due to hardware failures. We
continue to work with our vendor to address those hardware failures, and
expect to bring additional nodes back online tomorrow.

As a reminder, while the Archive system is down, Archive mounts on
Discover will remain unavailable.

Archive/Dirac:
We have replaced 90% of the failed Archive disk drives. In addition to
replacing the remaining failed disks, we will run stress tests
overnight. We will begin filesystem checkout tomorrow morning. Barring
further problems, we hope to return the Archive to service later in the
day tomorrow.
============================================================================

============================================================================
===== OTHER OUTAGES (as of 17:55 Thu Jul 10, 2014)
============================================================================
Date  Day Duration  System               Reason (Status)
----- --- --------- -------------------- -----------------------------------
There are currently no other outages.
============================================================================

 

Curator: Mason Chang
NCCS User Services: 301.286.9120
Authorizing NASA Official: Dan Duffy, High-Performance Computing Lead, GSFC Code 606.2