Prism A100 GPUs

Issue to be resolved

Need access to faster GPU nodes with more RAM.

Solution

Submit your jobs to the “DGX” QoS on Prism by specifying “-p dgx”. This gives you access to up to 8 NVIDIA A100 GPUs with:

  • * 8x NVIDIA A100 GPUs with 40 GB of VRAM and NVLink
  • * Dual AMD EPYC Rome 7742 CPUs; 64 cores each at 2.25GHz
  • * 1 TB RAM
  • * Dual 25Gb Ethernet network interfaces
  • * Dual 100Gb HDR100 Infiniband high speed network interfaces
  • * 14 TB RAID protected NVMe drives, mounted as /lscratch

All Prism users have access to the dgx QoS. Note, there are 22 nodes with NVIDIA V100 GPUs (4 GPUs each) so they will be more readily available. If you are using the A100s for smaller jobs, you may be asked to move to the V100s if a larger job needs these resources.

Example, to request an interactive session with 2 x A100 GPUs:

salloc -G2 -p dgx

For more information: https://www.nccs.nasa.gov/systems/ADAPT/Prism


Category:

Tags: