Discover Job Wait Times

Issue to be resolved

Long waits for small jobs to start on Discover.

Solution

Check your job summary report in your job’s standard output/error file (goes to slurm-.out in the submission directory by default) and compare the actual wallclock time that the job took with the requested wallclock time. If the actual time is substantially lower than your requested time, revise your requested time to be closer to the actual time. This will help the Slurm scheduler fit your jobs onto available nodes sooner, thereby reducing your real-world wait time for jobs to start.

Example:

Review additional information on Slurm best practices.


Category:

Tags: