This shows you the differences between two versions of the page.
— |
queues [2020/09/21 22:01] (current) root created |
||
---|---|---|---|
Line 1: | Line 1: | ||
+ | ===== Queueing System ===== | ||
+ | All jobs on AHPCC clusters which require a significant amount of CPU or memory should be submitted through the queueing system. | ||
+ | * A //**batch job**// - a specific command is executed on the node(s) assigned to the job without the need for user interaction. | ||
+ | * An // | ||
+ | |||
+ | A //**compute node**// is an individual computer which can be used to execute jobs. Compute nodes are grouped into // | ||
+ | |||
+ | * type of cpu and number of cores on each node | ||
+ | * number of nodes assigned | ||
+ | * the maximum number of nodes allowed to be used by a single job | ||
+ | * amount of memory | ||
+ | * walltime - the maximum amount of execution time for a single job | ||
+ | |||
+ | === Node to Queue Assignment === | ||
+ | All compute nodes are divided into groups called partitions. | ||
+ | |||
+ | < | ||
+ | tres-l1: | ||
+ | Maximum jobs size in number of nodes for immediate start per queue: | ||
+ | |||
+ | q30m32c: | ||
+ | q06h32c: | ||
+ | q72h32c: | ||
+ | qcDouglas: | ||
+ | qcABI: | ||
+ | | ||
+ | qtraining: | ||
+ | tres-l1: | ||
+ | </ | ||
+ | |||
+ | The output of the script above shows that a job requesting up to 26 nodes in the queue // | ||
+ | |||
+ | [[queues|Queues]] - summary of public queues | ||
+ | |||
+ | [[batch|Batch Jobs]] | ||
+ | |||
+ | [[interactive|Interactive Jobs]] | ||
+ | |||
+ | [[condo queues|Condo Queues]] | ||
+ | |||
+ | [[walltime extensions|Job Walltime Extensions]] |