Basic torque/slurm commands are:
torque | slurm | use |
---|---|---|
qsub | sbatch | submit <job file> |
qsub -I | srun | submit interactive job |
qstat | squeue | list all queued jobs |
qstat -u rfeynman | squeue -u rfeynman | list queued jobs for user rfeynman |
qdel | scancel | cancel <job#> |
shownodes -l -n;qstat -q | sinfo | node status;list of queues |
A basic slurm script looks like:
#!/bin/bash #SBATCH --job-name=mpi #SBATCH --output=zzz.slurm #SBATCH --partition comp06 #SBATCH --nodes=2 #SBATCH --tasks-per-node=32 #SBATCH --time=6:00:00 cd $SLURM_SUBMIT_DIR module purge module load intel/18.0.1 impi/18.0.1 mkl/18.0.1 mpirun -np $SLURM_NTASKS -machinefile /scratch/${SLURM_JOB_ID}/machinefile_${SLURM_JOB_ID} ./mympiexe -inputfile MA4um.mph -outputfile MA4um-output.mph
and a little more complicated with file moving in and out of a scratch area:
pinnacle-l1:$ cat script.slurm #!/bin/bash #SBATCH --job-name=espresso #SBATCH --output=zzz.slurm #SBATCH --nodes=4 #SBATCH --tasks-per-node=32 $SBATCH --time=00:00:10 #SBATCH --partition comp06 module purge module load intel/14.0.3 mkl/14.0.3 fftw/3.3.6 impi/5.1.2 cd $SLURM_SUBMIT_DIR cp *.in *UPF /scratch/$SLURM_JOB_ID cd /scratch/$SLURM_JOB_ID mpirun -ppn 16 -hostfile /scratch/${SLURM_JOB_ID}/machinefile_${SLURM_JOB_ID} -genv OMP_NUM_THREADS 4 -genv MKL_NUM_THREADS 4 /share/apps/espresso/qe-6.1-intel-mkl-impi/bin/pw.x -npools 1 <ausurf.in mv ausurf.log *mix* *wfc* *igk* $SLURM_SUBMIT_DIR/ pinnacle-l1:$