While the command line (CLI) remains the primary method for interacting with the Slurm scheduler, we recognize that squeue and sinfo output can sometimes be difficult to parse quickly. We use a ...
While the tech folks obsesses over the latest Llama checkpoints, a much grittier battle is being fought in the basements of data centers. As AI models scale to trillions of parameters, the clusters ...
SLURM supported releases (24 and 25) introduced the ability to return just the job state information much more quickly. Nextflow should use this capability if it's available in order to reduce the RPC ...
Update, 2024-09-25 09:17: The upgrade has been completed. The queue system (Slurm) on Fox will be upgraded on Wednesday, September 25 at 09:00. The upgrade is expected to last only five to ten minutes ...
The only way to connect to our clusters is by secure shell (ssh), e.g. from a Linux/UNIX system: ssh -l your_username carya.rcdc.uh.edu ssh -l your_username sabine ...
[2024-01-29 10:00: update] The upgrade has now started. The queue system on Fox will be upgraded on Monday (January 29) at 10:00. During the upgrade, running jobs will be suspended, and slurm commands ...
2017-06-20 12:37:47[WARNING]fleet.slurmlib._call_to_dict(): squeue returned an error code '1' Command list: ['squeue', '--format=%i %j %P %V', '-p', 'kive-slow,kive ...
Like most things these days, modern atmospheric science is all about big data. Whether it's an instrument flying in an aircraft taking sets of images several times a second and producing three ...