Sherlock changelog

Final hours announced for the June 2023 SRCF downtime

by Kilian Cavalotti, Technical Lead & Architect, HPC
Maintenance
Announce
As previously announced, the Stanford Research Computing Facility (SRCF), where Sherlock is hosted, will be powered off during the last week of June, in order to safely bring up power to the new SRCF2 datacenter. Sherlock will not be

A new tool to help optimize job resource requirements

by Kilian Cavalotti, Technical Lead & Architect, HPC
It’s not always easy to determine the right amount of resources to request for a computing job. Making sure that the application will have enough resources to run properly, but avoiding over-requests that would make the jobs spend too much
Documentation
Scheduler
Improvement

SRCF is expanding

by Kilian Cavalotti, Technical Lead & Architect, HPC
Maintenance
In order to bring up a new building that will increase data center capacity, a full SRCF power shutdown is planned for late June 2023. It’s expected to last about a week, and Sherlock will be unavailable during that time.

More free compute on Sherlock!

by Kilian Cavalotti, Technical Lead & Architect, HPC
Announce
Hardware
Improvement
We’re thrilled to announce that the free and generally available normal partition on Sherlock is getting an upgrade! With the addition of 24 brand new SH3_CBASE.1 compute nodes, each featuring one AMD EPYC 7543 Milan 32-core CPU and 256 GB

A new interactive step in Slurm

by Kilian Cavalotti, Technical Lead & Architect, HPC
A new version of the sh_dev tool has been released, that leverages a recently-added Slurm feature. Slurm 20.11 introduced a new“interactive step”, designed to be used with salloc to automatically launch a terminal on an allocated compute
Improvement
Scheduler

Your Sherlock prompt just got a little smarter

by Kilian Cavalotti, Technical Lead & Architect, HPC
Have you ever felt confused when running things on Sherlock and wondered if your current shell was part of a job? And if so, which one? Well, maybe you noticed it already, but we’ve deployed a small improvement to the Sherlock shell prompt
Improvement

Sherlock facts

by Kilian Cavalotti
Data
Improvement
Ever wondered how many compute nodes is Sherlock made of? Or how many users are using it? Or how many Infiniband cables link it all together? Well, wonder no more: head to the Sherlock facts page and see for yourself! > hint: there are...

Secure TensorBoard sessions with Sherlock OnDemand

by Kilian Cavalotti
Software
Improvement
If you're into machine learning (and who isn't these days?), you probably know all about TensorBoard already. If you don't, TensorBoard is TensorFlow's visualization toolkit. It provides the visualization and tooling needed for machine...

More (and easier!) GPU scheduling options

by Kilian Cavalotti
New
Scheduler
Improvement
GPU scheduling is now easier and more powerful on Sherlock, with the addition of new job submission options especially targeted at GPU workloads. The most visible change is that you can now use the --gpus option when submitting jobs...