Sherlock changelog

3.3 PFlops: Sherlock hits expansion milestone

by Kilian Cavalotti, Technical Lead & Architect, High Performance Computing
Hardware
Event
Sherlock is a traditional High-Performance Computing cluster in many aspects. But unlike most of similarly-sized clusters where hardware is purchased all at once, and refreshed every few years, it is in constant evolution. Almost like a

Tracking NFS problems down to the SFP level

by Kilian Cavalotti
Blog
Data
Hardware
When NFS problems turn out to be... not NFS problems at all.

Job #1

by Kilian Cavalotti
If you've been submitting jobs on Sherlock over the last couple days, you probably noticed something different about your your job ids... They lost a couple digits! If you submitted a job last week, its job id was likely in the 67,000...
Event
Scheduler

Sherlock is hard at work against COVID-19

by Kilian Cavalotti
About a month ago, we announced that we were dedicating a portion of Sherlock's computing resources to research projects around COVID-19. Since then, more than 15 PIs and research groups have reached out to share their projects, and...
Blog

Adventures in storage

by Kilian Cavalotti
_This is part of our blog series about behind-the-scenes things we do on a regular basis on Sherlock, to keep it up and running in the best possible conditions for our users. Now that Sherlock's old storage system has been retired, we...
Blog
Hardware
Data

More (and easier!) GPU scheduling options

by Kilian Cavalotti
New
Scheduler
Improvement
GPU scheduling is now easier and more powerful on Sherlock, with the addition of new job submission options especially targeted at GPU workloads. The most visible change is that you can now use the --gpus option when submitting jobs...

Next scheduled maintenance: Oct. 16

by Kilian Cavalotti
In order to prepare future improvements of the parallel /scratch file system, as well as performing some required work on the scheduler, Sherlock will not be available during the following times: Wednesday, October 16th, 2019 - from 8...
Maintenance

🎉 Job #50,000,000!

by Kilian Cavalotti
Event
We just wanted to share that Sherlock recently ran job #50,000,000! 🎈🎉 This is a significant milestone since Sherlock, in its current form, started running its first job in January 2017. Fifty million jobs in less than 3 years is no...

A better view at Sherlock's resources

by Kilian Cavalotti
Scheduler
Improvement
New
How many jobs are running? What partitions do I have access to? How many CPUs can I use? Where should I submit my jobs? Any of those sound familiar? We know it's not always easy to navigate the native scheduler tools, their syntax, and...