urn:noticeable:projects:bYyIewUV308AvkMztxixSherlock changelogwww.sherlock.stanford.edu2023-05-12T22:32:58.168ZCopyright © SherlockNoticeablehttps://storage.noticeable.io/projects/bYyIewUV308AvkMztxix/newspages/GtmOI32wuOUPBTrHaeki/01h55ta3gs1vmdhtqqtjmk7m4z-header-logo.pnghttps://storage.noticeable.io/projects/bYyIewUV308AvkMztxix/newspages/GtmOI32wuOUPBTrHaeki/01h55ta3gs1vmdhtqqtjmk7m4z-header-logo.png#8c1515urn:noticeable:publications:yYBxYUSUYLiw2D6qzR0S2023-05-12T22:30:44.259Z2023-05-12T22:32:58.168ZFinal hours announced for the June 2023 SRCF downtimeAs previously announced, the Stanford Research Computing Facility (SRCF), where Sherlock is hosted, will be powered off during the last week of June, in order to safely bring up power to the new SRCF2 datacenter. Sherlock will not be<p>As <a href="https://news.sherlock.stanford.edu/publications/srcf-is-expanding?utm_source=noticeable&amp;utm_campaign=sherlock.final-hours-announced-for-the-june-2023-srcf-downtime&amp;utm_content=publication+link&amp;utm_id=bYyIewUV308AvkMztxix.GtmOI32wuOUPBTrHaeki.yYBxYUSUYLiw2D6qzR0S&amp;utm_medium=newspage" target="_blank" title="SRCF is expanding">previously announced</a>, the Stanford Research Computing Facility (SRCF), where Sherlock is hosted, will be powered off during the last week of June, in order to safely bring up power to the new SRCF2 datacenter.</p><blockquote><p><strong>Sherlock will not be available for login, to submit jobs or to access files</strong> from <strong>Saturday June 24th, 2023 at 00:00 PST</strong> to <strong>Monday July 3rd, 2023 at 18:00 PST.</strong></p></blockquote><p>Jobs will stop running and access to login nodes will be closed at 00:00 PST on Saturday, June 24th, to allow sufficient time for shutdown and pre-downtime maintenance tasks on the cluster, before the power actually goes out. If everything goes according to plan, and barring issues or delays with power availability, access will be restored on Monday, July 3rd at 18:00 PST.</p><p>We will use this opportunity to perform necessary maintenance operations on Sherlock that can’t be done while jobs are running, which will avoid having to schedule a whole separate downtime. Sherlock will go offline in advance of the actual electrical shutdown to ensure that all equipment is properly powered off and minimize the risks of disruption and failures when power is restored.<br><br>A reservation will be set in the scheduler for the duration of the downtime: if you submit a job on Sherlock and the time you request exceeds the time remaining until the start of the downtime, your job will be queued until the maintenance is over, and the <code>squeue</code> command will report a status of <code>ReqNodeNotAvailable</code> (“Required Node Not Available”).</p><p><em>The hours leading up to a downtime are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be shorter than usual.<br><br></em>As previously mentioned, in anticipation of this week-long downtime, we encourage all users to plan their work accordingly, and ensure that they have contingency plans in place for their computing and data accessibility needs during that time. <strong>If you have important data that you need to be able to access while Sherlock is down, we strongly recommend that you start transferring your data to off-site storage systems ahead of time, to avoid last-minute complications.</strong> Similarly, if you have deadlines around the time of the shutdown that require computation results, make sure to anticipate those and submit your jobs to the scheduler as early as possible.<br><br>We understand that this shutdown will have a significant impact for users who rely on Sherlock for their computing and data processing needs, and we appreciate your cooperation and understanding as we work to improve our Research Computing infrastructure.<br><br>For help transferring data, any questions or concerns, please do not hesitate to reach out to <a href="mailto:[email protected]" rel="noopener nofollow" target="_blank">[email protected]</a>.</p>Kilian Cavalotti[email protected]urn:noticeable:publications:sfVys1ZofGziZcYKUEhR2023-02-24T02:00:00Z2023-03-01T18:49:02.984ZSRCF is expandingIn order to bring up a new building that will increase data center capacity, a full SRCF power shutdown is planned for late June 2023. It’s expected to last about a week, and Sherlock will be unavailable during that time.<p>The <a href="SRCF" rel="noopener nofollow" target="_blank" title="https://srcc.stanford.edu/facilities">Stanford Research Computing Facility</a> (SRCF), where Sherlock is hosted, has been a highly effective data center since its opening in January of 2014, and demand has grown so much that we’re expanding it! Another identical building (SRCF2) is under construction at SLAC, which will increase our data center capacity when it opens this summer.</p><p>In order to bring power to the new building, the entire existing SRCF data center will need to be shut down. The 12kV electrical infrastructure is so pervasive that for the new building to be connected safely, everything needs to be powered off, including the backup generators. It unfortunately means that all servers and equipment will need to be shut down for this event, including Sherlock.<br><br><strong>The full building power shutdown is planned for late June 2023, it’s expected to last for about a week, and Sherlock will be unavailable during that time.</strong></p><p>During the power outage, Sherlock will be entirely powered down, meaning that it will not allow login or data transfer, the <a href="https://www.sherlock.stanford.edu/docs/user-guide/ondemand/?utm_source=noticeable&amp;utm_campaign=sherlock.srcf-is-expanding&amp;utm_content=publication+link&amp;utm_id=bYyIewUV308AvkMztxix.GtmOI32wuOUPBTrHaeki.sfVys1ZofGziZcYKUEhR&amp;utm_medium=newspage" rel="noopener nofollow" target="_blank" title="Sherlock OnDemand">Sherlock OnDemand</a> interface will be down, jobs will not run, and data will not be accessible (including <code>$HOME</code>, <code>$SCRATCH</code> and <code>$OAK</code>). We expect all services to resume normally once power is back up, and jobs that were in queue before the downtime should resume being scheduled normally.<br><br><a href="https://itcommunity.stanford.edu/news/stanford-research-computing-facility-planned-shutdown-june-26-july-3-2023?utm_source=noticeable&amp;utm_campaign=sherlock.srcf-is-expanding&amp;utm_content=publication+link&amp;utm_id=bYyIewUV308AvkMztxix.GtmOI32wuOUPBTrHaeki.sfVys1ZofGziZcYKUEhR&amp;utm_medium=newspage" rel="noopener nofollow" target="_blank" title="Stanford Research Computing Facility Planned Shutdown June 26-July 3, 2023">The power outage is currently scheduled for the last week of June 2023</a>. Specific dates and times have not been finalized yet, but we will share more detailed information as the shutdown date gets closer.<br><br>In anticipation of this week-long downtime, we encourage all users to plan their work accordingly, and ensure that they have contingency plans in place for their computing and data accessibility needs during that time. If you have important data that you need to be able to access while Sherlock is down, we strongly recommend that you start transferring your data to off-site storage systems ahead of time, to avoid last-minute complications. Similarly, if you have deadlines around the time of the shutdown that require computation results, make sure to anticipate those and submit your jobs to the scheduler as early as possible.<br><br>We understand that this shutdown will have a significant impact for users who rely on Sherlock for their computing and data processing needs, and we appreciate your cooperation and understanding as we work to improve our Research Computing infrastructure.<br><br>For help in transferring data, any questions or concerns, please do not hesitate to reach out to <a href="mailto:[email protected]" rel="noopener nofollow" target="_blank">[email protected]</a>.<br></p>Kilian Cavalotti[email protected]urn:noticeable:publications:bhmmVOc9ZyiaN7E2dytq2019-10-14T21:23:00.001Z2019-10-14T21:25:08.706ZNext scheduled maintenance: Oct. 16In order to prepare future improvements of the parallel /scratch file system, as well as performing some required work on the scheduler, Sherlock will not be available during the following times: Wednesday, October 16th, 2019 - from 8...<p>In order to prepare future improvements of the parallel <code>/scratch</code> file system, as well as performing some required work on the scheduler, Sherlock will not be available during the following times:</p> <p><strong>Wednesday, October 16th, 2019 - from 8:00am to 6:00pm</strong></p> <p>Access to Sherlock will be unavailable, logins will be disabled and jobs won’t run during that period.</p> <p><em>Note: if you submit a job on Sherlock before the downtime and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The hours leading up to the maintenance are often a good time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be shorter than usual.</em></p> <p><strong>Oak storage will stay online during Sherlock’s maintenance, including all its gateways.</strong> If you need to access files on Oak during that time, you can use Globus, the shared DTN (eg. SSHFS), or other private NFS/SMB Oak gateways if you have any.</p> Kilian Cavalotti[email protected]urn:noticeable:publications:qipWwBYB76bf4efLtNM22019-01-23T00:17:00.001Z2019-01-23T01:39:38.909ZNext scheduled maintenance: Feb. 5Sherlock will not be available on Tuesday, February 5th, 2019 from 8:00am to 6:00pm. System maintenance and upgrades will be performed during this time. Access to Sherlock will be unavailable, logins will be disabled and jobs won't run...<p><strong>Sherlock will not be available on Tuesday, February 5th, 2019 from 8:00am to 6:00pm.</strong></p> <p>System maintenance and upgrades will be performed during this time. Access to Sherlock will be unavailable, logins will be disabled and jobs won’t run.</p> <p>If you submit a job on Sherlock and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over.</p> <p><em>Note that the hours leading up to the maintenance are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be shorter than usual.</em></p> Kilian Cavalotti[email protected]urn:noticeable:publications:dfMDzpXcvlXWMOgMCiYX2018-11-14T19:33:00.001Z2018-11-14T19:43:55.930ZNext scheduled maintenance: Nov. 28Sherlock will not be available on Wednesday, November 28th, 2018 from 8:00am to 6:00pm. System maintenance and upgrades will be performed during this time. Access to Sherlock will be unavailable, logins will be disabled and jobs won't...<p><strong>Sherlock will not be available on Wednesday, November 28th, 2018 from 8:00am to 6:00pm.</strong></p> <p>System maintenance and upgrades will be performed during this time. Access to Sherlock will be unavailable, logins will be disabled and jobs won’t run.</p> <p>If you submit a job on Sherlock and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over.<br> The squeue command will report a pending reason of <code>ReqNodeNotAvailable</code> (“Required Node Not Available”) for those jobs.</p> <p><em>Note that the hours leading up to the maintenance are an excellent time<br> to submit shorter, smaller jobs that can complete before the maintenance<br> begins: as the queues drain there will be many nodes available, and your<br> wait time may be shorter than usual.</em></p> Kilian Cavalotti[email protected]urn:noticeable:publications:oi33sJZ3gicHnUEF5rP42018-08-13T15:40:00.001Z2018-09-09T20:46:50.863ZHours announced for the SRCF shutdownAs previously announced, the Stanford Research Computing Facility (SRCF) will undergo major power maintenance over the Labor Day week-end (Sep. 1-3, 2018). Because it's hosted in that datacenter, Sherlock will not be available for login...<p>As <a href="https://news.sherlock.stanford.edu/posts/srcf-reboot?utm_source=noticeable&amp;utm_campaign=sherlock.hours-announced-for-the-srcf-shutdown&amp;utm_content=publication+link&amp;utm_id=bYyIewUV308AvkMztxix.GtmOI32wuOUPBTrHaeki.oi33sJZ3gicHnUEF5rP4&amp;utm_medium=newspage" target="_blank">previously announced</a>, the Stanford Research Computing Facility (SRCF) will undergo major power maintenance over the Labor Day week-end (Sep. 1-3, 2018).</p> <p>Because it’s hosted in that datacenter, <strong>Sherlock will not be available for login, to submit jobs or to access files</strong> from <strong>Friday, Aug. 31ˢᵗ at noon</strong> to <strong>Tuesday, Sep. 4ᵗʰ at 9am.</strong></p> <p>Jobs will stop running and access to login nodes will be closed at noon on Friday, Aug 31st, to allow sufficient time for shutdown and pre-downtime maintenance tasks on the cluster, before the power actually goes out. If everything goes according to plan, access will be restored on Tuesday, Sep. 4 at 9am.</p> <p>A reservation will be set in the scheduler for the duration of the downtime: if you submit a job on Sherlock and the time you request exceeds the time remaining until the start of the downtime, your job will be queued until the maintenance is over, and the <code>squeue</code> command will report a status of <code>ReqNodeNotAvailable</code> (“Required Node Not Available”).</p> <p><em>Note that the hours leading up to a downtime are an excellent time to submit shorter, smaller jobs that can complete before the maintenance begins: as the queues drain there will be many nodes available, and your wait time may be shorter than usual.</em></p> Kilian Cavalotti[email protected]urn:noticeable:publications:jlUepfq10DrNFAJB59yl2018-05-31T23:57:00.001Z2018-09-09T20:46:50.830ZSRCF RebootA shutdown of the Stanford Research Computing Facility (SRCF) is planned for Labor Day weekend, Saturday through Monday, September 1-3, due to the need for power system maintenance. We will bring Sherlock down on Friday evening, August...<p>A shutdown of the Stanford Research Computing Facility (SRCF) is planned for Labor Day weekend, <strong>Saturday through Monday, September 1-3</strong>, due to the need for power system maintenance.</p> <p><em>We will bring Sherlock down on Friday evening, August 31, and expect to bring it back up on Monday evening, September 3, 2018.</em></p> <p>During that time, jobs will not run, file systems will not be available, and login nodes will be offline. All network connectivity to Sherlock will be interrupted.</p> <p><strong>Sherlock will not be available for login, to submit jobs or to access files.</strong></p> <p>We will place a reservation in the scheduler, so jobs won't be allowed to start if they don't have a chance to end before the downtime.</p> <p>We know that this is a disruptive event, but power maintenance is a necessary operation, and all servers and equipment at the SRCF will need to be shut down, including the generators, when the power goes off. The Labor Day weekend was chosen for this event because it has consistently lower power use, implying lowest load at the SRCF.</p> <p>Please see the <a href="https://srcc.stanford.edu/events/srcf-reboot-planned-3-day-electrical-outage?utm_source=noticeable&amp;utm_campaign=sherlock.srcf-reboot&amp;utm_content=publication+link&amp;utm_id=bYyIewUV308AvkMztxix.GtmOI32wuOUPBTrHaeki.jlUepfq10DrNFAJB59yl&amp;utm_medium=newspage" target="_blank" rel="noopener">SRCF reboot page</a> for additional details, and don't hesitate to <a href="[email protected]" target="_blank" rel="noopener">reach out</a> if you have any question.</p> Kilian Cavalotti[email protected]urn:noticeable:publications:WsAnXQ37Et5HuUVLWcWU2018-01-18T00:18:02Z2018-09-09T20:46:50.791ZNext scheduled maintenance: Jan. 31Sherlock 2.0 will not be available on Wednesday, January 31st, 2018 from 8:00am to 2:00pm System maintenance and upgrades will be performed during this time. Access to Sherlock 2.0 will be unavailable, logins will be disabled and jobs...<p>Sherlock 2.0 will not be available on Wednesday, January 31st, 2018 from 8:00am to 2:00pm</p> <p>System maintenance and upgrades will be performed during this time. Access to Sherlock 2.0 will be unavailable, logins will be disabled and jobs won't run during that period.</p> <p><strong>Sherlock 1.0 will remain available during that time.</strong></p> <p>If you submit a job on Sherlock 2.0 and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable" ("Required Node Not Available").</p> Sherlock Team[email protected]urn:noticeable:publications:nnRuGklaNSQI8vGIX96r2017-11-28T19:30:34Z2018-09-09T20:46:50.863ZNext scheduled maintenance: Dec. 12Sherlock 2.0 will not be available on Tuesday, December 12th, 2017 from 8:00am to 2:00pm System maintenance and upgrades will be performed during this time. Access to Sherlock 2.0 will be unavailable, logins will be disabled and jobs...<p><strong>Sherlock 2.0 will not be available on Tuesday, December 12th, 2017 from 8:00am to 2:00pm</strong></p> <p>System maintenance and upgrades will be performed during this time. Access to Sherlock 2.0 will be unavailable, logins will be disabled and jobs won't run during that period. <strong>Sherlock 1.0 will remain available during that time.</strong></p> <p>If you submit a job on Sherlock 2.0 and the time you request exceeds the time remaining until the maintenance begins, your job will run when the maintenance is over. The squeue command will report "ReqNodeNotAvailable" ("Required Node Not Available").</p> Sherlock Team[email protected]urn:noticeable:publications:Ixec5REXXcnhqaxGjrhe2017-09-06T23:59:08Z2018-09-09T20:46:50.761ZNext scheduled maintenance: Sept. 20Sherlock will not be available on Wednesday, September 20 2017 between 9:00 am and 2:00 pm. System maintenance and upgrades will be performed during this time. Access to Sherlock will be unavailable, logins will be disabled and jobs won...<p><strong>Sherlock will not be available on Wednesday, September 20 2017 between 9:00 am and 2:00 pm.</strong></p> <p>System maintenance and upgrades will be performed during this time. Access to Sherlock will be unavailable, logins will be disabled and jobs won't run during that period.</p> <p>We placed a maintenance reservation in the scheduler, so no new job will be allowed to start if it doesn't have a chance to end before the downtime.</p> Sherlock Team[email protected]