During managed service incidents, this page documents the incident response playbooks the Core Services team can use when issues arise in the Sourcegraph Managed Services Platform (MSP) fleet and shared platform.

<aside> 💡 For more per-managed-service-service/MSP-operator-oriented guidance, refer to the Managed Services infrastructure pages instead.

</aside>



Basics

Declaring an incident

If a MSP service outage occurs, you should declare an Incidents , which more or less means using the /incident command to create an incident. Assess the impact of the outage and configure the incident as appropriate:

Infrastructure access

Quick links and brief summary below - for more details refer to the more generalized guidance.

Service-specific guidance is generated in Managed Services infrastructure pages.

Changing infrastructure

Custom terraform

Custom Terraform (*.tf) can be added to relevant environment workspaces in the managed-services repository to quickly provision and manage custom infrastructure using Terraform Cloud during an incident, without needing to make significant changes to sg msp to introduce a new resource.

CLI-apply mode

In peacetime, all service workspaces are left in "VCS mode", where the remote managed-services repository is used when running Terraform plan and apply in Terraform Cloud. Changes to the repository automatically triggers a plan as part of repository CI, and merging to main automatically deploys the workspaces.