This page includes incident response playbooks the Core Services team can use when issues arise in the Sourcegraph Managed Services Platform (MSP) fleet and shared platform.

<aside> 💡 For more per-MSP-service/MSP-operator-oriented guidance, refer to the Managed Services infrastructure pages instead.

</aside>



Basics

Declaring an incident

If a MSP service outage occurs, you should declare an Incidents , which more or less means using the /incident command to create an incident. Assess the impact of the outage and configure the incident as appropriate:

Infrastructure access

Quick links and brief summary below - for more details refer to the more generalized guidance.

Service-specific guidance is generated in Managed Services infrastructure pages.

Changing infrastructure

Custom terraform

Custom Terraform (*.tf) can be added to relevant environment workspaces in the managed-services repository to quickly provision and manage custom infrastructure using Terraform Cloud during an incident, without needing to make significant changes to sg msp to introduce a new resource.

CLI-apply mode

In peacetime, all service workspaces are left in "VCS mode", where the remote managed-services repository is used when running Terraform plan and apply in Terraform Cloud. Changes to the repository automatically triggers a plan as part of repository CI, and merging to main automatically deploys the workspaces.