On-call Alerts, schedules, escalations, and more

New

We were incredibly excited to finally share our new On-call product with you last week ๐ŸŽ‰

Weโ€™ve been shipping loads of great work in our existing Response and Status Page products, but weโ€™ve also been hard at work behind the scenes building On-call, which until now we haven't been able to talk about.

It's great to finally be able to show you what we're building going forward. In the meantime, we thought you might enjoy a few highlights as well as a long list of smaller improvements we made in the run-up to launch below.

With the release of On-call, we now not only provide a product for effective incident coordination, but we can also ensure your team gets proactively notified via phone, email, SMS, push notification and Slack when there is an issue, too ๐Ÿ””.

Our On-call solution is built from four main components:

  • Alerts: Connect tools like Datadog and Sentry to notify us if something goes wrong
  • Schedules: Configure who should be on-call and when
  • Escalation Paths: Define who to contact be contacted when something goes wrong
  • Notifications: Notify your team via SMS, phone call, email, Slack and ourย mobile app โœจ

Alerts

โ„น๏ธWe launched Alerts to all our customers back in January of this year

Alerts allows organizations to automatically create incidents when something fires from monitoring tools such as Datadog or Sentry. Auto-creating incidents removes unnecessary friction from the incident response process (like having to manually create incidents from alerts).

As part of the On-call release, we have added a new section to your Alert routes which allows you to specify how you want to escalate these alerts.

For example, if you would like to notify the relevant team via their escalation path and if you would like to set a delay on escalating grouped alerts (thereby giving you a chance to mark alerts as related in the incident channel before your phone starts ringing).

Schedules

Schedules determine when individuals from a team or group will be on-call. Within your schedule creation, you will have the ability to configure:

  • When your handover will take place (i.e. every Monday at 9am)
  • When the schedule will be active (i.e. all day, only during business hours)
  • Which individuals will be on-call, and if you want more than one person on-call simultaneously
  • Additional rotas, which can support models such as:
    • Follow-the-sun (i.e. spread support across timezones)
    • Shadowing (i.e. onboarding new on-call individuals)

Additionally, weโ€™ve built out support for overrides, making it easier for your teams to handle on-call coverage if someone needs to pop out to the gym or head to a doctorโ€™s appointment.

๐Ÿ‘€ Make sure to check out /inc cover me in Slack, which allows you to request coverage from other team members on the same schedule.

Escalations

Escalation paths tell us who you would like to reach out to and in what order. You can configure:

  • Who to notify (either individuals or schedules) and in what order
  • How long to try notifying before moving to the next stage
  • How many times to try notifying if escalations go unacknowledged

Note, if you do not have a schedule set up, you can create a schedule inline from the escalation path creation flow or follow instructions below in the On-call > Schedules section of the dashboard.

๐Ÿ‘€ Donโ€™t forget to check out /inc escalate to manually escalate during an incident. You can configure the form and utilize the catalog to make it easier for individuals to find the right person to escalate to.

Notifications

On-call supports notifications via Slack, SMS, phone call, email and push notification via our mobile app.

If you are an on-call, youโ€™ll be able to:

  • Determine where youโ€™d like to receive notifications when something goes wrong and about upcoming shift notifications
  • Create rules for receiving notifications (ie. notify me immediately via push notification)
  • Test your notifications to make sure things are working before starting your shift

๐Ÿ‘€ We will tell you if you set your notification rules in a way that violates an escalation policy you are on, so you can adjust accordingly.

Mobile app

The mobile app for On-call will be the easiest way of engaging with On-call from setting up your on-call preferences to quickly acknowledging issues.

In the mobile app you can:

  • Set up critical alerts to ensure push notifications can bypass silent and โ€œdo not disturbโ€ modes
  • Add contact card syncing to allow for SMS and phone calls to also bypass silent and โ€œdo not disturbโ€ modes
  • Configure your mobile app to light or dark mode

๐Ÿ‘€ One of the perks of having The Chainsmokers as investors is that when we were trying to figure out what our alert noises could be, they were only too happy to help. Thanks, team! ๐ŸŽต

If you are interested in learning more about On-call, please reach out to hello@incident.io. And, donโ€™t forget to check out our On-call help docs to help you get started!

๐Ÿš€ What else weโ€™ve shipped

Weโ€™ve been building On-call behind closed doors for a little while, which meant we havenโ€™t been able to publicly shout about some of the bits and bobs weโ€™ve shipped and fixed along the way.

There are far too many things to list out so weโ€™ve cherry picked some highlights from just the past month as we approached launch day!

๐Ÿ†• You can sync your shifts or the whole schedule to your calendar! (e.g. Google Calendar)

๐Ÿ†• We now support paging using incident.io On-call via Workflows

๐Ÿ†• The "User acknowledges an escalation" workflow now triggers for incidents escalated with incident.io On-call (already possible with existing providers)

๐Ÿ†• You can now add more than one notification rule per notification method (ie. Notify me immediately via SMS and notify me after 5 minutes via SMS again)

๐Ÿ†• You can now be notified with a delay up to 10 minutes

๐Ÿ†• When you have multiple consecutive alerts pending on an incident, we'll now roll them up into a single summary message instead of overwhelming the incident channel

๐Ÿ†• You can create test incidents from alert routes, to help you test your configuration more easily

๐Ÿ†• Within Preferences, you can now remove mobile app installations as a notification channel (ie. when changing or upgrading phones)

๐Ÿ†• When using an expression for your alert attributes, you can now specify a fallback value

๐Ÿ†• In incidents, you can now escalate to a user, in addition to escalation paths, from the dashboard

๐Ÿ†• You can receive upcoming shift email notifications

๐Ÿ†• Add ability to useย startsWithย in Javascript expressions for alerts

๐Ÿ†• Inspect your alerts from the alert detail page

๐Ÿ†• Empty alert attributes will now show in the alert inspect drawer

๐Ÿ†• Non-paging notifications will be sent as non-critical on the app

๐Ÿ†• Newย help centerย doc for bypassing focus/silent modes

๐Ÿ†• We will backfill your alerts if you configure a new attribute

๐Ÿ†• You can duplicate any existing escalation path

๐Ÿ†• Added support for sending SMS from our shortcode numbers

๐Ÿ’… Introduced visual consistency to escalations list view

๐Ÿ’… Added schedule name to upcoming schedule notification emails

๐Ÿ’… Improved user experience around adding phone number notification methods

๐Ÿ’… Donโ€™t auto-acknowledge alert escalations when you mark an alert as unrelated, in case you forget about it

๐Ÿ’… Links to schedules and escalation paths in upcoming shift messages are now clickable

๐Ÿ’… Improved styling and information on email on-call notifications

๐Ÿ’… Alert routes/sources no longer show as clickable to users without permissions

๐Ÿ’… Improve layout of sections in alerts details page

๐Ÿ’… Escalation paths are now listed alphabetically in the dashboard

๐Ÿ’… Show a warning when we truncate overrides

๐Ÿ’… Added more sensible alert notification defaults

๐Ÿ’… Allow ability to delete expressions from alert routes and sources

๐Ÿ’… On-call received a new icon in the sidebar

๐Ÿ’… Improved shift notification text in user preferences

๐Ÿ’… On-call QR codes point to the public app store applications

๐Ÿ’… Improved language for grouping alerts on the alert route

๐Ÿ’… Schedule overrides start will default to now if the shift has already started

๐Ÿ’… Fix styling of the On-call widget in the dashboard sidebar

๐Ÿ’… Improved responsiveness of the escalations page

๐Ÿ’… You can now disable all shift change notifications

๐Ÿ’… Improved the accuracy of the override preview

๐Ÿ’… You can now see the name of the mobile devices you've installed the app to, and how long ago they were used

๐Ÿ’… Added the ability to manage expressions within alert source and alert route forms

๐Ÿ’… Clarified messaging when we don't escalate because nobody is on-call

๐Ÿ’… Removed some confusing colour palettes from schedules

๐Ÿ’… When editing a schedule, we'll now show a confirmation dialog before navigating away if you have unsaved changes

๐Ÿ’… When declining an incident with pending alerts through the dashboard, you can now select all alerts instead of having to click through one by one to determine next steps

๐Ÿ’… We've improved the formatting of alert messages in Slack to be easier to scan

๐Ÿ’… The phone and SMS notification rules now display the phone number of that rule

๐Ÿ’… Notification rules and shift change notification rules are now both ordered by time, with delayed notifications shown further down in the list

๐Ÿ› Repeating escalations on escalation policies work

๐Ÿ› Fixed a bug where users were unable to manually escalate from the dashboard

๐Ÿ› Incidents will always get created from an alert, even if we canโ€™t set custom fields

๐Ÿ› Protect custom fields from being deleted if they are used in an alert route

๐Ÿ› Blocked access via the API to native manual escalations from private incidents

๐Ÿ› Fixed bug where related incidents to alerts were not showing when active or while in the post-incident phase

๐Ÿ› Correctly show users when they donโ€™t have permissions to create an alert source

๐Ÿ› Fixed shift length duration preview being incorrect when switching timeline size in override modal

๐Ÿ› Fixed bug where errored attributes were not showing in the alert inspect drawer

๐Ÿ› Fixed a bug in schedule/alert route deletion protection

๐Ÿ› Fixed a bug where the override creation modal would preselect the incorrect layer

๐Ÿ› Fixed a bug where upcoming shift emails would display the incorrect timezone

๐Ÿ› Fixed a bug that was causing alert filters to behave unpredictably

๐Ÿ› Fixed a bug that was causing schedules in non-UTC timezones to render entries incorrectly escalations

๐Ÿ› Fixed a bug around alert transition states

๐Ÿ› Fixed a bug when displaying on-call users from multiple rotas

๐Ÿ› Fixed a bug where we were displaying โ€œmissing variableโ€ in templated text expressions

๐Ÿ› Ensure we always display the latest action in the escalations list view

๐Ÿ› Smoother experience when editing attribute expressions

๐Ÿ› Fixed bugs with overrides on schedules with multiple rotations

๐Ÿ› Override previews now show up at the correct times on the timeline

๐Ÿ› We now correctly show the warning that an override is outside working hours when it actually is outside working hours

๐Ÿ› The notification for users going on-call for the first time is now only sent once

๐Ÿ› The "next changeover" time is now shown correctly for schedules where the schedule currently has no one on-call

๐Ÿ› Content no longer overflows between the panels when editing an alert source

๐Ÿ› /inc cover me now works in the same places that /inc cover does

๐Ÿ› Include overrides when calculating the next handover

๐Ÿ› Fixed auto-incrementing of acknowledgement options via SMS

๐Ÿ› Fixed on-call notification emails that donโ€™t have associated incidents

๐Ÿ› Fixed title character escaping in Slack escalation messages

๐Ÿ› Always ingest alerts, even if we canโ€™t evaluate an attribute

๐Ÿ› Fixed a bug where the wrong escalation path level was deleted while editing

๐Ÿ› Correctly show pending escalations in the timeline view again

๐Ÿ› Fixed a bug where not picking up a phone page would acknowledge it

๐Ÿ› We will correctly display if youโ€™re on-call indefinitely

๐Ÿ› We correctly escape Slack special characters in messages for alerts

๐Ÿ› Fixed a bug where the templated text editor wasn't always correctly displaying expression variables

๐Ÿ› Fixed a bug where expressions weren't correctly being created in alert routing

๐Ÿ› Fixed a bug where we were displaying empty schedule entries

Share on

Move fast when you break things