The latest news from incident.io HQ

We鈥檙e building the best way for your whole organization to respond, review and learn from incidents. This is where we talk about how and why.

Article

The ultimate guide to on-call schedules

Learn how to create effective on-call schedules that balance operational efficiency with employee well-being. Explore best practices, scheduling patterns, and tools to minimize downtime and prevent burnout.

Chris EvansPicture of Chris Evans

Chris Evans

15 min read
Article

What does SLO stand for? A complete guide to Service Level Objectives (SLOs)

In the land of tech acronyms, SLOs might feel like just another buzzword, but they鈥檙e crucial to a successful service strategy. This post breaks down the essentials: what SLOs are, why they matter, and how they can keep you sane (and your users happy).

Kate Bernacchi-SassPicture of Kate Bernacchi-Sass

Kate Bernacchi-Sass

10 min read
Data

Data quality testing

Our data observability workflow uses data quality testing to ensure data meets accuracy, consistency, and reliability standards, enabling confident, data-driven decisions. See how we built it, the common challenges we encountered, and the solutions.

Lambert Le ManhPicture of Lambert Le Manh

Lambert Le Manh

6 min read
Article

A new era for Catalog

Over the past year, we鈥檝e been working incredibly hard behind the scenes to make Catalog even more powerful and usable with new features, integrations, and more.

Charlie KingstonPicture of Charlie Kingston

Charlie Kingston

13 min read
Engineering

Building On-call: Our observability strategy

Our customers count on us to sound the alarm when their systems go sideways鈥攕o keeping our on-call service up and running isn鈥檛 just important; it鈥檚 non-negotiable. To nail the reliability our customers need, we lean on some serious observability (or as the cool kids say, o11y) to keep things running smoothly.

Martha LambertPicture of Martha Lambert

Martha Lambert

21 min read
Article

Introducing: incident.io for Microsoft Teams

The wait is finally over. Introducing: incident.io for Microsoft Teams 馃敟

Ed DeanPicture of Ed Dean

Ed Dean

5 min read
Engineering

Building On-call: Continually testing with smoke tests

Launching On-call meant we had to make our system rock-solid from the get-go. Our solution? Smoke tests to let us continually test product health and make sure we're comfortable making changes at pace.

Rory MalcolmPicture of Rory Malcolm

Rory Malcolm

11 min read
Article

Introducing SEV0

Welcome to our first-ever conference, taking place this September.

Stephen WhitworthPicture of Stephen Whitworth

Stephen Whitworth

3 min read
Data

Data stack 2024

It's been nearly 2 years since our last update on our data stack鈥攁nd we have a lot to share! Read about improvements to our local dev setup, why we switched key platforms, and some other cool things. 馃憖

Jack ColseyPicture of Jack Colsey

Jack Colsey

11 min read

Stay in the loop: subscribe to our RSS feed.

Move fast when you break things