The latest news from incident.io HQ

We’re building the best way for your whole organization to respond, review and learn from incidents. This is where we talk about how and why.

Data

Data quality testing

Our data observability workflow uses data quality testing to ensure data meets accuracy, consistency, and reliability standards, enabling confident, data-driven decisions. See how we built it, the common challenges we encountered, and the solutions.

Lambert Le ManhPicture of Lambert Le Manh

Lambert Le Manh

6 min read
Article

A new era for Catalog

Over the past year, we’ve been working incredibly hard behind the scenes to make Catalog even more powerful and usable with new features, integrations, and more.

Charlie KingstonPicture of Charlie Kingston

Charlie Kingston

13 min read
Engineering

Building On-call: Our observability strategy

Our customers count on us to sound the alarm when their systems go sideways—so keeping our on-call service up and running isn’t just important; it’s non-negotiable. To nail the reliability our customers need, we lean on some serious observability (or as the cool kids say, o11y) to keep things running smoothly.

Martha LambertPicture of Martha Lambert

Martha Lambert

21 min read
Article

Introducing: incident.io for Microsoft Teams

The wait is finally over. Introducing: incident.io for Microsoft Teams 🔥

Ed DeanPicture of Ed Dean

Ed Dean

5 min read
Engineering

Building On-call: Continually testing with smoke tests

Launching On-call meant we had to make our system rock-solid from the get-go. Our solution? Smoke tests to let us continually test product health and make sure we're comfortable making changes at pace.

Rory MalcolmPicture of Rory Malcolm

Rory Malcolm

11 min read
Article

Introducing SEV0

Welcome to our first-ever conference, taking place this September.

Stephen WhitworthPicture of Stephen Whitworth

Stephen Whitworth

3 min read
Data

Data stack 2024

It's been nearly 2 years since our last update on our data stack—and we have a lot to share! Read about improvements to our local dev setup, why we switched key platforms, and some other cool things. 👀

Jack ColseyPicture of Jack Colsey

Jack Colsey

11 min read
Article

Where does the time go after you resolve an incident?

We were curious: where does the time go after an incident is resolved? To find out, we analyzed the post-incident process of 13,000 incidents and 14,000 follow-ups action items.

Eryn CarmanPicture of Eryn Carman

Eryn Carman

8 min read
Data

How our data team handles incidents

Data incidents are just like any other type of incident, and having a well defined data incident management process in place makes it a lot less stressful when things inevitably break. Here's how our team does it.

Navo DasPicture of Navo Das

Navo Das

7 min read

Stay in the loop: subscribe to our RSS feed.

Move fast when you break things