The latest news from incident.io HQ

We’re building the best way for your whole organization to respond, review and learn from incidents. This is where we talk about how and why.

Article

Why engineering teams are moving from PagerDuty to incident.io On-Call

For years, teams have had to settle for legacy on-call tools that feel more like a burden than a solution. We recently hosted a webinar on migrating from PagerDuty to incident.io On-call.

Stephen WhitworthPicture of Stephen Whitworth

Stephen Whitworth

7 min read
Engineering

How we interview engineers in 2025

We've recently grown to 80 people across London, San Francisco, and New York, and naturally, our interview process has evolved. We thought it was time for an update on our engineer interview process to keep things transparent and accessible to candidates.

Chris ClassPicture of Chris Class

Chris Class

6 min read
Article

Automated incident response: Why it matters and where it’s headed

For years, incident response has been a mostly manual process: someone gets paged, scrambles to investigate, loops in the right people, and after some firefighting, hopefully resolves the issue before too many customers notice. But as modern systems become more complex and interconnected, the old ways don’t scale. That’s where Automated Incident Response (AIR) comes in.

Tom WentworthPicture of Tom Wentworth

Tom Wentworth

8 min read
Engineering

Debugging deadlocks in Postgres

Deadlocks are a natural hurdle in backend development, but with a bit of digging and careful design they can be identified and resolved.

Louis HeathPicture of Louis Heath

Louis Heath

7 min read
Article

Overhauling PagerDuty’s data model: a better way to route alerts

PagerDuty has long been the go-to solution for reliable on-call management, but its aging data model and lack of innovation have become a challenge. In this post we explore how incident.io On-call offers a better, more flexible approach to alert routing and provide practical advice on how to migrate smoothly from PagerDuty.

Chris EvansPicture of Chris Evans

Chris Evans

12 min read
Data

How data habits help build a data culture

Building a data-driven culture in a company is hard, but we've made it possible across incident.io with some unique tried and tested strategies.

Navo DasPicture of Navo Das

Navo Das

7 min read
Article

The Incident Maturity Model

Incidents are inevitable—how you handle them matters. The Incident Maturity Model shows how to level up from basic response to company-wide resilience, with actionable steps backed by real data. Where does your team stand?

Stephen WhitworthPicture of Stephen Whitworth

Stephen Whitworth

13 min read
Article

The flight plan that brought UK airspace to its knees

On August 28, 2023, a software bug in the UK air traffic control system caused six hours of chaos, reducing air traffic capacity and forcing manual operations. It's a great story of failure, resilience and communications in complex systems.

Chris EvansPicture of Chris Evans

Chris Evans

25 min read
Article

AWS re:Invent: The handy guide for the massive conference

AWS re:Invent is packed with 3,000+ sessions for developers, covering everything from scaling apps to generative AI. In this guide, we break down the sessions you shouldn't skip on. If you're headed to Vegas, don't forget to stop by and say hi!

incident.ioPicture of incident.io

incident.io

11 min read

Stay in the loop: subscribe to our RSS feed.

Move fast when you break things