How we leverage our Product Responder role to push our pace of development
At incident.io, Product Responder function plays a pivotal role in our ability to maintain a steady pace of development. Here, I'll highlight what the role is responsible for and explain how it makes us a better team.
incident.io
How our engineering team uses Polish Parties to maintain quality at pace
In a fast-moving company, quality cannot be delegated to a few individuals—it has to be a shared responsibility. One tool that helps us maintain our quality of work is Polish Parties. Here's how we run these crucial feedback sessions.
Leo Sjöberg
What is an SRE? Understanding the responsibilities of this crucial function
Site reliability engineers are responsible for quite a bit, but one thing is clear—their role is critical. In this article, we break down everything you need to know about SREs and what they focus on.
incident.io
How we achieved pixel-perfect polish during our Status Pages launch
When we launched Status Pages, we wanted to challenge industry norms and push our design polish to new levels. As an engineering team, here's how we worked with our design team to make this happen.
Dimitra Zuccarelli
Barcelona 2023 Company Offsite Recap
Last month, the team gathered for our second company offsite in sunny, oceanside Barcelona. Here's how it went.
Luis Gonzalez
Better security for your app's secrets
What comes after your default, out-of-box application secret solution? How do you add security to Heroku's environment variables, or go beyond putting secrets directly into Kubernetes? We've used GCP Secret Manager to improve our app secret handling, and this post shows how you can do the same.
Lawrence Jones
Effective incident escalations
In the ever-evolving digital landscape, every organization must confront its fair share of incidents. Regardless of the sector or size, one common thread weaves through them all: the need for effective incident management. A crucial part of this management is incident escalation.
Chris Evans
Driving successful change: Understanding DORA's Change Failure Rate metric
By using DORA's change failure rate metric, organizations can highlight inefficiencies in deployment processes and prevent pesky incidents from repeating.
Luis Gonzalez
Service level indicators: 6 key metrics for effective incident management
In this article, I'll highlight six important SLI metrics that can help drive better incident management processes.
incident.io
Stay in the loop: subscribe to our RSS feed.