Today, I spent an hour listening to an incident post-mortem. By design, these sessions are blameless; the goal is to extract learnings. However, often timese these learnings remain localized within a single team or organization.
In the pursuit of 0 OOSLA (Out Of SLA) and putting out daily oncall fires, teams often struggle to see the value in seeing beyond their own span of control. In large organizations, the friction of cross-org alignment only adds to the complexity.
But the real power of post-mortem action items is unlocked when we scale them to the company level. Is it easy? Certainly not. But It can be fullfilling to know your “local” fix potentially prevented dozens of “global” incidents before they even started. Leveraging agentic products to find similar gaps / vulnerabilities is a great opportunity to scale this as well. If you sit on similar situation- Speak up and question it next time.
