This standard ensures systems are instrumented to expose meaningful data about their behaviour, enabling teams to detect issues early, understand impact, and make informed decisions.
Aligned to our "Data-Driven Decision-Making" and "Engineering Excellence First" policies, this standard supports proactive monitoring, faster recovery, and better user outcomes. Without it, teams operate in the dark, increasing risk and reducing system trust.
Clearly defined impacts of meeting this standard include improved delivery flow, reduced risk, higher system resilience, and better alignment to business needs. Over time, teams will see reduced rework, faster time to value, and stronger system integrity.
Level 1 – Initial: Visibility into system behaviour is limited and reactive.
Level 2 – Managed: Some metrics and logs are available but lack standardisation.
Level 3 – Defined: Systems include consistent observability patterns and telemetry.
Level 4 – Quantitatively Managed: Teams use observability data to predict and prevent issues.
Level 5 – Optimising: Observability is proactive, embedded, and used to continuously improve resilience and performance.Teams instrument their systems to make operational and business metrics visible and actionable by default.