Site Reliability Engineering Lead

2 Days Old

We are seeking an experienced Site Reliability Engineering leader to join a high-growth SaaS organisation in a hybrid role that combines technical leadership with hands-on engineering. This is a key position for someone passionate about reliability, resilience, and running production systems at scale.
The successful candidate will lead and mentor the SRE team, set the technical direction for reliability engineering, and take end-to-end ownership of production systems. They will be accountable for availability, performance, and incident response, while working closely with Product and Engineering to define SLIs and establish meaningful SLOs that balance stability with delivery pace. They will champion a blameless culture, embedding robust incident management processes and driving continuous, systemic improvement.
Key skills and experience: Proven experience as a Lead or Senior SRE with a strong software engineering background Strong programming ability in PHP and Java or .NET Experience defining SLIs, setting SLOs, and using error budgets to guide decision-making Demonstrated ownership of production systems with full accountability for uptime and resilience Hands-on experience building and running incident management processes, including blameless postmortems Strong knowledge of observability and monitoring tools (eg Prometheus, Grafana, Datadog) Solid Linux sy...
Location:
London
Salary:
£90,000 - £110,000
Job Type:
FullTime
Category:
Engineering

We found some similar jobs based on your search