Senior Site Reliability Engineer (SRE)

New Today

Senior Site Reliability Engineer (SRE) Remote 12-month contract (high chance of extension) Job Description Join a global pioneer in the video game industry and own the reliability of high-traffic, revenue-critical platforms used by millions worldwide. As a Senior SRE, you'll shape the architecture, improve platform-wide resiliency, and ensure services stay performant, scalable, and secure. This isn't just about maintaining a single system, you'll influence reliability across multiple services, driving improvements that touch the entire ecosystem. Key Responsibilities Lead incident response and troubleshooting for production systems, resolving high-severity issues and driving post-incident improvements. Influence architecture to improve platform-wide reliability, resiliency, and operational efficiency, ensuring services remain available under heavy load. Drive containerisation best practices and manage Kubernetes-based workloads at scale. Build and maintain event-driven architectures that scale globally while ensuring fault-tolerance and high availability. Automate infrastructure provisioning, deployment, and monitoring using Infrastructure as Code (Terraform, CloudFormation, Ansible, CDK). Collaborate with engineering, product, and security teams to define SLOs, SLIs, and error budgets across servic...
Location:
London
Salary:
not provided
Job Type:
FullTime
Category:
Engineering

We found some similar jobs based on your search