Senior Site Reliability Engineer (SRE)
New Yesterday
Remote
12-month contract (high chance of extension)
Job Description
Join a global pioneer in the video game industry and own the reliability of high-traffic, revenue-critical platforms used by millions worldwide. As a Senior SRE, you'll shape the architecture, improve platform-wide resiliency, and ensure services stay performant, scalable, and secure. This isn't just about maintaining a single system, you'll influence reliability across multiple services, driving improvements that touch the entire ecosystem.
Key Responsibilities
- Lead incident response and troubleshooting for production systems, resolving high-severity issues and driving post-incident improvements.
- Influence architecture to improve platform-wide reliability, resiliency, and operational efficiency, ensuring services remain available under heavy load.
- Drive containerisation best practices and manage Kubernetes-based workloads at scale.
- Build and maintain event-driven architectures that scale globally while ensuring fault-tolerance and high availability.
- Automate infrastructure provisioning, deployment, and monitoring using Infrastructure as Code (Terraform, CloudFormation, Ansible, CDK).
- Collaborate with engineering, product, and security teams to define SLOs, SLIs, and error budgets across services.
- Provide mentorship, advocate SRE best practices, and ensure teams are empowered to deliver resilient, reliable systems.
Experience / Must-Have Skills
- Extensive experience in AWS and AWS-managed services (EC2, Lambda, S3, VPC, CloudWatch, CloudTrail, IAM, EKS, Service Catalog, multi-account environments).
- Strong Kubernetes / container orchestration experience, including EKS, OpenShift, Docker, and service mesh.
- Deep understanding of networking fundamentals: DNS, VPCs, routing, load balancing, TCP/IP, firewall policies.
- Proven track record in incident response and troubleshooting at scale.
- Hands-on experience with infrastructure automation and CI/CD pipelines.
- Experience designing event-driven architectures and resilient systems.
- High level of autonomy, able to influence platform-wide decisions and architect for reliability across services.
- Ability and desire to mentor junior staff
- Bonus: experience in gaming, interactive entertainment, or other high-traffic, global-scale platforms.
If you are interested in this role, please feel free to submit your CV.
- Location:
- London
- Job Type:
- FullTime
We found some similar jobs based on your search
-
New Yesterday
Senior Site Reliability Engineer (SRE)
-
London
Remote 12-month contract (high chance of extension) Job Description Join a global pioneer in the video game industry and own the reliability of high-traffic, revenue-critical platforms used by millions worldwide. As a Senior SRE, you'll shape the ...
More Details -
-
4 Days Old
Senior Software Engineer, Site Reliability Engineering, Core SRE
-
Greater London, England, United Kingdom
Senior Software Engineer, Site Reliability Engineering, Core SRE corporate_fare Google place London, UK Apply Qualifications Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 5 years of experience with softw...
More Details -
-
15 Days Old
Senior Site Reliability Engineer (SRE)
-
London
- IT
Senior Site Reliability Engineer (SRE) Remote 12-month contract (high chance of extension) Job Description Join a global pioneer in the video game industry and own the reliability of high-traffic, revenue-critical platforms used by millions worldwi...
More Details -
-
19 Days Old
Senior Software Engineer, Site Reliability Engineering, Core SRE
-
Greater London, England, United Kingdom
Senior Software Engineer, Site Reliability Engineering, Core SRE corporate_fare Google place London, UK Apply Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 5 years of experience with software development...
More Details -
-
21 Days Old
Senior Software Engineer, Site Reliability Engineering, Core SRE
-
Greater London, England, United Kingdom
Minimum qualifications Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 5 years of experience with software development in one or more programming languages. 3 years of experience in designing, analyzing, a...
More Details -
-
21 Days Old
Senior Software Engineer, Site Reliability Engineering, Core SRE
-
United Kingdom
Minimum qualifications: Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 5 years of experience with software development in one or more programming languages. 3 years of experience in designing, analyzing, ...
More Details -