Site Reliability Engineer II (SRE II)
New Today
**Job Description:******Strategic Imperative:****The **Site Reliability Engineer II (SREII)** plays a key role in our cloud and data infrastructure, ensuring our products are delivered on a stable, scalable, and secure foundation. By owning AWS/Terraform environments, CI/CD pipelines, and MySQL performance, this role directly impacts release velocity, site reliability, and overall customer experience. Through thoughtful automation and scripting, the SREII reduces operational toil, increases consistency, and frees engineering teams to focus on feature delivery. Tight collaboration with cross-functional teams, paired with strong monitoring, incident response, and documentation, creates predictable, repeatable operations as we grow. By embedding security best practices and continuously evaluating new tools and approaches, this role helps the organization operate more efficiently while de-risking our infrastructure over time.****Prodege:****A cutting-edge marketing and consumer insights platform, Prodege has charted a course of innovation in the evolving technology landscape by helping leading brands, marketers, and agencies uncover the answers to their business questions, acquire new customers, increase revenue, and drive brand loyalty & product adoption. Bolstered by a major investment by Great Hill Partners in Q4 2021 and strategic acquisitions of Pollfish, BitBurst & AdGate Media in 2022, Prodege looks forward to more growth and innovation to empower our partners to gather meaningful, rich insights and better market to their target audiences.As an organization, we go the extra mile to “Create Rewarding Moments” every day for our partners, consumers, and team. Come join us today!****Primary Objectives:***** ## ****Infrastructure Management:***** ## ****Automation & Scripting:***** ## ****Database Management:***** ## ****CI/CD Integration:***** ## ****Monitoring & Optimization:***** ## ****Incident Management:***** ## ****Collaboration:***** ## ****Documentation:***** ## ****Security:***** ## ****Continuous Improvement:********Qualifications** -** *To perform this job successfully, an individual must be able to perform each job duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.*****Detailed Job Duties:***** Infrastructure Management: Utilize Terraform to define and provision AWS infrastructure. Configure and maintain AWS services (e.g., EC2, S3, RDS, Lambda, VPC).* Automation & Scripting: Develop and manage automation scripts and tools using Bash, Python, and PHP to streamline operations and enhance efficiency.* Database Management: Support and manage MySQL databases, including performance tuning, backups, and recovery.* CI/CD Integration: Implement and manage continuous integration and continuous deployment (CI/CD) pipelines using Jenkins.* Monitoring & Optimization: Monitor system performance, availability, and resource usage. Implement optimizations to enhance system efficiency and reliability.* Incident Management: Troubleshoot and resolve infrastructure issues, outages, and performance problems swiftly and effectively.* Collaboration: Work with cross-functional teams to support application deployments and address infrastructure needs.* Documentation: Create and maintain comprehensive documentation for infrastructure configurations, processes, and procedures.* Security: Ensure that all infrastructure and operations adhere to security best practices and compliance standards.* Continuous Improvement: Evaluate and adopt new technologies and practices to improve infrastructure performance and operational efficiency.Success in this will include consistently delivering a stable, secure, and scalable AWS infrastructure that supports our products without surprise outages or performance bottlenecks. Deployments run smoothly through well-maintained CI/CD pipelines and automation, with minimal manual intervention and short lead times for changes. Systems are actively monitored, with incidents investigated quickly, root causes documented, and meaningful preventative fixes implemented. Cross-functional teams feel supported because infrastructure needs are anticipated, clearly communicated, and backed by up-to-date documentation. Over time, you’re recognized for reducing operational toil, improving system performance and cost efficiency, and thoughtfully introducing new tools and practices that elevate how we run production. **The MUST Haves:** (ex: e*ducation, experience, skills, certifications, licenses required)** ## Bachelor’s degree (or equivalent) in Computer Science, Software Engineering, Information Technology, or a related discipline; or equivalent professional experience gained in a similar infrastructure/DevOps engineering role.* ## Four or more (4+) years of experience in IT operations or a related role with a strong focus on Terraform and AWS.* ## Proficiency in Terraform for infrastructure as code (IaC) management.* ## Hands-on experience with AWS services (e.g., EC2, S3, RDS, Lambda).* ## Experience with scripting languages including Bash and Python/PHP.* ## Experience managing MySQL and RDS databases.* ## Knowledge of Jenkins for CI/CD pipeline management.* ## Strong analytical and troubleshooting skills with the ability to resolve complex infrastructure issues.* ## Excellent verbal and written communication skills, with the ability to convey technical information clearly to both technical and non-technical stakeholders.**The Nice to Haves:**(ex: *preferred additional skills, education, experience, certifications, licenses*) * **AWS knowledge required, experience with Google Cloud Platform (GCP) is a plus*** **Knowledge across multiple cloud providers.*** **Certification in public cloud disciplines will be an advantage.*** **Hands-on experience with Docker and Kubernetes.*** **Usage of AI knowledge in deployment and uptime automation.**
#J-18808-Ljbffr
- Location:
- United Kingdom
- Job Type:
- FullTime