JobMesh

SRE Sr Leader- REMOTE

Simple Software Solutions Group · Jacksonville, Florida, US

SRE Sr Leader Remote 6 Months We are seeking an SRE Senior Leader to drive system uptime, performance, and scalability by blending software engineering with...

Job description

SRE Sr Leader Remote: 6 Months We are seeking an SRE Senior Leader to drive system uptime, performance, and scalability by blending software engineering with operational expertise. They lead teams to define SLIs/SLOs, automate infrastructure (IaC), manage incidents, and conduct post-mortems. Key roles include mentoring engineers, setting reliability strategies, and optimizing cloud costs. Core Responsibilities: - Leadership & Mentoring: Lead a team of SREs, manage sprint planning, and foster career growth. - System Reliability & Strategy: Own the uptime, performance, and capacity planning of production systems. - Automation & Tools: Reduce manual work (toil) by building automation, managing infrastructure as code (Terraform, Kubernetes), and enhancing observability. - Incident Management: Drive root cause analysis (RCA), lead incident responses, and implement post-mortem action items. - SLI/SLO Management: Define Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to balance velocity and reliability. Required Skills and Qualifications: - Technical Expertise: Proficiency in coding/scripting (e.g., Python, Go) and familiarity with CI/CD tools. - Infrastructure Skills:...