Site Reliability Engineer (SRE) – II
Huntington National Bank · Columbus, Ohio, US
Description Summary: - As a Site Reliability Engineer (SRE) Level II, you will play a key role in maintaining the availability, scalability, and performance...
Job description
Description Summary: - As a Site Reliability Engineer (SRE) Level II, you will play a key role in maintaining the availability, scalability, and performance of critical infrastructure and services. You will be responsible for building and automating solutions that enhance system reliability and support continuous delivery. In this role, you will handle more complex operational tasks and incidents, provide mentorship to junior SREs, and collaborate with development teams to ensure systems are designed for reliability from the ground up. Incident Management: - complex incidents, and ensure service uptime. - Lead troubleshooting efforts for high-impact production issues, providing detailed root cause analysis (RCA) and preventative measures. - Participate in on-call rotations, acting as an escalation point for Level 1 SREs during major incidents. Automation & Infrastructure as Code (IaC): - Develop and maintain automation scripts and infrastructure using tools like Terraform, Ansible, or CloudFormation. - Implement automation solutions to eliminate manual tasks and improve system reliability, scalability, and performance. Performance & Scalability: - Analyze system performance and rec...