JobMesh

Senior Site Reliability Engineer

Oracle · Nashville, Tennessee, US

OCI Incident Response is the first line of defense in maintaining the high availability of Oracle’s cloud. We minimize customer-impacting events by making th...

Job description

OCI Incident Response is the first line of defense in maintaining the high availability of Oracle’s cloud. We minimize customer-impacting events by making them shorter, less frequent, and less impactful through large-scale incident management. We are at the forefront of reducing event duration by leveraging our operational experience, knowledge of best practices, and ability to develop tools that automate incident management. Description: We are looking for a Senior Site Reliability Engineer to join our OCI team. This role is part of a globally distributed team responsible for detecting, triaging, and mitigating OCI service-impacting events as quickly as possible. You will be part of one of these regional teams and will be responsible for minimizing the downtime of OCI services. You will achieve this by delivering excellent major incident management and operating systems with high scalability, performance, and security that help prevent incidents from occurring. Oracle’s Cloud is state-of-the-art and constantly evolving. When issues arise, your team will respond within minutes to ensure customer impact is minimized. This role will expose you to the inner workings of OCI’s systems a...