Senior II Site Reliability Engineer
Akamai Technologies · PL
Do you want to shape reliability practices for a new AI inference platform? Are you a senior technical leader who drives solutions across teams? Join the Aka...
Job description
Do you want to shape reliability practices for a new AI inference platform? Are you a senior technical leader who drives solutions across teams? Join the Akamai Inference Cloud Team The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design, implement, deploy and operate AI platforms that enable customers to run inference models and developers to create AI applications. Partner with the best: In this role, you'll lead reliability workstreams for Akamai's serverless inference platform, desig SRE tooling and automation, and drive technical decisions. Opportunities exist to mentor other SREs, influence architecture decisions with product engineering teams, and shape SRE practices for AI inference workloads and GPU infrastructure at scale. As a Senior II SRE, you will be responsible for: - Taking responsibility for observability strategy, designing telemetry, dashboards, alerts, defining SLO/SLI frameworks, and implementing improvements when targets are missed. - Building production-grade automation and tooling that reduces operational toil, improves incident response, and sets patterns that other SREs adopt - Owning incident management integration for infere...