Senior Site Reliability Engineer
SoundHound AI · CA
The Opportunity This is a high-ownership role with direct influence over infrastructure decisions. The team has a clear roadmap focused on improving reliabil...
Job description
The Opportunity This is a high-ownership role with direct influence over infrastructure decisions. The team has a clear roadmap focused on improving reliability, security posture, and operational maturity. The Senior Site Reliability Engineer helps build first-class infrastructure to deliver our best-in-class technology to the world. The infrastructure is large and complex, running in the cloud and on Kubernetes, so there's no shortage of interesting problems. What You'll Do: - Build software and systems for cloud infrastructure management and automation (Terraform, Ansible, Oracle Cloud, GCP) - Participate in developing frameworks for application deployment, customization, and upgrades (Kubernetes, ArgoCD, Vault, Jenkins) - Ensure application and infrastructure security complies with ISO 27001 / SOX / PCI - Improve observability, implement and measure key metrics, and define and enforce SLOs/SLAs (Prometheus, Grafana, ELK) - Collaborate with engineering, quality engineering, and product management to architect and build highly available, reliable, and secure systems What You'll Bring: - 8 years of experience working with cloud services at scale in a high-volume customer-facing env...