JobMesh

Site Reliability Engineer ID53670

AgileEngine · Madrid, ES

AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among...

Job description

AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. WHY JOIN US: If you're looking for a place to grow, make an impact, and work with people who care, we'd love to meet you! ABOUT THE ROLE: We are looking for a: SRE Operations Engineer: to maintain reliability across a cloud-based SaaS platform. You’ll handle live incidents, improve observability, and reduce toil through automation using Kubernetes, Terraform, Grafana, and AWS. Hands-on, execution-focused, with real ownership across CI/CD pipelines, GitOps workflows, and on-call rotations. WHAT YOU WILL DO: - Monitor and support production and staging environments to ensure availability, performance, and stability; - Respond to incidents, perform triage and root cause analysis, and contribute to remediation efforts; - Participate in on-call rotations with defined SLAs; - Handle operational requests from internal teams; - Maintain and improve monitoring, alerting, dashboards, logs, and metri...