JobMesh

Site Reliability Engineer ID60188

AgileEngine · Cluj-Napoca, Cluj County, RO

AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among...

Job description

AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. WHY JOIN US: If you're looking for a place to grow, make an impact, and work with people who care, we'd love to meet you! ABOUT THE ROLE: We are looking for an: SRE Operations Engineer: to keep production and staging environments running reliably across a cloud-based SaaS platform. You’ll respond to live incidents, reduce operational toil through automation, and improve observability using Kubernetes, Terraform, Grafana, and AWS. A hands-on role with real ownership across CI/CD pipelines, GitOps workflows, and on-call rotations. WHAT YOU WILL DO: - Monitor and support production and staging environments in real time, ensuring high availability, performance, and stability; - Respond to incidents, perform triage and root cause analysis, and contribute to post-incident reviews and remediation efforts; - Participate in an on-call rotation with defined SLAs; - Handle ad-hoc and unplanned operat...