JobMesh

Principal SRE (AI Enablement Platform)-2

ABC Fitness · PL

- Join ABC Fitness, the leading technology provider for the fitness industry! What You’ll Do • Architect and evolve core platform capabilities for reliabilit...

Job description

Join ABC Fitness, the leading technology provider for the fitness industry! What You’ll Do: - Architect and evolve core platform capabilities for reliability, including execution environments, CI/CD systems, and validation pipelines that support high-throughput, machine-assisted change. - Design and implement fast, ephemeral, and strictly isolated execution environments where generated work can be built, tested, and safely discarded at scale. - Transform CI/CD into a validation system by embedding automated verification (tests, integration harnesses, canarying, rollback signals) into promotion decisions. - Build production-like validation environments that allow realistic system behavior testing without impacting live systems. - Establish deep observability patterns for autonomous workflows, including tracing what ran, what failed, why, and what it cost across agents, tools, and orchestration layers. - Define and implement guardrails-as-code, including access controls, policy enforcement, cost protections, and auditability for platform usage. - Design for reliability from day one, including scalability, fault tolerance, performance optimization, and operational resilience. - Lead t...