Senior Applications Support Specialist
Ensono · GB
Key Responsibilities Incident & Problem Management - Lead major incident (MI) bridges and restore service with minimum business impact. - Handle all L3 escal...
Job description
Key Responsibilities: Incident & Problem Management Reliability Engineering Change, Release & Risk Automation, Monitoring & Observability Leadership & Mentorship - Lead major incident (MI) bridges and restore service with minimum business impact. - Handle all L3 escalations , perform deep diagnostics across Java, JVM, middleware, OS, and infra. - Own technical RCAs , drive long‑term and systemic remediation. - Identify recurring failure patterns and risks. - Apply SRE principles : SLIs/SLOs, error budgets, resilience patterns. - Tune JVM parameters , analyze thread/heap dumps, and improve performance. - Influence application architecture for fault tolerance, scalability, and recoverability . - Validate DR readiness , failover behavior, and resilience testing outcomes. - Provide technical approval and risk assessment for high-risk changes. - Enforce operational readiness for new apps and major releases. - Ensure changes meet audit, compliance, and regulatory expectations . - Build advanced automation using Shell/Python/PowerShell . - Develop frameworks for health validation , automated recovery, and compliance checks. - Define observability standards; optimize alerts and improve MTT...