JobMesh

Site Reliability Engineer

Barclays · Canary Wharf, England, GB

Job Description Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability,...

Job description

Job Description Purpose of the role: To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities: - Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. - Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. - Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. - Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. - Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations. - Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology c...