Site Reliability Engineer
Barclays · Prague, CZ
Job Description Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability,...
Job description
Job Description Purpose of the role: To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities: - Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. - Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. - Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. - Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. - Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations. - Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology c...