Site Reliability Engineer - London, United Kingdom - Experis LTD
Description
Responsibilities:
_
- Manage and monitor AWS infrastructure, particularly Lambda functions, to ensure the availability and reliability of services._
- Develop and maintain infrastructure automation and configuration management tools to support a rapidly changing environment._
- Collaborate with software development teams to ensure new code and services meet reliability and scalability requirements._
- Design and implement monitoring and alerting systems to proactively identify and mitigate issues, preferably experience with Prometheus, Grafana and Dynatrace_
- Troubleshoot and resolve production incidents and outages and conduct postmortem analysis to prevent future incidents._
- Develop and maintain disaster recovery plans for critical systems and services._
- Stay current with AWS and SRE best practices and proactively make recommendations to improve our infrastructure and processes._
- Requirements:_
- Bachelor's degree in computer science or a related field or equivalent work experience_
- 3+ years of experience working with AWS, particularly Lambda functions_
- Strong understanding of SRE principles and experience implementing them in a production environment_
- Experience with infrastructure automation and configuration management tools such as Terraform and CloudFormation_
- Strong programming and scripting skills in languages such as Python, Java, and Bash_
- Experience with monitoring and alerting tools such as CloudWatch, Prometheus and Dynatrace_
- Excellent problemsolving and troubleshooting skills._
- Strong communication and collaboration skills, with the ability to work effectively in a team environment._
- Preferred qualifications:_
- AWS certifications such as AWS Certified Solutions Architect or AWS Certified DevOps Engineer_
- Experience with containerisation technologies such as Docker and Kubernetes_
- Familiarity with agile development methodologies and practices_
- This role is a critical position within our organisation, and we are looking for someone who is passionate about building and maintaining highly reliable and scalable systems. If you have a strong background in AWS, particularly Lambda functions, and adeep understanding of SRE principles, we would love to hear from you._
More jobs from Experis LTD
-
Senior Cyber Security Incident Responder
Birmingham, United Kingdom - 3 days ago
-
Ion Platform Specialist Hsbcjp00043988
London, United Kingdom - 3 weeks ago
-
Data Governance and Quality Manager Work From Home
Staffordshire, United Kingdom - 3 weeks ago
-
Business Analyst
Coventry, United Kingdom - 3 weeks ago
-
Information and Records Officer
Manchester, United Kingdom - 2 weeks ago
-
Asst. Director for Technical Delivery
Birmingham, United Kingdom - 3 weeks ago