Site Reliability Engineer - London, United Kingdom - Experis LTD

Experis LTD
Experis LTD
Verified Company
London, United Kingdom

3 weeks ago

Tom O´Connor

Posted by:

Tom O´Connor

beBee Recruiter


Description

Responsibilities:
_

  • Manage and monitor AWS infrastructure, particularly Lambda functions, to ensure the availability and reliability of services._
  • Develop and maintain infrastructure automation and configuration management tools to support a rapidly changing environment._
  • Collaborate with software development teams to ensure new code and services meet reliability and scalability requirements._
  • Design and implement monitoring and alerting systems to proactively identify and mitigate issues, preferably experience with Prometheus, Grafana and Dynatrace_
  • Troubleshoot and resolve production incidents and outages and conduct postmortem analysis to prevent future incidents._
  • Develop and maintain disaster recovery plans for critical systems and services._
  • Stay current with AWS and SRE best practices and proactively make recommendations to improve our infrastructure and processes._
  • Requirements:_
  • Bachelor's degree in computer science or a related field or equivalent work experience_
  • 3+ years of experience working with AWS, particularly Lambda functions_
  • Strong understanding of SRE principles and experience implementing them in a production environment_
  • Experience with infrastructure automation and configuration management tools such as Terraform and CloudFormation_
  • Strong programming and scripting skills in languages such as Python, Java, and Bash_
  • Experience with monitoring and alerting tools such as CloudWatch, Prometheus and Dynatrace_
  • Excellent problemsolving and troubleshooting skills._
  • Strong communication and collaboration skills, with the ability to work effectively in a team environment._
  • Preferred qualifications:_
  • AWS certifications such as AWS Certified Solutions Architect or AWS Certified DevOps Engineer_
  • Experience with containerisation technologies such as Docker and Kubernetes_
  • Familiarity with agile development methodologies and practices_
  • This role is a critical position within our organisation, and we are looking for someone who is passionate about building and maintaining highly reliable and scalable systems. If you have a strong background in AWS, particularly Lambda functions, and adeep understanding of SRE principles, we would love to hear from you._

More jobs from Experis LTD