No more applications are being accepted for this job

Site Reliability Engineer - London, United Kingdom - DVF Recruitment

DVF Recruitment London, United Kingdom

2 weeks ago

Description

Job DetailsDVF Recruitmenthttps:
//www.dvfrecruitment.comJob DescriptionWe are seeking a Site Reliability Engineer to join our SRE team based in Reigate. The ideal candidate will have excellent communication skills, experience working with multiple stakeholders, and a track record in Azure and Observability platforms.

You will be joining at an exciting time of transformation as we work on improving the delivery of value for customers and the business.

You will be working in the Site Reliability and Response team, whose responsibility is to deliver and manage business critical services that are used 247 by our clients and colleagues around the world.

This role is open to flexible and hybrid working arrangements, with presence in the Reigate office up to two days per week.

Responsibilities:
Implement and maintain Observability platforms such as DatadogProactive monitoring of production and other environments to ensure stability, availability, security and integrityCollaborate with cross-functional teams to ensure the reliability, availability, and performance of our client-facing servicesEngage with business stakeholders to gather requirements, address concerns, and provide updates on projects and system statusContribute to the design, build and operational management of the servicesLead incident response, troubleshooting, and root cause analysis to mitigate and prevent future issuesDesign and implement automation and processes to improve the efficiency and effectiveness of the teams and other support functionQualificationsThe essential skills/experience for this position are:

Solid experience in Site Reliability Engineering or a similar role such as DevOps
Experience of running 24x7 services in a public cloud, ideally Azure
Deep understanding of cloud infrastructure and services, including best practices for monitoring, scaling, and security
Experience with observability platforms such as Data dog or similar tools
Strong interpersonal skills, with the ability to work effectively with many stakeholders
Solid verbal and written communication skills, and the ability to present technical information clearly and concisely
Previous experience working with external clients is needed
Experience with conducting Post-mortems or Post Incident Reviews
Confidence in making decisions and taking ownership of projects

Experience with Azure DevOps pipelines and scripting languages, such as Python or PowerShellOther highly desirable, but not essential skills are:

Azure certifications, such as Azure Administrator, Azure Developer, or Azure DevOps Engineer
Familiarity with Infrastructure as Code (IaC) tools like Pulumi, Terraform, ARM Templates, or Azure Bicep
Knowledge of containerization and orchestration technologies, such as Docker and Kubernetes
Previous experience working with Configuration as Code technologies such as Puppet or Ansible Familiar with high volume Web APIsPermanent

Site Reliability Engineer - London, United Kingdom - DVF Recruitment

Description

for Recruiters

Information