- Collaborate with development, operations, and product teams to define, review, and implement reliability standards and best practices.
- Design, implement, and maintain highly available and scalable architectures for our applications and infrastructure.
- Develop and enhance automated tools and frameworks to optimize system monitoring, deployment, and recovery.
- Troubleshoot and resolve complex issues throughout the entire software stack, including networking, databases, and distributed systems.
- Conduct performance analysis and capacity planning to ensure system scalability and resource optimization.
- Take a proactive approach to continuously improving reliability.
- Participate in incident response, root cause analysis, and postmortem activities to identify and rectify system failures.
- Collaborate with cross-functional teams to implement and improve CI/CD pipelines, ensuring reliable and efficient software releases.
- Stay up-to-date with emerging technologies and industry trends, actively contributing to ongoing system improvements.
- Participate in oncall rotation.
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience.
- Proven experience deploying and managing large-scale distributed systems successfully.
- Understanding of SRE concepts (error budgets, SLIs/SLOs, blameless postmortems)
- Proficiency in programming languages such as Python, C++, or Go
- Familiarity with monitoring and observability tools.
- Excellent problem-solving skills and ability to troubleshoot complex issues efficiently.
- Strong organizational and communication skills, with the ability to collaborate effectively in a cross-functional team environment.
- Experience with artificial intelligence (AI) and machine learning (ML) technologies and frameworks.
- Familiarity with security best practices and experience implementing security measures in a production environment.
- Experience with modern infrastructure technologies and tools, including cloud platforms (AWS, Azure, GCP), containers (Docker, Kubernetes), and orchestration (Ansible, Chef, Puppet).
- Solid understanding of networking protocols and technologies (TCP/IP, DNS, load balancing).
- Demonstrated experience with infrastructure as code (IaC) and automation tools (e.g., Terraform, GitHub Actions).
-
Reliability Engineer
Found in: SonicJobs Direct Apply UK - 1 day ago
Arla Foods Plc London, United Kingdom Full timeReliability Engineer - Oakthorpe Dairy (N13 6BU)Are you an FMCG Engineer looking to make the move into a Continuous Improvement? Are you a strong collaborator, and able to network with stakeholders at all levels? Are you looking for a role to challenge the status quo and drive im ...
-
Reliability Engineer
Found in: SonicJobs Direct Apply UK - 3 days ago
Synergi Search & Select Limited Hayes, United Kingdom Full timeJob Title - Reliability Engineer · Rate - up to £45,000 · Shift - Monday - Friday · FMCG/Manufacturing · Synergi are recruiting for a Reliability Engineer to join one of the leading Food Manufacturers within their sector. This is the chance to join a company whose products can b ...
-
Reliability Engineer
Found in: Jooble UK O C2 - 16 hours ago
Enable Soft, Inc. United KingdomAnnual reliability and maintainability conference. Includes a message from the chair, on-line registration, and a program brochure. · A division of Sandia National Laboratories. Includes information on the center as a whole, the training offered, ways to work with the center and ...
-
Reliability Engineer
Found in: Jooble UK O C2 - 6 days ago
Maintech Recruitment England, United KingdomWe are supporting a growing business who provide reliability services into leading manufactures across the UK and are looking for a Reliability Engineer to join the team. This is a great opportunity if you are a Maintenance Engineer looking for a change in direction or if you are ...
-
Reliability Engineer
Found in: Jooble UK O C2 - 5 days ago
Elysia United KingdomReliability Engineer - Jaguar TCS Racing · ACCELERATE YOUR CAREER · Fortescue WAE exists to accelerate the advantage and impact of our clients. We do it through innovative engineering and technology that solves complex problems and brings a step-change in weight, speed, and eff ...
-
Reliability Engineer
Found in: Talent UK C2 - 1 day ago
Balfour Beatty London, United Kingdom PermanentAbout the role · Amazing infrastructure isnt the only thing that gets built here. Incredible careers do too. Join our Rail UK team as a Reliability Engineer and you can build something to be proud of. · Role Purpose: We are seeking a highly skilled and experienced Reliability E ...
-
Reliability Engineer
Found in: Jooble UK O C2 - 7 minutes ago
Princes United KingdomThe Princes Group has over 7,000 employees with offices and production sites in the UK, Netherlands, Italy, Poland, France and Mauritius. Princes manufactures 350 different food and drink products responsibly sourced and enjoyed by consumers every day. None of this would be possi ...
-
Site Reliability Engineer
Found in: Ziprecruiter UK C2 - 1 day ago
WaferWire Cloud Technologies Greater London, United KingdomJob Description · We are seeking a highly motivated and experienced Site Reliability Engineer to join our growing team. You will be responsible for ensuring the reliability, performance, and scalability of our production systems. You will play a critical role in ensuring our syst ...
-
Lead Reliability Engineer
Found in: Ziprecruiter UK C2 - 2 days ago
EVolt Recruitment London, United KingdomJob Description · We're seeking a Lead Reliability Engineer to spearhead the enhancement of reliability and testing capabilities within an EV Charging OEM Specialist. where you will collaborate closely with the other heads of departments, and other key stakeholders to develop a c ...
-
Site Reliability Engineer
Found in: Appcast UK C C2 - 3 days ago
Humankind Global Recruitment Greater London, United KingdomSite Reliability Engineer · London (Hybrid 2 days a week on site · Permanent · £75,000 - £85,000 p/a · The Background · We are partnered with an innovative IT consultancy based in London but with a global presence who are leading advisors in their industry by creating lasting val ...
-
Site Reliability Engineer
Found in: Appcast UK GBP C2 - 2 days ago
DVF Recruitment London, United KingdomJob DetailsDVF Recruitmenthttps://www.dvfrecruitment.comJob DescriptionWe are seeking a Site Reliability Engineer to join our SRE team based in Reigate. The ideal candidate will have excellent communication skills, experience working with multiple stakeholders, and a track record ...
-
Site Reliability Engineer
Found in: Appcast Linkedin GBL C2 - 6 days ago
RedRock Consulting London, United KingdomSite Reliability Engineer (Linux/K8S/AWS) - Leading SaaS / ERP provider · Excellent opportunity to join a leading SaaS provider, that are expanding operations due to growth and forecasted digital change. · My client is looking for skilled Engineers' with a background supporting, ...
-
Site Reliability Engineer
Found in: Ziprecruiter UK C2 - 1 day ago
Humankind Global Recruitment Greater London, United KingdomJob Description · Site Reliability Engineer · London (Hybrid 2 days a week on site · Permanent · The Background · We are partnered with an innovative IT consultancy based in London but with a global presence who are leading advisors in their industry by creating lasting value for ...
-
Site Reliability Engineer
Found in: Appcast UK C C2 - 18 hours ago
Tata Consultancy Services Greater London, United KingdomRole: Application Support Engineer/Site Reliability Engineer · Job Type: Permanent · Location: London, United Kingdom · Ready to leverage your knowledge in application support? · Are you looking for an exciting opportunity to learn a top-tier level of understanding in supporting ...
-
Site Reliability Engineer
Found in: Ziprecruiter UK C2 - 2 days ago
Vallum Associates London, United KingdomJob Description · Job Title: Site Reliability Engineer · Location: London (Hybrid) · Duration: Contractual role · One of our Banking clients is looking for a Tech Site Reliability Engineer, with proven working experience in the Banking industry, working with FX/FI/FXOM Trading s ...
-
Lead Reliability Engineer
Found in: Jooble UK O C2 - 2 days ago
EVolt Recruitment London, United KingdomWe're seeking a Lead Reliability Engineer to spearhead the enhancement of reliability and testing capabilities within an EV Charging OEM Specialist, where you will collaborate closely with the other heads of departments and other key stakeholders to develop a comprehensive strate ...
-
Site Reliability Engineer
Found in: Ziprecruiter UK C2 - 2 days ago
trgtment London, United KingdomJob Description · Love the idea of responsibility? Love working with startups? Ready for a real challenge? · Yes to all three? Great. · I'm working with a large company in searching for a Site Reliability Engineer. The role would sit atop of a project which aims to launch 100 sta ...
-
Site Reliability Engineer
Found in: Appcast UK C C2 - 1 day ago
Huntress Talent London, United KingdomResponsibilities: · Contract Site Reliability Engineer · Term- 1-2 year contract · Hybrid - · London, UK · • Schedule and monitor real time trading systems · Devops Methodologies · • Monitor open and close of markets as it relates to the system · • Reduce the number of trading h ...
-
Principal Reliability Engineer
Found in: Jooble UK O C2 - 3 days ago
BP p.l.c. United KingdomTravel required Up to 25% travel should be expected with this role · Job category Engineering Group · Relocation available This role is not eligible for relocation · At bp, we're reimagining energy for people and our planet. We're leading the way in reducing carbon emissions a ...
-
Site Reliability Engineer
Found in: Appcast UK C C2 - 5 days ago
Mondrian Alpha London, United KingdomMy client, a renowned hedge fund with a global presence, is in search of a seasoned Site Reliability Engineer to join their London team. · As part of this team, you'll play a pivotal role in maintaining the technology infrastructure that drives the fund's operations, directly co ...
Site Reliability Engineer - London, United Kingdom - FactSet
Description
Responsibilities
:Requirements:
Desirable Qualifications:
Join our team and contribute to creating and maintaining a highly reliable and performant infrastructure that supports our growing platform. Help shape the future of our systems architecture while working in a collaborative and innovative environment. Experience in AI/ML is considered desirable but not mandatory, so if you have the skills or interest in these areas, we encourage you to apply.