Senior Site Reliability Engineer - London, United Kingdom - Christopher Ali

    Christopher Ali
    Christopher Ali London, United Kingdom

    3 weeks ago

    Default job background
    Description

    Job Description

    Are you a senior Site Reliability Engineer interested in making a difference? Would you like to work for a tech-for-good company?

    As a Site Reliability Engineer (SRE) you will be joining an incredible team and your mission will be to ensure the performance, security, reliability and availability of core services and platforms.Taking the lead on projects across the entire breadth of the tech stack you will be responsible for visibility and monitoring of systems, building tooling and automation to reduce TOIL and for responding to incidents as part of a 24/7 SRE on-call team.

    The Senior SRE will be a hands on individual contributor, work on key projects and help to build a first class SRE function.

    Responsibilities :

    • Implementing and maintaining monitoring solutions/metric-driven alerting, logging and tracing
    • Hands on work across numerous technical projects
    • Periodic 24x7 paid on - call duties
    • Pair programming and occasionally running training sessions for the team
    • Writing well-defined tickets and keeping them up to date
    • Eligible to obtain SC Clearance at SC level
    • Build and manage systems, infrastructure and applications using infrastructure as code and automation (Terrafrom,Ansible, K8s,Helm,Go)

    Skills and Experience :

    • Strong background in system automation using configuration management systems such as Ansible, Chef or Puppet
    • Strong background in SRE/DevOps or Linux System Administration
    • Experience with creation of automation using APIs
    • Solid understanding of containerisation and container orchestration using tools such as Kubernetes
    • Automation testing experience in an Agile Software environment
    • Familiarity with some or all of - Network management and optimisation, Postgresql Database management and optimisation, common security frameworks CIS,NIST,OWASP
    • Familiarity Public Cloud Services like AWS,GCP, Azure
    • Familiarity with co-located physical infrastructure (currently hybrid)
    • Understanding of Continuous Integration (CI) and Continuous Deployment (CD)
    • Technical writing and reviewing technical designs
    • Understanding of Agile practices
    • Understanding of one or more of the following languages - Ruby,Go,Java,Bash/Shell
    • Strong experience with issue tracking software like Jira

    Tech Stack :

    Applications are written in Ruby (with Rails) or Java. Client side web apps are written in React, Clojure, Java and Go

    Platform -

    • Multiple Kubernetes Cluster for Container orchestration
    • Apache Kafka and Redis for event messaging
    • Postgres for data storage
    • OpenStack Swift for Object storage
    • Juniper & Cisco networking devices

    The role comes with a competitive salary - £75,000 - £90,000 p/a as well as a bonus scheme, pension scheme and 26 days holiday + bank holidays as well as a host of other benefits.