Staff Site Reliability Engineer - Manchester, United Kingdom - Matillion

    Matillion
    Matillion Manchester, United Kingdom

    1 month ago

    Default job background
    Description

    It covers everything from the build, provisioning and maintenance of our cloud Infrastructure as well as reliability, capability management, observability, monitoring and metrics of our SaaS platform.

    Reporting into the Director of SRE and Observability, you will utilise your experience across all pillars of Site Reliability Engineering to drive best practice aimed at enhancing our ability to build truly reliable, observable and performative infrastructure for all our core services.

    Your experience building modern, multi-cloud platforms will play a pivotal role as we continue to modernise our stack and implement a wide range of new tools around logging, monitoring, metrics and alerting.

    We value in-person collaboration here at Matillion, therefore this role can either follow our hybrid work structure where employees work 2 days a week in the Manchester office, or can be remote depending on location.


    Kubernetes, AWS, ArgoCD, Terraform, DataDog, Prometheus, Golang/Python.

    Leading the design of major software components, systems, and features to improve the availability, scalability, latency, and efficiency of Matillion's SaaS services
    # Drive the design, implementation and management for expanding observability infrastructure, keeping up to date with new tools and technologies and be a recognised member of the broader Observability community
    # Providing guidance and mentorship to other team members on managing end-to-end availability and performance of critical services, design techniques and coding standards to cultivate innovation and collaboration across the business
    # Balancing competing priorities as you manage a range of individual projects, deadlines, and deliverables

    A passion for everything performance, observability, availability, scalability, and security with experience owning and delivering projects using Agile methodologies.

    Have previous experience of large scale web operations in a public cloud environment. Be competent in Ruby, Go, Java, Python or an equivalent programming language. Prometheus, Grafana, Elasticsearch, Logstash, Kibana, OpenTelemetry, Micrometer, New Relic, Data Dog. At Matillion, we are committed to providing competitive salaries in line with market standards.

    Our estimated compensation range for this position is £76,000 - £114,000 but the final salary will be based on your relevant skills, experience and qualifications demonstrated in the hiring process.

    Matillion has fostered a culture that is collaborative, fast-paced, ambitious, and transparent, and an environment where people genuinely care about their colleagues and communities.


    We operate a truly flexible and hybrid working culture that promotes work-life balance, and are proud to be able to offer the following benefits:

    ~ 30 days holiday + bank holidays
    ~5 days paid volunteering leave
    ~ Health insurance
    ~ Life Insurance
    ~ Pension
    ~ Access to mental health support
    ~ Career development with access to a Udemy account, Blinkist and much more


    Thousands of enterprises including Cisco, DocuSign, Pacific Life, Slack, and TUI trust Matillion technology to load, transform, sync, and orchestrate their data for a wide range of use cases from insights and operational analytics, to data science, machine learning, and AI.

    With over $300M raised from top Silicon Valley investors, we are on a mission to power the data productivity of our customers and the world.


    We celebrate diversity and we are committed to creating an inclusive environment for all of our team.

    Matillion does not discriminate on the basis of race, colour, religion, age, sex, national origin, disability status, genetics, sexual orientation, gender identity or expression, or any other characteristic protected by law.


    #