Site Reliability Engineer - Nuneaton, United Kingdom - Holland and Barrett

    Holland and Barrett
    Holland and Barrett Nuneaton, United Kingdom

    3 weeks ago

    Default job background
    Full time
    Description
    About the role:

    *** THIS ROLE IS BASED FULLY REMOTE IN THE EU REGION AND IS A 2 YEAR FTC ***

    We're on a mission to make health and wellness a way of life for everyone and technology is at the heart of our future to become a leading omnichannel retailer. We're building some great products, and we're investing in the rapidly advancing technology that is helping our customers meet their health and wellness goals. We're delivering more speed in the retail experience, greater convenience in service and delivery, and increasing personalization in our brand and product propositions, both in-store and online.

    Are you ready to revolutionize the world of Site Reliability Engineering (SRE) and make a significant impact on the retail industry? Holland & Barrett, a global leader in health and wellness, is on the lookout for a dynamic and experienced SRE to join our innovative team.

    As the Site Reliability Engineer, you will play a pivotal role in continuously improving H&B's Software Lifecycle and ensure the infrastructure we deliver are in a continuous state of improvement, embracing automation, and testing. We operate within a cloud-native environment and have a mature DevOps Chapter that you will be working closely with to achieve reliability to DevOps velocity.

    Your Responsibilities
    • The mission of our Site Reliability Engineering team is to ensure the reliable, efficient, and scalable operation of software systems or services.
    • SRE combines software engineering and operations principles to create a discipline focused on building and maintaining highly reliable and resilient systems.
    • Our goal is to make our systems better working across system operations, cloud infrastructure, pipeline engineering, software development, and performance testing.
    • You jump at the opportunity to collaborate across a variety of Technology teams to support the adoption of capacity and performance metrics embedded in our CICD pipelines.
    • You have the drive and curiosity to trial new software and create innovative ways to improve reliability while enabling engineers to deliver at pace.
    • You have the tenacity to conduct root cause analysis for underlying issues and incidents that affect system performance and availability, identifying and resolving errors to minimize re-occurrence.
    How you can make an impact as an SRE at H&B

    If you like problem-solving and working in a fast-paced, agile environment you will flourish at H&B. Experience with using as many of the following tools and technology:

    • Monitoring tools and instrumentation, Datadog, or similar observability platforms
    • AWS expertise; familiarity with core services
    • Software development or strong scripting experience including but not limited to Golang, Python, or Bash.
    • PagerDuty, Slack, and related tooling integrations
    • Good understanding of traditional operations areas such as Linux, storage, networking
    • Good familiarity with Docker and Kubernetes
    • Continuous delivery, build pipelines, artifact repositories, zero-downtime deployment.
    • Infrastructure as Code, particularly using Terraform.
    • Experimentation strategies, A/B testing, and Canary releases
    • Proving resilience and scalability using load and stress testing
    • CDNs and strong knowledge of web delivery protocols
    • Experience in incident management and post-mortems
    Required Skills and Experience:
    • Proven experience as a Site Reliability Engineer, with a passion for building and managing highly available, scalable, and resilient systems.
    • Understanding of SRE methodologies, tools, and technologies, coupled with a track record of solving complex challenges.
    • Adept at investigating and resolving problems efficiently using a systematic approach to troubleshoot and identifying root causes of issues analyzing data, logs, and metrics.
    • Communication skills to work closely with cross-functional teams with the ability to effectively convey technical information and explain complex concepts.
    • Knowledge of various technical areas, such as systems architecture, networking, operating systems, programming languages, databases, and cloud technologies.
    • Skilled in automation and programming to develop tools, scripts, and infrastructure as code solutions.
    What's in it for you:

    Empowerment and Growth: We believe in your limitless potential. As a member of H&B Tech, you'll have access to career development programs, mentorship opportunities, and resources to fuel your professional growth.

    Diversity at the Core: We recognize that diversity breeds innovation and creativity. You'll be part of a team that values and celebrates diverse perspectives and individuality, ensuring you thrive in an inclusive culture.

    Innovation: Your expertise will be instrumental in pioneering cutting-edge solutions that revolutionize our industry. You'll have the chance to work on high-impact projects that make a real difference.

    Collaboration: Join a close-knit team that works together, uplifts, and supports one another. Your ideas will be celebrated, and your voice heard at every step.

    Holland & Barrett is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.