Jobs
>
London

    SRE / Site Reliability Engineer - United Kingdom - Travelodge Hotels Limited

    Default job background
    Description
    Job Description
    Travelodge's mission is to be the UK's favourite hotel for value.

    With more than a million visits every week to our website and more than eighteen million customers a year, the use of technology is critical to both our customer offer and our low-cost operations.

    The mission within IT is to ensure innovative technology drives the business forward, through the development of the company's customer-facing and internal technology systems.

    The job in a nutshell

    As a Site Reliability Engineer in our IT Digital and Data Operations team, you will be passionate about maintaining and improving software that solves problems.

    The role will form a bridge between development and operations by applying a software engineering mindset to system administration topics.

    Your time will be split between operations duties and enhancing systems, software, monitoring and processes that help increase site reliability, availability and performance.

    Working closely with internal IT teams, business stakeholders and 3rd party suppliers, your primary responsibility will be to ensure system performance is optimised, with an eye toward pushing our capabilities forward by innovating to continually improve our technical environments.

    What you'll be doing

    • Being an advocate for DevOps methodologies and ways of working with the ability to apply them to existing and new integrations across our applications within our web stack.
    • Collaborating with developers at the design stage, to ensure services released to production are fit for purpose and deliver near zero defects.
    • Providing architectural governance from an operations perspective from the time of planning of changes and releases including creating documentation and evaluating architectural decisions.
    • Administering the production and pre-production environments including CMS's using monitoring tools, application functionality and availability checks.
    • Proficient use of deployment technologies such as Jenkins.
    • Troubleshooting and administering the Linux OS (preferably RHEL) and providing log analysis.
    • Building software and systems to manage platform infrastructure and applications and resolve vulnerabilities.
    • Improving reliability, quality, and time-to-market of our suite of software applications.
    • Measuring and optimising system performance, with an eye toward pushing our capabilities forward, getting ahead of growth, capacity needs, and innovating to continually improve
    • Providing 2nd / 3rd level operational and engineering support for multiple software applications and systems.
    • Gathering and analysing metrics from both operating systems and applications to assist in performance tuning and fault finding.
    • Partnering with the digital development teams and Product Owners to improve services through rigorous testing and release procedures.
    • Proactively deploy automation for regularly repeated tasks and identifying new automation opportunities.
    • Active engagement with future digital roadmaps, to innovate and ensure no legacy tech debt.
    • Supporting the continuous improvement of internal IT processes and ways of working.
    • Working on day to day incidents with the digital operations team, championing the resolution of tickets with pace and tenacity utilising preventative maintenance and proactive techniques to actively drive down incident tickets.
    • Running major incident calls and assisting with the resolution of major incident issues within the platforms and following through to root cause analysis and remediation.
    • Staying up-to-date with the IT industry methodologies and emerging trends.
    • Leading on establishing and implementing shifts in Culture to support adoption of new processes and ways of working, across teams.
    • Ensuring technology and processes are running optimally and this is reflected in the availability of all systems and tools.
    • To reduce or even eliminate toil in order to maximise the time spent on engineering and innovation.
    • Providing direct support and acting as a second in command to the IT Senior Digital Platform Manager, covering absences and leave, to manage and support the Digital Operations teams.
    What we'll expect from you

    To succeed in this role, you will be a 'hands-on' Engineer with a proven track record of improving and maintaining enterprise scale ecommerce/digital systems and associated applications on prem and in the cloud, with experience of multiple digital implementations.

    You will have broad technical knowledge and be comfortable working with pace and agility to ensure the required outcomes are achieved.

    You must have a strong understanding of systems integration and application lifecycle management, but we are not expecting expert knowledge in absolutely every technology; it is important that you can articulate what you know well and recognise when further understanding is required - be an active self-starter who can gather information and make appropriate decisions in a timely and organised manner.

    This role will require you to participate in an on-call rota, and manage the team outside of hours with web releases and platform upgrades.

    The ability to work with a variety of teams and technologies is required.

    As our Digital SRE you will have a good understanding of IT operations, support and software engineering in order to be successful.

    Essential

    • Professional Qualifications or demonstrable training and practical experience that relate to the function of an SRE
    • Proficiency with Redhat Linux distributions
    • Administration experience of at least one cloud Platform (AWS or Azure)
    • Proven experience of working with CI/CD pipelines
    • Setup and monitoring of end to end systems using enterprise monitoring and reporting tools, such as NewRelic, Splunk, Pingdom and Zabbix
    • Experience with distributed storage technologies like NFS, HDFS, Ceph, S3 as well as dynamic resource management frameworks (Kubernetes)
    • A proactive approach to spotting and resolving problems, areas for improvement, and performance bottlenecks
    • Working knowledge of Bash and\or KSH, Nginx and PHP.
    • Knowledge of relational and non relational databases and experience implementing best practice approaches.
    • Experience with the following technologies, Akamai WAF, Github Desirable
    • Understanding of Python with one or more high level languages, such as Java, Ruby, and JavaScript
    • Understanding of Apache and\or Tomcat
    • BSc and/or MSc Computer Science/Business Computing or equivalent experience
    • Experience as an SRE within retail/hospitality or similar
    • Certifications in Cloud computing with either AWS or Azure
    Travelodge Traits
    At Travelodge, we believe that behaviours are just as important as the activities you carry out.

    The ones we look for in every colleague are:

    I care about people

    • I treat everyone in a way I would like to be treated
    • I am easy to work with
    • I have a can do attitude
    • I care about the impact my work has on others
    I pay attention to detail

    • I do the little things that make a difference to our customers
    • I work to brand standards
    • I treat Travelodge time, equipment and stock as if it were my own
    I drive for results

    • I hit targets in my role and work at the right pace
    • I take ownership of problems and try to fix them fast
    • I look for ways to avoid future problems
    • I look for ways to promote Travelodge
    What you can expect from us
    Culture
    At Travelodge, we are warm, straightforward and optimistic.

    We have a big footprint in the UK, but still a small company feel and you can expect quality and value to be built into everything we do.

    You'll have the support of a close network of colleagues and managers, and every day is different here We want you to bring your personality to work and we love our diversity.

    Reward and recognition
    It's not just our customers we want to wake up with a smile on their face.

    As well as a competitive salary, being part of our hotel support centre means great holiday entitlements, pension contribution deals, being part of our bonus scheme, and a Thanks Card giving generous room and food discounts as well as friends and family rates.

    Career and development

    We want you to develop further with us at Travelodge and we'll provide you a development plan to help you reach your goals.

    You can expect to have a full induction and training relevant to your role. We advertise all our vacancies internally, so you'll have the opportunity to really develop your career with Travelodge.
    #J-18808-Ljbffr


  • Involved Solutions London, United Kingdom

    **Site Reliability Engineer - 12 Month Contract - SC Cleared** · **Rate**: Up to £750 per day · **Location**: Remote - 1 day per week in either London, Manchester or Bristol (whichever is closest to your home location) · **IR35**: Inside · **The role**: · Senior Site Reliability ...


  • eFinancialCareers London, United Kingdom

    **Compensation**: Market-Leading & Competitive · **Summary** · Their Trade Desk production team is looking to hire a Production Reliability Engineer who can oversee all aspects of the real-time trading platform. · The successful Production Reliability Engineer will have a rigorou ...


  • Lorien London, United Kingdom

    Site Reliability Engineer · Location: London (hybrid remote working) · **Salary**: Up to £100,000 + Very Generous Benefits Package · One of the fastest growing ecommerce organisation requires a Site Reliability Engineer to help be the glue between the companies Dev, QA and Produc ...


  • Arla Foods Plc London, United Kingdom Full time

    Reliability Engineer - Oakthorpe Dairy · Are you an FMCG Engineer with strong experience in Continuous Improvement and Kaizen projects? Are you a strong collaborator, and able to network with stakeholders at all levels? Are you looking for a role to challenge the status quo and d ...


  • eFinancialCareers London, United Kingdom

    Join us as a Site Reliability Engineer · - We'll look to you to provide technical support for relevant platforms, activities, and processes relating to areas of your specialist knowledge · - You'll assist with creating and implementing effective and efficient ITSM processes, whil ...


  • Evermore Global London, United Kingdom

    **Site Reliability Engineer / Linux / VMWARE/ Elastic Search /** · **Location: Central London / Hybrid** · **Salary: Circa £80,000 + Benefits** · **Permanent** · World leading online media company are seeking a suitable Site Reliability Engineer to join their expanding team in Lo ...


  • Lorien London, United Kingdom

    Site Reliability Engineer · Location: London (hybrid remote working) · **Salary**: Up to £100,000 + Very Generous Benefits Package · One of the fastest growing ecommerce organisation requires a Site Reliability Engineer to help be the glue between the companies Dev, QA and Produc ...


  • Explore Group London, United Kingdom

    **Lead Site reliability engineer - Fully remote - No sponsorship offered** · Role: Site Reliability engineer · Location: Fully remote · **Salary**: Up to £115,000 · **Responsibilities**: · - Design, build, and maintain scalable and highly available infrastructure on AWS · - Imple ...


  • Austin Werner Ltd London, United Kingdom

    Site Reliability Engineer - Global Media/Publishing business · We are seeking a Site Reliability Engineer for a globally leading Publishing business based in London. · My client has built their internal IT environment from ground up so is bespoke to the business with cutting edge ...


  • Nigel Frank International London, United Kingdom

    **Site Reliability Engineer/Team Manager - Hybrid - Up to £110,000.** · I am working with an insurance and technology consultancy who provide data-driven insight-let solutions to their customers to help them become more resilient and get the best possible performance for their bu ...


  • eFinancialCareers London, United Kingdom

    **Summary** · Not your usual type of investment manager, this innovative company looks beyond traditional finance and uses data science and technology to discover value in markets worldwide and develop sophisticated trading models. Scientists, technologists and academicscontinual ...


  • eFinancialCareers London, United Kingdom

    Data is at the heart of Bloomberg's technologies, which produce, distribute and protect some of the most critical and valuable data in global business. The Storage Engineering teams design and maintain the systems which store, process and protect thatdata. · This is not a traditi ...


  • eFinancialCareers London, United Kingdom

    **Summary** · Their Trade Desk production team is looking to hire a Production Reliability Engineer who can oversee all aspects of the real-time trading platform. · **Requirements**: · - 5+ years' relevant experience in IT ops, e.g. DevOps, Linux System Engineering, or Network En ...


  • Lorien London, United Kingdom

    This London based company strive to create a world class digital hub for their clients. They are currently hiring for a Site Reliability Engineer with good experience maintaining AWS infrastructure. This position is fully remote, but the office is open ifyou would like to go in. ...


  • Experis LTD London, United Kingdom

    Responsibilities:_ · - Manage and monitor AWS infrastructure, particularly Lambda functions, to ensure the availability and reliability of services._ · - Develop and maintain infrastructure automation and configuration management tools to support a rapidly changing environment._ ...


  • eFinancialCareers London, United Kingdom

    Site Reliabilty Engineer Responsibilities: · - Own critical parts of our software development life-cycle such as build/deploy · - Facilitate individual development teams to build best-in-class cloud-native solutions · Site Reliabilty Engineer Requirements: · - Experience in an em ...


  • eFinancialCareers London, United Kingdom

    Join us as a Senior Site Reliability Engineer · - We'll look to you to establish and run a SRE function to help design, build, deliver and run highly reliable, scalable and secure software systems · - This is a great opportunity to hone your existing engineering skills and advanc ...


  • eFinancialCareers London, United Kingdom

    Join us as a Streaming Site Reliability Engineer · - This is an exciting opportunity to use your technical expertise and collaborate with our colleagues to build effortless, digital first customer experiences · - Working in our Data & Analytics Service function, you'll collaborat ...


  • NonStop Consulting Ltd London, United Kingdom

    Hi all, we are currently recruiting for Digital Site Reliability Engineer to join Government Department on a contract for 6 months, fully remote work. · Essentials skills: · - experience with Terraform, CI, CD; · - leading assessments; · - programming; · - eligibility for SC Clea ...


  • Amazon Talent Acquisition London, United Kingdom

    Amazon Operations sits at the heart of the Amazon customer experience. We look after everything from the moment a customer clicks buy, to the moment their item is delivered - from desktop to doorstep. · Across Europe we have more than 50 Fulfillment Centers, hundreds of Delivery ...