Jobs
>
London

    Manager, Network Reliability Engineering - United Kingdom - Nomadgao

    Nomadgao
    Default job background
    Description
    As a global leader in cybersecurity, our team changed the game.

    We're looking for people with limitless passion, a relentless focus on innovation and a fanatical commitment to the customer to join us in shaping the future of cybersecurity.

    Consistently recognized as a top workplace, CrowdStrike is committed to cultivating an inclusive, remote-first culture that offers people the autonomy and flexibility to balance the needs of work and life while taking their career to the next level.

    CrowdStrike is looking for a leader who has a solid track record of building and operating hyper-scale hybrid cloud networks.

    In this role, you will be an integral part of the production network leadership team. You will define metrics, develop tools, and create approaches that improve how we monitor and operate the network. You will need a balanced set of skills in networks and systems with a focus on scaling and performance.

    Set the direction for and improve the reliability and efficiency of the network

    ~Contribute to maintaining a high-performance, fault-tolerant, and scalable network

    ~Develop, track, and report on KPIs and metrics that measure network capacity, performance, and availability

    ~Build tools and monitoring systems that provide granular, real-time observability

    ~Develop automation to continuously assess and detect suboptimal network state and identify potential points of failure

    ~Review designs,and traffic patterns to continually assess network capacity and availability

    ~Work with other engineering groups to close the feedback loop on areas for improvement

    ~Lead resolution of network incidents, conduct internal post-mortems, perform root cause analysis, and ensure corrective actions are taken in a timely manner

    ~Diagnose and solve complex network and application problems, and recommend improvements

    ~7+ years deploying and managing network infrastructure

    ~Experience leading a sustaining engineering or SRE team

    ~7+ experience working with network protocols such as BGP, MPLS (TE, Auto-BW), VxLAN, eVPN, and CLOS Architectures

    ~Experience with building and maintaining network monitoring and graphing tools, as well as streaming telemetry

    ~Programming experience in Python, Perl, Go or other scripting language

    ~Experience with Cloud Providers such as AWS and GCP


    Bonus Points:
    ~Experience with network simulation and testing tools (NS-3, NetSim, Batfish, Ixia)

    ~Production level experience supporting large scale network infrastructure

    ~LI-Remote

    #Remote-first culture

    ~Market leader in compensation and equity awards with option to participate in ESPP in eligible countries

    ~Competitive vacation and flexible working arrangements

    ~Physical and mental wellness programs

    ~Paid parental leave, including adoption

    ~A variety of professional development and mentorship opportunities

    ~Offices with stocked kitchens when you need to fuel innovation and collaboration

    ~Birthday time-off in your local country

    ~We are committed to fostering a culture of belonging where everyone feels seen, heard, valued for who they are and empowered to succeed. By embracing the diversity of our people, we achieve our best work and fuel innovation - generating the best possible outcomes for our customers and the communities they serve.

    If you need reasonable accommodation to access the information provided on this website, please contact


  • Lorien London, United Kingdom

    Site Reliability Engineer · Location: London (hybrid remote working) · **Salary**: Up to £100,000 + Very Generous Benefits Package · One of the fastest growing ecommerce organisation requires a Site Reliability Engineer to help be the glue between the companies Dev, QA and Produc ...


  • Austin Werner Ltd London, United Kingdom

    Site Reliability Engineer - Global Media/Publishing business · We are seeking a Site Reliability Engineer for a globally leading Publishing business based in London. · My client has built their internal IT environment from ground up so is bespoke to the business with cutting edge ...


  • Explore Group London, United Kingdom

    **Lead Site reliability engineer - Fully remote - No sponsorship offered** · Role: Site Reliability engineer · Location: Fully remote · **Salary**: Up to £115,000 · **Responsibilities**: · - Design, build, and maintain scalable and highly available infrastructure on AWS · - Imple ...


  • Involved Solutions London, United Kingdom

    **Site Reliability Engineer - 12 Month Contract - SC Cleared** · **Rate**: Up to £750 per day · **Location**: Remote - 1 day per week in either London, Manchester or Bristol (whichever is closest to your home location) · **IR35**: Inside · **The role**: · Senior Site Reliability ...


  • Lorien London, United Kingdom

    Site Reliability Engineer · Location: London (hybrid remote working) · **Salary**: Up to £100,000 + Very Generous Benefits Package · One of the fastest growing ecommerce organisation requires a Site Reliability Engineer to help be the glue between the companies Dev, QA and Produc ...


  • eFinancialCareers London, United Kingdom

    **Summary** · Not your usual type of investment manager, this innovative company looks beyond traditional finance and uses data science and technology to discover value in markets worldwide and develop sophisticated trading models. Scientists, technologists and academicscontinual ...


  • eFinancialCareers London, United Kingdom

    **Compensation**: Market-Leading & Competitive · **Summary** · Their Trade Desk production team is looking to hire a Production Reliability Engineer who can oversee all aspects of the real-time trading platform. · The successful Production Reliability Engineer will have a rigorou ...


  • Nigel Frank International London, United Kingdom

    **Site Reliability Engineer/Team Manager - Hybrid - Up to £110,000.** · I am working with an insurance and technology consultancy who provide data-driven insight-let solutions to their customers to help them become more resilient and get the best possible performance for their bu ...


  • Arla Foods Plc London, United Kingdom Full time

    Reliability Engineer - Oakthorpe Dairy · Are you an FMCG Engineer with strong experience in Continuous Improvement and Kaizen projects? Are you a strong collaborator, and able to network with stakeholders at all levels? Are you looking for a role to challenge the status quo and d ...


  • eFinancialCareers London, United Kingdom

    Join us as a Site Reliability Engineer · - We'll look to you to provide technical support for relevant platforms, activities, and processes relating to areas of your specialist knowledge · - You'll assist with creating and implementing effective and efficient ITSM processes, whil ...


  • Evermore Global London, United Kingdom

    **Site Reliability Engineer / Linux / VMWARE/ Elastic Search /** · **Location: Central London / Hybrid** · **Salary: Circa £80,000 + Benefits** · **Permanent** · World leading online media company are seeking a suitable Site Reliability Engineer to join their expanding team in Lo ...


  • eFinancialCareers London, United Kingdom

    **Summary** · Their Trade Desk production team is looking to hire a Production Reliability Engineer who can oversee all aspects of the real-time trading platform. · **Requirements**: · - 5+ years' relevant experience in IT ops, e.g. DevOps, Linux System Engineering, or Network En ...


  • Lorien London, United Kingdom

    This London based company strive to create a world class digital hub for their clients. They are currently hiring for a Site Reliability Engineer with good experience maintaining AWS infrastructure. This position is fully remote, but the office is open ifyou would like to go in. ...


  • eFinancialCareers London, United Kingdom

    Data is at the heart of Bloomberg's technologies, which produce, distribute and protect some of the most critical and valuable data in global business. The Storage Engineering teams design and maintain the systems which store, process and protect thatdata. · This is not a traditi ...


  • Experis LTD London, United Kingdom

    Responsibilities:_ · - Manage and monitor AWS infrastructure, particularly Lambda functions, to ensure the availability and reliability of services._ · - Develop and maintain infrastructure automation and configuration management tools to support a rapidly changing environment._ ...


  • eFinancialCareers London, United Kingdom

    Site Reliabilty Engineer Responsibilities: · - Own critical parts of our software development life-cycle such as build/deploy · - Facilitate individual development teams to build best-in-class cloud-native solutions · Site Reliabilty Engineer Requirements: · - Experience in an em ...


  • eFinancialCareers London, United Kingdom

    Join us as a Senior Site Reliability Engineer · - We'll look to you to establish and run a SRE function to help design, build, deliver and run highly reliable, scalable and secure software systems · - This is a great opportunity to hone your existing engineering skills and advanc ...


  • Amazon Talent Acquisition London, United Kingdom

    Amazon Operations sits at the heart of the Amazon customer experience. We look after everything from the moment a customer clicks buy, to the moment their item is delivered - from desktop to doorstep. · Across Europe we have more than 50 Fulfillment Centers, hundreds of Delivery ...


  • Amazon Talent Acquisition London, United Kingdom

    Amazon Operations sits at the heart of the Amazon customer experience. We look after everything from the moment a customer clicks buy, to the moment their item is delivered - from desktop to doorstep. · Across Europe we have more than 50 Fulfillment Centers, hundreds of Delivery ...


  • NonStop Consulting Ltd London, United Kingdom

    Hi all, we are currently recruiting for Digital Site Reliability Engineer to join Government Department on a contract for 6 months, fully remote work. · Essentials skills: · - experience with Terraform, CI, CD; · - leading assessments; · - programming; · - eligibility for SC Clea ...