Operations Shift Engineer - London, United Kingdom - Netskrt Systems Inc.

    Netskrt Systems Inc.
    Netskrt Systems Inc. London, United Kingdom

    1 week ago

    Default job background
    Description
    Intelligent content collection, staging and distribution
    # Adaptive networking that leverages connectivity as and when available
    # An edge cache that allows users to access the content they want locally, using the apps and subscriptions that they already have

    We are a highly motivated team dedicated to delivering products and services that improve customer experience when accessing internet video at the edge of the network.

    Netskrt offers the opportunity to obtain hands-on experience with storage, networking, security, and cloud technologies.

    As an Operations Engineer, you are responsible for monitoring and maintaining the health of Netskrt's systems, investigating faults to resolution, and accepting new infrastructure and solutions as the system continues to grow and scale.

    You should be passionate not only about learning new technologies, but also about running systems and software in the real world.

    As an Operations Engineer, you are responsible for monitoring, supporting and maintaining system health.

    Your mission is to ensure that our service is highly available to end users by investigating and resolving issues generated by customers, event management monitoring solutions, or internal channels.

    The Live Operations team is at the heart of ongoing live monitoring and reporting; solving production problems; and building automation tools to monitor system health, execute production acceptance tests, and validate changes.

    You will work closely and collaborate with US and Canada based Service Reliability Engineering, Development, Customer Success and Infrastructure teams to ensure a holistic approach to troubleshooting and implement preventative measures to mitigate faults as swiftly as possible.

    In-depth troubleshooting of production issues to include occasionally joining live event bridges to troubleshoot with the Customer
    Develop and execute operational acceptance procedures for new edge solutions to ensure infrastructure is deployed to Production with zero service impact
    Escalation of unresolved issues and liaison with US and Canada based Operations & Engineering teams
    Continual evolution of managed services and operational procedures to improve and maintain quality standards and resolution times
    Deep dive analytics and metrics investigations as part of continual improvement initiatives to drive performance
    Degree in Computer Science or related technical field
    Minimum of 3-years' experience supporting, developing and deploying large scale software systems
    Solid experience in the use of Linux/Unix
    Deep understanding of internet and networking protocols (DNS, BGP)
    Experience with caching and CDN (content delivery network) technologies (Amazon, Limelight/Edgio, Akamai, Netflix, Fastly)
    Good understanding of video streaming protocols and technologies
    Experience with monitoring tools, e.g., Experience in system and server administration, large system deployments.
    Wide knowledge in networking, security, database and cloud systems
    Solid experience in use of fault tolerant approaches in a large-scale distributed environment and high-performance systems
    Configuration/container management (Kubernetes, Chef, Puppet, Mesos)
    Cloud computing and cloud technologies (AWS, OpenStack)
    #