Jobs
>
London

    Technical Duty Officer, Network Operations - London, United Kingdom - Box

    Box
    Default job background
    Full time
    Description


    WHAT IS BOX?

    Box is the market leader for Cloud Content Management. Our mission is to power how the world works together. Box is partnering with enterprise organizations to accelerate their digital transformation by creating a single platform for secure content management, collaboration and workflow. We have an amazing opportunity to further establish ourselves as leaders in the space, and we need strong advocates to help us achieve that goal.

    By joining Box, you will have the unique opportunity to help capture a majority of this developing market and define what content management looks like for the digital enterprise. Today, Box powers over 97,000 businesses, including 70% of the Fortune 500 who trust Box to manage their content in the cloud.

    WHY BOX NEEDS YOU

    Box is looking for a dynamic Global Site Reliability Technical Duty Officer to help lead our Global Technical Operations and oversee the continuous health, availability, and reliability of an industry-leading platforms and SaaS offerings. It is the responsibility of the TDO team to lead 24x7 GTOC teams in preventing, monitoring, identifying, troubleshooting, mitigating, and resolving issues that affect the availability and quality of Box's platforms and services.

    This is an integral shift-based leader and single point of technical escalation within the GTOC organization, assuming accountability for overall production site health and the performance of core customer facing journeys. This role will help maintain total site awareness, detecting metric and service deviations, final level of change approval, and the proactive identification of potential issues; resolving them before they escalate to customer impacting incidents.

    We are building a world class Operations Center and need the best talent possible to get us there. That's where you come in

    WHAT YOU'LL DO

    • Own and direct live-site Major Incident Management from detection, identification, escalation, mitigation, and recovery.
    • Triage, refine, and verify the Problem Statement, notifies and coordinate the efforts of all appropriate SME resources, and lead cross-functional Incident Bridges to quickly identify and mitigate the problem and restore service. You'll be evaluated in how well you are able to reduce MTTD to MTTR.
    • Ensure accurate, valid and timely communication to key stakeholders and business entities.
    • Lead daily Incident and Change ticket reviews, coordinate and monitor change windows, and coordinate with Problem Management on TopOps Issues and action items.
    • Operate across organizational boundaries (Business, Dev, Ops, CS) to protect our customers, their data, and the availability of all Box services, from internal and external security threats, unanticipated volume surges, and significant performance issues.
    • Troubleshoot and identify critical problems in a SOA/API-based, global hybrid cloud, distributed edge architecture on multiple enterprise and public clouds regions.
    • Provide day to day technical expertise and experience to the organization to address issues in globally diverse, high velocity 24x7 environments - from policy and procedural decisions to key architectural and tooling insights to improve Box's Incident, Change, and Problem Management engineering capabilities.
    • Lead daily reviews of planned changes (CAB) in Jira; accountable for reviewing and minimizing change risk, ensuring adequate and appropriate change timing and duration, and complete rollout, validation, and rollback plans that are optimized to prevent site or service impact.
    • Ensure all customer-impacting Incident tickets are completely and correctly documented and augmented with appropriate metrics, timelines, actions taken, and actions still pending.
    • Contributes and reviews Incident postmortems to ensure adequate documentation and appropriate prioritization of action items related to reducing MTTI, MTTM and MTTR.
    • Participates in Problem Management scrums and Postmortems to identify leading organizational and company-wide technical issues, threats, and trends that block the ability of the organization or teams to perform their roles and provide services optimally and reliably.
    • Lead projects to improve tools and processes related to overall site and service manageability, observability, and resiliency.
    • Coordinate regularly with Infosec, Customer Success, Platform and Dev leaders to continuously access new security and customer on-boarding threats and known issues.
    • Continuously mentor and train Global NOC and system engineers.

    WHO YOU ARE

    • You have 5+ years of large-scale production/platform operations experience in a large, SaaS provider environments, preferably as a TDO/Major Incident Manager, SRE team leader or Infrastructure (IaaS) or Platform (PaaS) Architecture SME in a Managed Service Provider environment.
    • Experience in bare metal, Openstack, and K-8 architectures supporting a large number of SOA-API-based services.
    • Exposure to Open Source Service-Meshes, Proxies, Caching, Message Buses (Kafka, MQS), NOSQL (Hbase, Hadoop), MYSQL clusters, and Search environments (SOLR, ES).
    • You should be competent in debugging global, distributed Web/API sites based on Linux systems (Ubuntu, RHL, Centos), BGP, iBGP, and IP Anycast networking in multi-vendor virtualized, Edge and hybrid public cloud architectures.
    • You are not expected to be an expert in all areas, but you should be familiar with common terminologies, processes, and architectures in Linux Open Source environments, as well as a thorough understanding of Virtualization, Containers, and Kubernetes.
    • You are confident and comfortable communicating and interacting with individual-contributors through C-level executives from multiple countries, ethnicities, and backgrounds.
    • You have a rock solid command presence and are calm and collected in highly stressful situations, such as a major service outage.
    • You're driven to continuously learn new skills and technologies.
    • Bachelor's degree in Computer Science or Information Systems or equivalent technical field, or similar work experience in a large-scale 24/7 production environment supporting critical, real-time applications.
    • Flexibility to work different shifts and provide weekend coverage depending on need.

    Required Skills

    • Solid understanding of ITILv4 Service Lifecycle Management, Service Delivery KPIs, SLIs, SLOs, and Incident, Change, and Problem Management framework, terminology, tools (ServiceNow, Remedy, Jira Service Desk), and processes
    • Solid knowledge and understanding of security standards and best practices, such as: OWASP, W3C, ISO 27001, SOC1-2, PCI, and SOX
    • Ability to troubleshoot secured protocols such as: SSH, SSO, TLS, FTPS, WebDav, HTTPS
    • Solid understanding and debugging skills in TCP/IP, BGP, IP Anycast, and distributed internal and external DNS
    • Two years working experience and knowledge with multi-regional public cloud providers
    • Experience with observability tools and distributed tracing in large scale environments (Splunk, Datadog, Wavefront, Catchpoint, ThousandEyes, Sensu, SignalFX RUM, Open Telemetry, SNMP)
    • Good understanding and experience with configuration management tools and CI/CD pipelines - Puppet, Ansible, Terraform, Artifactory
    • Excellent interpersonal and communication skills

    Desired Skills

    • Understanding of Agile methods and tools (Jira).
    • Experience with WAF, Bot Managers, and Content Delivery Networks (Cloudflare, Akamai)
    • Experience working in and transitioning into multi-regional hybrid cloud architectures (GCP preferred, AWS)
    • Understanding of Apache Zookeeper and Hadoop.
    • Experience with large production Scala, Java, Node, PHP environments helpful.
    • Experience working with various message bus technologies (Kafka, RabbitMQ, MQS)
    • Experience working with relational and non-relational databases and search engines (Mysql, Postgres, HBase, Elastic Search, SOLR)
    • Experience with caching apps (Squid, Redis, Memcache)
    • Experience with service mesh technologies in a hybrid-cloud environment (Zookeeper, Smart Stack)


    BENEFITS

    Box Benefits package includes pension, medical and dental coverage. We have a robust wellness program including 25 days of vacation (plus your birthday off) and subsidized gym membership. There is such a thing as a free lunch, our in-house chef prepares this daily along with lots of snacks and drinks. EMEA HQ office is located in the impressive White Collar Factory on Old Street; , European offices in Paris and Munich.

    EQUAL OPPORTUNITY

    We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability, and any other protected ground of discrimination under applicable human rights legislation. Box strives to respect the dignity and ‎‎independence of people with disabilities and is committed to giving them the same ‎‎opportunity to succeed as all other employees. Accommodations are available ‎throughout ‎the application process and an employee's employment at Box.

    For details on how we protect your information when you apply, please see our Personnel Privacy Notice.



    #LI-EMEA


  • Morgan Stanley

    Network Operations

    3 weeks ago


    Morgan Stanley London, United Kingdom

    Network Operations - Corvil · Job Number: · 3232887 · POSTING DATE: Mar 13, 2023 · PRIMARY LOCATION: Europe, Middle East, Africa-United Kingdom-United Kingdom-London · EDUCATION LEVEL: Bachelor's Degree · JOB: Production Management and Operational Support · EMPLOYMENT TYPE: Full ...


  • ITP London, United Kingdom

    **The Opportunity** · BAI Communications are recruiting for two Network Operations Apprentices to join their team in London. · The Level 4 Network Engineer apprenticeship programme that we are offering**, **will allow you to kick-start your career in an industry that is constantl ...


  • ITP London, United Kingdom

    **The Opportunity** · Our client is recruiting for a Network Operations Apprenticeto join their team in London. · The Level 4 Network Engineer apprenticeship programme that we are offering**, **will allow you to kick-start your career in an industry that is constantly evolving, d ...


  • MDE Consultants London, United Kingdom

    **Network Operations Support** · Location: London, SW1E · **Salary**: £25, ,000 pa · Hours: Full time · Experience: customer service / STEM background / people skills / project management skills · My client is on a mission to provide drivers with reliable and easy-to-use EV charg ...


  • Transport for London London, United Kingdom

    **Network Operations Coordinator** · **042913** · **Organisation** · - NETWORK MANAGEMENT CONTROL CENTRE · **Job** · - Administration · **Position Type** · - Full Time · **Location: Southwark, London** · **Salary: £33,800 (plus 23% non-pensionable shift allowance), plus benefits* ...


  • Transport for London London, United Kingdom Part time

    **Network Operations Coordinator (Part Time)** · **041479** · **Organisation** · - NETWORK MANAGEMENT CONTROL CENTRE · **Job** · - Administration · **Position Type** · - Part Time · **Location: Southwark, London** · **Salary: circa £16,900 per annum (plus 23% non-pensionable shif ...


  • The Green Recruitment Company London, United Kingdom

    Posted 19 June 2023 · Salary 30,000 · LocationLondon · Job type Permanent · DisciplineRenewable Energy & Infrastructure · Contact NameNoor Rizwan · The Green Recruitment Company is delighted to be partnered with one of the leading, EV Charging infrastructure businesses in the UK ...


  • Transport for London London, United Kingdom

    **Organisation** **-** NETWORK MANAGEMENT CONTROL CENTRE · **Job** **-** Administration · **Position Type** **-** Full Time · **Location: Southwark, London (on-site)** · **Salary: £34,000 per annum, plus 23% non-pensionable shift allowance and benefits** · **Type: Permanent TfL c ...


  • Real Time Consultants Limited London, United Kingdom

    **Network Operations Engineer - London (Shift work with generous shift allowance) - Up to £68, % Annual Performance Bonus + Great Benefits** · A hugely exciting and innovative company that are looking to revolutionise the way technology and space interact are looking for a TAC En ...


  • Arc IT Recruitment London, United Kingdom

    **Security & Network Operations Lead** · **Remote - UK** · **£competitive plus 25% bonus plus benefits** · There is a wealth if opportunity to help mature and develop a security and network operations function within a fast paced and driven Information Security function and overa ...


  • Real Time Consultants Limited London, United Kingdom

    **Senior Network Operations Engineer (RF) - London - Shift Pattern - Up to £80k + Bonus & Great Benefits** · We are currently working with a global communications company who are revolutionising access to the internet by providing connectivity to those previously left behind. The ...


  • Box London, United Kingdom

    **WHAT IS BOX?** · Box is the market leader for Cloud Content Management. Our mission is to power how the world works together. Box is partnering with enterprise organizations to accelerate their digital transformation by creating a single platform for secure content management, ...


  • Box London, United Kingdom

    **WHAT IS BOX?** · Box is the market leader for Cloud Content Management. Our mission is to power how the world works together. Box is partnering with enterprise organizations to accelerate their digital transformation by creating a single platform for secure content management, ...


  • McArthurGlen London, United Kingdom

    **What you'll be doing...** · **Key Accountabilities** · - Supervise a team of analysts and external partners and act as a deputy for the SNOC Manager as and when required. · - Act as a point of escalation and mentor for junior members of team · - Monitor logging of events in the ...

  • GE Healthcare

    Network Operator

    2 weeks ago


    GE Healthcare Chalfont Saint Giles, United Kingdom

    **Job Description Summary**: As a Connectivity Trainee, you'll be responsible for facilitating remote access to our medical devices, allowing systems to be fully supported by the online engineers for monitoring, troubleshooting, and repairs on our life-changing medical equipment. ...

  • Hamilton Barnes Associates Limited

    Network Operations

    3 weeks ago


    Hamilton Barnes Associates Limited United Kingdom

    Join this team as a Contract Network Operations Engineer · Are you ready for an exciting contract opportunity with a leading IT Services provider? We are currently seeking a skilled Network Operations Engineer for an initial 12-month contract. · Join this IT Services provider to ...

  • Gigaclear

    Network Operations

    5 days ago


    Gigaclear England, United Kingdom

    Network Operations looks after the customer network and is responsible for maintaining both the physical and logical services to our customers, reporting to the NOC Manager we re a lean and efficient team who utilise peer reviewed process and supported tools and documentation to ...

  • Morgan Stanley

    Network Operations

    3 weeks ago


    Morgan Stanley London, United Kingdom Full time

    About Morgan Stanley · Morgan Stanley is a leading global financial services firm providing a wide range of investment banking, securities, investment management and wealth management services. As a market leader, the talent and passion of our people is critical to our success. T ...


  • Trust Payments Bromley, United Kingdom Full time

    Trust Payments have an exciting opportunity for a Network Operations Centre (NOC) Analyst to join the team. · Location: Bromley · Salary: Competitive + Shift Allowance · Job Type: Hybrid, 24/7 shifts including weekends (any antisocial hours can be worked from home) · Reporting to ...


  • 83zero Limited Woking, United Kingdom Full time

    As a Network Operations Engineer, you'll have fantastic opportunities to develop both yourself and our collective capabilities performing a mixture of RUN and Project activities with other likeminded Network Analysts · Network Operations Analyst · We have an exciting opportunity ...