IT Platform Team Lead - Edinburgh, United Kingdom - CALERO

CALERO
CALERO
Verified Company
Edinburgh, United Kingdom

2 weeks ago

Tom O´Connor

Posted by:

Tom O´Connor

beBee Recruiter


Description

Job Summary:


Calero is a global 24/7 business supporting 1,000's of clients using our developed platforms hosted in Azure, OCI and in country data centers.


The IT Platform Team Lead is responsible for the site reliability of these hosted platforms, implementing solutions to improve the stability and observability with a focus on areas which impact the customer's experience.


As a Team Lead you will be responsible for the development of the UK/US teams, running daily reviews, preventative maintenance, active monitoring, BAU activities and project work.

You will be required to work closely with other engineering teams participating in the DevOps lifecycle meetings to gain an understanding of the changes being deployed.

You will be required to lead investigations where there are issues impacting end user performance or availability of service, providing information from logs and analysis through your team to identify root cause and suggest a corrective course of action to first restore service then how to prevent further reoccurrence.


The role will also be hands on, where you will be expected to actively monitor the environment, analyze log data, review query performance, build and deploy services using terraform scripts and take on regular project work.


Duties and Responsibilities of the job:

  • Leading the IT Platform team and ensuring adequate cover to actively monitor and investigate issues during Calero's business hours.
  • Senior DBA (database administrator) where you will be required to; assess, investigate, maintain and configure the MS SQL Servers environment currently used to deliver the majority of hosted platform service today.
  • Report out performance issues and engage with SMEs (Subject Matter Experts) to review and determine the next course of action.
  • Manage and enhance Calero's Application Monitoring, working with engineering to develop alert thresholds and playbooks to run when triggered.
  • Identify and investigate single points of failure that could result in loss of service and work with Infrastructure and Engineering teams to mitigate.
  • Analyze current capacity and report out where thresholds are reached that would directly impact the hosted services.
  • Own, update and regularly review Calero's Service Catalogue for Platforms, this includes but not limited to; reoccurring issues and recovery, service data flows, network topology, logging and alerts.
  • Provide timely communication to the business where service outages occur and report on trending and recurring issues so these can be investigated by the relevant teams
  • Provide and manage an observability service so that platforms are actively monitored for performance and availability and teams using these services are aware how to raise concerns.
  • Develop performance baselines to measure clients against to understand areas which need investigated, or to alert development teams of potential issues as part of a regular report out at Development and Operation Meetings.
  • Present regular reporting to the RCA Board on recent issues, trends, improvements made, or areas which have degraded.

Education:


  • Certifications in Microsoft or other IT related exams preferred.

Personal strengths:


  • Patient and diplomatic when partnering with end users. Able to communicate concisely and competently, handling matters with the appropriate level of judgement and sensitivity
  • Confident leader and mentor, setting SMART tasks and delivering against project milestones
  • Ability to communicate effectively; build consensus, facilitate working sessions and negotiate solutions and alternatives
  • Demonstrated investigate and resolve mindset
  • Strong written and verbal communication skills in English
  • Technical, analysis and problemsolving mindset.

Experience and Knowledge:


  • Minimum of 5 years DBA using MS SQL Server, working with Enterprise and Standard edition, query analyzer, performance tuning, clustering, use and configuration of SQL Tools like Redgate, SolarWinds DPA.
  • Experience working with IIS, Linux and Windows operating systems
  • Minimum of 4 years using and developing monitoring tools such as; Azure Monitor, Application Insights, SolarWinds or similar cloud/Application Performance Monitoring.
  • Strong troubleshooting abilities for both systems and networking and interpreting/analyzing end user descriptions of problems.

Contact with Others

  • All levels of management, including C-Level, Business Unit leadership and Information Technology team members

Management Responsibility

  • Leadership role, which includes mentoring staff, triaging and planning their work tasks, chairing regular meetings and dealing with escalations.

Confidentiality:


  • This position will be exposed to extremely confidential information and discretion is paramount. A confidentiality, noncompete, and nonsolicitation agreement will be required.

BENEFITS

  • Private Healthcare for you and your immediate family
  • Gym Membership at

More jobs from CALERO