Jobs
>
London

    Site Reliability Engineer - London, United Kingdom - paddle

    Default job background
    Description

    What do we do?

    Paddle offers SaaS companies a completely different approach to their payment infrastructure. Instead of assembling and maintaining a complex stack of payments-related apps and services, we're a Merchant of Record for our customers. That means we take away 100% of the pain of payment fragmentation. It's faster, safer, cheaper, and, above all, way better.

    We're backed by investors including KKR, FTV Capital, Kindred, Notion, and 83North and serve over 3000 software sellers in 245 territories globally.

    The Role:

    As a Site Reliability Engineer, you'll be helping to drive our product and engineering department forward, ensuring reliability on different parts of the Paddle platform and helping our Engineers to work better and more efficiently.

    Paddle SRE team's role is "Everything SRE", with a focus on infrastructure, reliability standards, and practices. SRE team is part of the Platform function. By following this model:

    • It's easy to spot patterns and draw similarities between services and projects.
    • We act as a glue between disparate product teams, creating solutions out of distinct pieces of software.
    • Enable product engineers to use DevOps practices to maintain user-facing products without divergence in practice across the business.
    • Define production standards as code and work to smooth out any sharp edges to greatly simplify things for the product engineers running their services.

    You are empowered to use the right tech for the job. You'll have the freedom to input into what technology and tooling are used and educate the rest of your colleagues accordingly.

    As an SRE, we want you to be a driving force of improving and automating how our Product teams develop software at all stages of its lifecycle, which we strive to achieve with strong collaboration and communication with our engineers.

    Tech Stack

    • Go for our new services
    • PHP/Laravel and Python/Django for our classic system
    • Docker in production and local development
    • AWS ECS Fargate and AWS EKS for our runtime
    • AWS SQS for our asynchronous message queues
    • AWS Eventbridge for our event bus
    • Aurora MySQL and PostgreSQL for persistent data storage
    • Redis for key/value store
    • Terraform for resource management
    • Cloudflare for our firewall and DNS server

    What you'll do:

    • Be on the on-call rotation to respond to incidents
    • Handle production incidents, author blameless postmortems and enrich operational playbooks and runbooks
    • Create, maintain and test our system disaster recovery process/
    • Monitoring, alerting and SLO tracking.
    • Developing tools to maximise engineering efficiency such as automating the deployment infrastructure.
    • Be an advocate of the GitOps methodology
    • Collaborate and enable engineers to do their jobs more efficiently
    • Seek out processes that can be improved with automation and have internal Developer Experience as main driver.

    We'd love to hear from you:

    • Have AWS experience, we use ECS/Fargate, EKS, EC2, Aurora RDS, S3 and Eventibridge excessively
    • Knowledge of platform and ops concepts such as networking and Linux administration
    • Experience working with microservices and distributed systems at scale
    • Experience with monitoring tools: we use Opentelemetry, Grafana, ELK, Pingdom and PagerDuty.

    Everyone is welcome at Paddle

    At Paddle, we're committed to removing invisible barriers, both for our customers and within our own teams. We recognise and celebrate that every Paddler is unique and we welcome every individual perspective. As an inclusive employer, we don't care if, or where, you studied, what you look like or where you're from. We're more interested in your craft, curiosity, passion for learning and what you'll add to our culture. We encourage you to apply even if you don't match every part of the job ad, especially if you're part of an underrepresented group.

    Please let us know if there's anything we can do to better support you through the application process and in the workplace. We will do everything we can to support any accommodations needed. We're committed to building a diverse team where everyone feels safe to be their authentic self. Let's grow together.


    Why you'll love working at Paddle

    We are a diverse, growing group of Paddlers across the globe who pride ourselves on our transparent, collaborative and respectful culture.

    We live and breathe our values, which are:

    Exceptional Together

    Execute with impact

    Better than Yesterday

    We offer a full suite of benefits, including attractive salaries, stock options, retirement plans, private healthcare and well-being initiatives.

    We are a 'digital-first' company , which means you can work remotely, from one of our stylish hubs, or even a bit of both We offer all team members unlimited holidays and 4 months of paid family leave regardless of gender. We invest in learning and will help you with your personal development via constant exposure to new challenges, an annual learning fund, and regular internal and external training.

    #J-18808-Ljbffr


  • Explore Group London, United Kingdom

    **Lead Site reliability engineer - Fully remote - No sponsorship offered** · Role: Site Reliability engineer · Location: Fully remote · **Salary**: Up to £115,000 · **Responsibilities**: · - Design, build, and maintain scalable and highly available infrastructure on AWS · - Imple ...


  • Austin Werner Ltd London, United Kingdom

    Site Reliability Engineer - Global Media/Publishing business · We are seeking a Site Reliability Engineer for a globally leading Publishing business based in London. · My client has built their internal IT environment from ground up so is bespoke to the business with cutting edge ...


  • Lorien London, United Kingdom

    Site Reliability Engineer · Location: London (hybrid remote working) · **Salary**: Up to £100,000 + Very Generous Benefits Package · One of the fastest growing ecommerce organisation requires a Site Reliability Engineer to help be the glue between the companies Dev, QA and Produc ...


  • eFinancialCareers London, United Kingdom

    **Summary** · Not your usual type of investment manager, this innovative company looks beyond traditional finance and uses data science and technology to discover value in markets worldwide and develop sophisticated trading models. Scientists, technologists and academicscontinual ...


  • Nigel Frank International London, United Kingdom

    **Site Reliability Engineer/Team Manager - Hybrid - Up to £110,000.** · I am working with an insurance and technology consultancy who provide data-driven insight-let solutions to their customers to help them become more resilient and get the best possible performance for their bu ...


  • eFinancialCareers London, United Kingdom

    **Compensation**: Market-Leading & Competitive · **Summary** · Their Trade Desk production team is looking to hire a Production Reliability Engineer who can oversee all aspects of the real-time trading platform. · The successful Production Reliability Engineer will have a rigorou ...


  • Lorien London, United Kingdom

    This London based company strive to create a world class digital hub for their clients. They are currently hiring for a Site Reliability Engineer with good experience maintaining AWS infrastructure. This position is fully remote, but the office is open ifyou would like to go in. ...


  • eFinancialCareers London, United Kingdom

    **Summary** · Their Trade Desk production team is looking to hire a Production Reliability Engineer who can oversee all aspects of the real-time trading platform. · **Requirements**: · - 5+ years' relevant experience in IT ops, e.g. DevOps, Linux System Engineering, or Network En ...


  • eFinancialCareers London, United Kingdom

    Data is at the heart of Bloomberg's technologies, which produce, distribute and protect some of the most critical and valuable data in global business. The Storage Engineering teams design and maintain the systems which store, process and protect thatdata. · This is not a traditi ...


  • eFinancialCareers London, United Kingdom

    Join us as a Site Reliability Engineer · - We'll look to you to provide technical support for relevant platforms, activities, and processes relating to areas of your specialist knowledge · - You'll assist with creating and implementing effective and efficient ITSM processes, whil ...


  • Evermore Global London, United Kingdom

    **Site Reliability Engineer / Linux / VMWARE/ Elastic Search /** · **Location: Central London / Hybrid** · **Salary: Circa £80,000 + Benefits** · **Permanent** · World leading online media company are seeking a suitable Site Reliability Engineer to join their expanding team in Lo ...


  • Experis LTD London, United Kingdom

    Responsibilities:_ · - Manage and monitor AWS infrastructure, particularly Lambda functions, to ensure the availability and reliability of services._ · - Develop and maintain infrastructure automation and configuration management tools to support a rapidly changing environment._ ...


  • eFinancialCareers London, United Kingdom

    Site Reliabilty Engineer Responsibilities: · - Own critical parts of our software development life-cycle such as build/deploy · - Facilitate individual development teams to build best-in-class cloud-native solutions · Site Reliabilty Engineer Requirements: · - Experience in an em ...


  • eFinancialCareers London, United Kingdom

    Join us as a Senior Site Reliability Engineer · - We'll look to you to establish and run a SRE function to help design, build, deliver and run highly reliable, scalable and secure software systems · - This is a great opportunity to hone your existing engineering skills and advanc ...


  • Amazon Talent Acquisition London, United Kingdom

    Amazon Operations sits at the heart of the Amazon customer experience. We look after everything from the moment a customer clicks buy, to the moment their item is delivered - from desktop to doorstep. · Across Europe we have more than 50 Fulfillment Centers, hundreds of Delivery ...


  • Amazon Talent Acquisition London, United Kingdom

    Amazon Operations sits at the heart of the Amazon customer experience. We look after everything from the moment a customer clicks buy, to the moment their item is delivered - from desktop to doorstep. · Across Europe we have more than 50 Fulfillment Centers, hundreds of Delivery ...


  • eFinancialCareers London, United Kingdom

    Join us as a Streaming Site Reliability Engineer · - This is an exciting opportunity to use your technical expertise and collaborate with our colleagues to build effortless, digital first customer experiences · - Working in our Data & Analytics Service function, you'll collaborat ...


  • NonStop Consulting Ltd London, United Kingdom

    Hi all, we are currently recruiting for Digital Site Reliability Engineer to join Government Department on a contract for 6 months, fully remote work. · Essentials skills: · - experience with Terraform, CI, CD; · - leading assessments; · - programming; · - eligibility for SC Clea ...


  • eFinancialCareers London, United Kingdom

    We are responsible for life cycle management of the network architecture including planning, automation, implementation and monitoring. We work closely with other Engineering teams, the Chief Technology Office (CTO) and product managers across the enterprise.The NSRE team drives ...


  • NonStop Consulting Ltd London, United Kingdom

    This is an 6 months contract and mostly remote · *Due to the nature of the assignment details need to remain vague at this point, but the central requirements are: · Eligibility for getting SC Cleared · DevOps Engineering/ Cloud Engineering/ Infrastructure Engineering experience ...