AI Inference Engineer - London

Only for registered members London, United Kingdom

5 days ago

Default job background
+

Job summary

We are looking for an AI Inference engineer to join our growing team.
+

Responsibilities

  • Benchmark and address bottlenecks throughout our inference stack
  • Develop APIs for AI inference that will be used by both internal and external customers
  • Explore novel research and implement LLM inference optimizations
+

Qualifications

  • Familiarity with common LLL architectures and inference optimization techniques (e.g. continuous batching, quantization etc.)
    Lorem ipsum dolor sit amet
    , consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

    Donec lacinia nisi nec odio ultricies imperdiet.
    Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

    Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
    , at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
    Get full access

    Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    AI Inference Engineer

    Only for registered members

    We are looking for an AI inference engineer to join our growing team. · ...

    London, England

    5 days ago

  • Work in company

    AI Inference Engineer

    Only for registered members

    We are looking for an AI Inference engineer to join our growing team in London. Opportunity to work on large-scale deployment of machine learning models. · Develop APIs for AI inference that will be used by both internal and external customers · Benchmark and address bottlenecks ...

    London

    2 weeks ago

  • Work in company

    Staff Software Engineer, Inference

    Only for registered members

    Anthropic's mission is to create reliable, interpretable, and steerable AI systems. Our Inference team builds critical systems serving Claude to millions of users worldwide. · ...

    London, UK

    1 week ago

  • Work in company

    Staff Software Engineer, Inference

    Only for registered members

    About Anthropic's mission is to create reliable AI systems that are safe and beneficial for users and society as a whole. · ...

    Greater London, England

    2 weeks ago

  • Work in company

    Staff Software Engineer, Inference

    Only for registered members

    We bring Claude to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request routing to fleet-wide orchestration across diverse AI accelerators. · ...

    London £325,000 - £390,000 (GBP) Full time

    2 weeks ago

  • Work in company

    Lead AI Inference Engineer

    Only for registered members

    Tether is pioneering a global financial revolution by building cutting-edge solutions that empower businesses to seamlessly integrate reserve-backed tokens across blockchains. · ...

    London

    1 month ago

  • Work in company

    Site Reliability Engineer, Inference Infrastructure

    Only for registered members

    We are looking for a Site Reliability Engineer to join the Model Serving team at Cohere.The team is responsible for developing deploying and operating the AI platform delivering Cohere's large language models through easy to use API endpoints. · ...

    London

    1 month ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    We are looking for Members of Technical Staff to join the Model Serving team at Cohere. · ...

    London, England

    1 month ago

  • Work in company

    Staff Software Engineer, Inference Infrastructure

    Only for registered members

    We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation semantic search RAG agents. · Cohere is a team of researchers engineers designers more passionate about their craft Each pe ...

    London

    1 month ago

  • Work in company

    Full-Stack Software Engineer, Inference

    Only for registered members

    We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...

    London

    1 month ago

  • Work in company

    Site Reliability Engineer, Inference Infrastructure

    Only for registered members

    We are looking for a Site Reliability Engineer to join the Model Serving team at Cohere. · We obsess over what we build and like to work hard and move fast to do what's best for our customers. · Cohere is a team of researchers engineers designers and more who are passionate about ...

    London, England

    1 month ago

  • Work in company

    Full-Stack Software Engineer, Inference

    Only for registered members

    We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...

    London, England

    1 month ago

  • Work in company

    Senior AI Inference Engineer ( specialist) - 100 Remote

    Only for registered members

    You'll work on the C++ layer that powers local AI, porting and enhancing inference engines like ONNX and similar, to run efficiently on edge devices. · ...

    London

    1 month ago

  • Work in company

    Senior ML Infrastructure Engineer

    Only for registered members

    We are partnering with a robotics and AI company building the core software infrastructure that enables advanced AI models to operate reliably in real-world robotic systems.We need a Senior ML Infrastructure Engineer to design, build and optimise the training and inference platfo ...

    London

    1 month ago

  • Work in company

    Machine Learning Engineer

    Only for registered members

    Experienced ML Infrastructure Engineer to support the deployment, optimisation and scaling of advanced machine learning models in production environments. · ...

    London

    1 week ago

  • Work in company

    Machine Learning Engineer

    Only for registered members

    We're hiring one of the first technical engineers for a pioneering AI startup building a foundation model that fully automates development. · ...

    London

    1 week ago

  • Work in company

    AI Engineer

    Only for registered members

    We are seeking an AI Engineer to join our Global Analytics team in London. This role is focused on the end-to-end lifecycle of production-grade AI, from training and fine-tuning specialized models to architecting high-performance inference pipelines. · Model Training & Fine-Tunin ...

    London Full time

    2 weeks ago

  • Work in company

    AI Engineer

    Only for registered members

    We are seeking an AI Engineer to join our Global Analytics team in London. This role is focused on the end-to-end lifecycle of production-grade AI from training and fine-tuning specialized models to architecting high-performance inference pipelines. · We require a candidate who v ...

    London Full time

    2 weeks ago

  • Work in company

    Engineering Manager

    Only for registered members

    We are committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives. Our vision is to create autonomy that propels the world forward. · ...

    London

    1 month ago

  • Work in company

    Software Engineer

    Only for registered members

    Meta is seeking a Software Engineer to join our team. The ideal candidate is someone with experience working on maximizing performance of AI models on GPUs or custom silicon. · The AI Applications Engineering team is dedicated to maximizing training and inference performance of G ...

    London Full time

    3 weeks ago