AI Inference Engineer - London

Only for registered members London, United Kingdom

5 days ago

Job summary

We are looking for an AI Inference engineer to join our growing team.
+

Responsibilities

Benchmark and address bottlenecks throughout our inference stack
Develop APIs for AI inference that will be used by both internal and external customers
Explore novel research and implement LLM inference optimizations

Qualifications

Familiarity with common LLL architectures and inference optimization techniques (e.g. continuous batching, quantization etc.)
Job description
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access
Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company

AI Inference Engineer

Only for registered members

We are looking for an AI inference engineer to join our growing team. · ...

London, England

5 days ago

Work in company

We are looking for an AI Inference engineer to join our growing team in London. Opportunity to work on large-scale deployment of machine learning models. · Develop APIs for AI inference that will be used by both internal and external customers · Benchmark and address bottlenecks ...

London

2 weeks ago

Work in company

Staff Software Engineer, Inference

Only for registered members

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. Our Inference team builds critical systems serving Claude to millions of users worldwide. · ...

London, UK

1 week ago

Work in company

Staff Software Engineer, Inference

Only for registered members

About Anthropic's mission is to create reliable AI systems that are safe and beneficial for users and society as a whole. · ...

Greater London, England

2 weeks ago

Work in company

Staff Software Engineer, Inference

Only for registered members

We bring Claude to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request routing to fleet-wide orchestration across diverse AI accelerators. · ...

London £325,000 - £390,000 (GBP) Full time

2 weeks ago

Work in company

Lead AI Inference Engineer

Only for registered members

Tether is pioneering a global financial revolution by building cutting-edge solutions that empower businesses to seamlessly integrate reserve-backed tokens across blockchains. · ...

London

1 month ago

Work in company

Site Reliability Engineer, Inference Infrastructure

Only for registered members

We are looking for a Site Reliability Engineer to join the Model Serving team at Cohere.The team is responsible for developing deploying and operating the AI platform delivering Cohere's large language models through easy to use API endpoints. · ...

London

1 month ago

Work in company

Staff Software Engineer, Inference Infrastructure

Only for registered members

We are looking for Members of Technical Staff to join the Model Serving team at Cohere. · ...

London, England

1 month ago

Work in company

Staff Software Engineer, Inference Infrastructure

Only for registered members

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation semantic search RAG agents. · Cohere is a team of researchers engineers designers more passionate about their craft Each pe ...

London

1 month ago

Work in company

Full-Stack Software Engineer, Inference

Only for registered members

We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...

London

1 month ago

Work in company

Site Reliability Engineer, Inference Infrastructure

Only for registered members

We are looking for a Site Reliability Engineer to join the Model Serving team at Cohere. · We obsess over what we build and like to work hard and move fast to do what's best for our customers. · Cohere is a team of researchers engineers designers and more who are passionate about ...

London, England

1 month ago

Work in company

Full-Stack Software Engineer, Inference

Only for registered members

We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...

London, England

1 month ago

Work in company

Senior AI Inference Engineer ( specialist) - 100 Remote

Only for registered members

You'll work on the C++ layer that powers local AI, porting and enhancing inference engines like ONNX and similar, to run efficiently on edge devices. · ...

London

1 month ago

Work in company

Senior ML Infrastructure Engineer

Only for registered members

We are partnering with a robotics and AI company building the core software infrastructure that enables advanced AI models to operate reliably in real-world robotic systems.We need a Senior ML Infrastructure Engineer to design, build and optimise the training and inference platfo ...

London

1 month ago

Work in company

Machine Learning Engineer

Only for registered members

Experienced ML Infrastructure Engineer to support the deployment, optimisation and scaling of advanced machine learning models in production environments. · ...

London

1 week ago

Work in company

Machine Learning Engineer

Only for registered members

We're hiring one of the first technical engineers for a pioneering AI startup building a foundation model that fully automates development. · ...

London

1 week ago

Work in company

AI Engineer

Only for registered members

We are seeking an AI Engineer to join our Global Analytics team in London. This role is focused on the end-to-end lifecycle of production-grade AI, from training and fine-tuning specialized models to architecting high-performance inference pipelines. · Model Training & Fine-Tunin ...

London Full time

2 weeks ago

Work in company

AI Engineer

Only for registered members

We are seeking an AI Engineer to join our Global Analytics team in London. This role is focused on the end-to-end lifecycle of production-grade AI from training and fine-tuning specialized models to architecting high-performance inference pipelines. · We require a candidate who v ...

London Full time

2 weeks ago

Work in company

Engineering Manager

Only for registered members

We are committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives. Our vision is to create autonomy that propels the world forward. · ...

London

1 month ago

Work in company

Software Engineer

Only for registered members

Meta is seeking a Software Engineer to join our team. The ideal candidate is someone with experience working on maximizing performance of AI models on GPUs or custom silicon. · The AI Applications Engineering team is dedicated to maximizing training and inference performance of G ...

London Full time

3 weeks ago

AI Inference Engineer - London

Job summary

Responsibilities

Qualifications

Job description

Similar jobs

AI Inference Engineer

AI Inference Engineer

Staff Software Engineer, Inference

Staff Software Engineer, Inference

Staff Software Engineer, Inference

Lead AI Inference Engineer

Site Reliability Engineer, Inference Infrastructure

Staff Software Engineer, Inference Infrastructure

Staff Software Engineer, Inference Infrastructure

Full-Stack Software Engineer, Inference

Site Reliability Engineer, Inference Infrastructure

Full-Stack Software Engineer, Inference

Senior AI Inference Engineer ( specialist) - 100 Remote

Senior ML Infrastructure Engineer

Machine Learning Engineer

Machine Learning Engineer

AI Engineer

AI Engineer

Engineering Manager

Software Engineer

Directory

for Recruiters

Information