AI Inference Engineer - London
5 days ago

Job summary
We are looking for an AI Inference engineer to join our growing team.+
Responsibilities
- Benchmark and address bottlenecks throughout our inference stack
- Develop APIs for AI inference that will be used by both internal and external customers
- Explore novel research and implement LLM inference optimizations
Qualifications
- Familiarity with common LLL architectures and inference optimization techniques (e.g. continuous batching, quantization etc.)
Job description
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.Get full accessAccess all high-level positions and get the job of your dreams.
Similar jobs
We are looking for an AI inference engineer to join our growing team. · ...
5 days ago
We are looking for an AI Inference engineer to join our growing team in London. Opportunity to work on large-scale deployment of machine learning models. · Develop APIs for AI inference that will be used by both internal and external customers · Benchmark and address bottlenecks ...
2 weeks ago
Anthropic's mission is to create reliable, interpretable, and steerable AI systems. Our Inference team builds critical systems serving Claude to millions of users worldwide. · ...
1 week ago
About Anthropic's mission is to create reliable AI systems that are safe and beneficial for users and society as a whole. · ...
2 weeks ago
We bring Claude to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request routing to fleet-wide orchestration across diverse AI accelerators. · ...
2 weeks ago
Tether is pioneering a global financial revolution by building cutting-edge solutions that empower businesses to seamlessly integrate reserve-backed tokens across blockchains. · ...
1 month ago
We are looking for a Site Reliability Engineer to join the Model Serving team at Cohere.The team is responsible for developing deploying and operating the AI platform delivering Cohere's large language models through easy to use API endpoints. · ...
1 month ago
We are looking for Members of Technical Staff to join the Model Serving team at Cohere. · ...
1 month ago
We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation semantic search RAG agents. · Cohere is a team of researchers engineers designers more passionate about their craft Each pe ...
1 month ago
We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...
1 month ago
We are looking for a Site Reliability Engineer to join the Model Serving team at Cohere. · We obsess over what we build and like to work hard and move fast to do what's best for our customers. · Cohere is a team of researchers engineers designers and more who are passionate about ...
1 month ago
We are training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. · ...
1 month ago
You'll work on the C++ layer that powers local AI, porting and enhancing inference engines like ONNX and similar, to run efficiently on edge devices. · ...
1 month ago
We are partnering with a robotics and AI company building the core software infrastructure that enables advanced AI models to operate reliably in real-world robotic systems.We need a Senior ML Infrastructure Engineer to design, build and optimise the training and inference platfo ...
1 month ago
Experienced ML Infrastructure Engineer to support the deployment, optimisation and scaling of advanced machine learning models in production environments. · ...
1 week ago
We're hiring one of the first technical engineers for a pioneering AI startup building a foundation model that fully automates development. · ...
1 week ago
We are seeking an AI Engineer to join our Global Analytics team in London. This role is focused on the end-to-end lifecycle of production-grade AI, from training and fine-tuning specialized models to architecting high-performance inference pipelines. · Model Training & Fine-Tunin ...
2 weeks ago
We are seeking an AI Engineer to join our Global Analytics team in London. This role is focused on the end-to-end lifecycle of production-grade AI from training and fine-tuning specialized models to architecting high-performance inference pipelines. · We require a candidate who v ...
2 weeks ago
We are committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives. Our vision is to create autonomy that propels the world forward. · ...
1 month ago
Meta is seeking a Software Engineer to join our team. The ideal candidate is someone with experience working on maximizing performance of AI models on GPUs or custom silicon. · The AI Applications Engineering team is dedicated to maximizing training and inference performance of G ...
3 weeks ago