Research Scientist, Science of Post-Training and Reinforcement Learning - London, UK
1 day ago

Job description
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Access all high-level positions and get the job of your dreams.
Similar jobs
Reinforcement Learning · We are seeking experts in AI, Machine Learning, MLOps, Software Engineering and Data Science to join our client's team as they define their 2026 technical roadmap. · Achieve breakthroughs in high-DOF autonomous systems and embodied AI through collaborativ ...
2 days ago
This is a job for a Reinforcement Learning Engineer at a fast-growing AI infrastructure startup. The ideal candidate should possess deep expertise in PyTorch and other relevant skills. · Design and implement scalable RLOps platform architectureIntegrate diverse ML libraries and e ...
1 month ago
This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. · You will lead the development of a first-of-its-kind RLOps platform, · designing scalable infrastructure for RL model training and LLM finetuning. · ...
1 month ago
All3 is transforming how buildings are conceived, developed and delivered. We combine AI-powered design with robotic prefabrication and on-site assembly to build custom architecture at the cost and speed of mass production -- unlocking even the most complex sites. · We're current ...
1 week ago
Reinforcement Learning Control Engineer · All3 is transforming how buildings are conceived, developed and delivered. We combine AI-powered design with robotic prefabrication and on-site assembly to build custom architecture at the cost and speed of mass production -- unlocking ev ...
3 hours ago
All3 is transforming how buildings are conceived, developed and delivered. We combine AI-powered design with robotic prefabrication and on-site assembly to build custom architecture at the cost and speed of mass production -- unlocking even the most complex sites. · We're current ...
6 days ago
Research Scientist, Autonomous Agents — Reinforcement Learning
Only for registered members
· Snapshot · We are looking for Research Scientists to join the Autonomous Agents team, and produce research in the development of next-generation technologies to power increasingly open-ended autonomous agents which strive to assist, support, and supplement humans in their dail ...
6 days ago
Research Scientist, Science of Post-Training and Reinforcement Learning
Only for registered members
Snapshot · We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus on scaling, evaluation, and the practical details that make methods work. · This ...
3 days ago
AI Research Engineer - Reinforcement Learning (100 Remote)
Only for registered members
We're not just building products, we're pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. · ...
1 month ago
Research Scientist, Science of Post-Training and Reinforcement Learning
Only for registered members
· Snapshot · We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus on scaling, evaluation, and the practical details that make methods work. · T ...
1 day ago
Research Scientist, Science of Post-Training and Reinforcement Learning
Only for registered members
Snapshot · We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus on scaling, evaluation, and the practical details that make methods work. · This ...
2 days ago
2026 Machine Learning Center of Excellence (Time Series & Reinforcement Learning) - Summer Associate
Only for registered members
The Chief Data & Analytics Office (CDAO) at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey.As a part of CDAO, The Machine Learning Center of Excellence (MLCOE) partners across the firm to shape, create, and deploy Machine Learning Solutions f ...
1 month ago
2026 Machine Learning Center of Excellence (Time Series & Reinforcement Learning) - Summer Associate
Only for registered members
Job Description · The Chief Data & Analytics Office (CDAO) at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey. As a part of CDAO, The Machine Learning Center of Excellence (MLCOE) partners across the firm to shape, create, and deploy Machine L ...
1 week ago
2026 Machine Learning Center of Excellence (Time Series & Reinforcement Learning) - Summer Associate
Only for registered members
The Chief Data & Analytics Office (CDAO) at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey. · ...
1 month ago
2026 Machine Learning Center of Excellence (Time Series & Reinforcement Learning) - Summer Associate
Only for registered members
The Chief Data & Analytics Office at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey. · As a part of CDAO, The Machine Learning Center of Excellence (MLCOE) partners across the firm to shape, create, and deploy Machine Learning Solutions for o ...
1 month ago
We are on a mission to accelerate reinforcement learning for building superhuman artificial intelligence systems. · We believe that reinforcement learning will form a part of every sophisticated AI system of the future.It already impacts the world we live in, from its use in crea ...
2 weeks ago
At AgileRL, we are on a mission to accelerate reinforcement learning for building superhuman artificial intelligence systems. · We believe that reinforcement learning will form a part of every sophisticated AI system of the future. It already impacts the world we live in, from i ...
1 week ago
At AgileRL, we are on a mission to accelerate reinforcement learning for building superhuman artificial intelligence systems. · We believe that reinforcement learning will form a part of every sophisticated AI system of the future. It already impacts the world we live in, from i ...
4 days ago
At AgileRL, we are on a mission to accelerate reinforcement learning for building superhuman artificial intelligence systems. · We believe that reinforcement learning will form a part of every sophisticated AI system of the future. It already impacts the world we live in, from i ...
4 days ago
We are looking for Members of Technical Staff with strong academic backgrounds to join our team and lead research initiatives on training foundation models for a brand new enterprise agent. ...
1 month ago