Research Scientist, Science of Post-Training and Reinforcement Learning - London, UK

Only for registered members London, UK, United Kingdom

1 day ago

Default job background
Snapshot · We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus on scaling, evaluation, and the practical details that make methods work. · This ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company

    Reinforcement Learning

    Randstad Technologies

    Reinforcement Learning · We are seeking experts in AI, Machine Learning, MLOps, Software Engineering and Data Science to join our client's team as they define their 2026 technical roadmap. · Achieve breakthroughs in high-DOF autonomous systems and embodied AI through collaborativ ...

    Charing Cross

    2 days ago

  • Work in company

    Reinforcement Learning Engineer

    Only for registered members

    This is a job for a Reinforcement Learning Engineer at a fast-growing AI infrastructure startup. The ideal candidate should possess deep expertise in PyTorch and other relevant skills. · Design and implement scalable RLOps platform architectureIntegrate diverse ML libraries and e ...

    London Full time

    1 month ago

  • Work in company

    Reinforcement Learning Engineer

    Only for registered members

    This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. · You will lead the development of a first-of-its-kind RLOps platform, · designing scalable infrastructure for RL model training and LLM finetuning. · ...

    London, England

    1 month ago

  • Work in company

    Reinforcement Learning Control Engineer

    Only for registered members

    All3 is transforming how buildings are conceived, developed and delivered. We combine AI-powered design with robotic prefabrication and on-site assembly to build custom architecture at the cost and speed of mass production -- unlocking even the most complex sites. · We're current ...

    London

    1 week ago

  • Reinforcement Learning Control Engineer · All3 is transforming how buildings are conceived, developed and delivered. We combine AI-powered design with robotic prefabrication and on-site assembly to build custom architecture at the cost and speed of mass production -- unlocking ev ...

    London

    3 hours ago

  • Work in company

    Reinforcement Learning Control Engineer

    Only for registered members

    All3 is transforming how buildings are conceived, developed and delivered. We combine AI-powered design with robotic prefabrication and on-site assembly to build custom architecture at the cost and speed of mass production -- unlocking even the most complex sites. · We're current ...

    London Area

    6 days ago

  • Work in company

    Research Scientist, Autonomous Agents — Reinforcement Learning

    Only for registered members

    · Snapshot · We are looking for Research Scientists to join the Autonomous Agents team, and produce research in the development of next-generation technologies to power increasingly open-ended autonomous agents which strive to assist, support, and supplement humans in their dail ...

    London, UK

    6 days ago

  • Snapshot · We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus on scaling, evaluation, and the practical details that make methods work. · This ...

    London Full time

    3 days ago

  • Work in company

    AI Research Engineer - Reinforcement Learning (100 Remote)

    Only for registered members

    We're not just building products, we're pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. · ...

    London

    1 month ago

  • · Snapshot · We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus on scaling, evaluation, and the practical details that make methods work. · T ...

    London, UK

    1 day ago

  • Snapshot · We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus on scaling, evaluation, and the practical details that make methods work. · This ...

    London, England

    2 days ago

  • The Chief Data & Analytics Office (CDAO) at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey.As a part of CDAO, The Machine Learning Center of Excellence (MLCOE) partners across the firm to shape, create, and deploy Machine Learning Solutions f ...

    London

    1 month ago

  • Job Description · The Chief Data & Analytics Office (CDAO) at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey. As a part of CDAO, The Machine Learning Center of Excellence (MLCOE) partners across the firm to shape, create, and deploy Machine L ...

    Greater London, England

    1 week ago

  • The Chief Data & Analytics Office (CDAO) at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey. · ...

    London E JP

    1 month ago

  • The Chief Data & Analytics Office at JPMorgan Chase is responsible for accelerating the firm's data and analytics journey. · As a part of CDAO, The Machine Learning Center of Excellence (MLCOE) partners across the firm to shape, create, and deploy Machine Learning Solutions for o ...

    London Full time

    1 month ago

  • Work in company

    Machine Learning Engineer

    Only for registered members

    We are on a mission to accelerate reinforcement learning for building superhuman artificial intelligence systems. · We believe that reinforcement learning will form a part of every sophisticated AI system of the future.It already impacts the world we live in, from its use in crea ...

    London

    2 weeks ago

  • Work in company

    Machine Learning Engineer

    Only for registered members

    At AgileRL, we are on a mission to accelerate reinforcement learning for building superhuman artificial intelligence systems.‍ · We believe that reinforcement learning will form a part of every sophisticated AI system of the future. It already impacts the world we live in, from i ...

    London £50,000 - £90,000 (GBP) per year

    1 week ago

  • Work in company

    Front-end Engineer

    Only for registered members

    At AgileRL, we are on a mission to accelerate reinforcement learning for building superhuman artificial intelligence systems.‍ · We believe that reinforcement learning will form a part of every sophisticated AI system of the future. It already impacts the world we live in, from i ...

    London £38,000 - £75,000 (GBP) per year

    4 days ago

  • Work in company

    DevOps Engineer

    Only for registered members

    At AgileRL, we are on a mission to accelerate reinforcement learning for building superhuman artificial intelligence systems.‍ · We believe that reinforcement learning will form a part of every sophisticated AI system of the future. It already impacts the world we live in, from i ...

    London £50,000 - £90,000 (GBP) per year

    4 days ago

  • Work in company

    Software Engineering LMTS

    Only for registered members

    We are looking for Members of Technical Staff with strong academic backgrounds to join our team and lead research initiatives on training foundation models for a brand new enterprise agent. ...

    London

    1 month ago