- Build and operate large-scale data ingestion systems for pre-training, including web crawling, extraction, and dataset delivery
- Run experiments to evaluate crawling strategies, extraction methods, and ingestion tradeoffs
- Analyze ingested data to identify gaps, redundancy, and areas to improve
- Build ingestion pipelines that scale reliably across large data campaigns
- Develop specialized crawlers for high-priority data sources
- Review code, debug production issues, and continuously improve ingestion infrastructure
- Curious about how training data influences model capabilities, and can iterate quickly based on measurable downstream impact
- Able to collaborate tightly across functions: researchers, infra, operations, and external partners
- Enjoy working in a hybrid research–engineering role
- Experience building web crawling, data ingestion, or large-scale data acquisition systems using Ray, Beam, Spark, or similar technologies
- Familiarity with how LLMs are trained and evaluated, and an intuition for what makes data useful for training
- Comfortable working with very large datasets (multi-TB to PB scale) and building systems that are observable, testable, and maintainable
- Comfortable designing experiments and using data to guide system improvements
- Excellent communication skills. You can explain system behavior. You consider and communicate tradeoffs clearly
- Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally
- Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance
- Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning
- Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time
- Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.
-
Member of Technical Staff
1 month ago
Only for registered members LondonWe are seeking a highly skilled Member of Technical Staff to lead the development of advanced agentic workflows that will transform how scientists interact with our platform. · ...
-
Member of Technical Staff
1 month ago
Only for registered members London, EnglandAt Microsoft AI our health team is on a mission to help millions of users better understand and proactively manage their health and wellbeing. · We're responsible for ensuring that Microsoft AI's models and services are useful trusted and safe across diverse health journeys. For ...
-
Member of Technical Staff
3 weeks ago
Only for registered members LondonWe are looking for a highly skilled Member of Technical Staff to lead the development of cutting-edge workflows. · You will build autonomous systems that can navigate complex scientific tasks entirely through natural language conversation. · ...
-
Member of Technical Staff
4 weeks ago
Only for registered members London AreaWe are seeking a highly skilled Member of Technical Staff to lead the development of advanced agentic workflows that will transform how scientists interact with our platform. · You will design autonomous systems capable of navigating complex scientific tasks — from retrieving str ...
-
Member of Technical Staff
1 month ago
Only for registered members London, EnglandWe are seeking experienced High Performance Computing Engineers to join our team and contribute to the evolution of our personal AI, Copilot This role offers the unique opportunity to work on some of the largest scale supercomputers in the world. · Build secure and performant AI ...
-
Member of Technical Staff
1 month ago
Only for registered members LondonWe are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. · Have proven expertise in areas of interest evidenced by an exceptional publication track record and significant technical leadership in high-i ...
-
Member of Technical Staff
1 month ago
Only for registered members London, England+Develop and implement cutting-edge safety methodologies and mitigations for products that are served to millions of users through Copilot every day. · ...
-
Member of Technical Staff
1 week ago
Only for registered members LondonTessl is building the pioneering platform for AI Native software development. We're looking for a Software Engineer to join our engineering team and build real-time AI Native workflows, · Design and implement new functionality that you discussed and reviewed with the product team ...
-
Member of Technical Staff
1 month ago
Only for registered members London, EnglandWe build open superintelligence and make it accessible to all. · We're developing open weight models for individuals, agents, · enterprises and even nation states.Conduct critical comparative analysis · to advance our understanding · of model capabilities. · Build evaluation sys ...
-
Member of Technical Staff
1 month ago
Only for registered members London, EnglandWe are building open superintelligence and making it accessible to all. · We're developing open weight models for individuals, agents, enterprises, and even nation states. · We want you to do the most impactful work of your career with the confidence that you and the people you c ...
-
Member of Technical Staff
2 weeks ago
Only for registered members London, EnglandWe are seeking passionate and talented Senior Software Engineers to join our multimodal team. · Crafting user experiences that highlight new applications of state-of-the-art AI models shaping the future. · ...
-
Member of Technical Staff
2 days ago
Only for registered members LondonWe are seeking engineers and researchers to join our Pretraining Text Data team · ...
-
Member of Technical Staff
1 week ago
Only for registered members LondonWe are looking for a highly skilled Member of Technical Staff to lead the development of cutting-edge workflows. · You will build autonomous systems that can navigate complex scientific tasks entirely through natural language conversation. · In your role, you will architect and d ...
-
Member of Technical Staff
3 weeks ago
Only for registered members LondonWe aim to create a compact, talent-dense technical team to develop the next generation of frontier training methods. · ...
-
Member of Technical Staff
1 month ago
Only for registered members LondonWe're assembling a world-class team of builders with backgrounds in healthcare, big tech, and frontier AI research labs. Our goal is to translate cutting-edge research—like MAI-DxO )—into transformative products for millions of users across · Collaborate with AI researchers, pro ...
-
Member of Technical Staff
1 week ago
Only for registered members London Full timeWe aim to create a compact, talent-dense technical team to develop the next generation of frontier training methods. · We want to build in Europe, have impact in the Bay Area. We prioritise applicants interested in relocating to Budapest or London.domain expertise in either machi ...
-
Member of the Technical Staff
3 days ago
Only for registered members LondonWe engineer for five-nines reliability and bulletproof consistency, targeting the scale of a Tier-1 bank managing hundreds of millions of transactions daily. · AI isn't an add-on; it's woven into our fabric. We use it to power autonomous operations, create agentic development wor ...
-
Memeber of Technical Staff
1 week ago
Only for registered members LondonTessl is a fast-growing Series A startup based in London founded by Guy Podjarny. At Tessl we believe AI is transforming software development AI Native Developers will define features architecture and workflows in specs not code guiding the work of AI agents We re building the pi ...
-
Member of Technical Staff
1 month ago
Only for registered members London, EnglandWe're responsible for ensuring that Microsoft AI's models and services are useful, trusted and safe across diverse customer health journeys. · Bridge research and product: Translate cutting-edge AI research into prototypes and product opportunities. · Build evaluation systems: De ...
-
Member of Technical Staff
1 month ago
Only for registered members LondonBuild systems that transform powerful pre-trained models into aligned and general agents. · Drive research and engineering initiatives that push the frontier of post-training, from data curation to large-scale optimization. · Collaborate across pre-training and post-training team ...
-
Member of technical staff
1 week ago
Only for registered members London+Job summary · H is hiring the world's best AI talent to shape the future of superintelligent AI. · +Develop and train advanced LLMs and VLMs. · Research and implement training methods for enhanced capabilities. · ...
Member of Technical Staff - Greater London - Reflection AI
Description
Overview
Our Mission Reflection's mission is to build open superintelligence and make it accessible to all. We're developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.
About The Role Data is playing an increasingly crucial role at the frontier of AI innovation. Many of the most meaningful advances in recent years have come not from new architectures, but from better data. As a member of the Data Team, your mission is to build and operate the ingestion systems that turn the open web and other large-scale data sources into reliable, well-structured corpora for training frontier models. You will own the machinery that acquires, extracts, normalizes, versions, and delivers data to our pre-training pipelines. You'll work directly with world-class researchers to close the loop between what we collect and how it impacts model performance.
This role is ideal for engineers who love building robust distributed systems, but who also want to run experiments, reason about tradeoffs in data acquisition, and iterate quickly based on measurable impact.
Working closely with our pre-training and data quality teams, you will:
About You
Skills And Qualifications
What We Offer
#J-18808-Ljbffr
-
Member of Technical Staff
Only for registered members London
-
Member of Technical Staff
Only for registered members London, England
-
Member of Technical Staff
Only for registered members London
-
Member of Technical Staff
Only for registered members London Area
-
Member of Technical Staff
Only for registered members London, England
-
Member of Technical Staff
Only for registered members London
-
Member of Technical Staff
Only for registered members London, England
-
Member of Technical Staff
Only for registered members London
-
Member of Technical Staff
Only for registered members London, England
-
Member of Technical Staff
Only for registered members London, England
-
Member of Technical Staff
Only for registered members London, England
-
Member of Technical Staff
Only for registered members London
-
Member of Technical Staff
Only for registered members London
-
Member of Technical Staff
Only for registered members London
-
Member of Technical Staff
Only for registered members London
-
Member of Technical Staff
Full time Only for registered members London
-
Member of the Technical Staff
Only for registered members London
-
Memeber of Technical Staff
Only for registered members London
-
Member of Technical Staff
Only for registered members London, England
-
Member of Technical Staff
Only for registered members London
-
Member of technical staff
Only for registered members London