Scientific Data Engineer - Boston, England, United Kingdom - Arrayo

    Arrayo
    Arrayo Boston, England, United Kingdom

    2 weeks ago

    Default job background
    Description

    Data Engineer

    We are seeking a skilled Data Engineer to join our cross-disciplinary team and contribute to the development of scalable, metadata-rich data products that adhere to FAIR principles.

    In this role, you will work closely with platform, scientific, and AI/ML teams to build and maintain curated datasets and robust data pipelines. These pipelines will support analytics, modeling, and deep learning within platforms such as AWS, Azure Databricks, and Domino Data Lab.

    Key Responsibilities:

    • Develop, deploy, and maintain scalable, cloud-native data pipelines for molecular modeling and related domains.
    • Operationalize architectural blueprints using modern orchestration frameworks and cloud services (AWS/Azure).
    • Partner with scientists, cheminformaticians, and data scientists to understand domain-specific requirements and deliver efficient, reusable data solutions.
    • Process and integrate molecular property datasets, embedding rich metadata to maximize downstream value for AI/ML applications.
    • Establish data quality, lineage, and governance standards in line with FAIR principles, ensuring reproducibility, traceability, and compliance.
    • Enable interactive dataset exploration through tools like Spotfire.
    • Shape schema design, enrich metadata, and develop APIs for reliable and flexible data access.
    • Optimize storage and compute performance across data lakes and warehouses (e.g., Delta Lake, Parquet, Redshift).
    • Document data contracts, pipeline logic, and operational best practices to ensure long-term sustainability and effective collaboration.

    Required Qualifications:

    • Demonstrated experience as a data engineer in biopharmaceutical or life sciences, particularly supporting drug discovery or translational research.
    • Hands-on work with molecular structure data, computed properties, simulation outputs, or imaging datasets.
    • Proficiency in Python (including Pandas or PySpark) and SQL, with exposure to ETL/orchestration tools such as Airflow or dbt.
    • Strong knowledge of cloud-native services on AWS (e.g., S3, Glue, Lambda, Athena) and Azure (Data Factory, Data Lake).
    • Track record of collaborating with scientific teams and translating research needs into scalable data solutions.

    Preferred Qualifications:

    • Experience with cheminformatics libraries (e.g., RDKit, Open Babel, CDK).
    • Familiarity with scientific data standards, ontologies, and best practices for metadata capture.
    • Understanding of data science workflows in computational chemistry, bioinformatics, or AI/ML-driven research.
    • Orchestration & ETL: Apache Airflow, Prefect
    • Scientific Libraries (Preferred): RDKit, Open Babel, CDK

    Seniority level

    • Mid-Senior level

    Employment type

    • Full-time

    Job function

    • Engineering, Research, and Information Technology
    • Biotechnology Research, Pharmaceutical Manufacturing, and IT Services and IT Consulting


  • Circle Boston, England, United Kingdom

    Job Description: · As a Staff Data Engineer at Circle, you will play a key role in shaping the company's data infrastructure and analytics capabilities. You will lead the design and implementation of scalable data pipelines and warehouses that power business functions, including ...


  • Cyvl Boston, England, United Kingdom

    Job Description · Cyvl is a technology company revolutionizing the way civil engineering firms and governments map and manage transportation infrastructure. Our enterprise-grade hardware and software solutions leverage 3D mapping sensors to capture LiDAR, imagery, and GPS data, r ...


  • Circle Boston, England, United Kingdom

    At Circle, we're pushing the boundaries of financial technology to create an inclusive future with transparency. · We're a global fintech company that enables value transfer like digital data—globally, almost instantly, and at lower costs than traditional systems. · What You'll B ...


  • Circle Boston

    Job Description: · Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data — globally, nearly instantly and less expensively than legacy settlement systems. · This groundbreaking new inter ...


  • Arrayo Boston

    Job Title: Data Engineer · We are seeking a skilled Data Engineer to join our cross-disciplinary team at Arrayo. In this role, you will play a central part in developing scalable, metadata-rich data products that adhere to FAIR principles, working hand-in-hand with platform, scie ...


  • Cyvl Boston

    Job Summary · Cyvl is a Boston-based tech startup revolutionizing the way civil engineering firms and governments map and manage transportation infrastructure. Our enterprise-grade hardware and software solutions leverage 3D mapping sensors to capture LiDAR, imagery, and GPS data ...


  • Circle Boston

    Software Engineer II, Data Platform · Circle is a financial technology company at the forefront of the emerging internet of money. Our infrastructure supports businesses, institutions, and developers in leveraging technological advances. · What You'll Be Part Of · We value visibi ...

  • Compare the Market

    Data Engineer

    1 week ago


    Compare the Market Peterborough, England, United Kingdom

    Our purpose is to make great financial decision making simple for everyone, driving us every day. · We're on a mission to create an automated quoting engine with a seamless experience wrapped in a brand everyone loves. · We change lives by making it easy to switch and save money ...

  • Experis UK

    Data Engineer

    7 hours ago


    Experis UK lincoln, england, United Kingdom £43,000 - £57,000 per year

    Job Description · Experis UK is seeking a skilled Data Engineer to join our Data Platforms team. The ideal candidate will have experience in designing, building, and maintaining scalable data pipelines and infrastructure. · The role requires strong technical expertise, problem-so ...

  • BJSS

    Data Engineer

    1 week ago


    BJSS lincoln, england, United Kingdom

    About Us · We are DataOps advocates who leverage software engineering best practices to build scalable and reusable data solutions that empower clients to derive insights, inform decisions, and drive business value. · Role Overview · This role combines the discipline of software ...


  • J.P. MORGAN-1 Wisbech Full time £90,000 - £108,000 per year

    Job Description · We are seeking a highly skilled Lead Data Engineer to join our Infrastructure Platforms organization. As a key member of an agile team, you will be responsible for enhancing, building, and delivering data collection, storage, access, and analytics solutions in a ...


  • Wakapi East Lindsey, England, United Kingdom

    As a Full Stack Engineer + Data Engineering expert, you will play a key role in shaping our technology landscape at Wakapi. This dynamic position combines hands-on software development with data pipeline design and implementation, contributing to both frontend/backend systems and ...

  • BJSS

    Data Engineer

    4 days ago


    BJSS Lincoln

    Bjss advocates for data operations and applies software engineering best practices to build scalable data solutions. · Our clients rely on us for complex challenges, allowing us to work with a wide range of tools and technologies. · Data engineers at Bjss combine software enginee ...

  • Compare the Market

    Data Engineer

    4 days ago


    Compare the Market Peterborough

    Transforming Financial Decision Making · Our purpose is to make great financial decision making effortless for everyone, driving us every day with a singular focus. · We aim to create an automated quoting engine that offers the simplest of experiences, paired with a brand that ev ...


  • Mace Group Peterborough, England, United Kingdom

    Mace Group Job Description · At Mace, our purpose is to redefine the boundaries of ambition. We believe in creating places that are responsible, bringing transformative impact to our people, communities, and societies across the globe. · Within our consult business, we harness ou ...


  • Jazz Pharmaceuticals Cambridgeshire and Peterborough, England, United Kingdom £100,000 - £120,000 per year

    We are seeking a highly skilled Senior Principal to lead projects related to data engineering requirements and initiatives across our Research and Development team. · Key Responsibilities · Design, develop and maintain data pipelines for processing Research and Development data f ...


  • Phoenix Resourcing Services Peterborough

    Job Title: Data Center Maintenance Technician · Location: Cambridge Area · Job Summary: · We are seeking a highly motivated and skilled individual to join our team as a Data Center Maintenance Technician in the Cambridge area. As a recession-proof industry with excellent growth o ...


  • PRS LTD Peterborough, England, United Kingdom

    Job Opportunity in Data Center · We are seeking a motivated individual to join our team at PRS Ltd, working in the data center and critical site. This is a recession-proof industry with exciting growth opportunities and comprehensive training provided. · Duties and Responsibiliti ...


  • Hamilton Barnes Associates Limited Cambridgeshire and Peterborough, England, United Kingdom

    A leading technology and engineering consultancy, Hamilton Barnes Associates Limited, specializes in product development and digital transformation across healthcare, automotive, and consumer products sectors. · Job Description · We are seeking a talented Data & AI Engineer to de ...


  • Anglian Water Services Lincoln

    Career Opportunity for Enterprise Data Engineer · The Data and Digital Services team at Anglian Water is seeking a talented Enterprise Data Engineer to join our expanding team. As a key member of our squad, you will play a crucial role in transforming the way data and digital ser ...

Jobs
>
Boston