Machine Learning Scientist — Agentic data pipelines
Iambic Therapeutics · Boston, Massachusetts, US
Job Summary We are seeking a scientist to join our team at Iambic Therapeutics, working on data acquisition and curation for Enchant , our multimodal transfo...
Job description
Job Summary We are seeking a scientist to join our team at Iambic Therapeutics, working on data acquisition and curation for Enchant , our multimodal transformer model trained at scale on a wide variety of biomedical data. In this role, you will design and build agentic systems that acquire, clean, format, and quality-control the large-scale datasets that power Enchant training. You will work at the intersection of LLM-based automation and biomedical data engineering—developing AI agents that can navigate heterogeneous data sources, enforce quality standards, and operate reliably at scale. This role is ideal for candidates who combine strong software engineering instincts with scientific understanding of biomedical data, and who are excited about using LLMs as tools to solve practical data problems. Key Responsibilities: Design, build, and maintain agentic systems for automated data acquisition from public and proprietary biomedical data sources Develop LLM-based pipelines for data cleaning, normalization, and formatting across diverse data modalities (e.g., molecular, genomic, clinical, literature) Implement automated quality-control workflows that detect anomalies, flag inconsist...