Member of Technical Staff -Member of Technical Staff - Pretraining Text Data
Microsoft · London, England, GB
Overview We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large langu...
Job description
Overview We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large language models. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you. In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development. You will lead efforts to: Develop novel data collection strategies Improve dataset quality and integrity: Understand data-driven model behaviors: Train models to understand the impact of data and data mixes Align datasets with ethical and societal values This is a cross-disciplinary, high-impact role ideal for engineers and researchers who want to push the boundaries of what AI can learn from data. Microsoft Superintelligence Team: Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day...