JobMesh

AI Engineer - Reinforcement Learning

Blue Yonder · Paris, Île-De-France, FR

About the AI Studio The AI Studio's mission is to find the fastest possible path to an autonomous supply chain. We're developing AI agents, learning systems,...

Job description

About the AI Studio The AI Studio's mission is to find the fastest possible path to an autonomous supply chain. We're developing AI agents, learning systems, training models, and more to overcome the biggest challenges remaining in the global supply chain. In short, we are having a lot of fun. Your mission in this role: We're looking for an ambitious AI Engineer specialising in Reinforcement Learning to work on environments, evaluations, data pipelines, and tooling for robust training systems. You'll help shape how we approach reward modeling, environment design, and agent training. If you're energised by pushing the boundaries of what’s possible, this is your chance. Responsibilities: - Design and implement RL environments for supply chain decision-making - Develop reward functions that capture what "good" looks like for our agents - Create evaluation frameworks to measure agent performance and catch failure modes - Build data pipelines for training and human feedback collection - Document what works (and what doesn't) so we can compound our learnings - Stay on top of industry trends and cutting edge use cases We want to talk if you: - You've trained or fine-tuned LLMs - Are excit...