JobMesh

Agentic RL Researcher – Distributed Computing

Huawei Technologies Canada Co., Ltd. · Markham, Ontario, CA

Huawei Canada has an immediate permanent opening for a Researcher. About the team: The Distributed Data Storage and Management Lab leads research in distribu...

Job description

Huawei Canada has an immediate permanent opening for a Researcher. About the team: The Distributed Data Storage and Management Lab leads research in distributed data systems, aiming to develop next-generation cloud serverless products that encompass core infrastructure and databases. This lab addresses various data challenges, including cloud-native disaggregated databases, pay-by-query user models, and optimizing low-level data transfers via RDMA. Teams within this lab create advanced cloud serverless data infrastructure and implement cutting-edge networking technologies for Huawei's global AI infrastructure. About the job: Design and develop advanced Agentic Reinforcement Learning (RL) and Multi-Agent Reinforcement Learning (MARL) algorithms for cooperative, competitive, and mixed-agent environments, including CTDE, decentralized learning, and hierarchical agent systems. Build scalable simulation and training platforms for large-scale agent systems, supporting self-play, population-based training, curriculum learning, and emergent behavior analysis. Optimize multi-agent learning performance on distributed compute clusters, improving sample efficiency, credit assignment, agent coo...