JobMesh

Senior Research Scientist, Post-Training LLM and DLM

NVIDIA · Santa Clara, California, US

We are now looking for a Senior Research Scientist passionate about Large Language Model (LLM) and Diffusion Language Model (DLM) post-training and system op...

Job description

We are now looking for a Senior Research Scientist passionate about Large Language Model (LLM) and Diffusion Language Model (DLM) post-training and system optimization. Are you excited to shape the future of large-scale generative AI? NVIDIA is at the forefront of foundation models and generative AI systems, enabling cutting-edge research and real-world deployment at unprecedented scale. Our team is dedicated to advancing post-training algorithms, building efficient large-scale systems, and developing evaluation frameworks to ensure reliability and scalability. Join us to work with world-class researchers and engineers on building the next generation of AI. What you will be doing: Designing and implementing post-training algorithms LLMs and DLMs. Driving efficiency and scalability improvements across training pipelines and serving systems Collaborating with researchers to translate cutting-edge ideas into production-ready implementations. Exploring new paradigms for evaluation. Demonstrating strong engineering practices, and contributing to open-source communities. What we need to see: PhD in Computer Science, Electrical Engineering, or related field, or equivalent research experie...