Senior AI Compiler Engineer, MLIR
NVIDIA · Santa Clara, California, US
NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing.
Job description
NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”. NVIDIA is hiring a Senior AI Compiler Engineer. GPUs are driving rapid progress in deep learning—from LLMs and generative AI to recommendation, vision, and speech. On this team, you’ll build an MLIR-based AI compiler that powers NVIDIA’s inference engine end to end, with a focus on performance, fast builds, low memory use, and Ahead-of-Time and Just-in-Time usability across data center and edge. What you’ll be doing: Develop MLIR-based graph representations and optimizations for future GPU architectures. Partner with framework and hardware teams to enable new model patterns and upcoming GPU architectural features. Define APIs and MLIR dialects, conduct performance optimizations and analysis, implement compiler optimizations and kernel generation for neural networks, and...