JobMesh

Research Engineer, Post-Training for Code Security Analysis

DeepMind · Mountain View, California, US

About Us Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learni...

Job description

About Us Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority. The Role: In this role, you'll work with a team of elite researchers and engineers to design and implement post-training strategies that enhance Gemini’s capabilities in code security analysis . You will bring contributions to our ML innovation, post-training refinement (SFT/RLHF), advanced evaluation, and data generation to ensure our models can reliably perform safe and powerful code security analysis. Key responsibilities: - Design and Implement advanced post-training algorithms (SFT, RLHF, RLAIF) to optimize Gemini for code security tasks and secure coding practices. - Diagnose and interpret training outcomes (regressions in coding ability, gains in security reasoning), and propose solutions to improve model capabilities. - Actively monitor and e...