JobMesh

Research Engineer/Research Scientist - Red Team (Alignment)

AI Security Institute · London, England, GB

About the AI Security Institute The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and transl...

Job description

About the AI Security Institute The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and translating that knowledge into action. We’re in the heart of the UK government with direct lines to No. 10 (the Prime Minister's office), and we work with frontier developers and governments globally. We’re here because governments are critical for advanced AI going well, and UK AISI is uniquely positioned to mobilise them. With our resources, unique agility and international influence, this is the best place to shape both AI development and government action. Team Description: Risks from misaligned AI systems will grow in importance as AI systems become more capable, autonomous, and integrated into society. Understanding these risks and stress-testing mitigations is hence crucial to ensuring advanced AI systems are developed and deployed safely and beneficially in the future. The Alignment Red Team is a specialised sub-team within AISI's wider Red Team focused on detecting and evaluating misalignment in frontier AI systems. We perform novel research to develop techniques for finding misalignment, and pre- and post-deployment evalua...