JobMesh

Sub Team Lead - Red Team (Control)

AI Security Institute · London, England, GB

About the AI Security Institute The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and transl...

Job description

About the AI Security Institute The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and translating that knowledge into action. We’re in the heart of the UK government with direct lines to No. 10 (the Prime Minister's office), and we work with frontier developers and governments globally. We’re here because governments are critical for advanced AI going well, and UK AISI is uniquely positioned to mobilise them. With our resources, unique agility and international influence, this is the best place to shape both AI development and government action. Team Description: Risks from misaligned AI systems will grow in importance as AI systems become more capable, autonomous, and integrated into society. AI control measures seek to detect, constrain, and/or counteract potentially misaligned AI models; we expect these measures to become increasingly important in the face of capable AI systems that may be unreliable, deceptive, or misaligned. The Control Red Team partners with leading frontier AI companies to stress-test control measures. The team uses techniques from adversarial ML to develop algorithms to find a range of failure...