Research Scientist/Engineer (Evaluations)
Apollo Research · London, England, GB
Application deadline: We are conducting interviews actively and aim to fill this role as soon as we find someone suitable. ABOUT THE OPPORTUNITY We develop a...
Job description
Application deadline: We are conducting interviews actively and aim to fill this role as soon as we find someone suitable. ABOUT THE OPPORTUNITY: We develop and run evaluations that help assess the risks posed by scheming AIs. You will get to work with frontier labs like OpenAI, Anthropic, and Google DeepMind and be amongst the first to interact with new models before anyone else. The ideal candidate loves rigorously testing frontier AI models, and enjoys building efficient pipelines and automating them. YOU WILL HAVE THE OPPORTUNITY TO: - Run pre-deployment evaluation campaigns on the most capable AI systems in the world. We partner with multiple labs, giving you access to a breadth of models that no single AI lab could offer. You'll be among the first people to interact with new models before anyone else. - Deep dive into AI cognition. Scan through thousands of model transcripts to surface behavioral patterns that no one has ever observed before. These patterns are often deeply surprising and fascinating to study, e.g. the non-standard language and the reward-seeking reasoning described in our anti-scheming paper. - Build new evaluations for frontier risks , from designing novel...