JobMesh

Safeguards Enforcement Lead, Frontier Abuse Enforcement

Anthropic · New York City, New York, US

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and fo...

Job description

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: Anthropic is an AI safety and research company working to build reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our customers and society as a whole. As a Safeguards Enforcement Lead focusing on frontier model abuse, you will serve as the dedicated program owner for one of the most consequential and fast-moving abuse vectors on our platform: actors who misuse Anthropic's models to train competing AI systems in violation of our usage policies and terms of service. Safety is core to our mission, and you'll own how we identify, investigate, and act against this category of harm — from developing the detection playbook to seeing enforcement cases through to resolution. Important Context: In this position, you may be exposed to and engage with explicit content spanning a range of topics, includin...