Safeguards Policy Analyst

Anthropic Pbc•San Francisco, CA

302d•$170,000 - $200,000

About The Position

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. As a Safeguards Policy Analyst, you will be responsible for building and executing enforcement workflows for our products and services, with a focus on detecting and mitigating potential harmful use. In this role, you will have the unique opportunity to function both as a policy owner and develop the enforcement strategy for a suite of policies. As a member of the user Integrity and Authenticity team, your initial focus will be on improving on the current policies and expanding integrity and authenticity enforcement workflows. This role may later expand to include broader areas and methods of harm reduction. Safety is core to our mission and you'll help shape policy enforcement so that our users can safely interact with and build on top of our products in a harmless, helpful and honest way. Important context for this role: In this position you may be exposed to and engage with explicit content spanning a range of topics, including those of a sexual, violent, or psychologically disturbing nature.

Requirements

Experience establishing and scaling policy enforcement, and review workflows
Written and improved policies for tech products and platforms
Excellent written and verbal communication skills, with the ability to explain complex policy topics to various audiences
Used SQL and/or other data analysis tools to draw insights from large datasets
Identified emerging risks and threat actors, and provided feedback to a diverse sets of stakeholders, such as Product, Policy, Engineering, and Legal teams
Worked with generative AI products, including writing effective prompts for content review and enforcement
Navigated and thrived in a fast-paced and dynamic environment
An understanding of the challenges that exist in implementing product policies at scale, including in the content moderation space
Maintained strong collaboration with team members while navigating rapidly evolving priorities and workstreams
Experience as a trust & safety professional or subject matter expert working in one or more of the following focus areas: elections, influence operations, or fraud and abuse

Responsibilities

Design and architect automated enforcement systems and review workflows that scale effectively while maintaining high accuracy
Partner with Product, Engineering, and Data Science teams to optimize detection models for policy violations and automated enforcement systems
Review flagged content to drive enforcement and policy improvements
Work with external experts to gather feedback on policy, product interventions, and harm mitigations
Enforce usage policies with a focus on detecting and mitigating potential harmful use of AI systems
Support the Safeguards policy design team by providing detailed feedback on policy gaps based on real enforcement scenarios
Keep up to date with emerging AI policy enforcement best practices, and use these to inform our decision-making and workflows

Benefits

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Lovely office space in which to collaborate with colleagues

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Career Level

Mid Level

Education Level

Bachelor's degree

Safeguards Policy Analyst

About The Position

Requirements

Responsibilities

Benefits

What This Job Offers

Job Search Resources

Tools

Career Hubs

Guides

Company