Demo

Safeguards Policy Analyst

Anthropic
San Francisco, CA Full Time
POSTED ON 3/16/2025
AVAILABLE BEFORE 5/15/2025

As a Safeguards Policy Analyst, you will be responsible for building and executing enforcement workflows for our products and services, with a focus on detecting and mitigating potential harmful use. In this role, you will have the unique opportunity to function both as a policy owner and develop the enforcement strategy for a suite of policies.  As a member of the user Integrity and Authenticity team, your initial focus will be on improving on the current policies and expanding integrity and authenticity enforcement workflows. This role may later expand to include broader areas and methods of harm reduction. Safety is core to our mission and you’ll help shape policy enforcement so that our users can safely interact with and build on top of our products in a harmless, helpful and honest way. 

*Important context for this role: In this position you may be exposed to and engage with explicit content spanning a range of topics, including those of a sexual, violent, or psychologically disturbing nature.

Responsibilities

  • Design and architect automated enforcement systems and review workflows that scale effectively while maintaining high accuracy
  • Partner with Product,  Engineering, and Data Science teams to optimize detection models for policy violations and automated enforcement systems
  • Review flagged content to drive enforcement and policy improvements
  • Work with external experts to gather feedback on policy, product interventions, and harm mitigations
  • Enforce usage policies with a focus on detecting and mitigating potential harmful use of AI systems
  • Support the Safeguards policy design team by providing detailed feedback on policy gaps based on real enforcement scenarios
  • Keep up to date with emerging AI policy enforcement best practices, and use these to inform our decision-making and workflows

You may be a good fit if you have:

  • Experience establishing and scaling policy enforcement, and review workflows
  • Written and improved policies for tech products and platforms
  • Excellent written and verbal communication skills, with the ability to explain complex policy topics to various audiences
  • Used SQL and/or other data analysis tools to draw insights from large datasets
  • Identified emerging risks and threat actors, and provided feedback to a diverse sets of stakeholders, such as Product, Policy, Engineering, and Legal teams
  • Worked with generative AI products, including writing effective prompts for content review and enforcement
  • Navigated and thrived in a fast-paced and dynamic environment
  • An understanding of the challenges that exist in implementing product policies at scale, including in the content moderation space
  • Maintained strong collaboration with team members while navigating rapidly evolving priorities and workstreams
  • Experience as a trust & safety professional or subject matter expert working in one or more of the following focus areas: elections, influence operations, or fraud and abuse

Deadline to apply: None. Applications will be reviewed on a rolling basis. 

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Safeguards Policy Analyst?

Sign up to receive alerts about other jobs on the Safeguards Policy Analyst career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$31,756 - $42,227
Income Estimation: 
$43,147 - $63,605
Income Estimation: 
$44,770 - $56,266
Income Estimation: 
$40,806 - $50,664
Income Estimation: 
$73,046 - $96,976
Income Estimation: 
$93,716 - $124,745
Income Estimation: 
$118,976 - $146,289
Income Estimation: 
$112,672 - $149,113
Income Estimation: 
$98,475 - $115,895
Income Estimation: 
$79,521 - $98,503
Income Estimation: 
$101,242 - $124,726
Income Estimation: 
$107,442 - $160,602
Income Estimation: 
$110,400 - $142,096
Income Estimation: 
$118,913 - $150,937
Income Estimation: 
$75,577 - $102,434
Income Estimation: 
$73,572 - $96,186
Income Estimation: 
$101,242 - $124,726
Income Estimation: 
$125,329 - $152,916
Income Estimation: 
$129,291 - $167,349
Income Estimation: 
$133,136 - $171,866
Income Estimation: 
$138,790 - $181,781
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Anthropic

Anthropic
Hired Organization Address New York, NY Full Time
Anthropic Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable ...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role As the Product Manager for Platform Experience at Anthropic, you will drive the development and adoption ...
Anthropic
Hired Organization Address New York, NY Full Time
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be saf...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role As the second user researcher at Anthropic, you'll drive research strategy for key product areas in an en...

Not the job you're looking for? Here are some other Safeguards Policy Analyst jobs in the San Francisco, CA area that may be a better fit.

Public Policy Analyst, Product

Climate Policy Radar, San Francisco, CA

Policy Analyst

KFF, San Francisco, CA

AI Assistant is available now!

Feel free to start your new journey!