Demo

Safeguards Analyst, Cyber Harms

Anthropic
San Francisco, CA Full Time
POSTED ON 2/7/2025
AVAILABLE BEFORE 4/6/2025

About the role

As a Safeguards Analyst focusing on Cyber Harms, you will play a critical role in protecting our platform and users from cyber security risks through consistent policy enforcement and trend analysis.

Important Context: In this position, you may be exposed to and engage with explicit content spanning a range of topics, including those of a sexual, violent, or psychologically disturbing nature. There is also an on-call responsibility across the Policy and Enforcement teams.

Responsibilities:

  • Enforce trust and safety policies with a specific focus on detecting and mitigating potential cyber security risks and harmful use of AI systems
  • Monitor and analyze platform activity to identify emerging cyber threat patterns and trends that may require policy updates or enforcement actions
  • Work with engineers to develop and iterate on safety systems that govern responsible use of our models for emerging capabilities and use cases related to cyber threats
  • Conduct thorough investigations of potential policy violations related to cyber harms, gathering and documenting evidence to support enforcement decisions, and working to escalate cases with investigations and/or Security to identify coordinated activity
  • Collaborate with the Policy team to provide feedback on policy gaps and ambiguities based on real enforcement scenarios involving cyber threats
  • Support the development and refinement of detection methods for cyber-related abuse through data analysis and pattern recognition
  • Work closely with cross-functional teams to ensure consistent application of policies across different use cases and scenarios
  • Maintain detailed documentation of investigation findings and enforcement actions
  • Participate in regular policy reviews and provide insights from an enforcement perspective
  • Operationalize review workflows and determine prioritization of reviews
  • Handle user appeals and communications related to enforcement actions with professionalism and clarity

You may be a good fit if you have:

  • 2 years of experience in cybersecurity, or related field
  • Strong understanding of cybersecurity concepts, web security, and common attack patterns
  • Experience in offensive cybersecurity, CTFs, or penetration testing (OSCP Certification is not required, but valued)
  • Ability to utilize Python and/or other data analysis tools and interact with large databases
  • Demonstrated ability to analyze complex situations and make well-reasoned decisions under pressure
  • Strong attention to detail and ability to maintain accurate documentation
  • Excellent written and verbal communication skills
  • Ability to work independently while maintaining strong collaboration with team members
  • Bachelor's degree in Computer Science, Information Security, or related field (or equivalent practical experience)

Strong candidates may:

  • Have a deep interest in AI safety and responsible technology development
  • Have a background in ethical hacking/pen-testing/malware analysis
  • Can balance competing priorities and handle time-sensitive issues effectively
  • Are comfortable working in ambiguous situations and can make sound judgments based on available information
  • Demonstrate strong analytical thinking and problem-solving skills
  • Are proactive in identifying emerging threats and suggesting improvements to existing processes
  • Have experience with or interest in content moderation and policy enforcement at scale
  • Can effectively communicate technical concepts to both technical and non-technical stakeholders



If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Safeguards Analyst, Cyber Harms?

Sign up to receive alerts about other jobs on the Safeguards Analyst, Cyber Harms career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$46,059 - $58,088
Income Estimation: 
$60,511 - $79,447
Income Estimation: 
$87,093 - $107,335
Income Estimation: 
$111,725 - $147,313
Income Estimation: 
$112,673 - $137,290
Income Estimation: 
$140,233 - $181,029
Income Estimation: 
$161,209 - $233,553
Income Estimation: 
$112,673 - $137,290
Income Estimation: 
$139,945 - $168,577
Income Estimation: 
$140,233 - $181,029
Income Estimation: 
$161,209 - $233,553
Income Estimation: 
$139,945 - $168,577
Income Estimation: 
$164,835 - $201,088
Income Estimation: 
$135,994 - $168,063
Income Estimation: 
$161,209 - $233,553
Income Estimation: 
$70,462 - $84,818
Income Estimation: 
$77,991 - $108,747
Income Estimation: 
$87,093 - $107,335
Income Estimation: 
$140,233 - $181,029
Income Estimation: 
$161,209 - $233,553
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Anthropic

Anthropic
Hired Organization Address New York, NY Full Time
Anthropic Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable ...
Anthropic
Hired Organization Address New York, NY Full Time
About Anthropic Anthropic is an AI safety and research company that’s working to build reliable, interpretable, and stee...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role As an Economist at Anthropic, you'll lead our work to measure and understand AI's effects on the global e...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role: We are looking for backend / platform software engineers to join our Product Foundations org. We build f...

Not the job you're looking for? Here are some other Safeguards Analyst, Cyber Harms jobs in the San Francisco, CA area that may be a better fit.

Safeguards Senior Analyst, Bio Harms

Anthropic, San Francisco, CA

AI Assistant is available now!

Feel free to start your new journey!