What are the responsibilities and job description for the Safeguards Analyst, Cyber Harms position at Anthropic?
About the role
As a Safeguards Analyst focusing on Cyber Harms, you will play a critical role in protecting our platform and users from cyber security risks through consistent policy enforcement and trend analysis.
Important Context: In this position, you may be exposed to and engage with explicit content spanning a range of topics, including those of a sexual, violent, or psychologically disturbing nature. There is also an on-call responsibility across the Policy and Enforcement teams.
Responsibilities:
- Enforce trust and safety policies with a specific focus on detecting and mitigating potential cyber security risks and harmful use of AI systems
- Monitor and analyze platform activity to identify emerging cyber threat patterns and trends that may require policy updates or enforcement actions
- Work with engineers to develop and iterate on safety systems that govern responsible use of our models for emerging capabilities and use cases related to cyber threats
- Conduct thorough investigations of potential policy violations related to cyber harms, gathering and documenting evidence to support enforcement decisions, and working to escalate cases with investigations and/or Security to identify coordinated activity
- Collaborate with the Policy team to provide feedback on policy gaps and ambiguities based on real enforcement scenarios involving cyber threats
- Support the development and refinement of detection methods for cyber-related abuse through data analysis and pattern recognition
- Work closely with cross-functional teams to ensure consistent application of policies across different use cases and scenarios
- Maintain detailed documentation of investigation findings and enforcement actions
- Participate in regular policy reviews and provide insights from an enforcement perspective
- Operationalize review workflows and determine prioritization of reviews
- Handle user appeals and communications related to enforcement actions with professionalism and clarity
You may be a good fit if you have:
- 2 years of experience in cybersecurity, or related field
- Strong understanding of cybersecurity concepts, web security, and common attack patterns
- Experience in offensive cybersecurity, CTFs, or penetration testing (OSCP Certification is not required, but valued)
- Ability to utilize Python and/or other data analysis tools and interact with large databases
- Demonstrated ability to analyze complex situations and make well-reasoned decisions under pressure
- Strong attention to detail and ability to maintain accurate documentation
- Excellent written and verbal communication skills
- Ability to work independently while maintaining strong collaboration with team members
- Bachelor's degree in Computer Science, Information Security, or related field (or equivalent practical experience)
Strong candidates may:
- Have a deep interest in AI safety and responsible technology development
- Have a background in ethical hacking/pen-testing/malware analysis
- Can balance competing priorities and handle time-sensitive issues effectively
- Are comfortable working in ambiguous situations and can make sound judgments based on available information
- Demonstrate strong analytical thinking and problem-solving skills
- Are proactive in identifying emerging threats and suggesting improvements to existing processes
- Have experience with or interest in content moderation and policy enforcement at scale
- Can effectively communicate technical concepts to both technical and non-technical stakeholders