Demo

Research Scientist, Frontier Red Team (Autonomy)

Anthropic
San Francisco, CA Full Time
POSTED ON 1/19/2025
AVAILABLE BEFORE 3/19/2025

We are looking for Research Scientists to develop and productionize advanced autonomy evaluations on our Frontier Red Team. Our goal is to develop and implement a gold standard of advanced autonomy evals to determine the AI Safety Level (ASL) of our models. This will have major implications for the way we train, deploy, and secure our models, as detailed in our Responsible Scaling Policy (RSP)

We believe that developing autonomy evals is one of the best ways to study increasingly capable and agentic models. If you’ve thought particularly hard about how models might be agentic and associated risks, and you’ve built an eval or experiment around it, we’d like to meet you.

Please note:

  • We will be prioritizing candidates who can start ASAP and can be based in either our San Francisco or London office.
  • We’re still iterating on the structure of our team. It is possible that this role might end up being the people manager of a few other individual contributors (ICs). If you would be interested in people management, you may express interest in the application.

Responsibilities:

  • Lead the end-to-end development of autonomy evals and research. This starts with risk and capability modeling, and includes designing, implementing, and regularly running these evals.
  • Quickly iterate on experiments  to evaluate autonomous capabilities and forecast future capabilities.
  • Provide technical leadership to Research Engineers to scope build scalable and secure infrastructure to quickly run large-scale experiments.
  • Communicate the outcomes of the evaluations to relevant Anthropic teams, as well as policy stakeholders and research collaborators, where relevant.
  • Collaborate with other projects on the Frontier Red Team, Alignment, and beyond to improve infrastructure and design safety techniques for autonomous capabilities.

You may be a good fit if you:

  • Have an ML background and experience leading experimental research on LLMs/multimodal models and/or agents
  • Have strong Python-based engineering skills
  • Are driven to find solutions to ambiguously scoped problems
  • Design and run experiments and iterate quickly to solve machine learning problems
  • Thrive in a collaborative environment (we love pair programming)
  • Have experience training, working with, and prompting models

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Scientist, Frontier Red Team (Autonomy)?

Sign up to receive alerts about other jobs on the Research Scientist, Frontier Red Team (Autonomy) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$102,775 - $137,396
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
Income Estimation: 
$103,625 - $127,928
Income Estimation: 
$88,975 - $120,741
Income Estimation: 
$68,121 - $81,836
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$125,958 - $157,570
Income Estimation: 
$151,423 - $191,781
Income Estimation: 
$224,177 - $300,651
Income Estimation: 
$213,290 - $266,052
Income Estimation: 
$225,010 - $318,974
Income Estimation: 
$182,205 - $244,055
Income Estimation: 
$125,958 - $157,570
Income Estimation: 
$120,989 - $162,093
Income Estimation: 
$74,806 - $91,633
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Anthropic

Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role As an Enterprise Account Executive at Anthropic, you’ll drive adoption of safe, frontier AI by securing s...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role As a member of our Strategic Product Management (SPM) team at Anthropic, you’ll own and lead strategic in...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role We are seeking an experienced backend software engineer to join Anthropic’s Cloud Platform team. You will...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the Role: Anthropic is seeking an experienced recruiting leader to scale our G&A, Communications, and Policy recru...

Not the job you're looking for? Here are some other Research Scientist, Frontier Red Team (Autonomy) jobs in the San Francisco, CA area that may be a better fit.

Research Scientist, Frontier Red Team (Cyber)

Menlo Ventures, San Francisco, CA

AI Assistant is available now!

Feel free to start your new journey!