Demo

Research Scientist/Engineer, Alignment Finetuning

Anthropic
San Francisco, CA Full Time
POSTED ON 2/7/2025
AVAILABLE BEFORE 4/6/2025

About the role:

As a Research Scientist/Engineer on the Alignment Finetuning team at Anthropic, you'll lead the development and implementation of techniques aimed at training language models that are more aligned with human values: that demonstrate better moral reasoning, improved honesty, and good character. You'll work to develop novel finetuning techniques and to use these to demonstrably improve model behavior.

Responsibilities:

  • Develop and implement novel finetuning techniques using synthetic data generation and advanced training pipelines
  • Use these to train models to have better alignment properties including honesty, character, and harmlessness
  • Create and maintain evaluation frameworks to measure alignment properties in models
  • Collaborate across teams to integrate alignment improvements into production models
  • Develop processes to help automate and scale the work of the team

You may be a good fit if you:

  • Have an MS/PhD in Computer Science, ML, or related field, or equivalent experience
  • Possess strong programming skills, especially in Python
  • Have experience with ML model training and experimentation
  • Have a track record of implementing ML research
  • Demonstrate strong analytical skills for interpreting experimental results
  • Have experience with ML metrics and evaluation frameworks
  • Excel at turning research ideas into working code
  • Can identify and resolve practical implementation challenges

Strong candidates may also have:

  • Experience with language model finetuning
  • Background in AI alignment research
  • Published work in ML or alignment
  • Experience with synthetic data generation
  • Familiarity with techniques like RLHF, constitutional AI, and reward modeling
  • Track record of designing and implementing novel training approaches
  • Experience with model behavior evaluation and improvement

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Scientist/Engineer, Alignment Finetuning?

Sign up to receive alerts about other jobs on the Research Scientist/Engineer, Alignment Finetuning career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$102,775 - $137,396
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
Income Estimation: 
$82,813 - $108,410
Income Estimation: 
$120,989 - $162,093
Income Estimation: 
$74,806 - $91,633
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$70,310 - $88,223
Income Estimation: 
$88,950 - $110,401
Income Estimation: 
$84,958 - $111,603
Income Estimation: 
$88,950 - $110,401
Income Estimation: 
$109,186 - $139,009
Income Estimation: 
$115,336 - $159,446
Income Estimation: 
$102,775 - $137,396
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Anthropic

Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role As a Startup Account Manager at Anthropic, you'll drive expansion and retention of our fastest-growing st...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role As a Enterprise Technical Success Manager at Anthropic, you will be a strategic partner and the go-to tec...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the Role As the Product Manager for Core Product at Claude.ai, you will shape the evolution of Claude from an AI a...
Anthropic
Hired Organization Address San Francisco, CA Full Time
About the role As a Research Engineer in Researcher Productivity, you'll design and build critical infrastructure that e...

Not the job you're looking for? Here are some other Research Scientist/Engineer, Alignment Finetuning jobs in the San Francisco, CA area that may be a better fit.

AI Assistant is available now!

Feel free to start your new journey!