Demo

Research Scientist, Reinforcement Learning (Training)

OpenAI
OpenAI Salary
San Francisco, CA Full Time
POSTED ON 2/27/2025
AVAILABLE BEFORE 5/25/2025

About the Team

The Training Core Algorithms / Reinforcement Learning team is responsible for researching and developing the next generation of algorithms to power our RLHF stack (reinforcement learning from human feedback). The algorithms we develop are used in ChatGPT consumer product and the OpenAI API.

About the Role

As a Member of Technical Staff on our team, you will research and develop improvements to all components of our RLHF stack, including data collection, supervised finetuning, reward modeling, off- and on-policy learning, active learning, and evaluations. The ultimate test for our algorithms is how useful they are to our users, and we often deploy our algorithms into new ChatGPT models.

We're looking for people who have extensive background in reinforcement learning research, are able to iterate quickly, and are proficient at coding.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will :

  • Come up with improvements to RLHF
  • Prototype and evaluate these ideas
  • Scale up your innovations to ChatGPT scale

You might thrive in this role if you :

  • Love being on the cutting edge of RL and language model research
  • Can iterate fast on lots of ideas
  • Like doing research that has real-world impact
  • About OpenAI

    OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

    We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

    OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

    For US Based Candidates : Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

    We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

    OpenAI Global Applicant Privacy Policy

    At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Research Scientist, Reinforcement Learning (Training)?

    Sign up to receive alerts about other jobs on the Research Scientist, Reinforcement Learning (Training) career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $102,775 - $137,396
    Income Estimation: 
    $153,127 - $203,425
    Income Estimation: 
    $139,626 - $193,276
    Income Estimation: 
    $164,650 - $211,440
    Income Estimation: 
    $130,030 - $173,363
    Income Estimation: 
    $130,030 - $173,363
    Income Estimation: 
    $194,895 - $259,743
    Income Estimation: 
    $192,057 - $260,440
    Income Estimation: 
    $249,515 - $311,938
    Income Estimation: 
    $155,477 - $213,492
    Income Estimation: 
    $68,606 - $89,684
    Income Estimation: 
    $88,975 - $120,741
    Income Estimation: 
    $68,121 - $81,836
    Income Estimation: 
    $71,928 - $87,026
    Income Estimation: 
    $125,958 - $157,570
    Income Estimation: 
    $82,813 - $108,410
    Income Estimation: 
    $120,989 - $162,093
    Income Estimation: 
    $74,806 - $91,633
    Income Estimation: 
    $71,928 - $87,026
    Income Estimation: 
    $145,337 - $174,569
    Income Estimation: 
    $102,775 - $137,396
    Income Estimation: 
    $153,127 - $203,425
    Income Estimation: 
    $139,626 - $193,276
    Income Estimation: 
    $164,650 - $211,440
    Income Estimation: 
    $130,030 - $173,363
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at OpenAI

    OpenAI
    Hired Organization Address Washington, DC Full Time
    About the Team Join the engineering teams that bring OpenAI's ideas safely to the world! The Applied Engineering team wo...
    OpenAI
    Hired Organization Address Washington, DC Full Time
    About the Team Join the engineering teams that bring OpenAI's ideas safely to the world!! The Applied Engineering team w...
    OpenAI
    Hired Organization Address New York, NY Full Time
    About the Team The Corporate Security team at OpenAI is dedicated to ensuring the safety and security of our people and ...
    OpenAI
    Hired Organization Address San Francisco, CA Full Time
    About the Team Our team brings OpenAI's most capable technology to the world through our products. Most recently, we rel...

    Not the job you're looking for? Here are some other Research Scientist, Reinforcement Learning (Training) jobs in the San Francisco, CA area that may be a better fit.

    AI Engineer, Reinforcement Learning

    Mytra, South San Francisco, CA

    AI Assistant is available now!

    Feel free to start your new journey!