Demo

Machine Learning Research Scientist / Research Engineer, Post-Training

The Rundown AI, Inc.
San Francisco, CA Full Time
POSTED ON 4/20/2025
AVAILABLE BEFORE 5/17/2025

Scale works with the industry’s leading AI labs to provide high quality data and accelerate progress in GenAI research. We are looking for Research Scientists and Research Engineers with expertise in LLM post-training (SFT, RLHF, reward modeling). This role will focus on optimizing data curation and algorithmic improvements to enhance LLM capabilities in core areas such as instruction following, factuality, coding, multilingual and multimodal understanding.In this role, you will develop novel methods to improve the alignment and generalization of large-scale generative models. You will collaborate with researchers and engineers to define best practices in data-driven AI development. You will also partner with top foundation model labs to provide both technical and strategic input on the development of the next generation of generative AI models.You will : Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in areas of instruction following, factuality, coding, multilingual and multimodal understanding.Design and experiment new approaches to preference optimization.Analyze model behavior, identify weaknesses, and propose solutions for bias mitigation and model robustness.Publish research findings in top-tier AI conferences.Ideally you’d have : Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or a related field.Deep understanding of deep learning, reinforcement learning, and large-scale model fine-tuning.Experience with post-training techniques such as RLHF, preference modeling, or instruction tuning.Excellent written and verbal communication skills.Published research in areas of machine learning at major conferences (NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, etc.) and / or journals.Previous experience in a customer facing role.#J-18808-Ljbffr

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Machine Learning Research Scientist / Research Engineer, Post-Training?

Sign up to receive alerts about other jobs on the Machine Learning Research Scientist / Research Engineer, Post-Training career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$108,245 - $136,486
Income Estimation: 
$136,683 - $171,343
Income Estimation: 
$82,813 - $108,410
Income Estimation: 
$120,989 - $162,093
Income Estimation: 
$74,806 - $91,633
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$102,775 - $137,396
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
Income Estimation: 
$136,683 - $171,343
Income Estimation: 
$178,466 - $212,939
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at The Rundown AI, Inc.

The Rundown AI, Inc.
Hired Organization Address New York, NY Full Time
About the Team As part of the Growth team, you'll be at the forefront of bringing OpenAI's technology to the world. This...
The Rundown AI, Inc.
Hired Organization Address San Francisco, CA Full Time
Waymo is an autonomous driving technology company with the mission to be the most trusted driver. Since its start as the...
The Rundown AI, Inc.
Hired Organization Address San Francisco, CA Full Time
About Anyscale : At Anyscale, we're on a mission to democratize distributed computing and make it accessible to software...
The Rundown AI, Inc.
Hired Organization Address San Francisco, CA Full Time
About the Team The Corporate Security team at OpenAI is dedicated to ensuring the safety and security of our people and ...

Not the job you're looking for? Here are some other Machine Learning Research Scientist / Research Engineer, Post-Training jobs in the San Francisco, CA area that may be a better fit.

AI Assistant is available now!

Feel free to start your new journey!