Demo

Senior Research engineer - Multimodal Language Models

Luma AI
Palo Alto, CA Full Time
POSTED ON 3/30/2025
AVAILABLE BEFORE 5/29/2025

Luma’s mission is to build multimodal AI to expand human imagination and capabilities.

We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will come from vision. So, we are working on training and scaling up multimodal foundation models for systems that can see and understand, show and explain, and eventually interact with our world to effect change.


We are looking for engineers with significant experience solving hard problems in PyTorch, multimodal data, and distributed systems. You will work as a team to end-to-end build cutting edge multimodal language models with strong emphasis on audio and visual data. Your contributions will be pivotal in shaping various research projects and product roadmaps.

\n


Responsibilities
  • Design and develop large-scale annotation efforts for model post-training.
  • Build tools to evaluate and benchmark multimodal language models.
  • Develop large-scale AI training and inference methods.
  • Ensure efficient implementation of models & systems for data processing and training.
  • Build tools to visualize, evaluate and filter datasets.
  • Collaborate with research and engineering teams across Luma to transfer research to products and services.
  • Implement cutting-edge product prototypes based on multimodal generative AI.


Experience
  • Expertise in Python & Pytorch, including practical experience working with the full development pipeline from data processing, preparation & data loading to training and inference.
  • Experience processing large-scale text data, or (bonus) interleaved data spanning audio, video, image, and/or text.
  • Hands-on experience in developing or benchmarking at least one of the following topics: LLMs, Vision Language Models, Audio Language Models, generative video models.

Good to have
  • Experience in design and development of annotation tools
  • Experience in synthetic data


Compensation
  • The pay range for this position in California is $200,000 - $300,000/yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan. 


\n
$200,000 - $300,000 a year
The pay range for this position in California is $200,000 - $300,000yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan.
\n

Your application is reviewed by real people.

Salary : $200,000 - $300,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior Research engineer - Multimodal Language Models?

Sign up to receive alerts about other jobs on the Senior Research engineer - Multimodal Language Models career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$131,953 - $159,624
Income Estimation: 
$169,825 - $204,021
Income Estimation: 
$166,631 - $195,636
Income Estimation: 
$162,237 - $199,353
Income Estimation: 
$181,083 - $218,117
Income Estimation: 
$103,228 - $139,671
Income Estimation: 
$116,726 - $151,072
Income Estimation: 
$124,724 - $161,246
Income Estimation: 
$124,724 - $161,246
Income Estimation: 
$147,901 - $186,323
Income Estimation: 
$147,901 - $186,323
Income Estimation: 
$170,841 - $219,163
Income Estimation: 
$81,769 - $104,543
Income Estimation: 
$89,551 - $118,439
Income Estimation: 
$103,228 - $139,671
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Luma AI

Luma AI
Hired Organization Address Palo Alto, CA Full Time
Luma AI's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality ...
Luma AI
Hired Organization Address Stanford, CA Full Time
Luma is looking for an engineer to lead our AI Agent workstream. Luma's future product uses Agents as a Creative Partner...
Luma AI
Hired Organization Address Palo Alto, CA Full Time
We are looking for our first Data Scientist. You are a highly motivated individual contributor. You will define a data-d...
Luma AI
Hired Organization Address Stanford, CA Full Time
Luma is looking for a Technical Artist to join our Applied team. Luma's Applied team takes our underlying foundation mod...

Not the job you're looking for? Here are some other Senior Research engineer - Multimodal Language Models jobs in the Palo Alto, CA area that may be a better fit.

AI Assistant is available now!

Feel free to start your new journey!