Demo

Research Scientist - Multimodal Language Models

Luma AI
Stanford, CA Full Time
POSTED ON 3/4/2025
AVAILABLE BEFORE 6/4/2025

Luma's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will come from vision and audio. So, we are working on training and scaling up multimodal foundation models for systems that can see, hear and understand, show and explain, and eventually interact with our world to effect change.

We are looking for researchers with significant experience solving hard problems in multimodal language models. You will work end-to-end on cutting edge multimodal language models with strong emphasis on audio and visual data. Your contributions will be pivotal in shaping various research projects and product roadmaps.

Responsibilities

  • Design and implement novel AI algorithms and architectures for multimodal language models.
  • Build tools to evaluate and benchmark multimodal language models.
  • Develop large-scale AI training and inference methods.
  • Ensure efficient implementation of models & systems for data processing and training.
  • Build tools to analyze and process multimodal data.
  • Collaborate with research and engineering teams across Luma to transfer research to products and services.
  • Implement cutting-edge product prototypes based on multimodal generative AI.

Experience

  • Expertise in Python & Pytorch, including practical experience working with the full development pipeline from data processing & data loading to training, inference, and optimization.
  • Experience working with large-scale text data, or (bonus) interleaved data spanning audio, video, image, and / or text.
  • Hands-on experience in developing or benchmarking at least one of the following topics : LLMs, Vision Language Models, Audio Language Models, generative video models .
  • Compensation

  • The pay range for this position in California is $200,000 - $300,000yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan.
  • 200,000 - $300,000 a year

    The pay range for this position in California is $200,000 - $300,000yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan.

    Your application is reviewed by real people.

    Salary : $200,000 - $300,000

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Research Scientist - Multimodal Language Models?

    Sign up to receive alerts about other jobs on the Research Scientist - Multimodal Language Models career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $108,245 - $136,486
    Income Estimation: 
    $136,683 - $171,343
    Income Estimation: 
    $82,813 - $108,410
    Income Estimation: 
    $120,989 - $162,093
    Income Estimation: 
    $74,806 - $91,633
    Income Estimation: 
    $71,928 - $87,026
    Income Estimation: 
    $145,337 - $174,569
    Income Estimation: 
    $102,775 - $137,396
    Income Estimation: 
    $153,127 - $203,425
    Income Estimation: 
    $139,626 - $193,276
    Income Estimation: 
    $164,650 - $211,440
    Income Estimation: 
    $130,030 - $173,363
    Income Estimation: 
    $151,423 - $191,781
    Income Estimation: 
    $224,177 - $300,651
    Income Estimation: 
    $213,290 - $266,052
    Income Estimation: 
    $225,010 - $318,974
    Income Estimation: 
    $182,205 - $244,055
    Income Estimation: 
    $68,606 - $89,684
    Income Estimation: 
    $88,975 - $120,741
    Income Estimation: 
    $68,121 - $81,836
    Income Estimation: 
    $71,928 - $87,026
    Income Estimation: 
    $125,958 - $157,570
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Luma AI

    Luma AI
    Hired Organization Address Palo Alto, CA Full Time
    Luma AI's mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality ...
    Luma AI
    Hired Organization Address Stanford, CA Full Time
    Luma is looking for an engineer to lead our AI Agent workstream. Luma's future product uses Agents as a Creative Partner...
    Luma AI
    Hired Organization Address Palo Alto, CA Full Time
    We are looking for our first Data Scientist. You are a highly motivated individual contributor. You will define a data-d...
    Luma AI
    Hired Organization Address Palo Alto, CA Full Time
    Luma’s mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is ...

    Not the job you're looking for? Here are some other Research Scientist - Multimodal Language Models jobs in the Stanford, CA area that may be a better fit.

    AI Assistant is available now!

    Feel free to start your new journey!