Demo

Senior Research Engineer- Performance Optimization

Luma AI
Palo Alto, CA Full Time
POSTED ON 1/20/2025
AVAILABLE BEFORE 4/17/2025

We are looking for engineers with significant problem-solving experience in PyTorch, CUDA, and distributed systems. You will work with Research Scientists to build & train cutting-edge foundation models on thousands of GPUs.

Responsibilities

  • Ensure efficient implementation of models & systems for data processing, training, inference, and deployment.
  • Identify and implement optimization techniques for massively parallel and distributed systems.
  • Identify and remedy efficiency bottlenecks (memory, speed, utilization) by profiling and implementing high-performance CUDA, Triton, C , and PyTorch code.
  • Work closely together with the research team to ensure systems are planned to be as efficient as possible from start to finish.
  • Build tools to visualize, evaluate, and filter datasets.
  • Implement cutting-edge product prototypes based on multimodal generative AI.

Experience

  • Experience training large models using Python & PyTorch, including practical experience working with the entire development pipeline from data processing, preparation & data loading to training and inference.
  • Experience optimizing and deploying inference workloads for throughput and latency across the stack (inputs, model inference, outputs, parallel processing, etc.).
  • Experience with profiling CPU & GPU code in PyTorch, including Nvidia Nsight or similar.
  • Experience writing & improving highly parallel & distributed PyTorch code, with familiarity in DDP, FSDP, Tensor Parallel, etc.
  • Experience writing high-performance parallel C . Bonus if done within an ML context with PyTorch, like for data loading, data processing, inference code.
  • Experience with high-performance Triton / CUDA and writing custom PyTorch kernels. Top candidates will be able to utilize tensor cores; optimize performance with CUDA memory and other similar skills.
  • Good to have experience working with Deep learning concepts such as Transformers & Multimodal Generative models such as Diffusion Models and GANs.
  • Good to have experience building inference / demo prototype code (incl. Gradio, Docker, etc.).
  • Please note this role is not meant for recent grads.
  • 175,000 - $250,000 a year

    In addition to cash base pay, you'll also receive a sizable grant of Luma's equity.

    The pay range for this position is for Bay Area. Base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience.

    Your applications are reviewed by real people.

    J-18808-Ljbffr

    Salary : $175,000 - $250,000

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Senior Research Engineer- Performance Optimization?

    Sign up to receive alerts about other jobs on the Senior Research Engineer- Performance Optimization career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $131,953 - $159,624
    Income Estimation: 
    $169,825 - $204,021
    Income Estimation: 
    $166,631 - $195,636
    Income Estimation: 
    $162,237 - $199,353
    Income Estimation: 
    $181,083 - $218,117
    Income Estimation: 
    $131,953 - $159,624
    Income Estimation: 
    $169,825 - $204,021
    Income Estimation: 
    $166,631 - $195,636
    Income Estimation: 
    $162,237 - $199,353
    Income Estimation: 
    $181,083 - $218,117
    Income Estimation: 
    $113,077 - $147,784
    Income Estimation: 
    $135,356 - $164,911
    Income Estimation: 
    $153,902 - $198,246
    Income Estimation: 
    $85,996 - $102,718
    Income Estimation: 
    $111,859 - $131,446
    Income Estimation: 
    $110,457 - $133,106
    Income Estimation: 
    $105,809 - $128,724
    Income Estimation: 
    $122,763 - $145,698
    Income Estimation: 
    $105,809 - $128,724
    Income Estimation: 
    $136,611 - $163,397
    Income Estimation: 
    $135,163 - $163,519
    Income Estimation: 
    $131,953 - $159,624
    Income Estimation: 
    $150,859 - $181,127
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Luma AI

    Luma AI
    Hired Organization Address Stanford, CA Full Time
    We are looking for people with strong product sense and a great taste for design. This role involves building performant...
    Luma AI
    Hired Organization Address Stanford, CA Full Time
    We are looking for people with strong product sense and a great taste for design. This role involves inventing new kinds...
    Luma AI
    Hired Organization Address Palo Alto, CA Full Time
    Luma is looking for a Technical Artist to join our Applied team. Luma’s Applied team takes our underlying foundation mod...
    Luma AI
    Hired Organization Address Stanford, CA Full Time
    The SRE role at Luma AI sits with the Infrastructure and Research teams and is responsible for our GPU clusters. Luma ru...

    Not the job you're looking for? Here are some other Senior Research Engineer- Performance Optimization jobs in the Palo Alto, CA area that may be a better fit.

    AI Assistant is available now!

    Feel free to start your new journey!