Demo

ML Performance Engineer

Oumi
Palo Alto, CA Full Time
POSTED ON 2/24/2025
AVAILABLE BEFORE 3/22/2025
About Oumi

Why we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that AI will have a transformative impact on humanity, and that developing it collectively, in the open, is the best path forward to ensure that it is done efficiently and safely.

What we do: Oumi provides an all-in-one platform to build state-of-the-art AI models, end to end, from data preparation to production deployment, empowering innovators to build cutting-edge models at any scale. Oumi also develops open foundation models in collaboration with academic collaborators and the open community.

Our Approach: Oumi is fundamentally an open-source first company, with open-collaboration across the community as a core principle. Our work is:

  • Open Source First: All our platform and core technology is open source
  • Research-driven: We conduct and publish original research in AI, collaborating with our community of academic research labs and collaborators
  • Community-powered: We believe in the power of open-collaboration and welcome contributions from researchers and developers worldwide

Role Overview

The ML Performance Engineer will be an integral part of Oumi's research team, focusing on optimizing and accelerating training and inference with AI models. This role involves developing efficient CUDA/Triton kernels, contributing to open-source projects, and collaborating with researchers and engineers to improve model performance. Engineers at Oumi will work on various aspects of model acceleration including kernel optimization, memory management, and performance profiling.

What you’ll bring:

  • ML Performance: Demonstrated experience optimizing models, training & inference pipelines, and familiarity with profiling tools (NSight, nvprof)
  • Programming Skills: Strong programming skills in one of Python, C or Rust
  • Systems Knowledge: familiarity with low-level operating systems foundations, PyTorch internals, GPU architectures is highly desirable
  • ML Expertise: Deep understanding of machine learning and deep learning concepts, with specific knowledge of large language models (LLMs).
  • Open Source: Familiarity with open-source projects and a passion for contributing to the open-source community.
  • Values: Share Oumi's values: Beneficial for all, Customer-obsessed, Radical Ownership, Exceptional Teammates, Science-grounded.

Benefits

  • Competitive salary: $120,000 - $220,000
  • Equity in a high-growth startup
  • Comprehensive health, dental and vision insurance
  • 21 days PTO
  • Regular team offsites and events

Compensation Range: $140K - $220K

Salary : $120,000 - $220,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a ML Performance Engineer?

Sign up to receive alerts about other jobs on the ML Performance Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$98,763 - $126,233
Income Estimation: 
$116,330 - $143,011
Income Estimation: 
$113,077 - $147,784
Income Estimation: 
$85,996 - $102,718
Income Estimation: 
$111,859 - $131,446
Income Estimation: 
$110,457 - $133,106
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$122,763 - $145,698
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$136,611 - $163,397
Income Estimation: 
$135,163 - $163,519
Income Estimation: 
$131,953 - $159,624
Income Estimation: 
$150,859 - $181,127
Income Estimation: 
$116,330 - $143,011
Income Estimation: 
$135,356 - $164,911
Income Estimation: 
$153,902 - $198,246
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Oumi

Oumi
Hired Organization Address Palo Alto, CA Full Time
About Oumi Why we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that ...
Oumi
Hired Organization Address Palo Alto, CA Full Time
About Oumi Why we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that ...
Oumi
Hired Organization Address Palo Alto, CA Full Time
About Oumi Why we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that ...
Oumi
Hired Organization Address Palo Alto, CA Full Time
About Oumi Why we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that ...

Not the job you're looking for? Here are some other ML Performance Engineer jobs in the Palo Alto, CA area that may be a better fit.

ML Performance Engineer

TBWA\Chiat\Day, Mountain View, CA

Principal ML Performance Engineer

Advanced Micro Devices, Inc, San Jose, CA

AI Assistant is available now!

Feel free to start your new journey!