Demo

Software Engineer - GPU Kernel

Scout AI
New York, NY Full Time
POSTED ON 2/2/2025
AVAILABLE BEFORE 4/2/2025

Intro

Scout AI is a new hiring platform that connects software engineers to opportunities with world-class companies. On Scout, you get a more relevant and growthful interviewing experience, you receive feedback on your performance, and you also get end-to-end support to improve your chances of getting hired.

If you perform well on the Scout interview, you become eligible for opportunities with all companies in the Scout network (only complete the interview once).

This role is with our partner company that is actively hiring:
Mako

About the company

Mako’s AI platform reduces AI compute costs by up to 70%

Our breakthrough technology eliminates the need for expensive and manual GPU optimization, automatically generating high-performance code that runs efficiently on any hardware. Two core capabilities drive immediate business value:

Cost Optimization: Deploy AI models with up to 70% lower computing costs, directly improving your bottom line.

Universal Deployment: Run your existing AI models at peak performance across any GPU infrastructure, eliminating vendor lock-in and scaling constraints.

Mako delivers continuous, automated performance improvements without requiring changes to your existing code or hiring specialized engineers. Our intelligent compiler automatically optimizes your AI workloads 24/7, ensuring you maintain peak efficiency as your models and infrastructure evolve.

Technical Innovation

At the core of our platform is an innovative compiler that leverages hardware-aware deep learning-based search to automatically select from the growing ecosystem of vendor-provided and open-source GPU kernel libraries. Our compiler extends beyond library selection with optimization passes for both vertical and horizontal kernel fusion, enabling the generation of novel kernels outside the original search space.

Our roadmap includes extending the compiler to generate entirely new kernels from scratch. By integrating cutting-edge AI technologies into the compilation pipeline from day one, Mako is pioneering the next generation of modern compilation.

About the role

Summary

Our R&D team is focused on creating the most efficient engine for deploying generative AI models, with efforts ranging from precise GPU kernel tuning to comprehensive system optimizations.

We're looking for an expert level engineer with a strong background in either CUDA, ROCm, or Triton kernel optimization. Your role will involve leading substantial improvements in GPU performance and playing a key role in pioneering AI and machine learning initiatives.

Our tech

Our team builds software infrastructure for high-performance AI inference and training on any hardware. There are three core components:

  1. Mako Compiler automatically selects, tunes, and generates GPU kernels for any hardware platform
  2. Mako Runtime serves compiled models at high performance
  3. Mako Platform enables users to easily deploy and manage deployments across any cloud (you’ll be working on this!)

Responsibilities

  • Explore and analyze performance bottlenecks in ML training and inference.
  • Develop and optimize high-performance computing kernels in Triton, CUDA, and/or ROCm.
  • Implement programming solutions in C/C and Python.
  • Deep dive into GPU performance optimizations to maximize efficiency and speed.
  • Collaborate with the team to extend and improve existing machine learning compilers or frameworks such as MLIR, Pytorch, Tensorflow, ONNX Runtime, TensorRT. (This is optional but beneficial)

Qualifications

  • Bachelor's, Master’s or PhD’s degree in Computer Science, Electrical Engineering, or a related field.
  • Strong programming skills in C/C and Python.
  • Deep understanding and experience in GPU performance optimizations.
  • Proven experience with kernel optimizations on CUDA, ROCm, or other accelerators.
  • General experience with the training and deployment of ML models
  • Experience with distributed systems development or distributed ML workloads

Bonus Points

  • Experience with innovative OSS projects like FlashAttention, mlc-llm, vllm.
  • Experience with machine learning compilers or frameworks such as TVM, MLIR, Pytorch, Tensorflow, ONNX Runtime, TensorRT.

Our Benefits

  • Competitive salary package
  • Performance-based bonuses and incentives
  • Comprehensive health insurance coverage for you and your family
  • Flexible working hours and remote work options
  • Professional development opportunities, including training programs and conferences
  • Generous vacation and paid time off policy
  • Company-sponsored social activities and team-building events
  • Modern and comfortable work environment with state-of-the-art equipment and facilities

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Software Engineer - GPU Kernel?

Sign up to receive alerts about other jobs on the Software Engineer - GPU Kernel career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$176,149 - $220,529
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$77,657 - $95,021
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Scout AI

Scout AI
Hired Organization Address Seattle, WA Full Time
Intro Scout AI is a new hiring platform that connects software engineers to opportunities with world-class companies. On...
Scout AI
Hired Organization Address Seattle, WA Full Time
Intro Scout AI (scoutnow.ai) is a new hiring platform that connects software engineers to opportunities with world-class...
Scout AI
Hired Organization Address Seattle, WA Full Time
Intro Scout AI is a new hiring platform that connects software engineers to opportunities with world-class companies. On...
Scout AI
Hired Organization Address Boston, MA Full Time
Intro Scout AI (scoutnow.ai) is a new hiring platform that connects software engineers to opportunities with world-class...

Not the job you're looking for? Here are some other Software Engineer - GPU Kernel jobs in the New York, NY area that may be a better fit.

Senior GPU Kernel Engineer

Alldus, New York, NY

GPU Kernel Engineer

Mako, New York, NY

AI Assistant is available now!

Feel free to start your new journey!