Demo

Staff Software Engineer - Machine Learning Platform (San Francisco)

Replicate
San Francisco, CA Full Time
POSTED ON 3/26/2025
AVAILABLE BEFORE 4/24/2025
Replicate makes it easy for software engineers to run and customize machine learning models in the cloud. With a library of thousands of open-source models, you can get started with one line of code—or fine-tune and deploy your own models when you need something custom. We handle the infrastructure, so you can focus on building. Our team comes from places like Docker, GitHub, and NVIDIA, and we’re obsessed with making AI as intuitive as deploying a web app. We build in public, ship fast, and care about getting the details right.

The Platform team at Replicate oversees the entire lifecycle of models, from packaging and deployment to serving, scaling, and monitoring. You’ll be developing the infrastructure that supports thousands of models and powers millions of predictions daily. This is a chance to build something truly innovative, where each decision you make has a tangible impact and allows your creativity to shine.

What You’ll Be Doing

  • Designing and building our deployment and model-serving platform.
  • Building technology to operate the latest advancements in the ML and AI space.
  • Designing systems to maximize the utilization and reliability of our Kubernetes clusters and GPUs, including multi-regional traffic shifting and failover capabilities.
  • Owning and optimizing fair and reliable task allocation and queuing across a diverse set of customers with heterogeneous workloads.
  • Working with our Models team to speed up model inference through techniques like caching, weights management, machine configurations, and runtime optimizations in Python and PyTorch.
  • Working with technologies such as
    • Python, Go, and Node.js
    • Kubernetes and Terraform
    • Redis, Google BigQuery, and PostgreSQL
We're looking for the right person, not just someone who checks boxes, but it’s likely you have…

  • Experience building platforms at scale.
  • Worked in complex systems with many moving parts; you have opinions on monoliths vs. services.
  • Designed and implemented developer-friendly APIs to enable scalable and reliable integration.
  • Hands-on experience setting up and operating Kubernetes.
  • A passion for building tools that empower developers.
  • Strong communication and collaboration skills, with the ability to understand customer needs and distill complex topics into clear, actionable insights. We believe that most of programming isn’t just about writing code; building a platform requires a collaborative approach.
  • At least 10 years of full-time software engineering experience.

These aren’t hard requirements, but we definitely want to talk with you if…

  • You have worked on machine learning platform teams in the past.
  • You have experience working with or on teams that have put ML/AI into production, even though this role does not entail building ML models directly.
  • You have some exposure to serving Generative AI features where GPUs are costly commodities and workloads can take significant time to finish.

You'll be working from our beautiful office in the Mission, San Francisco, at least 3 days a week.

Compensation Range: $230K - $280K

Salary : $230,000 - $280,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Staff Software Engineer - Machine Learning Platform (San Francisco)?

Sign up to receive alerts about other jobs on the Staff Software Engineer - Machine Learning Platform (San Francisco) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$176,149 - $220,529
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Replicate

Replicate
Hired Organization Address San Francisco, CA Full Time
You’ll be helping our customers build cool stuff on Replicate. You’ll be pointing them in the right direction, solving g...
Replicate
Hired Organization Address San Francisco, CA Full Time
Replicate lets you run machine learning models in the cloud. We’re not just another AI company; we’re a team of develope...
Replicate
Hired Organization Address San Francisco, CA Full Time
Replicate helps developers build with AI. We're building the infrastructure and tools that make it easy to discover, run...

Not the job you're looking for? Here are some other Staff Software Engineer - Machine Learning Platform (San Francisco) jobs in the San Francisco, CA area that may be a better fit.

Software Engineer - Machine Learning Platform

Snowflake Computing, San Mateo, CA

Staff Software Engineer - Machine Learning

Hivemapper, San Francisco, CA

AI Assistant is available now!

Feel free to start your new journey!