Senior Software Engineer, AI Platform, Distributed Systems
Compensation : $180,000 - $260,000 USD
Location : San Francisco Bay Area (Hybrid, 2 days in-office)
Who Are We?
We are building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises.
Why Join Us?
- High-Impact Work Work on technology that directly enables the training of AI models for some of the worlds leading research labs and companies.
- Technical Excellence Push the boundaries of distributed systems and AI infrastructure at scale.
- Fast-Paced Innovation We reward individuals who take ownership, move quickly, and drive measurable results.
- Career Growth Your impact directly correlates with career advancement, with opportunities to take on expanded responsibilities and leadership roles.
- Clear Ownership Drive end-to-end solutions with autonomy and clear accountability.
Role Overview
As a Senior Software Engineer, AI Platform, Distributed Systems , you will design and develop the core backend infrastructure that powers large-scale AI training workflows. Your expertise in distributed systems, cloud infrastructure, and scalable data pipelines will be critical in optimizing data flow, improving system reliability, and supporting AI model training.
Your Impact
Develop Scalable Systems Design and implement high-throughput data processing, storage, and streaming capabilities using distributed databases and messaging systems.Optimize Performance Improve the efficiency and reliability of backend infrastructure, ensuring seamless AI data workflows.Architect AI-Driven Workflows Build scalable APIs and cloud-native solutions that integrate with machine learning pipelines and customer data workflows.Enhance Reliability Work with support teams to troubleshoot and resolve system issues, ensuring robust backend operations.Collaborate Across Teams Work closely with product managers, engineers, and stakeholders to deliver customer-driven solutions.Stay Ahead of Trends Keep up with emerging technologies in AI infrastructure, distributed computing, and cloud platforms to drive innovation.What You Bring
5 years of backend engineering experience, specializing in distributed systems and scalable architectures.Strong experience with databases (Relational, NoSQL, Key-Value Stores) and message queues for large-scale data processing.Expertise in backend APIs (REST, GraphQL) and development with Node.js (NestJS preferred), TypeScript, Java, or Python .Proficiency in cloud infrastructure, particularly Google Cloud Platform (GCP preferred), AWS, or Azure .Experience designing high-throughput, large-scale data pipelines and optimizing database performance.Strong problem-solving skills, with the ability to break down complex challenges and execute methodically.A self-driven mindset, thriving in fast-paced environments and taking full ownership of engineering challenges.Strong communication and collaboration skills, working effectively across cross-functional teams.Experience using AI-powered development tools like Cursor and GitHub Copilot .Nice to Have
Knowledge of search technologies such as ElasticSearch.Experience with containerized infrastructure (Kubernetes) and DevOps tools (ArgoCD, DataDog).Familiarity with AI model training workflows and cloud-based ML infrastructure.Engineering at Our Company
Our engineering team is at the forefront of AI infrastructure development, creating scalable and high-performance systems that support the next generation of AI models. We emphasize rapid iteration, technical excellence, and collaborative problem-solving, allowing engineers to work on cutting-edge challenges that shape the future of artificial intelligence.
Technology Stack :
Frontend : React.js, TypeScriptBackend : Node.js, TypeScript, Python, Java & KotlinAPIs : GraphQLCloud & Infrastructure : Google Cloud Platform (GCP), KubernetesDatabases : MySQL, Spanner, PostgreSQLQueueing / Streaming : Kafka, PubSubJoin us and shape the foundation for the next wave of AI innovation.
Salary : $180,000 - $260,000