What are the responsibilities and job description for the Staff Software Engineer - ML Systems & Performance position at Alldus?
My client is looking for an experienced Staff Software Engineer to play a key role in optimizing machine learning infrastructure for generative models. This position involves designing and implementing innovative model-serving solutions on a proprietary inference engine, with a focus on improving efficiency, reducing latency, and maximizing throughput. You’ll also be responsible for developing monitoring and profiling tools to diagnose performance bottlenecks and drive system-level optimizations. This role offers the opportunity to collaborate with applied ML researchers and industry leaders to ensure their workloads are fully optimized for high-performance acceleration.
Responsibilities :
- Contribute to advancing the performance of generative media models by optimizing model-serving infrastructure.
- Design and implement next-generation model-serving architectures that improve efficiency, reduce processing delays, and optimize resource usage.
- Develop performance analysis tools to identify system inefficiencies and propose enhancements.
- Work closely with ML researchers and technical teams to ensure optimal acceleration for demanding workloads.
Requirements :
This role is ideal for someone passionate about pushing the limits of ML performance and working on next-generation AI infrastructure.