Demo

Machine Learning Engineer - Inference Systems

Alldus
San Francisco, CA Full Time
POSTED ON 2/21/2025
AVAILABLE BEFORE 5/16/2025

We are on the lookout for a skilled and enthusiastic Machine Learning Engineer to join our innovative ML Inference team! In this exciting role, you will have the chance to contribute to the development of cutting-edge technologies in ML / LLM inference and serving. Work alongside a talented team dedicated to building and enhancing next-generation Large Language Model (LLM) Inference Engines.

Key Responsibilities :

  • Develop and Enhance Inference Engine : Design, implement, and optimize a state-of-the-art LLM Inference Engine. Integrate the latest inference techniques from AI research to boost latency and throughput.
  • Performance Optimization : Execute deep performance optimizations across the technology stack, including PyTorch, C , and CUDA. Analyze and enhance system performance to address diverse use cases effectively.
  • Customer Collaboration : Engage with clients to comprehend their specific performance needs and tailor solutions. Provide technical expertise to ensure seamless deployment and operation of inference systems.
  • Technical Leadership : Shape the roadmap and vision for our inference technologies. Spearhead initiatives aimed at fostering innovation and ensuring our solutions remain competitive.
  • Infrastructure Development : Collaborate with team partners to build and sustain scalable, multi-replica serving infrastructure. Ensure the reliability and scalability of our LLM serving systems to accommodate growing workloads.

Qualifications :

  • Technical Skills : Proficient in systems programming with languages like C . Strong experience with machine learning frameworks, particularly PyTorch. Expertise in GPU programming and CUDA for optimizing performance. Solid grasp of AI / ML concepts, especially related to large language models.
  • Experience : Proven track record in developing and optimizing ML / LLM inference systems. Demonstrated ability to translate research advancements into effective production systems. Experience in performance tuning and profiling across various tech stacks. Familiarity with vLLM is a plus.
  • If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Machine Learning Engineer - Inference Systems?

    Sign up to receive alerts about other jobs on the Machine Learning Engineer - Inference Systems career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $123,167 - $152,295
    Income Estimation: 
    $146,673 - $180,130
    Income Estimation: 
    $77,900 - $95,589
    Income Estimation: 
    $101,387 - $124,118
    Income Estimation: 
    $101,387 - $124,118
    Income Estimation: 
    $119,030 - $151,900
    Income Estimation: 
    $149,493 - $192,976
    Income Estimation: 
    $184,796 - $233,226
    Income Estimation: 
    $119,030 - $151,900
    Income Estimation: 
    $149,493 - $192,976
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Alldus

    Alldus
    Hired Organization Address San Mateo, CA Full Time
    My Client is pioneering the future of Automation AI, developing an avant-garde platform that revolutionizes how repetiti...
    Alldus
    Hired Organization Address Cambridge, MA Contractor
    Our client is an Elite ServiceNow partner and they are hiring a ServiceNow (CSM) Business Process Consultant on a 6-mont...
    Alldus
    Hired Organization Address Santa Rosa, CA Full Time
    My Client is pioneering the future of Automation AI, developing an avant-garde platform that revolutionizes how repetiti...
    Alldus
    Hired Organization Address Santa Clara, CA Full Time
    My Client is pioneering the future of Automation AI, developing an avant-garde platform that revolutionizes how repetiti...

    Not the job you're looking for? Here are some other Machine Learning Engineer - Inference Systems jobs in the San Francisco, CA area that may be a better fit.

    Machine Learning Engineer - Inference

    Together AI, San Francisco, CA

    AI Assistant is available now!

    Feel free to start your new journey!