Demo

Software Engineer, LLM Inference Engine and Product

Waveforms AI, Inc
San Francisco, CA Full Time
POSTED ON 2/16/2025
AVAILABLE BEFORE 5/7/2025

Job title : Software Engineer, LLM Inference Engine and Product / Member of Technical Staff

Who We Are WaveForms AI is an Audio Large Language Models (LLMs) company building the future of audio intelligence through advanced research and products. Our models will transform human-AI interactions making them more natural, engaging and immersive.

Role overview : The Software Engineer, LLM Inference Engine and Product will focus on developing and optimizing a real-time inference engine for multimodal large language models (LLMs) that handle audio and text inputs seamlessly. This role involves leveraging technologies such as LiveKit, RTC engines, WebRTC, and FastAPI to create an efficient, real-time API layer. You will contribute to cutting-edge AI systems that enable smooth user experiences across platforms, including iOS, Android, and desktop.

Key Responsibilities

  • Real-time Inference Development : Build and optimize a robust inference engine that supports multimodal LLMs, handling real-time audio and text inputs.
  • Technology Integration : Leverage tools like LiveKit, RTC engines, WebRTC, and FastAPI to enable low-latency, real-time communication and inference.
  • End-to-End Pipeline Design : Create and maintain the complete inference pipeline, from data ingestion to model serving, ensuring real-time performance.
  • Cross-platform Compatibility : Ensure the inference engine operates efficiently across various platforms, including mobile (iOS / Android) and desktop.
  • Optimization & Performance Tuning : Optimize the inference system to reduce latency, improve throughput, and enhance user experience.
  • API Development : Design and maintain scalable APIs to support real-time LLM interaction for diverse applications.

Required Skills & Qualifications

  • Inference Engine Expertise : Proven experience in building and optimizing inference engines for multimodal AI systems, particularly combining audio and text inputs.
  • Technical Proficiency : Strong experience with LiveKit, RTC engines, WebRTC, and FastAPI for real-time communication and model inference.
  • Real-time System Design : Expertise in creating real-time pipelines and maintaining low-latency performance in production systems.
  • Cross-platform Development : Familiarity with iOS, Android, and desktop app development, ensuring seamless integration with inference systems.
  • Performance Optimization : Proficiency in optimizing inference engines to reduce latency and improve computational efficiency.
  • API Development : Experience in designing and maintaining APIs for real-time AI applications.
  • Minimum Experience

  • 4-5 years of relevant professional experience is required
  • If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Software Engineer, LLM Inference Engine and Product?

    Sign up to receive alerts about other jobs on the Software Engineer, LLM Inference Engine and Product career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $97,257 - $120,701
    Income Estimation: 
    $123,167 - $152,295
    Income Estimation: 
    $146,673 - $180,130
    Income Estimation: 
    $176,149 - $220,529
    Income Estimation: 
    $97,257 - $120,701
    Income Estimation: 
    $123,167 - $152,295
    Income Estimation: 
    $77,657 - $95,021
    Income Estimation: 
    $97,257 - $120,701
    Income Estimation: 
    $123,167 - $152,295
    Income Estimation: 
    $146,673 - $180,130
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Waveforms AI, Inc

    Waveforms AI, Inc
    Hired Organization Address San Francisco, CA Full Time
    Full Stack Software Engineer Who We Are WaveForms AI is an Audio LLM company redefining how humans interact with AI-maki...
    Waveforms AI, Inc
    Hired Organization Address San Francisco, CA Full Time
    Job title : Software Engineer, AI Infrastructure (Training Inference) / Member of Technical Staff Who We Are WaveForms A...

    Not the job you're looking for? Here are some other Software Engineer, LLM Inference Engine and Product jobs in the San Francisco, CA area that may be a better fit.

    Software Engineer - ML/LLM Inference

    Alldus, San Francisco, CA

    AI Assistant is available now!

    Feel free to start your new journey!