Demo

Senior ML Platform Engineer (Serving Infrastructure)

Vlink
Sunnyvale, CA Full Time
POSTED ON 4/26/2025
AVAILABLE BEFORE 5/22/2025

Job Title : ML Engineer

Location : Remote

Employment Type : (Full-time or contract)

Duration : Long Term

About VLink : Started in 2006 and headquartered in Connecticut, VLink is one of the fastest growing digital technology services and consulting companies. Since its inception, our innovative team members have been solving the most complex business, and IT challenges of our global clients.

Job Description :

Role Overview : We're looking for an experienced engineer to build our ML serving infrastructure. You'll create the platforms and systems that enable reliable, scalable model deployment and inference. This role focuses on the runtime infrastructure that powers our production ML capabilities.

Key Responsibilities :

  • Design and implement scalable model serving platforms for both batch and real-time inference
  • Build model deployment pipelines with automated testing and validation
  • Develop monitoring, logging, and alerting systems for ML services
  • Create infrastructure for A / B testing and model experimentation
  • Implement model versioning and rollback capabilities
  • Design efficient scaling and load balancing strategies for ML workloads
  • Collaborate with data scientists to optimize model serving performance

Technical Requirements :

  • 7 years of software engineering experience, with 3 years in ML serving / infrastructure
  • Strong expertise in container orchestration (Kubernetes) and cloud platforms
  • Experience with model serving technologies (TensorFlow Serving, Triton, KServe)
  • Deep knowledge of distributed systems and microservices architecture
  • Proficiency in Python and experience with high-performance serving
  • Strong background in monitoring and observability tools
  • Experience with CI / CD pipelines and GitOps workflows
  • Nice to Have :

  • Experience with model serving frameworks :
  • TorchServe for PyTorch models

  • TensorFlow Serving for TF models
  • Triton Inference Server for multi-framework support
  • BentoML for unified model serving
  • Expertise in model runtime optimizations :
  • Model quantization (INT8, FP16)

  • Model pruning and compression
  • Kernel optimizations
  • Batching strategies
  • Hardware-specific optimizations (CPU / GPU)
  • Experience with model inference workflows :
  • Pre / post-processing pipeline optimization

  • Feature transformation at serving time
  • Caching strategies for inference
  • Multi-model inference orchestration
  • Dynamic batching and request routing
  • Experience with GPU infrastructure management
  • Knowledge of low-latency serving architectures
  • Familiarity with ML-specific security requirements
  • Background in performance profiling and optimization
  • Experience with model serving metrics collection and analysis
  • Employment Practices :

    EEO, ADA, FMLA Compliant

    VLink is an equal opportunity employer. At VLink, we are committed to embracing diversity, multiculturalism, and inclusion. VLink does not discriminate on the basis of race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. All aspects of employment including the decision to hire, promote, or discharge, will be decided on the basis of qualifications, merit, performance, and business needs.

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Senior ML Platform Engineer (Serving Infrastructure)?

    Sign up to receive alerts about other jobs on the Senior ML Platform Engineer (Serving Infrastructure) career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $154,597 - $194,610
    Income Estimation: 
    $172,688 - $210,712
    Income Estimation: 
    $170,589 - $211,671
    Income Estimation: 
    $178,619 - $225,190
    Income Estimation: 
    $86,891 - $130,303
    Income Estimation: 
    $110,730 - $135,754
    Income Estimation: 
    $128,617 - $162,576
    Income Estimation: 
    $117,033 - $148,289
    Income Estimation: 
    $103,114 - $138,258
    Income Estimation: 
    $118,163 - $145,996
    Income Estimation: 
    $120,777 - $151,022
    Income Estimation: 
    $129,363 - $167,316
    Income Estimation: 
    $86,891 - $130,303
    Income Estimation: 
    $129,363 - $167,316
    Income Estimation: 
    $145,845 - $177,256
    Income Estimation: 
    $147,836 - $182,130
    Income Estimation: 
    $154,597 - $194,610
    Income Estimation: 
    $86,891 - $130,303
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Vlink

    Vlink
    Hired Organization Address Nashville, TN Full Time
    Position : SERVICE DESK Personal Location : Nashville, TN Duration : Long Term Contract IT Call Center Experience prefer...
    Vlink
    Hired Organization Address Brunswick, NJ Full Time
    Job Title : Vertex Techno-Functional Consulatant Location : New Brunswick NJ Employment Type : Contract About VLink : St...
    Vlink
    Hired Organization Address Swanzey, NH Full Time
    About VLink : Started in 2006 and headquartered in Connecticut, VLink is one of the fastest growing digital technology s...
    Vlink
    Hired Organization Address Minneapolis, MN Full Time
    Job Details Job Description: Tell us about your department: This role is on the Marketplace Platform team. The team's pr...

    Not the job you're looking for? Here are some other Senior ML Platform Engineer (Serving Infrastructure) jobs in the Sunnyvale, CA area that may be a better fit.

    Senior Software Engineer, ML Platform

    Woven by Toyota, Palo Alto, CA

    Senior Software Engineer, ML Platform

    Woven by Toyota, Stanford, CA

    AI Assistant is available now!

    Feel free to start your new journey!