Demo

Staff Machine Learning Infrastructure Engineer

Vlink
Sunnyvale, CA Full Time
POSTED ON 2/23/2025
AVAILABLE BEFORE 5/21/2025

Job Title : Staff Machine Learning Infrastructure Engineer

Location : Remote

Employment Type : (Full-time or contract)

Duration : Long Term

About VLink : Started in 2006 and headquartered in Connecticut, VLink is one of the fastest growing digital technology services and consulting companies. Since its inception, our innovative team members have been solving the most complex business, and IT challenges of our global clients.

Job Description :

Department : ML Platform Engineering

About the Role : As a Staff Machine Learning Infrastructure Engineer, you will architect and lead the technical vision for our ML platform initiatives, focusing on building scalable infrastructure that powers our ML capabilities. You will design and drive the evolution of our ML platforms, data systems, and serving infrastructure that enable teams to efficiently develop, deploy, and operate ML models at scale.

Key Responsibilities :

  • Architect end-to-end ML infrastructure spanning data processing, feature management, and model serving
  • Design and lead implementation of next-generation ML platforms that support diverse ML workloads
  • Drive technical excellence in ML infrastructure through standardization and automation
  • Build scalable data processing systems and feature platforms that handle massive-scale ML workloads
  • Design robust ML serving architectures supporting both real-time and batch inference
  • Establish best practices for ML observability, monitoring, and operational excellence
  • Lead cross-functional technical initiatives and mentor platform engineers
  • Drive infrastructure decisions that impact the entire ML lifecycle

Technical Leadership :

  • Define technical strategy and roadmap for ML infrastructure
  • Drive architectural decisions for complex ML systems
  • Lead design reviews and provide technical mentorship
  • Collaborate with data science teams to understand and address infrastructure needs
  • Establish standards for reliability, scalability, and performance
  • Build frameworks and platforms that accelerate ML development
  • Required Qualifications :

  • 10 years of software engineering experience, with 5 years focusing on ML infrastructure
  • Deep expertise in distributed systems and data processing at scale
  • Strong background in ML platform development and MLOps practices
  • Experience building production ML infrastructure supporting critical business applications
  • Proven track record of leading complex technical initiatives
  • Expert-level knowledge in :
  • Large-scale data processing systems (Spark, Beam)

  • Feature store architectures and implementations
  • ML serving platforms and inference optimization (TorchServe, Tensorflow Serving and Triton)
  • Container orchestration and cloud platforms
  • Data pipeline design and optimization
  • ML system monitoring and observability
  • Technical Expertise :

  • Data Infrastructure :
  • Feature stores and feature computation systems

  • Data quality and validation frameworks
  • Dataset versioning and lineage tracking
  • Efficient data storage and access patterns
  • Serving Infrastructure :
  • Model deployment and serving platforms

  • Inference optimization and scaling
  • Load balancing and traffic management
  • Model versioning and lifecycle management
  • Platform Development :
  • MLOps tooling and automation

  • Experimentation platforms
  • Monitoring and observability systems
  • Resource management and optimization
  • Preferred Qualifications :

  • Experience with GPU infrastructure and optimization
  • Background in high-performance computing
  • Contributions to open-source ML infrastructure projects
  • Experience with ML-specific security and compliance requirements
  • Master's degree in Computer Science or related field
  • Impact :

  • Shape the technical direction of ML infrastructure across the organization
  • Drive innovation in ML platforms and tools
  • Mentor and grow the technical capabilities of the team
  • Establish architectural patterns and best practices
  • Enable rapid ML development and deployment at scale
  • Employment Practices :

    EEO, ADA, FMLA Compliant

    VLink is an equal opportunity employer. At VLink, we are committed to embracing diversity, multiculturalism, and inclusion. VLink does not discriminate on the basis of race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. All aspects of employment including the decision to hire, promote, or discharge, will be decided on the basis of qualifications, merit, performance, and business needs.

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Staff Machine Learning Infrastructure Engineer?

    Sign up to receive alerts about other jobs on the Staff Machine Learning Infrastructure Engineer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $258,641 - $455,625
    Income Estimation: 
    $884,710 - $2,266,655
    Income Estimation: 
    $103,114 - $138,258
    Income Estimation: 
    $118,163 - $145,996
    Income Estimation: 
    $120,777 - $151,022
    Income Estimation: 
    $129,363 - $167,316
    Income Estimation: 
    $86,891 - $130,303
    Income Estimation: 
    $129,363 - $167,316
    Income Estimation: 
    $145,845 - $177,256
    Income Estimation: 
    $147,836 - $182,130
    Income Estimation: 
    $154,597 - $194,610
    Income Estimation: 
    $86,891 - $130,303
    Income Estimation: 
    $81,253 - $112,554
    Income Estimation: 
    $89,966 - $112,616
    Income Estimation: 
    $95,407 - $122,738
    Income Estimation: 
    $103,114 - $138,258
    Income Estimation: 
    $86,891 - $130,303
    Income Estimation: 
    $154,597 - $194,610
    Income Estimation: 
    $172,688 - $210,712
    Income Estimation: 
    $170,589 - $211,671
    Income Estimation: 
    $178,619 - $225,190
    Income Estimation: 
    $86,891 - $130,303
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Vlink

    Vlink
    Hired Organization Address Tallahassee, FL Full Time
    Job Description: The Consultant will work with the Child Support Program Operational Procedures and Training Process aug...
    Vlink
    Hired Organization Address Salt Lake, UT Contractor
    Job Title: Senior Calypso Developer Location: Salt Lake City, UT (Client Location) Duration: 12 Month Contract Senior Ca...
    Vlink
    Hired Organization Address Sunnyvale, CA Full Time
    Job Title : ML Platform Engineering Location : Remote Employment Type : (Full-time or contract) Duration : Long Term Abo...
    Vlink
    Hired Organization Address Atlanta, GA Full Time
    Short Description : Under broad supervision, designs, codes, tests, modifies & debugs computer software. Researches & an...

    Not the job you're looking for? Here are some other Staff Machine Learning Infrastructure Engineer jobs in the Sunnyvale, CA area that may be a better fit.

    Staff Machine Learning Engineer

    1st. Creative Learning Academy Inc., Palo Alto, CA

    AI Assistant is available now!

    Feel free to start your new journey!