Demo

Principal Software Engineer, Model Inference - Hybrid - 5414

Benchmark IT - Technology Talent
Raleigh, NC Full Time
POSTED ON 2/18/2025
AVAILABLE BEFORE 3/13/2025

Principal Software Engineer, Model Inference


About the Company:

Our client's scalable artificial intelligence (AI) and machine learning (ML) platform enables enterprises to create and deliver AI-enabled applications at scale across hybrid cloud environments. Built using open-source technologies, OpenShift AI provides trusted, operationally consistent capabilities for teams to experiment, serve models, and deliver innovative apps.


Role Overview:

The OpenShift AI team seeks a Principal Software Engineer with Kubernetes and Model Inference Runtimes experience to join their rapidly growing engineering team. This team focuses on making machine learning model deployment and monitoring seamless and scalable across the hybrid cloud and the edge. This is a fascinating opportunity to build and impact the next generation of hybrid cloud MLOps platforms.


What will you do?

  • Develop and maintain a high-quality, high-performing ML inference runtime platform for multi-modal and distributed model serving.
  • Contribute directly to upstream inference runtime communities such as vLLM, TGI, PyTorch, OpenVINO, and others.
  • Maintain CI/CD build pipelines for container images that allow faster, more secure, reliable, and frequent releases
  • Coordination and communication with various stakeholders
  • Applying a growth mindset by staying up to date with AI and ML advancements


What will you bring?

  • Highly experienced with programming in Python and PyTorch
  • Familiarity with model parallelization, quantization, and memory optimization using vLLM, TGI, and other inference libraries.
  • Experience with Python packaging such as PyPI libraries
  • Development experience with C especially with the CUDA APIs is a big plus
  • Solid understanding of the fundamentals of model inferencing architectures
  • Experience with Jenkins, Git, shell scripting, and related technologies
  • Experience with the development of containerized applications in Kubernetes
  • Experience with Agile development methodologies
  • Experience with Cloud Computing using at least one of the following Cloud infrastructures AWS, GCP, Azure, or IBM Cloud
  • Ability to work across a large distributed hybrid engineering team
  • Experience with open-source development is a plus


This position follows a hybrid work model, requiring 3 days per week on-site in Raleigh, NC.


This is a fantastic opportunity to work on cutting-edge AI/ML technology and contribute to innovative solutions in hybrid cloud environments. If this role excites you and matches your background, we encourage you to apply!

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Principal Software Engineer, Model Inference - Hybrid - 5414?

Sign up to receive alerts about other jobs on the Principal Software Engineer, Model Inference - Hybrid - 5414 career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$176,149 - $220,529
Income Estimation: 
$156,679 - $196,968
Income Estimation: 
$176,149 - $220,529
Income Estimation: 
$156,679 - $196,968
Income Estimation: 
$77,657 - $95,021
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Benchmark IT - Technology Talent

Benchmark IT - Technology Talent
Hired Organization Address Shelton, CT Full Time
Job Details About the Role Our direct client is embarking on a key systems conversion project, w e are seeking an experi...
Benchmark IT - Technology Talent
Hired Organization Address New York, NY Full Time
Job Details About the Role The Platform Infrastructure Group plays a critical role in keeping all Platform environments ...
Benchmark IT - Technology Talent
Hired Organization Address Monroe, CT Full Time
Our direct client looking for a value driven IT Technical Support Specialist – SAP User Provisioning to join their IT de...
Benchmark IT - Technology Talent
Hired Organization Address Wilton, CT Full Time
Job Details For our direct client near Norwalk, CT we seek a meticulous and skilled QA Tester with a strong background i...

Not the job you're looking for? Here are some other Principal Software Engineer, Model Inference - Hybrid - 5414 jobs in the Raleigh, NC area that may be a better fit.

Senior Software Engineer - Model Service Runtimes

Manpower Group Inc., Raleigh, NC

AI Assistant is available now!

Feel free to start your new journey!