Demo

AI Platform Support Engineer

ITTConnect
Atlanta, GA Full Time
POSTED ON 2/21/2025
AVAILABLE BEFORE 5/19/2025

Job Description

ITTConnect is seeking a AI Platform Specialist  to work for one of our clients.  This is a role with a global leader in consulting, digital transformation, technology and engineering services present in nearly 50 countries. The end client is in the Telecom business.

We are building a new team of platform specialists to support and enhance high-performance AI services. These are highly technical, hands-on roles focused on customer, application, and platform support of AI-focused workloads.

As an AI Platform Specialist, these roles will provide application and GPU support. The team will deliver Tier 1 and Tier 2 support to developers and engineers while collaborating closely with Tier 3 and 4 platform teams and vendors for issue resolution. The roles require user knowledge of Kubernetes, virtualization, and cloud-native technologies as well as operator knowledge of GPUs and other AI supporting services.  Each specialist should have a focus on customer service along with goals of reliability, scalability, and performance.

The AI Platform Specialist  will report to the Senior AI Architect and work as peers within a specialized AI support team. Collaboration with internal VM and container support teams as well as NVIDIA, Codeium, and other vendor specialists will be essential for supporting customers, troubleshooting, and optimizing AI workloads.

Key Responsibilities :

  • Platform Support & Incident Response

o Provide Tier 1 & Tier 2 support for AI-driven applications and workloads.

o Troubleshoot and resolve issues related to Kubernetes deployments, GPU utilization, and service performance.

o Collaborate with Tier 3 teams, including Kubernetes engineers and external vendors, to escalate and resolve complex issues.

  • Kubernetes & Cloud-Native Operations
  • o Full adoption, creation, and integrations into automated services using Helm, Ansible, Terraform, etc.

    o Deploy, manage, and support containerized AI workloads on Google Anthos-powered Kubernetes clusters.

    o Ensure adherence to pod security policies, automated rollouts / rollbacks, and best practices for scalable and secure Kubernetes environments.

  • GPU Infrastructure & AI Services Management
  • o Optimize and support GPU-enabled workloads including CUDA and other AI acceleration frameworks.

    o Assist in the installation, configuration, and support of AI coding assistants (e.g., Codeium).

  • Observability & Documentation
  • o Maintain detailed operational documentation, runbooks, and troubleshooting guides.

    o Utilize monitoring / logging tools like New Relic, Big Panda, Prometheus, Grafana, and other observability frameworks.

  • Process Improvement & Collaboration
  • o Work cross-functionally with developers, IT teams, and vendors to ensure seamless deployment and support of AI services.

    o Contribute to CI / CD pipelines, automation, service, and security best practices.

    o Track and communicate work through task management platforms (ServiceNow and Jira).

    Requirements

  • Hybrid Cloud – In-depth knowledge of private (on-premises) and public (GCP & AWS) cloud architectures and services.
  • AI / ML Software – Developer experience with DevOps practices (Git, Jenkins, etc.) as well as working with AI / ML engineers and data scientists.
  • AI / ML Hardware – Experience deploying, supporting, and optimizing on-premises and cloud GPUs (NVIDIA & AMD) enabled infrastructure (VMs & Containers).
  • Kubernetes Expertise – Hands-on experience with deploying and managing containerized workloads in Kubernetes.
  • Technical Support & Troubleshooting – Proven ability to diagnose and resolve customer and platform issues in production environments.
  • Strong Communication & Documentation – Ability to clearly document procedures, write knowledge base articles, and collaborate with customers and teams.
  • Time Management & Accountability – Ability to work independently, prioritize tasks, and manage workload effectively.
  • Preferred Qualifications

  • Experience with GPU orchestration tools like Run : AI, NVIDIA AI Enterprise, VMWare Private AI Foundation, etc.
  • Exposure to AI coding assistants like Codeium, Copilot, or Tabnine.
  • Proficient in development tools like Python, PyTorch, TensorFlow, Jupyter Notebooks, etc.
  • Requirements

    10 years of experience as an IT Business Analyst MUST HAVE experience in Financial Services / Banking Must have previous technical background, preferably as a Developer Highly desirable previous experience with core banking platforms Experience defining functional architecture, business requirements and functional tests Seasoned in business analysis for the banking sector with exceptional analytical and conceptual thinking skills Domain Experience on Karate Framework; Test Engineering, Kafka, Event Driven and REST APIs Domain knowledge on Postman Domain knowledge on AWS Experience creating detailed reports and giving presentations Excellent planning, organizational, and time management skills Demonstrated ability to work well with different business users and stakeholders groups gathering business requirements and writing functional specification documents BS in Computer Science, Computer Engineering or similar

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a AI Platform Support Engineer?

    Sign up to receive alerts about other jobs on the AI Platform Support Engineer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $77,900 - $95,589
    Income Estimation: 
    $101,387 - $124,118
    Income Estimation: 
    $184,796 - $233,226
    Income Estimation: 
    $179,606 - $233,815
    Income Estimation: 
    $77,900 - $95,589
    Income Estimation: 
    $101,387 - $124,118
    Income Estimation: 
    $101,387 - $124,118
    Income Estimation: 
    $119,030 - $151,900
    Income Estimation: 
    $119,030 - $151,900
    Income Estimation: 
    $149,493 - $192,976
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at ITTConnect

    ITTConnect
    Hired Organization Address Fort Lauderdale, FL Full Time
    This is a remote position. ITTConnect is seeking a Sr Data Scientist with experience in Loyalty Programs and/or Travel &...
    ITTConnect
    Hired Organization Address Atlanta, GA Full Time
    Job Description ITTConnect is seeking a Senior .Net Developer to work for one of our clients. This is a role with a glob...
    ITTConnect
    Hired Organization Address Atlanta, GA Full Time
    This is a remote position. ITTConnect is seeking a Google Cloud Contact Center (CCAI) Architect to work for one of our c...
    ITTConnect
    Hired Organization Address Atlanta, GA Full Time
    ITTConnect is seeking a Go-to-Market Leader for Data, AI & Analytics Lead for a direct hire position with a client that ...

    Not the job you're looking for? Here are some other AI Platform Support Engineer jobs in the Atlanta, GA area that may be a better fit.

    Principal Engineer, ML/AI Platform

    Credit Acceptance Corporation, Atlanta, GA

    AI Assistant is available now!

    Feel free to start your new journey!