Demo

AI DevOps Support Engineer - REMOTE

NTT DATA, Inc.
Atlanta, GA Remote Full Time
POSTED ON 3/10/2025
AVAILABLE BEFORE 6/10/2025

Req ID : 316845

NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.

We are currently seeking a AI DevOps Support Engineer - REMOTE to join our team in Atlanta, Georgia (US-GA), United States (US).

As an AI Platform Specialist, these roles will provide application and GPU support. The team will deliver Tier 1 and Tier 2 support to developers and engineers while collaborating closely with Tier 3 and 4 platform teams and vendors for issue resolution. The roles require user knowledge of Kubernetes, virtualization, and cloud-native technologies as well as operator knowledge of GPUs and other AI supporting services. Each specialist should have a focus on customer service along with goals of reliability, scalability, and performance.

Day to Day Responsibilities :

  • Platform Support & Incident Response

Provide Tier 1 & Tier 2 support for AI-driven applications and workloads.

  • Troubleshoot and resolve issues related to Kubernetes deployments, GPU utilization, and service performance.
  • Collaborate with Tier 3 teams, including Kubernetes engineers and external vendors, to escalate and resolve complex issues.
  • Kubernetes & Cloud-Native Operations
  • Full adoption, creation, and integrations into automated services using Helm, Ansible, Terraform, etc.

  • Deploy, manage, and support containerized AI workloads on Google Anthos-powered Kubernetes clusters.
  • Ensure adherence to pod security policies, automated rollouts / rollbacks, and best practices for scalable and secure Kubernetes environments.
  • GPU Infrastructure & AI Services Management
  • Optimize and support GPU-enabled workloads including CUDA and other AI acceleration frameworks.

  • Assist in the installation, configuration, and support of AI coding assistants (e.g., Codeium).
  • Observability & Documentation
  • Maintain detailed operational documentation, runbooks, and troubleshooting guides.

  • Utilize monitoring / logging tools like New Relic, Big Panda, Prometheus, Grafana, and other observability frameworks.
  • Process Improvement & Collaboration
  • Work cross-functionally with developers, IT teams, and vendors to ensure seamless deployment and support of AI services.

  • Contribute to CI / CD pipelines, automation, service, and security best practices.
  • Track and communicate work through task management platforms (ServiceNow and Jira).
  • Minimum Requirements :

  • 5 years with hybrid Cloud - In-depth knowledge of private (on-premises) and public (GCP & AWS) cloud architectures and services.
  • 5 years developer experience with DevOps practices (Git, Jenkins, etc.) as well as working with AI / ML engineers and data scientists.
  • 5 years experience deploying, supporting, and optimizing on-premises and cloud GPUs (NVIDIA & AMD) enabled infrastructure (VMs & Containers).
  • 5 years of Kubernetes Expertise, including hands-on experience with deploying and managing containerized workloads in Kubernetes.
  • Preferred Qualifications

  • Experience with GPU orchestration tools like Run : AI, NVIDIA AI Enterprise, VMWare Private AI Foundation, etc.
  • Exposure to AI coding assistants like Codeium, Copilot, or Tabnine.
  • Proficient in development tools like Python, PyTorch, TensorFlow, Jupyter Notebooks, etc.
  • Technical Support & Troubleshooting - Proven ability to diagnose and resolve customer and platform issues in production environments.
  • Strong Communication & Documentation - Ability to clearly document procedures, write knowledge base articles, and collaborate with customers and teams.
  • Time Management & Accountability - Ability to work independently, prioritize tasks, and manage workload effectively.
  • About the Team & Reporting Structure

    These positions will report to the Senior AI Architect and work as peers within a specialized AI support team. Collaboration with internal VM and container support teams as well as NVIDIA, Codeium, and other vendor specialists will be essential for supporting customers, troubleshooting, and optimizing AI workloads.

    Where required by law, NTT DATA provides a reasonable range of compensation for specific roles. The starting pay range for this remote role is $109,275 - $227,656. This range reflects the minimum and maximum target compensation for the position across all US locations. Actual compensation will depend on a number of factors, including the candidate's actual work location, relevant experience, technical skills, and other qualifications.

    INDHCLSMC

    LI-PAS

    About NTT DATA

    NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies.Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at us.nttdata.com

    NTT DATA endeavors to make https : / / us.nttdata.com accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact us at https : / / us.nttdata.com / en / contact-us . This contact information is for accommodation requests only and cannot be used to inquire about the status of applications. NTT DATA is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status. For our EEO Policy Statement, please click here . If you'd like more information on your EEO rights under the law, please click here . For Pay Transparency information, please click here .

    Salary : $109,275 - $227,656

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a AI DevOps Support Engineer - REMOTE?

    Sign up to receive alerts about other jobs on the AI DevOps Support Engineer - REMOTE career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $70,491 - $94,370
    Income Estimation: 
    $87,186 - $108,041
    Income Estimation: 
    $78,935 - $89,377
    Income Estimation: 
    $79,311 - $112,035
    Income Estimation: 
    $184,796 - $233,226
    Income Estimation: 
    $179,606 - $233,815
    Income Estimation: 
    $158,960 - $205,707
    Income Estimation: 
    $154,509 - $200,187
    Income Estimation: 
    $117,024 - $149,811
    Income Estimation: 
    $137,568 - $176,908
    Income Estimation: 
    $119,030 - $151,900
    Income Estimation: 
    $149,493 - $192,976
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at NTT DATA, Inc.

    NTT DATA, Inc.
    Hired Organization Address Washington, DC Full Time
    Req ID: 320241 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If ...
    NTT DATA, Inc.
    Hired Organization Address Nashville, TN Full Time
    Job Title : Accounts Receivable Clerk- (Temporary 6 months or longer) Industry : Healthcare FSLA status : Non-Exempt Dep...
    NTT DATA, Inc.
    Hired Organization Address Missouri, MO Full Time
    Make an impact with NTT DATA Join a company that is pushing the boundaries of what is possible. We are renowned for our ...
    NTT DATA, Inc.
    Hired Organization Address Atlanta, GA Full Time
    Req ID: 319731 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If ...

    Not the job you're looking for? Here are some other AI DevOps Support Engineer - REMOTE jobs in the Atlanta, GA area that may be a better fit.

    AI DevOps Support Engineer - REMOTE

    NTT DATA Services, Atlanta, GA

    DevOps Engineer

    Arize AI, Inc, Marietta, GA

    AI Assistant is available now!

    Feel free to start your new journey!