What are the responsibilities and job description for the AI Platform Support Engineer position at ITTConnect?
Job Description
ITTConnect is seeking a AI Platform Specialist to work for one of our clients. This is a role with a global leader in consulting, digital transformation, technology and engineering services present in nearly 50 countries. The end client is in the Telecom business.
We are building a new team of platform specialists to support and enhance high-performance AI services. These are highly technical, hands-on roles focused on customer, application, and platform support of AI-focused workloads.
As an AI Platform Specialist, these roles will provide application and GPU support. The team will deliver Tier 1 and Tier 2 support to developers and engineers while collaborating closely with Tier 3 and 4 platform teams and vendors for issue resolution. The roles require user knowledge of Kubernetes, virtualization, and cloud-native technologies as well as operator knowledge of GPUs and other AI supporting services. Each specialist should have a focus on customer service along with goals of reliability, scalability, and performance.
The AI Platform Specialist will report to the Senior AI Architect and work as peers within a specialized AI support team. Collaboration with internal VM and container support teams as well as NVIDIA, Codeium, and other vendor specialists will be essential for supporting customers, troubleshooting, and optimizing AI workloads.
Key Responsibilities :
- Platform Support & Incident Response
o Provide Tier 1 & Tier 2 support for AI-driven applications and workloads.
o Troubleshoot and resolve issues related to Kubernetes deployments, GPU utilization, and service performance.
o Collaborate with Tier 3 teams, including Kubernetes engineers and external vendors, to escalate and resolve complex issues.
o Full adoption, creation, and integrations into automated services using Helm, Ansible, Terraform, etc.
o Deploy, manage, and support containerized AI workloads on Google Anthos-powered Kubernetes clusters.
o Ensure adherence to pod security policies, automated rollouts / rollbacks, and best practices for scalable and secure Kubernetes environments.
o Optimize and support GPU-enabled workloads including CUDA and other AI acceleration frameworks.
o Assist in the installation, configuration, and support of AI coding assistants (e.g., Codeium).
o Maintain detailed operational documentation, runbooks, and troubleshooting guides.
o Utilize monitoring / logging tools like New Relic, Big Panda, Prometheus, Grafana, and other observability frameworks.
o Work cross-functionally with developers, IT teams, and vendors to ensure seamless deployment and support of AI services.
o Contribute to CI / CD pipelines, automation, service, and security best practices.
o Track and communicate work through task management platforms (ServiceNow and Jira).
Requirements
Preferred Qualifications
Requirements
10 years of experience as an IT Business Analyst MUST HAVE experience in Financial Services / Banking Must have previous technical background, preferably as a Developer Highly desirable previous experience with core banking platforms Experience defining functional architecture, business requirements and functional tests Seasoned in business analysis for the banking sector with exceptional analytical and conceptual thinking skills Domain Experience on Karate Framework; Test Engineering, Kafka, Event Driven and REST APIs Domain knowledge on Postman Domain knowledge on AWS Experience creating detailed reports and giving presentations Excellent planning, organizational, and time management skills Demonstrated ability to work well with different business users and stakeholders groups gathering business requirements and writing functional specification documents BS in Computer Science, Computer Engineering or similar