What are the responsibilities and job description for the Sr Server Support Tech position at Quanta Manufacturing Nashville, LLC?
Overview:
We seek multiple highly skilled NVIDIA GB200 Sr Server Support Technicians to join our team. You will install, configure, maintain, and troubleshoot NVIDIA GB200 servers and associated hardware in this role. The ideal candidate will have substantial experience in server hardware support, specifically with NVIDIA products, and a passion for working in a fast-paced, dynamic environment.
Key Responsibilities:
- Server Installation & Configuration: Install, configure, and deploy NVIDIA GB200 servers in data center environments, ensuring they are correctly set up for optimal performance and scalability.
- Hardware Maintenance: Perform regular maintenance and health checks on NVIDIA GB200 servers, including monitoring hardware performance, updating firmware, and replacing or upgrading components.
- Troubleshooting & Repairs: Diagnose and resolve hardware and software issues related to the NVIDIA GB200 servers, ensuring minimal downtime and maintaining system integrity.
- Performance Optimization: Monitor server performance and implement corrective actions to optimize the efficiency, stability, and reliability of NVIDIA GB200 hardware.
- System Updates & Patches: Apply firmware updates, patches, and drivers to NVIDIA servers, ensuring compatibility with the latest software and hardware environments.
- Integration Support: Help integrate NVIDIA GB200 servers with other systems and software, ensuring compatibility and smooth communication across the network.
- Documentation & Reporting: Maintain accurate records of server configurations, maintenance schedules, and troubleshooting efforts. Generate regular reports on server health, performance, and issues.
- Collaboration: Work closely with IT infrastructure teams, network engineers, and other technical staff to ensure seamless server operations and integration with existing infrastructure.
- Data Center Operations: Support data center operations, ensuring that NVIDIA GB200 servers are properly rack-mounted, cabled, and positioned for optimal airflow and cooling.
Required Skills and Qualifications:
- Bachelor’s degree in Information Technology, Computer Science, or a related field, or equivalent technical certifications and experience.
- Proven experience working with NVIDIA GB200 servers or similar high-performance computing hardware.
- Strong understanding of server hardware, including CPU, memory, storage, networking components, and cooling systems.
- Familiarity with server operating systems (Linux, Windows Server) and server management tools.
- Experience with server virtualization, data center management, and cloud-based environments.
- Solid understanding of networking concepts, protocols, and configurations (TCP/IP, DNS, DHCP, etc.).
- Proficiency with server diagnostics tools and hardware monitoring software.
- Excellent troubleshooting and problem-solving skills with attention to detail.
- Ability to work in a fast-paced environment and handle multiple tasks simultaneously.
- Strong communication skills, both written and verbal, with the ability to explain technical issues to non-technical personnel.
Preferred Qualifications:
- Experience with NVIDIA-specific hardware and software solutions, including GPUs, CUDA, and other NVIDIA technologies.
- Familiarity with GPU server configurations and use cases, particularly in AI, machine learning, and high-performance computing environments.
- Knowledge of server management frameworks like IPMI, iLO, or similar.
- IT certifications (e.g., CompTIA A , Cisco CCNA, or similar) are a plus.
- Familiarity with cloud platforms (AWS, Google Cloud, Azure) and their interaction with on-premises server infrastructure.
Additional Information:
- Ability to lift heavy hardware components and perform physical installations and repairs in a data center environment.
- Willingness to work on-call or during non-business hours for emergency maintenance or system downtime.