What are the responsibilities and job description for the Technical Infrastructure Engineer position at Strategic Business Systems (SBS)?
Strategic Business Systems (SBS) is seeking a highly motivated and skilled engineer to join our team and manage a diverse fleet of server hardware across network, storage, compute, and AI domains. The ideal candidate will possess expertise in IBM Storage Fusion, Spectrum Scale & Spectrum Protect. A background in server hardware validation, failure analysis, lab infrastructure management, and network administration, with additional experience in LLMs, containerization, and virtualization is highly desired. This role is onsite daily in Fremont, CA.
Roles and Responsibilities:
- Server Hardware Management: Maintain and oversee server racks from multiple OEMs, ensuring optimal performance across network, storage, compute, and AI hardware.
- Failure Analysis & Debugging: Support hardware validation by analyzing system- and module-level failures from customer's data centers. Debug firmware, BIOS, CPLD, and related platform issues.
- Vendor & Firmware Coordination: Interface with OEM vendors for firmware and driver updates, ensuring compliance and system stability.
- Network Infrastructure Management: Maintain and configure switches, routers, firewalls, and core networking protocols (TCP/IP, DNS, DHCP).
- AI & Compute Frameworks: Work with LLMs and popular AI frameworks such as TensorFlow and PyTorch to support AI infrastructure needs.
- Containerization & Virtualization: Deploy and manage containerized applications using Docker & Kubernetes and maintain virtual machines on VMware/KVM.
- Lab Operations & Inventory: Manage lab infrastructure, conduct safety audits, maintain inventory control, and oversee access to critical server hardware.
- Firmware Debugging & Dediprog Tools: Utilize Dediprog tools for FW/BIOS debugging and collaborate on failure analysis initiatives.
- Cross-Functional Collaboration: Work closely with failure analysis leads and engineering teams on mission-critical projects.
- Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related field.
- Experience working with IBM Storage Fusion, Spectrum/Storage Scale, Spectrum Protect
- 5 years of experience in server rack management, lab infrastructure management, or related fields.
- Expertise in debugging/troubleshooting complex hardware issues (storage, compute, and AI systems).
- Strong Linux/Unix background (RedHat, Fedora, CentOS, etc.).
- Proficiency in scripting languages (Python, PowerShell, Perl, PHP, etc.).
- Hands-on experience with Kubernetes, Docker, and virtualization platforms (VMware, KVM, etc.).
- Experience with failed server hardware validation, including BIOS/CPLD FW debugging.
- Strong networking knowledge, including TCP/IP, DNS, and DHCP.
- Deep understanding of server hardware components (motherboards, power distribution, storage systems).
- Excellent problem-solving skills, ability to work independently, and strong communication/documentation abilities.
Estimated compensation range: $150,000-185,000
About SBS
SBS is pleased to offer a comprehensive benefits package to eligible, full time employees. This includes Medical, Dental, Vision, 401k, Life Insurance/Disability, and Paid Time Off (PTO). SBS provides the opportunity to work with the best in the industry on a wide range of cutting-edge enterprise technologies in a fast-paced culture that rewards leadership and creative thinkers.
Strategic Business Systems, Inc. (SBS) is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin.
If interested in learning more about this opportunity, please send your resume to recruiting@sbsplanet.com .
Salary : $150,000 - $185,000