Demo

Sr. Staff, AI Cluster Engineering

SK hynix America
San Jose, CA Full Time
POSTED ON 3/25/2025
AVAILABLE BEFORE 5/25/2025
Job Title: Sr. Staff, AI Cluster Engineering
Office Location: San Jose, CA
Work Model: Onsite
      
About SK hynix America      
At SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data centers. As a global leader in DRAM and NAND flash technologies, we drive the evolution of advancing mobile technology, empowering cloud computing, and pioneering future technologies. Our cutting-edge memory technologies are essential in today's most advanced electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape.
We're looking for innovative minds to join our mission of shaping the future of technology. At SK hynix America, you'll be part of a team that's pioneering breakthrough memory solutions while maintaining a strong commitment to sustainability. We're not just adapting to technological change – we're driving it, with significant investments in artificial intelligence, machine learning, and eco-friendly solutions and operational practices. As we continue to expand our market presence and push the boundaries of what's possible in semiconductor technology, we invite you to be part of our journey to creating the next generation of memory solutions that will define the future of computing.
 
Job Overview:
As the AI Cluster engineer, you will work on development and operation of high-performance computing clusters supporting AI/ML workloads. You will be responsible for development, implementation, operation, and optimization of AI data center IT environments to ensure scalability, performance, reliability, and cost-effectiveness. This role requires collaboration with cross-functional teams to align computing infrastructure with the organization's strategic direction.
Responsibilities:

Computing Cluster Infrastructure Development

  • Design and implement distributed computing cluster infrastructure to support large-scale AI/ML model training and inference jobs with a focus on transformer-based AI models.
  • Build and maintain distributed system to ensure scalability, efficient resource allocation, and high throughput.
  • Optimize cluster performance through hardware selection, equipment configuration, , network engineering, and performance analysis.
  • Deploy and operate data center networking infrastructure using software system for automation, design validation, deployment, and operational support.
  • Implement tools and processes to maintain high uptime and ensure infrastructure reliability during both model training and inference phases.
  • Identify and resolve performance bottlenecks, improving overall system throughput and response times.


Team Leadership & Collaboration

  • Collaborate with cross-functional teams, including research, security, and benchmark test engineering teams to integrate infrastructure with AI workflows, ensuring seamless deployment and operation.
  • Engage with technology vendors and partners to evaluate new solutions to drive innovation in AI computing infrastructure.
Qualification:      
  • Bachelor’s degree in Computer Science, Engineering, or a related field (Master’s degree preferred).
  • 5 years of hands-on experience in computing cluster and backend server system engineering.
  • 3 years of experience in cloud computing (AWS, Azure, GCP).
  • Strong familiarity with AI/ML infrastructure requirements, best practices, and industry trends.
  • Experience in designing and managing distributed systems for large-scale AI training and inference.
  • Strong background in building and optimizing infrastructure for real-time AI systems.
  • Expertise in optimizing resource utilization, improving system throughput, and reducing latency in both training and inference.

 

Total Rewards:       
  • Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. The U.S. pay range for this position is base $110,000 - $140,000. Pay within this range varies by work location and may also depend on job-related skills and experience. Your Recruiter can share more about the specific salary range for the job location during the hiring process.
    • Our benefits include:
      • Top Tier health insurance at no employee cost
      • Paid day offs: PTO Company Holidays Happy Fridays
      • Paid Parental Leave Program
      • 401k Matching
      • Educational reimbursement up to $10,000 per year
      • Donation Matching and volunteering opportunities
      • Corporate discount programs
      • Free Breakfast/Lunch/Dinner provided to employees
Equal Employment Opportunity:

SKHYA is an Equal Employment Opportunity Employer. We provide equal employment opportunities to all qualified applicants and employees and prohibit discrimination and harassment of any type without regard to race, sex, pregnancy, sexual orientation, religion, age, gender identity, national origin, color, protected veteran or disability status, genetic information or any other status protected under federal, state, or local applicable laws. 
      

      

Salary : $110,000 - $140,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Sr. Staff, AI Cluster Engineering?

Sign up to receive alerts about other jobs on the Sr. Staff, AI Cluster Engineering career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$128,536 - $205,780
Income Estimation: 
$201,260 - $351,109
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$172,688 - $210,712
Income Estimation: 
$170,589 - $211,671
Income Estimation: 
$178,619 - $225,190
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$81,253 - $112,554
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$95,407 - $122,738
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at SK hynix America

SK hynix America
Hired Organization Address Lafayette, IN Full Time
Job Title: Purchasing Specialist Office Location: West Lafayette, IN Work Model: Onsite About SK hynix America At SK hyn...
SK hynix America
Hired Organization Address Lafayette, IN Full Time
Job Title: Strategic Planner and Corporate Relations Office Location: West Lafayette, IN Work Model: Onsite About SK hyn...
SK hynix America
Hired Organization Address San Jose, CA Full Time
Job Title: Sr. Director, Sales Office Location: San Jose, CA Work Model: Onsite Requirement: English/Korean Bilingual Ab...
SK hynix America
Hired Organization Address San Jose, CA Contractor
Job Title: Recruiting Coordinator (Contract) Office Location: San Jose, CA Work Model: Onsite About SK hynix America At ...

Not the job you're looking for? Here are some other Sr. Staff, AI Cluster Engineering jobs in the San Jose, CA area that may be a better fit.

Director, AI Cluster Engineering

SK hynix America, San Jose, CA

AI Assistant is available now!

Feel free to start your new journey!