Demo

HPC System Site Lead

California Creative Solutions Inc.
Los Alamos, NM Full Time
POSTED ON 3/4/2025
AVAILABLE BEFORE 4/27/2025

Job Responsibilities:

  • Maintain the HPC systems availability to the customer.
  • Lead technical output of on-site client HW technicians, system admins, and system analysts.
  • Serve as primary customer focal point for system support of systems and on-site activities.
  • Full-time 100% presence on customer site for standard business hours.
  • Routine face-to-face and group interaction with site team to organize tasks, follow up, and assist with challenges they encounter.
  • Track system health and Cases, review regularly (weekly) with customers and HPC leadership.
  • Maintaining availability reports for tracking SLA's.
  • Pre-plan system upgrades; review plans with team and customers, arrange for staffing and equipment, including pre-arrange open lines of communication in case of issues.
  • Escalate Cases and assist team members escalating Cases to next-tier support, and follow up to drive closure via escalation processes.
  • Manage on-site parts inventory using business tools.
  • Manage site tools and equipment.
  • Maintaining the on-call schedule to support our 365 24x7 contracts.
  • Assisting with hardware and system installation activities in new systems.

Team Support

  • Build strong working relationships with teammates, leadership, and customers.
  • Maintain awareness of upcoming training and prompt team members to complete trainings.
  • Maintain a team calendar of planned leave including on-call schedule for operational issues.
  • Provide performance review input to the District Service Manager (DSM) and suggestions for team member performance and development.
  • Escalate to DSM any personnel issues, risk of missing SLA, or customer satisfaction concerns.
  • Maintain a clean and safe working environment.
  • Support DSM in on-boarding new team members by providing site-specific details (e.g., customer network accounts, badge, parking, etc.).

Required Qualifications & Experience:

  • 8 years of professional experience and a Bachelor of Arts/Science or equivalent degree in computer science or related area of study; without a degree, three additional years of relevant professional experience (11 years in total).
  • In-depth knowledge of high-performance computing (HPC) systems.
  • Proficiency in managing and optimizing HPC environments, including system configuration, performance tuning, and troubleshooting.
  • Strong understanding of parallel computing, cluster management, and distributed computing technologies.
  • Experience with HPC workload managers and schedulers such as SLURM, PBS, or similar.
  • Advanced knowledge of Linux operating systems.
  • Familiarity with software development tools and environments commonly used in HPC, including compilers, debuggers, and performance analysis tools.
  • Experience with various scripting languages such as Python or Bash.
  • Proven experience in system administration, including hardware and software installation, maintenance, and upgrades.
  • Knowledge of network architecture, storage solutions, and data management within HPC environments.
  • Ability to implement and manage security protocols and best practices in a high-performance computing context to maintain customer security posture.
  • Strong project management skills, including planning, execution, and monitoring of HPC projects.
  • Ability to lead and coordinate a team of technical professionals, ensuring timely and successful project delivery.
  • Experience in resource allocation, budgeting, and performance metrics tracking for HPC projects.
  • Excellent problem-solving abilities, with a focus on identifying root causes and implementing effective solutions.
  • Strong analytical skills to assess system performance and make data-driven decisions for optimization.
  • Ability to troubleshoot complex technical issues in a high-stakes HPC environment.
  • Exceptional communication skills, both written and verbal, to effectively interact with team members, stakeholders, and clients.
  • Ability to convey complex technical information in a clear and concise manner to non-technical audiences.
  • Strong collaboration skills to work effectively within a multidisciplinary team and across organizational boundaries.
  • Extensive experience in HPC system management and administration, with a track record of successful project and team leadership.
  • Willingness to participate in ongoing professional development and training opportunities which may require travel.

Preferred Qualifications:

  • CompTIA A or Server Certification
  • Security Certification
  • Linux Certification
  • PMP or Project
  • Vendor Certifications
  • Experience with ticket-tracking software (Salesforce, SmartSheets; any ticket tracking is good)

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a HPC System Site Lead?

Sign up to receive alerts about other jobs on the HPC System Site Lead career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,597 - $131,824
Income Estimation: 
$104,896 - $133,785
Income Estimation: 
$123,198 - $153,566
Income Estimation: 
$142,209 - $179,056
Income Estimation: 
$177,932 - $225,503
Income Estimation: 
$83,086 - $106,052
Income Estimation: 
$83,298 - $131,726
Income Estimation: 
$101,020 - $131,637
Income Estimation: 
$177,932 - $225,503
Income Estimation: 
$208,896 - $274,954
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at California Creative Solutions Inc.

California Creative Solutions Inc.
Hired Organization Address Force, UT Full Time
Job Description : We are seeking a Transport / IP Network Controller to join our team at Hill Air Force Base. This role ...
California Creative Solutions Inc.
Hired Organization Address Springfield, TN Full Time
Job Requirement : Deep knowledge and experience with Geospatial Information Systems (GIS). Experience with Intelligence ...
California Creative Solutions Inc.
Hired Organization Address Chantilly, VA Full Time
Job Responsibilities : Design, develop, and support cloud initiatives to support customer-driven needs. Responsible for ...
California Creative Solutions Inc.
Hired Organization Address Herndon, VA Full Time
Job Responsibilities : Develop an understanding of the most effective and efficient processes to accomplish tasks focusi...

Not the job you're looking for? Here are some other HPC System Site Lead jobs in the Los Alamos, NM area that may be a better fit.

AI Assistant is available now!

Feel free to start your new journey!