What are the responsibilities and job description for the High Performance Computing (HPC) Expert position at Archarithms Inc?
WE ARE ARCARITHM, and we are changing the world!
If you are ready to grow your career and change the world with us, then join the Arcarithm team!
We are located in beautiful, downtown Huntsville, AL, one of the fastest growing cities in the U.S.! At Arcarithm, we cultivate and foster an environment of integrity, open communication, work life balance, and career development. We are committed to investing in our employees by offering comprehensive health insurance options, a generous 401K plan, competitive salaries, continuous career growth opportunities, flexible schedules including remote work, mentoring and performance incentives.
Arcarithm is currently seeking top talent in the areas of full stack software development, artificial intelligence, optimization, and data analytics. You will work in a dynamic and challenging environment alongside our customers which include Lockheed Martin, General Dynamics, Northrop Grumman, Raytheon, US Army, US Navy, US Air Force, the Missile Defense Agency, and NASA on cutting edge technologies including machine learning, augmented and virtual reality, big data analytics, and more!
We are excited to continue to change and improve the world through innovation and technology!
Contact us today to hear more about Arcarithm and all we offer!
Job Title: High Performance Computing (HPC) Expert
Job Location: Tullahoma, TN
Job Duties:
- Design, setup, and maintain large scale Linux clusters for high performance computing applications.
- Troubleshoot and resolve complex technical issues that may arise within the HPC environment.
- Collaborate with cross-functional teams to ensure seamless integration of HPC systems into broader IT infrastructure.
- Develop, implement, and manage policies and procedures for HPC system administration.
- Monitor system performance and make recommendations for improvements or upgrades as needed.
- Stay abreast of the latest developments in HPC technology and apply this knowledge to improve our systems.
- Provide technical support and guidance to less experienced team members.
- Participate in project planning, execution, and post-mortem analysis for HPC initiatives.
- Contribute to the continuous improvement of our IT infrastructure by identifying areas for optimization and implementing solutions.
- Adhere to all company policies and procedures, as well as relevant industry standards and best practices.
Qualifications:
- BS Degree: HPC candidates should hold a Bachelor's or Master's Degree in Computer Science, Electrical Engineering, Mathematics, Statistics, Robotics, Artificial Intelligence, Machine Learning, Data Science, or related technical field from an accredited university.
- 6 to 10 years of HPC system administration.
- Current U. S. Citizenship is required.
- Strong knowledge of Linux operating systems and large scale cluster management tools (e.g., Slurm, LSF).
- Proficiency in scripting languages such as Python, Perl, or Bash.
- Experience with parallel computing paradigms and high-performance interconnects (e.g., InfiniBand, RoCE).
- Strong problem-solving skills and the ability to work effectively under pressure.
- Excellent communication skills, both written and verbal.
- Ability to work collaboratively in a team environment as well as independently.
- Familiarity with cloud computing platforms (e.g., AWS, Azure) is a plus.
- Proactive approach to learning new technologies and staying current with industry trends.