What are the responsibilities and job description for the Infrastructure Software Engineer position at Enfabrica?
Join an ambitious, experienced team of silicon and distributed systems experts as an Infrastructure Software Engineer. You have the opportunity to build a groundbreaking new category of product that revolutionizes the performance and scalability of next-generation distributed computing systems, and to help solve key infrastructure challenges facing our customers.
We are looking for talented, motivated candidates with experience building groundbreaking, novel computing and networking systems for large-scale distributed computing environments, who are looking to grow in a fast paced, dynamic startup environment.
The Infrastructure Software Engineer is responsible for designing, implementing, and maintaining infrastructure solutions for the company, such as our HPC build cluster, Kubernetes and Nomad clusters, remote developer environments, etc. This role requires a solid understanding of cloud computing platforms, infrastructure technologies, and Linux operating systems. Our Infrastructure team supports both hardware and software teams within the organization.
This position is eligible for remote work.
Responsibilities:
- Work closely with hardware and software teams to understand their needs and develop solutions that meet those needs. This requires being able to learn and understand multiple domains (hardware design, verification, embedded software, etc) at a high level.
- Manage and maintain cloud based solutions as well as on-prem systems, including monitoring
- Collaborate on our HPC system which runs our EDA workloads
- Troubleshoot and resolve infrastructure issues such as degraded performance, loss of connectivity, build failures etc
- Participate in code reviews and provide feedback to other engineers
Qualifications:
- Bachelor's degree in Computer Science, Software Engineering, or a related field, or applicable experience
- 3 years of experience working with cloud computing platforms (AWS or GCP)
- 3 years of experience with infrastructure technologies (Kubernetes, CI/CD, etc.)
- Fluent with Go
- Proficient with scripting languages such as Bash and Python
- Some proficiency with Terraform and/or Puppet
- Familiar with Grafana and Prometheus
- Solid understanding of Docker containers
- Experience with Linux operating systems (Debian and RHEL)
- Strong problem-solving and analytical skills
- Excellent communication and teamwork skills
Experience supporting hardware teams a plus!
About Us:
Enfabrica is on a mission to revolutionize AI compute systems and infrastructure at scale through the development of superior-scaling networking silicon and software which we call the Accelerated Compute Fabric. Founded and led by an executive team assembled from first-class semiconductor and distributed systems/software companies throughout the industry, Enfabrica sets themselves apart from other startups with a very strong engineering pedigree, a proven track record of delivering, deploying and scaling products in data center production environments, and significant investor support for our ambitious journey! Together, with their differentiated approach to solving the I/O bottlenecks in distributed AI and accelerated compute clusters, Enfabrica is unleashing the revolution in next-gen computing fabrics.