Sorry! This job is no longer available. Please explore similar jobs listed on the left.
Anthropic is Hiring a Software Engineer, Systems Near San Francisco, CA
About the role:
Anthropic is seeking an experienced engineer for our Systems team. You'll lead initiatives supporting some of the largest, most sophisticated clusters in industry used to train, research, and ultimately serve AI models. Your work will be crucial in ensuring Anthropic is able to continue reliably and safely training frontier models!
Responsibilities:
Lead build out of industry-leading AI clusters (thousands to hundreds of thousands of machines), partnering closely with cloud service providers on cluster build out and required features
Consult with different stakeholders to deeply understand infrastructure and compute needs, identifying potential solutions to support frontier research and product development
Set technical strategy and oversee development of high scale, reliable infrastructure systems
Mentor top technical talent
Design processes (e.g. postmortem review, incident response, on-call rotations) that help the team operate effectively and never fail the same way twice
You may be a good fit if you:
8 industry experience, as well as 3 years of experience leading large scale, complex projects or teams as an engineer or tech lead
Are obsessed with infrastructure reliability, scalability, security, and continuous improvement
Have a passion for supporting internal partners like research to understand their needs
Have excellent communication skills to build consensus with stakeholders, both internally and externally
Possess deep knowledge of modern cloud infrastructure including Kubernetes, Infrastructure as Code, AWS, and GCP
Strong candidates may also:
Have security and privacy best practice expertise
Experience with machine learning infrastructure like GPUs, TPUs, or Trainium, as well as supporting networking infrastructure like NCCL
Low level systems experience, for example linux kernel tuning and eBPF
Technical expertise: Quickly understanding systems design tradeoffs, keeping track of rapidly evolving software systems
Deadline to apply: None. Applications will be reviewed on a rolling basis.