What are the responsibilities and job description for the Software Engineer, AI Networking, Machine Learning Infrastructure position at Tesla?
As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopilot and the Tesla Bot.
At the core of our autonomy capabilities are neural networks that the research team is designing to train on very large amounts of data, across large-scale GPU clusters and our supercomputer Dojo. Robustly training these models at scale and in the shortest amount of time is critical to our mission.
We are optimizing the communication collectives used in AI training and inference workloads to ensure they are robust and performant while improving observability.
Responsibilities
- Identify gaps and optimize the performance of the collective communication libraries used in the training software stack
- Build infrastructure to improve observability into the collective communication libraries to significantly reduce cognitive load in debugging massively distributed training jobs
- Optimize the AI network software stack with respect to the network topology of our AI supercomputing clusters
- Develop and integrate various health checks to the fault tolerance training infrastructure
- Collaborate with the supercomputing and research team to ensure requirements on network bandwidth and topology for modern AI workloads are met
Requirements
Compensation and Benefits
Benefits
Along with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire :
2 medical plan options with $0 payroll deduction
Expected Compensation
104,000 - $360,000 / annual salary cash and stock awards benefits
Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.
Salary : $104,000 - $360,000