What are the responsibilities and job description for the Staff ML Architect: Assembly Programming and Performance Engineer position at Arm limited?
Job Overview :
High-performance ML workloads on Arm CPUs requires the co-development of algorithms and highly optimized CPU kernels. In CT-ML (Central Technology, Machine Learning), rapid kernel prototyping is crucial for exploring algorithms and assessing trade-offs between model accuracy and performance. Successful prototypes are essential to drive future CPU architecture development and also deliverables to Central Engineering for final production.
Increase your chances of reaching the interview stage by reading the complete job description and applying promptly.
Responsibilities :
This position is part of a dedicated team within the CT-ML group to focus on analyzing ML workload, rapid prototyping of highly optimized CPU kernels to drive model performance and accuracies.
Required Skills and Experience :
- Strong interest and passion for implementing high-performance kernel code in a dynamic environment.
- 4 years experience in implementing high-performance CPU kernel with vector and matrix extensions.
- Experience measuring and understanding performance.
- Experience in creating an efficient kernel code development framework including tools and testing.
- Deep understanding of CPU architecture.
Nice To Have” Skills and Experience :
Salary Range : $185,491-$250,958 per year
We value people as individuals and our dedication is to reward people competitively and equitably for the work they do and the skills and experience they bring to Arm. Salary is only one component of Arm's offering. The total reward package will be shared with candidates during the recruitment and selection process.
J-18808-Ljbffr
Salary : $185,491 - $250,958