Demo

ML Quantization Engineer

POSTED ON 4/22/2025 AVAILABLE BEFORE 5/1/2025
SEMRON Dresden, Saxony Full Time

About the Role



We’re SEMRON, a venture-backed startup focused on redefining AI hardware for Edge devices. If you’re deep into quantization and enjoy working at the intersection of machine learning and hardware, we’d like to hear from you. In this role, you will be responsible for building a highly scalable inference framework for our future chip generations. You will participate in fundamental architectural decisions and have the opportunity to contribute to upstream open-source projects.



What you will do:

  • Develop and maintain an inference framework that’s tightly tuned for SEMRON hardware
  • Collaborate directly with ML, compiler, and hardware teams to refine and adapt quantization algorithms for our specific needs
  • Apply and innovate on the latest quantization methods like AdaRound, BRECQ, GPTQ, and QuaRot, bringing fresh ideas to SEMRON’s approach



What you should bring in:

  • Solid skills in PyTorch and experience with torch.FX, plus the know-how to write efficient, custom CUDA kernels
  • A solid understanding of current quantization research and hands-on experience with techniques that push performance.



Helpful but not required:

  • Experience with State-of-the-art NN compression methods like Adaround, QDrop, QUIP or GPTQ
  • Experience with typical tools used in ML environments like HuggingFace’s transformers or DeepSpeed

Popular Search Topics

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at SEMRON

SEMRON
Hired Organization Address Dresden, Full Time
About The Role In this role, you will be building the digital part of the SEMRON’s future chip generations. You will des...
SEMRON
Hired Organization Address Dresden, Full Time
About The Role In this role you are responsible for system level and integration verification of the designs, as they ar...
SEMRON
Hired Organization Address Dresden, Saxony Intern
About The Role In this position you will be responsible for designing test procedures and equipment for our custom semic...
SEMRON
Hired Organization Address Dresden, Full Time
Über Den Job Wir bei SEMRON sind ein junges Tech-Startup und entwickeln leistungsstarke Hardware, die es ermöglicht, KI-...

Not the job you're looking for? Here are some other ML Quantization Engineer jobs in the Dresden, Saxony area that may be a better fit.