Demo

Machine Learning Research Engineer - Etched Labs

Etched
Cupertino, CA Full Time
POSTED ON 1/26/2025
AVAILABLE BEFORE 4/23/2025

Job Description

Job Description

About Etched

Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Etched Labs is the organization within Etched whose mission is to democratize generative AI, pushing the boundaries of what will be possible in a post-Sohu world.

Key responsibilities

  • Propose and implement novel research to do things that would be impossible on GPUs
  • Translate the mathematical operations of the most popular Transformer-based models into instructions that optimally run the models on Sohu
  • Develop deep knowledge of the architecture and design of Sohu in collaboration with HW architects and designers.
  • Co-design and finetune specific model architectures to further their efficiency on Sohu
  • Contribute to the design of the Sohu software stack and the tools and abstractions needed to implement models using Python and some Rust.

Representative projects

  • Propose and implement a novel test time compute algorithm that leverages Sohu's unique capabilities to unlock a product could never be achieved on a typical GPU
  • Implement a diffusion model on Sohu in such a way that achieves X% utilization, Y token / sec throughput and Z seconds of TTFT latency
  • Optimize the model instructions and scheduling algorithm to optimize for latency, throughput, or specific situations such as speculative decoding.
  • Implement model-specific inference-time acceleration techniques such as speculative decoding, tree search, KV cache sharing, etc.
  • You may be a good fit if you have

  • An ML Research background with interests in HW co-design
  • Experience with Python, Pytorch, and / or JAX
  • Familiarity with transformer model architectures and inference serving stacks (vLLM, SGLang, etc.) or experience working in distributed inference / training environments
  • Experience working cross-functionally in diverse software and hardware organizations
  • Strong candidates may also have

  • ML Systems Research and HW Co-design backgrounds
  • Published inference time compute research and / or efficient ML research
  • Experience with Rust
  • Familiarity with GPU kernels, the CUDA compilation stack and related tools, or other hardware accelerators
  • Benefits

  • Full medical, dental, and vision packages, with 100% of premium covered
  • Housing subsidy of $2,000 / month for those living within walking distance of the office
  • Daily lunch and dinner in our office
  • Relocation support for those moving to Cupertino
  • How we're different

    Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

    We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

    Salary : $2,000

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Machine Learning Research Engineer - Etched Labs?

    Sign up to receive alerts about other jobs on the Machine Learning Research Engineer - Etched Labs career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $123,167 - $152,295
    Income Estimation: 
    $146,673 - $180,130
    Income Estimation: 
    $135,356 - $164,911
    Income Estimation: 
    $153,053 - $187,211
    Income Estimation: 
    $153,902 - $198,246
    Income Estimation: 
    $113,077 - $147,784
    Income Estimation: 
    $135,356 - $164,911
    Income Estimation: 
    $153,902 - $198,246
    Income Estimation: 
    $98,763 - $126,233
    Income Estimation: 
    $116,330 - $143,011
    Income Estimation: 
    $113,077 - $147,784
    Income Estimation: 
    $116,330 - $143,011
    Income Estimation: 
    $135,356 - $164,911
    Income Estimation: 
    $153,902 - $198,246
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Etched

    Etched
    Hired Organization Address Cupertino, CA Full Time
    About Etched : Etched is building AI chips that are hard-coded for individual model architectures. Our first product (So...
    Etched
    Hired Organization Address Cupertino, CA Full Time
    Job Description Job Description About Etched Etched is building AI chips that are hard-coded for individual model archit...
    Etched
    Hired Organization Address Cupertino, CA Full Time
    About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu...
    Etched
    Hired Organization Address Cupertino, CA Full Time
    About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu...

    Not the job you're looking for? Here are some other Machine Learning Research Engineer - Etched Labs jobs in the Cupertino, CA area that may be a better fit.

    Machine Learning Research Engineer

    Etched, Cupertino, CA

    Machine Learning Researcher

    ETCHED LLC, Cupertino, CA

    AI Assistant is available now!

    Feel free to start your new journey!