Demo

Machine Learning Research Engineer

Etched
Cupertino, CA Full Time
POSTED ON 1/26/2025
AVAILABLE BEFORE 4/24/2025

Job Description

Job Description

About Etched

Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Etched Labs is the organization within Etched whose mission is to democratize generative AI, pushing the boundaries of what will be possible in a post-Sohu world.

Key responsibilities

  • Propose and conduct novel research to achieve results on Sohu that are unviable on GPUs
  • Translate core mathematical operations from the most popular Transformer-based models into maximally performant instruction sequences for Sohu
  • Develop deep architectural knowledge informing best-in-the-world software performance on Sohu HW, collaborating with HW architects and designers.
  • Co-design and finetune emerging model architectures for highest efficiency on Sohu
  • Guide and contribute to the Sohu software stack, performance characterization tools, and runtime abstractions by implementing frontier models using Python and Rust.

Representative projects

  • Propose and implement a novel test time compute algorithm that leverages Sohu's unique capabilities to unlock a product could never be achieved on a typical GPU
  • Implement diffusion models on Sohu to achieve GPU-impossible latencies that allow for real-time image generation
  • Optimize model instructions and scheduling algorithms to optimize for utilization, latency, throughput, and / or a mix of these metrics.
  • Implement model-specific inference-time acceleration techniques such as speculative decoding, tree search, KV cache sharing, priority scheduling, etc by interacting with the rest of the inference serving stack.
  • You may be a good fit if you have

  • An ML Research background with interests in HW co-design
  • Experience with Python, Pytorch, and / or JAX
  • Familiarity with transformer model architectures and / or inference serving stacks (vLLM, SGLang, etc.) and / or experience working in distributed inference / training environments
  • Experience working cross-functionally in diverse software and hardware organizations
  • Strong candidates may also have

  • ML Systems Research and HW Co-design backgrounds
  • Published inference-time compute research and / or efficient ML research
  • Experience with Rust
  • Familiarity with GPU kernels, the CUDA compilation stack and related tools, or other hardware accelerators
  • Benefits

  • Full medical, dental, and vision packages, with 100% of premium covered
  • Housing subsidy of $2,000 / month for those living within walking distance of the office
  • Daily lunch and dinner in our office
  • Relocation support for those moving to Cupertino
  • How we're different

    Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

    We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

    Salary : $2,000

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Machine Learning Research Engineer?

    Sign up to receive alerts about other jobs on the Machine Learning Research Engineer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $123,167 - $152,295
    Income Estimation: 
    $146,673 - $180,130
    Income Estimation: 
    $103,114 - $138,258
    Income Estimation: 
    $118,163 - $145,996
    Income Estimation: 
    $120,777 - $151,022
    Income Estimation: 
    $129,363 - $167,316
    Income Estimation: 
    $86,891 - $130,303
    Income Estimation: 
    $113,077 - $147,784
    Income Estimation: 
    $135,356 - $164,911
    Income Estimation: 
    $153,902 - $198,246
    Income Estimation: 
    $98,763 - $126,233
    Income Estimation: 
    $116,330 - $143,011
    Income Estimation: 
    $113,077 - $147,784
    Income Estimation: 
    $116,330 - $143,011
    Income Estimation: 
    $135,356 - $164,911
    Income Estimation: 
    $153,902 - $198,246
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Etched

    Etched
    Hired Organization Address Cupertino, CA Full Time
    About Etched : Etched is building AI chips that are hard-coded for individual model architectures. Our first product (So...
    Etched
    Hired Organization Address Cupertino, CA Full Time
    Job Description Job Description About Etched Etched is building AI chips that are hard-coded for individual model archit...
    Etched
    Hired Organization Address Cupertino, CA Full Time
    About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu...
    Etched
    Hired Organization Address Cupertino, CA Full Time
    About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu...

    Not the job you're looking for? Here are some other Machine Learning Research Engineer jobs in the Cupertino, CA area that may be a better fit.

    Back-End / Machine-Learning Engineer

    Protogon Research, Menlo, CA

    Senior Machine Learning Engineer

    OPPO US Research Center, Palo Alto, CA

    AI Assistant is available now!

    Feel free to start your new journey!