Demo

Founding Data Infrastructure Engineer (Query Engine)

zaimler
San Mateo, CA Full Time
POSTED ON 2/26/2025
AVAILABLE BEFORE 5/23/2025

About Us

We are on a mission to bridge the gap between enterprise business knowledge and data, democratizing data discovery and curation to prepare organizations for the era of generative AI. Today's data tools are overly complex, poorly integrated, and siloed, forcing AI Practitioners and data scientists alike to spend more time wrestling with tools, relying on tribal knowledge, and navigating data lakes rather than doing meaningful data science work. The current landscape of data tools and processes is heavily manual and needs to catch up with the vast amount of data generated daily. With the advent of Gen AI and multi-modality, this challenge has only grown more complex and broken.

Backed by top VC funds, we are committed to making enterprise data AI-ready faster, more reliably, and with a stronger foundation of factual semantic knowledge. This leads to more accurate models, superior outcomes, and better business results. Our team of seasoned data infrastructure and machine learning experts (from LinkedIn, Visa, Truera, Hive, and Branch) has spent the past two decades building bespoke systems to solve these very challenges.

Join our growing team of ML research and data infrastructure experts. We're committed to empowering AI and data scientists to seamlessly integrate semantic learning with generative AI. Be part of our journey to shape the future of enterprise AI.

Who You Are

  • Thrives in early-stage environments, eager to build robust systems from scratch.
  • Passionate about distributed systems and solving complex data challenges at scale.
  • Able to navigate ambiguity and adapt to changing requirements in a fast-paced startup.
  • Advocates for engineering efficiency and continuous improvement.
  • A leader who enjoys mentoring others and fostering a strong engineering culture.
  • Excited to work cross-functionally with a team that values transparency, purpose-driven innovation, and collective leadership.

What You Will Be Doing

  • Design and Develop : Build scalable, fault-tolerant query engines optimized for performance and resource efficiency.
  • Optimize Performance : Apply advanced techniques such as vectorized processing, cost-based optimization, and caching to enhance query execution.
  • Integrate Seamlessly : Develop integrations with modern data lake formats (e.g., Apache Iceberg, Delta Lake, and Hudi) and semantic layers.
  • Innovate in Distributed Systems : Architect solutions to handle concurrency, scalability, and reliability in distributed environments.
  • Leverage Open Source : Contribute to or extend platforms like Apache Spark, Presto, and Trino to meet unique product requirements.
  • Collaborate : Work with product, data science, and engineering teams to align technical solutions with business needs.
  • Stay Current : Research and implement the latest advancements in query processing and distributed systems.
  • Prior Experience

  • Proficiency in programming languages such as Java, Scala, Rust, or C .
  • Deep understanding of query engine internals, distributed systems architecture, and parallel query processing.
  • Experience with modern big data technologies (e.g., Apache Spark, Presto, Trino) and data formats like Parquet, ORC, or Avro.
  • Proven ability to build and optimize scalable systems capable of processing petabyte-scale datasets.
  • Strong grasp of SQL semantics, execution plans, and query optimization techniques.
  • Nice to Have

  • Experience in building AI / ML infrastructure and ML production systems at scale.
  • Hands-on experience with Linux, Docker, and other containerization technologies.
  • Prior experience at an early-stage startup, developing systems and processes from scratch.
  • Why Join Us?

    We're a fast-moving, well-funded startup based in San Mateo, working onsite with flexible hours because the best ideas happen when smart people collaborate in person. We take ownership of our work, move with urgency while maintaining quality, and focus on delivering real results-not just effort. We offer competitive compensation, equity, full benefits (Medical, Dental, Vision, 401k), and a workspace built for collaboration, transparency, and deep technical problem-solving.

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Founding Data Infrastructure Engineer (Query Engine)?

    Sign up to receive alerts about other jobs on the Founding Data Infrastructure Engineer (Query Engine) career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $154,597 - $194,610
    Income Estimation: 
    $172,688 - $210,712
    Income Estimation: 
    $170,589 - $211,671
    Income Estimation: 
    $178,619 - $225,190
    Income Estimation: 
    $86,891 - $130,303
    Income Estimation: 
    $103,114 - $138,258
    Income Estimation: 
    $118,163 - $145,996
    Income Estimation: 
    $120,777 - $151,022
    Income Estimation: 
    $129,363 - $167,316
    Income Estimation: 
    $86,891 - $130,303
    Income Estimation: 
    $85,996 - $102,718
    Income Estimation: 
    $111,859 - $131,446
    Income Estimation: 
    $110,457 - $133,106
    Income Estimation: 
    $105,809 - $128,724
    Income Estimation: 
    $122,763 - $145,698
    Income Estimation: 
    $105,809 - $128,724
    Income Estimation: 
    $136,611 - $163,397
    Income Estimation: 
    $135,163 - $163,519
    Income Estimation: 
    $131,953 - $159,624
    Income Estimation: 
    $150,859 - $181,127
    Income Estimation: 
    $129,363 - $167,316
    Income Estimation: 
    $145,845 - $177,256
    Income Estimation: 
    $147,836 - $182,130
    Income Estimation: 
    $154,597 - $194,610
    Income Estimation: 
    $86,891 - $130,303
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Not the job you're looking for? Here are some other Founding Data Infrastructure Engineer (Query Engine) jobs in the San Mateo, CA area that may be a better fit.

    AI Assistant is available now!

    Feel free to start your new journey!