Demo

Senior Software Engineer - Hail Team

Broad Institute
Cambridge, MA Full Time
POSTED ON 2/25/2025
AVAILABLE BEFORE 5/13/2025

Description & Requirements

At the Broad Institute broadly and within the Neale Lab specifically, we leverage statistical and software techniques to understand the mechanisms of disease from extremely large datasets generated by scalable sequencing technologies. The lab and Institute are entering an age of one million sequences, millions of transcriptomes, tens of thousands of medical images, and complete medical records. The development of scalable scientific assays has transformed biological engineering problems into software engineering ones. We seek a senior software engineer to help solve those problems.

This team develops, maintains, and operates Hail, a suite of libraries, data systems, and services for analyzing the world's largest genome sequencing datasets. Hail supports scientists beginning with individual sequences through the production of a sequencing matrix, the calculation of per-row and per-column statistics, distributed matrix multiplications to search for genetic relatedness, preparation of thousands of phenotypes per sequence, regression to search for genetic associations with phenotypes, subsetting and export for distribution to collaborators, and as a data store for web-based data browsers and rare disease diagnostic support systems.

The team faces three major challenges in the coming years. First, the largest sequencing callset has doubled every year since 2003 and the next doubling is anticipated in 2025. Second, the phenotypes have grown from binary disease status tables to medical records, medical images, and cellular assays. Third, the project must adapt to the changing hardware landscape, new scientific-analytical techniques, and new analytical databases.

Hail's two core products are Query and Batch, both of which are open source and openly developed. We are seeking a Senior Software Engineer to focus primarily on Batch. Batch is a cost-metered, multi-tenant, spot-tolerant, elastic, horizontally-scalable compute engine. The team operates an installation of Batch as a Software-as-a-Service for a community of hundreds of scientists within the Broad Institute.

Batch is implemented in Python, the control plane is deployed on Kubernetes, the compute plan is a directly managed set of VMs. Batch relies on many technologies including : OCI container images, crun, Google and Azure cloud storage, Google and Azure VM APIs, Google and Azure container registry APIs, Grafana, Prometheus, OAuth2, MySQL, Envoy, and asyncio.

Responsibilities

  • Work with scientists and software engineers to realize transformative scientific goals.
  • Design, implement, test, tune, document, deploy, operate, maintain and support new features, analysis methods, and infrastructure.
  • Operate and maintain computing infrastructure and software deployments.
  • Participate in constructive code reviewing, share best practices with team members.
  • Mentor junior engineers and interns.
  • Contribute to system architecture and design.
  • Refine software development processes and best practices.

Requirements

  • Expertise in Python as our primary programming language.
  • B.S. or B.A. in Computer Science or related field.
  • 5 years industry experience working as part of a software team.
  • Experience designing and developing one of : compilers, query planners, or distributed systems.
  • Understanding of computer science fundamentals.
  • Facility with "tools of the trade", e.g., Unix system administration, shell scripting, build and deployment tools, version control, etc.
  • Ability to meet deadlines and work cooperatively in a small, collaborative team with limited formal processes.
  • In addition to Python, our current technology stack also includes the JVM, Scala, GCP, Azure, and C . Our domain knowledge includes machine learning, bioinformatics, statistical genetics, compilers, and theoretical math. Hires need not have experience with every aspect of our technologies and domains.

    Our website : https : / / hail.is. Our GitHub : https : / / github.com / hail-is / hail.

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Senior Software Engineer - Hail Team?

    Sign up to receive alerts about other jobs on the Senior Software Engineer - Hail Team career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $123,167 - $152,295
    Income Estimation: 
    $146,673 - $180,130
    Income Estimation: 
    $146,673 - $180,130
    Income Estimation: 
    $176,149 - $220,529
    Income Estimation: 
    $77,657 - $95,021
    Income Estimation: 
    $97,257 - $120,701
    Income Estimation: 
    $97,257 - $120,701
    Income Estimation: 
    $123,167 - $152,295
    Income Estimation: 
    $123,167 - $152,295
    Income Estimation: 
    $146,673 - $180,130
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Broad Institute

    Broad Institute
    Hired Organization Address Cambridge, MA Full Time
    General information Location Cambridge, MA Ref # 42023 Job Family Research Workplace On-Site Date published 31-Mar-2025 ...
    Broad Institute
    Hired Organization Address Cambridge, MA Full Time
    Description & Requirements The Broad Institute of MIT and Harvard is one of the world's leading biomedical research inst...
    Broad Institute
    Hired Organization Address Burlington, MA Full Time
    General information Location Burlington, MA Ref # 41917 Job Family Research Workplace On-Site Date published 13-Mar-2025...
    Broad Institute
    Hired Organization Address Cambridge, MA Full Time
    Description & Requirements Would you like to work at the intersection of AI and healthcare, solving complex computationa...

    Not the job you're looking for? Here are some other Senior Software Engineer - Hail Team jobs in the Cambridge, MA area that may be a better fit.

    Senior Software Engineer - Hail Team

    Broad Institute of MIT and Harvard, Cambridge, MA

    Senior Software Process Engineer

    Software Aspekte, Natick, MA

    AI Assistant is available now!

    Feel free to start your new journey!