Demo

Principal Data Engineer- Remote, USA

Ambry Genetics Corporation
California, MO Remote Full Time
POSTED ON 1/23/2025
AVAILABLE BEFORE 4/23/2025

Compensation : $180,000 - $200,000 per year. You are eligible to a Short-Term Incentive Plan with the target at 7.5% of your annual earnings, terms and conditions apply.

Read on to find out what you will need to succeed in this position, including skills, qualifications, and experience.

Principal Data Engineer - Remote USA

The Principal Data Engineer is responsible for driving the design, development, and implementation of Ambry's data infrastructure and solutions. This role will play a pivotal part in building and maintaining scalable, reliable, and efficient data pipelines, data warehouses, and data lakes. The Principal Data Engineer will collaborate closely with data architects, scientists, and analysts to ensure that data is accessible, secure, and aligned with business objectives. As a Principal Data Engineer at Ambry, you’ll approach tasks with a customer-based, cloud-first mindset to support and enhance various data platform products, including Ambry’s data lakes, streams, and warehouses. This role will be primarily responsible for building, monitoring, and operationalizing our data streams which are hydrated via CDC (change data capture) from a suite of 20 on-prem and cloud databases.

Essential Functions

  • Build Kafka connectors to sync updates from source data stores
  • Build partitioned Kafka topics to sync updates to destination data marts
  • Build multiplexed data analytics workloads using Apache Flink to monitor streaming metrics and perform real-time data transformations
  • Build dashboards using Datadog and Cloudwatch to ensure system health and user support
  • Build opinionated but accommodating schema registries that ensure data governance
  • Work closely with your West Coast based scrum team to submit and review PRs daily, maintain documentation and backlogs, validate builds across multiple environments, and deploy at a 2–4-week sprint cadence
  • Design reasonable database schemas with query access patterns as the forethought
  • Build and maintain CI / CD pipelines using infrastructure-as-code
  • Iteratively migrate on-prem ETL jobs written in PHP into AWS Flink and Glue processes
  • Partner with QA Engineers in building automated test suites
  • Partner with end-users to resolve service disruptions and evangelize our data product offerings
  • Vigilantly oversee data quality and alert upstream data producers of all disparities, latency, and defects
  • Develop and maintain the overall data platform architecture strategy, roadmap, and implementation plans to support the company's data-driven initiatives and business objectives.
  • Design and implement scalable, secure, and high-performance data architectures, including data warehouses, data lakes, and data pipelines, leveraging both on-premises and cloud technologies.
  • Establish data governance policies, standards, and best practices for data management, data quality, data security, and data privacy across the organization.
  • Lead the development and implementation of real-time data streaming solutions, including event-driven architectures, data ingestion, transformation, and consumption using technologies like Apache Kafka, Apache Flink, and AWS Managed Streaming for Kafka (MSK).
  • Oversee the creation and maintenance of Business Intelligence (BI) platforms, data visualization tools, and self-service analytics capabilities to enable data-driven decision-making across the organization.
  • Lead and manage a team of data engineers, database administrators, and data analysts, fostering their professional growth, promoting best practices, and ensuring adherence to organizational standards and processes
  • Other duties as assigned

Qualifications

  • Basic understanding of genomic concepts and terminology
  • Experience with PyFlink
  • Experience with AWS Kinesis
  • Willing to work PST hours between 8 : 00 AM - 5 : 00 PM or 9 : 00 AM – 6 : 00 PM
  • Strong familiarity with any combination of our tech stacks in order of importance : Apache Kafka (MSK flavor preferred), Debezium, Python, Apache Flink or PySpark Streaming, MySQL (RDS flavors preferred), Python, CDK or Terraform, Athena, Glue, Lambda, Appflow, HANA / 4, PHP, Redis, Docker, Javascript
  • Experience building data APIs and offering Data as a Service
  • Experience integrating with SaaS platforms such as SAP and Salesforce
  • Experience or willingness to learn working with PHP MVC frameworks such as Symfony
  • Experience with Atlassian products, i.e. Jira, Confluence, Bamboo
  • Experience with system diagramming tools such as Miro, LucidCharts, or Visio
  • 6 years’ experience working with professional scrum teams and / or equivalent schooling
  • 4 years’ experience using Git versioning control
  • 3 years’ experience designing and indexing relational databases
  • 2 years’ experience building and operationalizing real-time data streams
  • Bachelor’s or master’s degree in computer, data, math, or life sciences or equivalent work experience
  • Preferred

  • AWS Associate Solution Architect certification
  • AWS Data Engineer certification
  • About Us :

    Ambry Genetics Corporation is a CAP-accredited and CLIA-licensed molecular genetics laboratory based in Aliso Viejo, California. We are a genetics-based healthcare company that is dedicated to open scientific exchange so we can work together to understand and treat all human disease faster.

    At Ambry, everyone is welcome. A career at Ambry Genetics is a chance to be part of a dynamic company that aims to improve health by understanding the relationships between genetics and human disease. We earned our reputation as industry leaders by responsibly introducing cutting-edge genetic testing solutions and continually sharing what we learn with the global scientific community.

    At Ambry you will be learning, challenging yourself, and having fun while collaborating with teammates through the open exchange of ideas. Our outstanding benefits program includes medical, dental, vision, 401k with a 4% employer match, FSA, paid sick leave and generous paid time off (PTO) program.

    Ambry Genetics is an Equal Opportunity Employer (EOE) and we maintain a drug-free work environment.

    The Company believes in second chance employment. Qualified applicants with arrest or conviction history will be considered regardless of their arrest or conviction history, consistent with local laws such as Los Angeles County Fair Chance Ordinance and the California Fair Chance Act.

    For the purpose of the above job description, “Essential Functions” are “Material Job Duties”.

    Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

    All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity, gender expression, national origin, ancestry, age, marital status or protected veteran status and will not be discriminated against on the basis of disability, protected medical condition as defined by applicable state or local law, genetic information, or any other characteristic protected by applicable federal, state, or local laws and ordinances. If you have a disability or special need that requires accommodation, please contact us at careers@ambrygen.com

    Ambry does not accept unsolicited resumes from individual recruiters, third party recruiting agencies, outside recruiters or firms without an executed contract in place. We are not responsible for any fees related to resumes that are unsolicited or are received by Ambry. Such resumes will be deemed the sole property of Ambry and will be processed accordingly.

    PRIVACY NOTICES

    To review Ambry’s Privacy Notice, Click here : https : / / www.ambrygen.com / legal / privacy-policy

    To review the California privacy notice, click here : California Privacy Notice | Ambry Genetics

    To review the UKG privacy notice, click here : California Privacy Notice | UKG

    J-18808-Ljbffr

    Remote working / work at home options are available for this role.

    Salary : $180,000 - $200,000

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Principal Data Engineer- Remote, USA?

    Sign up to receive alerts about other jobs on the Principal Data Engineer- Remote, USA career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $168,522 - $211,152
    Income Estimation: 
    $189,259 - $248,928
    Income Estimation: 
    $168,522 - $211,152
    Income Estimation: 
    $189,259 - $248,928
    Income Estimation: 
    $71,122 - $96,652
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $143,391 - $179,890
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Ambry Genetics Corporation

    Ambry Genetics Corporation
    Hired Organization Address Aliso Viejo, CA Full Time
    Ambry Genetics Corporation dba Ambry Genetics seeks multiple positions (2) for Sr. Software Engineer in Aliso Viejo, CA....
    Ambry Genetics Corporation
    Hired Organization Address Aliso Viejo, CA Full Time
    Responsibilities include interpreting diagnostic test results, review and summary of relevant medical literature, and / ...
    Ambry Genetics Corporation
    Hired Organization Address Aliso Viejo, CA Full Time
    Compensation : $125,000-$,170,000 per year. You are eligible to a Short-Term Incentive Plan with the target at 7.5% of y...
    Ambry Genetics Corporation
    Hired Organization Address Greenville, SC Full Time
    Compensation: $95,000 - $110,000 per year. You are eligible for a Sales Incentive Plan with the target of $75,000 annual...

    Not the job you're looking for? Here are some other Principal Data Engineer- Remote, USA jobs in the California, MO area that may be a better fit.

    Principal Engineer

    7501 Vantive US Healthcare LLC USA, Mountain Home, AR

    Senior Data Partnerships Manager, USA (Remote)

    Incode Technologies, California, MO

    AI Assistant is available now!

    Feel free to start your new journey!