Demo

DataHub Developer

Veridic Solutions
Austin, TX Full Time
POSTED ON 2/20/2025
AVAILABLE BEFORE 5/17/2025

Position Overview

We are looking for an experienced DataHub Developer with Committer Experience to join our team and contribute to the design, development, and optimization of enterprise metadata management and data lineage solutions. The ideal candidate will have strong expertise in data cataloging , data lineage , data governance , and hands-on experience with DataHub , Spark-based frameworks , and machine learning for anomaly detection. This role demands a mix of open-source contribution, technical problem-solving, and metadata management expertise.

Key Responsibilities

  • DataHub Development and Integration
  • Lead projects involving metadata cataloging using the DataHub open-source framework.
  • Design and develop custom APIs to integrate ETL pipelines and enable real-time metadata ingestion.
  • Ingest metadata from multiple systems, including data lakes, upstream, and downstream systems, to provide a holistic metadata ecosystem.
  • Customize and extend DataHub to enrich impact analysis by identifying pipelines reading / writing to data assets.
  • Data Lineage and Governance Implementation
  • Provide end-to-end data lineage solutions for PII identification, governance, and compliance reporting.
  • Develop and implement processes to enhance impact analysis and ensure seamless data governance practices.
  • Spark-Based Framework Development
  • Design, develop, and maintain Spark-based custom frameworks for config-as-code mechanisms to facilitate data enrichment and transfer.
  • Improve the performance and scalability of Spark applications to ensure seamless data processing.
  • Provide recommendations and guidance on the design and development of ETL pipelines using Spark.
  • Machine Learning Integration for Anomaly Detection
  • Collaborate with ML engineers to create features from profiled batch data.
  • Develop and integrate machine learning models for anomaly detection in data patterns.
  • AWS Cost Optimization and Platform Efficiency
  • Lead AWS cost optimization initiatives to enhance platform-wide efficiency.
  • Successfully support Spark version upgrades and ensure the platform's scalability and performance.
  • Community Engagement and Contributions
  • Act as a committer to the DataHub open-source community by contributing new features, fixing issues, and enhancing documentation.
  • Participate in open-source discussions, propose architectural improvements, and represent the organization in community events.

Required Qualifications

  • Experience :
  • 5 years in metadata management, data lineage, or data governance roles.
  • Proven track record as a committer or active contributor to the DataHub open-source project.
  • Technical Skills :
  • Proficiency in Java , Python , and REST API development.
  • Strong experience with Apache Spark for ETL pipeline design and custom framework development.
  • Expertise in metadata ingestion from systems like data lakes, databases, and ETL tools.
  • Hands-on experience with AWS services and cost optimization strategies.
  • Familiarity with machine learning techniques for anomaly detection.
  • Other Skills :
  • Strong analytical and problem-solving skills.
  • Excellent communication and collaboration abilities.
  • Preferred Qualifications

  • Knowledge of data governance regulations like GDPR , CCPA , or HIPAA .
  • Experience with infrastructure-as-code tools such as Terraform or Helm .
  • Familiarity with other metadata management tools like Amundsen , Collibra , or Alation .
  • Understanding of version control, CI / CD pipelines, and open-source development practices.
  • If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a DataHub Developer?

    Sign up to receive alerts about other jobs on the DataHub Developer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $91,609 - $116,575
    Income Estimation: 
    $115,838 - $142,817
    Income Estimation: 
    $114,981 - $143,201
    Income Estimation: 
    $88,359 - $121,264
    Income Estimation: 
    $93,716 - $124,745
    Income Estimation: 
    $118,976 - $146,289
    Income Estimation: 
    $112,672 - $149,113
    Income Estimation: 
    $98,475 - $115,895
    Income Estimation: 
    $114,981 - $143,201
    Income Estimation: 
    $129,640 - $165,363
    Income Estimation: 
    $112,672 - $149,113
    Income Estimation: 
    $115,719 - $153,093
    Income Estimation: 
    $137,343 - $165,639
    Income Estimation: 
    $135,811 - $184,429
    Income Estimation: 
    $120,390 - $162,969
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Veridic Solutions

    Veridic Solutions
    Hired Organization Address Dallas, TX Full Time
    Job Details Role Title: Manufacturing Application Managed Services Specialist Location : Dallas, TX Client Special Requi...
    Veridic Solutions
    Hired Organization Address North Reading, MA Contractor
    REQUIRED SKILLS: BSc/MSc in Aerospace, Mechanical Engineering, Mechatronics, Robotics, or equivalent Experience designin...
    Veridic Solutions
    Hired Organization Address St Louis, MO Contractor
    Key Responsibilities: Requirement Analysis: Review and analyze business requirements, user stories, and technical specif...
    Veridic Solutions
    Hired Organization Address Charlotte, NC Contractor
    Preferred Qualifications Certification in RPA tools (UiPath Advanced Developer, Blue Prism Developer, etc.). Experience ...

    Not the job you're looking for? Here are some other DataHub Developer jobs in the Austin, TX area that may be a better fit.

    Site Surveyor

    Ion Developer Llc, Kyle, TX

    Assistant Project Manager - Multifamily Construction

    Multifamily Real Estate Owner and Developer, Austin, TX

    AI Assistant is available now!

    Feel free to start your new journey!