Demo

Principal Data Engineer

Nava Software Solutions
Houston, TX Full Time
POSTED ON 2/22/2025
AVAILABLE BEFORE 4/22/2025

Job Details

NAVA Software solutions is looking for a Principal Data Engineer

Details:

Principal Data Engineer

Location: Houston, TX 4 days/week onsite

Duration: Full time / Direct Hire


The Principal Data Engineer within the Data Science and Analytics team, plays a crucial role in architecting, implementing, and managing robust, scalable data platforms. This position demands a blend of cloud data engineering, systems engineering, data integration, and machine learning systems knowledge to enhance GST's data capabilities, supporting advanced analytics, machine learning projects, and real-time data processing needs. You will guide other team members and collaborate closely with cross-functional teams to design and implement modern data solutions that enable data-driven decision-making across the organization.

As a Principal Data Engineer you will:

  • Collaborate with Business, and IT functional experts to gather requirements or issues, perform gap analysis and recommend/implement process and/or technology improvements to optimize data solutions.
  • Design data solutions on Databricks including Delta Lake, Data Warehouse, Data Mart and others to support the data science and analytical needs of the organization.
  • Design and implement scalable and reliable data pipelines to ingest, process, and store diverse data at scale, using technologies such as Databricks, Apache Spark, Kafka, Flink, AWS Glue or other AWS services.
  • Work within cloud environments like AWS to leverage services including but not limited to EC2, RDS, S3, Athena, Glue, Lambda, EMR, Kinesis, and SQS for efficient data handling and processing.
  • Develop and optimize data models and storage solutions (SQL, NoSQL, Key-Value DBs, Data Lakes) to support operational and analytical applications, ensuring data quality and accessibility.
  • Utilize ETL tools and frameworks (e.g., Apache Airflow, Talend) to automate data workflows, ensuring efficient data integration and timely availability of data for analytics.
  • Implement pipelines with a high degree of automation for data workflows and deployment pipelines using tools like Apache Airflow, Terraform, and CI/CD frameworks.
  • Collaborate closely with business analysts, data scientists, machine learning engineers, and optimization engineers, providing the data infrastructure and tools needed for complex analytical models, leveraging Python, scala or R for data processing scripts.
  • Ensure compliance with data governance, compliance and security policies, implementing best practices in data encryption, masking, and access controls within a cloud environment.
  • Establish best practices for code documentation, testing, and version control, ensuring consistent and reproductive data engineering practices across the team.
  • Monitor and troubleshoot data pipelines and databases for performance issues, applying tuning techniques to optimize data access and throughput.
  • Ensure efficient usage of AWS and Databricks resources to minimize costs while maintaining high performance and scalability.
  • Cross functional work understanding data landscape, developing proof of concepts, and demonstrating to stakeholders.
  • Leads one or more data projects and support with internal and external resources. Coach and mentor junior data engineers.
  • Stay abreast of emerging technologies and methodologies in data engineering, advocating for and implementing improvements to the data ecosystem.

What We Need From You

  • Bachelor's Degree Computer Science, Data Science, MIS, Engineering, Mathematics, Statistics or other quantitative discipline with 5-8 years of hands-on experience in data engineering, with a proven track record in designing and operating large-scale data pipelines and architectures Req
  • Proven experience designing scalable, fault-tolerant data architecture and pipelines on Databricks delta lake, lakehouse, unity catalog, streaming, AWS, ETL/ELT development and data modeling, with a focus on performance optimization and maintainability Required
  • Deep experience of platforms and services like Databricks, and AWS native data offerings Required
  • Solid experience with big data technologies (Databricks, Apache Spark, Kafka) and AWS cloud services related to data processing and storage Required
  • Strong hands-on experience with ETL/ELT pipeline development using AWS tools and Databricks Workflows Required
  • Strong experience in AWS cloud services, with hands-on experience in integrating cloud storage and compute services with Databricks Required
  • Proficient in SQL and programming languages relevant to data engineering (Python, Java, Scala Required
  • Hands on RDBMS and data warehousing experience (data modeling, analysis, programming, stored procedures) Required
  • Good understanding of system architecture and design patterns to design and develop applications using these principles Required
  • Proficiency with version control systems like Git and experience with CI/CD pipelines for automating data engineering deployments Required
  • Familiarity with machine learning model deployment and management practices is a plus Preferred
  • Experience with SAP, BW, HANA, Tableau, or Power BI is a plus Preferred
  • Experience with auto, manufacturing, or supply chain industries is a plus Preferred
  • Project life-cycle leadership and support for requirement workshop, design, development, test cycles and production cutover, post-go live support, and environment strategy. Strong knowledge of agile methodologies Required
  • Strong communication skills, capable of collaborating effectively across technical and non-technical teams in a fast-paced environment. Required
  • AWS Certified Solution Architect Preferred
  • Databricks Certified Associate Developer for Apache Spark Preferred or other relevant certifications. Preferred

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Principal Data Engineer?

Sign up to receive alerts about other jobs on the Principal Data Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$168,522 - $211,152
Income Estimation: 
$189,259 - $248,928
Income Estimation: 
$168,522 - $211,152
Income Estimation: 
$189,259 - $248,928
Income Estimation: 
$71,122 - $96,652
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$143,391 - $179,890
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Nava Software Solutions

Nava Software Solutions
Hired Organization Address Berkeley Heights, NJ Full Time
Job Description Job Description NAVA Software solutions is looking for a Full Stack .Net Web Developer Details : Full St...
Nava Software Solutions
Hired Organization Address Houston, TX Full Time
Job Details NAVA Software solutions is looking for a SAP Solutions Architect -Finance Details: SAP Solution Architect - ...
Nava Software Solutions
Hired Organization Address Sugar, TX Full Time
Job Details NAVA Software solutions is looking for an Integration Developer Details: Senior Developer Integrations Locat...
Nava Software Solutions
Hired Organization Address Houston, TX Full Time
Job Details NAVA Software solutions is looking for a Data and Analytics Analyst Details: Data and Analytics Analyst - Te...

Not the job you're looking for? Here are some other Principal Data Engineer jobs in the Houston, TX area that may be a better fit.

Principal Data Engineer

Gulf States Toyota, Missouri, TX

Principal Data Engineer

Gulf States Toyota, Simonton, TX

AI Assistant is available now!

Feel free to start your new journey!