What are the responsibilities and job description for the Lakehouse Developer - Remote position at Nava Software Solutions?
Job Details
NAVA Software is looking for a Lakehouse Developer
Details:
Lakehouse Developer
Location: Remote
Duration: 6 months CTH
Position Summary:
The Lakehouse Developer is responsible for designing, implementing, and maintaining data lakehouse solutions that integrate data storage and analytics. This role involves collaborating with data engineers and analysts to enable data workflows and ensure data integrity. The developer will leverage tools and technologies for efficient data processing and analysis.
Minimum Qualifications:
- 6 Years overall IT experience with minimum 4 years of work experience in below tech skills
- Work experience in data lakehouse in any of these: Apache Iceberg, Databricks, Delta Lake.
- Proficient in Python scripting and PySpark for data processing tasks.
- Strong SQL capabilities, with hands-on experience managing big data using ETL tools.
- Experience with the AWS cloud platform and its data services
- Skilled in BASH/Shell scripting.
- Preferred: Experience with Kafka and Mulesoft API.
- Understanding of healthcare data systems is a plus.
- Experience in Agile methodologies.
- Strong analytical and problem-solving skills.
- Effective communication and teamwork abilities.
Responsibilities:
- Implement lakehouse architecture to integrate and optimize data storage and analytics processes.
- Develop ETL pipelines for efficient data ingestion, transformation, and loading from various sources.
- Optimize query performance by analyzing and tuning data workflows to minimize latency.
- Implement data governance policies to ensure compliance and protect sensitive data.
- Collaborate with data teams to gather requirements and deliver solutions that enhance data accessibility.