What are the responsibilities and job description for the Data Engineer position at DPP Tech, Inc.?
Job Details
We're Hiring: Data Engineer Pleasanton, CA (Hybrid to Onsite from September)
Are you a skilled Data Engineer with expertise in Python, PySpark, and Databricks? Do you thrive in a fast-paced, high-volume data environment? If so, we have an exciting opportunity for you!
Location: Pleasanton, CA (Onsite 3 days/week, transitioning to 5 days from September)
Work Hours: PST hours, with occasional weekend on-call rotation
Key Responsibilities:
Design, implement, and optimize high-volume, low-latency data pipelines
Work with Python, PySpark, and Databricks for large-scale data processing
Build and manage batch/analytics pipelines on cloud platforms (Azure preferred)
Develop RESTful APIs using Java & Spring Framework
Write complex SQL queries and work with NoSQL databases (HBase, PostgreSQL)
Implement CI/CD workflows using Gradle, ArgoCD, GitHub Actions, Kubernetes
Collaborate with cross-functional teams to improve data infrastructure
What We re Looking For:
4-8 years of experience in Data Engineering
Strong expertise in Python, PySpark, Databricks, and Spark
Experience with Azure or other major cloud platforms
Knowledge of HDFS, distributed computing, and analytics pipelines
Proficiency in CI/CD workflows and DevOps tools
Ability to work in a fast-paced, agile environment
Strong communication & collaboration skills