What are the responsibilities and job description for the Databricks Architect position at GeorgiaTEK Systems Inc.?
Databricks Architect – Hybrid position
Location: Houston, TX
Rate: DOE
US Citizen, Green Card, TN, GC EAD, and H4 EAD only No Third-party agencies Corp to Corp
Solution Design:
Develop comprehensive data architectures on Databricks, considering data ingestion, processing, transformation, storage, and analytics needs, aligning with business requirements.
Technical Leadership:
Provide expert guidance on Databricks features, best practices, and optimization techniques to data engineering and science teams.
Customer Engagement:
Collaborate with clients to understand their data challenges, propose Databricks solutions, and guide them through implementation processes.
Proof-of-Concept Development:
Build prototypes and demonstrations to showcase the capabilities of Databricks for specific use cases.
Integration with Cloud Services:
Integrate Databricks with other cloud services like Azure, AWS, or GCP to create a unified data ecosystem.
Performance Optimization:
Monitor and optimize data pipelines on Databricks to ensure scalability and performance.
Data Governance and Security:
Design and implement data governance strategies, including access controls, data quality checks, and compliance measures within the Databricks platform.
Required Skills:
Deep understanding of the Databricks platform:
Extensive knowledge of Databricks core components like Delta Lake, Spark, SQL, and MLflow.
Data Engineering Expertise:
Proficiency in data pipeline design, ETL/ELT processes, and data transformation techniques.
Programming Languages:
Strong coding skills in Python, Scala, or other languages supported by Databricks.
Cloud Computing Knowledge:
Familiarity with cloud platforms like Azure, AWS, or GCP.
Machine Learning Understanding:
Basic knowledge of machine learning algorithms and MLOps practices to integrate with Databricks.
Communication and Collaboration:
Ability to effectively communicate technical concepts to both technical and non-technical stakeholders.