What are the responsibilities and job description for the Data Engineer - Databrick, ETL, Python position at Addison Group?
Title: Data Engineer - Databrick, ETL, Python
Salary: $110-120K bonus
Location: Remote USA
No sponsorship available
Essential Functions: which may be representative but not all inclusive of those commonly associated with this position.
Education: Bachelor’s degree preferred.
Experience: Minimum five years of experience building and maintaining data pipelines. Minimum five years of experience working in a large data environment. Minimum five years of experience with data modeling tools (e.g. Erwin).
Skills And Abilities
Which may be representative but not all inclusive of those commonly associated with this position.
Salary: $110-120K bonus
Location: Remote USA
No sponsorship available
Essential Functions: which may be representative but not all inclusive of those commonly associated with this position.
- Evaluates new data sources to ensure alignment with ongoing projects.
- Maintains knowledge of existing datasets, sources, structure and quality.
- Translates business cases into technical requirements.
- Collaborates with data engineering and data science teams to ensure alignment.
- Cleans, transforms and preprocesses data for advanced analytics.
- Develops and implements data quality checks and collaborates with engineering to resolve issues.
- Conducts exploratory analysis to identify trends, patterns and anomalies.
- Creates summaries and visualization for the data science team.
- Develops and maintains ETL (Extract, Transform, Load) pipelines to ensure data availability for downstream processes.
- Demonstrates and promotes a workplace culture of innovation, high performance, accountability, commitment, respect and teamwork.
- Regular attendance.
- Other duties as assigned.
Education: Bachelor’s degree preferred.
Experience: Minimum five years of experience building and maintaining data pipelines. Minimum five years of experience working in a large data environment. Minimum five years of experience with data modeling tools (e.g. Erwin).
Skills And Abilities
Which may be representative but not all inclusive of those commonly associated with this position.
- ETL processes with Databricks experience a plus.
- Experience in normalized, dimensional, star schema and snow-flake models.
- Proficient in Python (especially Pandas, NumPy, Matplotlib, Seaborn, etc.), PySpark experience preferred, and SQL.
- Familiarity with the Databricks workspace, notebooks, and jobs, including automation and orchestration using Azure Data Factory.
- Experience working with Parquet files, Delta Lake, Unity Catalog, and optimizing data storage within Azure.
- Understanding of data modeling concepts, normalization techniques.
- Ability to develop and implement data quality checks.
- Experience with version control systems like Git to manage code in a collaborative environment.
- Ability to create data visualizations using tools like Power BI or Plotly to support preliminary analysis.
- Strong communication skills to collaborate effectively with data scientists, data engineers, and stakeholders across the business.
- Well versed with software engineering processes (especially Agile).
- Strong analytical and problem-solving skills.
- Organized, goal-oriented, self-starter, can manage multiple tasks from start to completion with limited supervision.
- Ability to effectively present information and respond to questions.
- Knowledge of entertainment industry a plus.
Salary : $110,000 - $120,000