What are the responsibilities and job description for the Data Engineer (Databricks, ETL, Big Data, Azure & Data Pipeline Optimization) position at Open Systems Inc.?
Title: Data Engineer
Location: Bentonville, AR, 72712 (Hybrid)
Type: 6 Months, Long-term Contract.
Industry: Retail.
Job Description:
- Data Engineer to join our growing team of analytics experts.
- The role will be responsible for expanding and optimizing our data and data pipeline architecture, as well as optimizing data flow and collection for cross-functional teams.
- The ideal candidate is an experienced data pipeline builder and data wrangler who enjoys optimizing data systems and building them from the ground up.
- The Data Engineer will support our software developers, database architects, data analysts, and data scientists on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projects.
- They must be self-directed and comfortable supporting the data needs of multiple teams, systems, and products.
- The right candidate will be excited by the prospect of optimizing or even re-designing our data architecture to support our next generation of products and data initiatives.
Responsibilities for Data Engineer
- Create and maintain optimal data pipeline architecture for data-intensive applications.
- Assemble large, complex data sets that meet functional / non-functional business requirements.
- Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
- Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using Azure SQL, Cosmo DB, Databricks, and other legacy databases.
- Build analytics Dashboard/Visualizations utilizing the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics.
- Work with stakeholders, including the Executive, Product, Data, and Design teams, to assist with data-related technical issues and support their data infrastructure needs.
- Keep our data separated and secure across national boundaries through multiple data centers and Azure regions.
- Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader.
- Work with data and analytics experts to strive for greater functionality in our data systems.
Qualifications for Data Engineer:
- Strong Python programming skills, expert level on using Python to process Big Data.
- Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL), as well as working familiarity with a variety of databases.
- Extensive Experience with Databricks on the Azure Cloud platform and a deep understanding of Delta Lake and Lake House Architecture.
- Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
- Strong analytic skills related to working with Data Visualization Dashboard, Metrics, etc, experience with Tableau, Power BI, or Looker tools.
- Build processes supporting data transformation, data structures, metadata, dependency, and workload management.
- A successful history of manipulating, processing, and extracting value from large, disconnected datasets.
- Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores.
- Familiar with Deployment tools like Docker and building CI/CD pipelines.
- Experience supporting and working with cross-functional teams in a dynamic environment.
- 8 years’ experience in software development, Data engineering, and
- Bachelor’s degree in computer science, Statistics, Informatics, Information Systems, or another quantitative field. A postgraduate/master’s degree is preferred.
Experienced Data Engineer with 8 years in software development and data engineering, specializing in big data processing, cloud platforms (Azure), and data pipeline optimization. Proficient in Python, SQL, and Databricks, with expertise in Delta Lake, Lake House Architecture, and ETL processes. Skilled in building scalable data infrastructure, automating workflows, and optimizing data flow for cross-functional teams. Strong background in data visualization using Power BI, Tableau, or Looker, and hands-on experience with Docker, CI/CD pipelines, and stream processing. Adept at collaborating with data scientists, analysts, and software developers to ensure efficient data delivery and analytics solutions. Holds a Bachelor’s/Master’s degree in Computer Science, Statistics, or a related field.