What are the responsibilities and job description for the Azure Data Engineer position at Building Services 32BJ Benefit Funds?
Position: Azure Data Engineer
Reporting To: Manager, Data Engineering
FLSA Status: Exempt
Position Summary:
As an Azure Data Engineer you will get to play a key and a collaborative role in the delivery of powerful data-driven products that support 32BJ Health Fund’s mission of providing high-quality and low-cost healthcare to its union members. The Data Engineer will be responsible for providing internal analysts with accurate datasets by implementing best practices in data collection, movement, storage, and transformation of large datasets. This individual will work with both current ETL/Data Warehousing and provide direction for future development of data storage, streaming and pipeline architectures.
Primary Duties and Responsibilities:
- Work with Health Fund Analytics, Operations and IT to create and maintain an optimal Azure Cloud data pipeline; implementing new data infrastructure and migrating existing data from disparate sources into a robust data warehouse, based on best practices of integrating data into a consolidated repository
- Solid SQL skills using Azure Blob/Data Lake, Azure Data Factory, and Data Bricks to transform data; Previous experience with other data science and data transformation tools such as Azure AI and R, as well as languages such as Python for developing optimized centralized data warehouse and pipelines for use by Health Fund Operations and Analytics
- Design, build, and manage automated test frameworks and scripts that support a continuous integration/continuous delivery (CI/CD) approach
- Generate subsets of data, data models, and provide APIs and variables needed for interfacing with internal tools as well as public-facing websites
- Work with IT and Operations, to evaluate business cases for use of scalable cloud solutions such as Azure and Dynamics 365
- Support the implementation and maintenance of Data Governance policy requirements and standards for data and data systems within your domain
- Interface with internal teams and vendors’ IT to work through data quality issues and champion HIPAA compliant best practices in data handling and transfer
- Create clear documentation (user guides, quick starts, schemas, and data dictionaries) of established data structures and use cases
- Provide tutorials and working sessions to empower analysts in creating queries and accessing right level of data and establish oneself as an expert resource on internal and external data sources and management
- Support data engineering team in end-to-end data classification technical implementation and ongoing maintenance
Qualifications:
- Bachelor’s degree or equivalent experience; Advanced degree preferred
- 3 years of full-time experience or demonstrated accomplishments in relevant data engineering areas
- Prior experience in working with healthcare claims is a must
- Strong knowledge of SQL & Python with hands-on MS Azure industry experience is a must
- Knowledge of methods for handling non-relational JSON and XML formatted data
- Familiarity with Azure cloud performance optimization techniques
- Experience with statistical packages such as STATA or SAS is preferred