What are the responsibilities and job description for the Data engineer (Python, pyspark, AWS) position at Donato Technologies, Inc.?
Job Description
Donato Technologies, established in 2012, excels as a comprehensive IT service provider renowned for delivering an exceptional staffing experience and prioritizing the needs of both clients and employees. We specialize in staffing, consulting, software development, and training, catering to small and medium-sized enterprises. While our core strength lies in Information Technology, we also deeply understand and address the unique business requirements of our clients, leveraging IT to effectively meet those needs. Our commitment is to provide high-quality, customized solutions using the optimal combination of technologies.
Title: Data Engineer (Python, Pyspark)
Location: [Columbus, OH] (Need local to OH and In person interview)
Job Summary---
A Data Engineer at CMS is a software engineer with proficiency in data. The data engineer will
build and maintain the CMS data warehouse which is used for both reporting and analytics
across the company. The individual works cross functionally with technical and business teams
to identify opportunities to better leverage data. The data comes from a variety of sources and it
is the responsibility of the data engineer to make sense of the data using cloud based systems
(AWS) and provide a reliable and structured format to meet the different business needs at
CMS.
Duties And Responsibilities
Donato Technologies, established in 2012, excels as a comprehensive IT service provider renowned for delivering an exceptional staffing experience and prioritizing the needs of both clients and employees. We specialize in staffing, consulting, software development, and training, catering to small and medium-sized enterprises. While our core strength lies in Information Technology, we also deeply understand and address the unique business requirements of our clients, leveraging IT to effectively meet those needs. Our commitment is to provide high-quality, customized solutions using the optimal combination of technologies.
Title: Data Engineer (Python, Pyspark)
Location: [Columbus, OH] (Need local to OH and In person interview)
Job Summary---
A Data Engineer at CMS is a software engineer with proficiency in data. The data engineer will
build and maintain the CMS data warehouse which is used for both reporting and analytics
across the company. The individual works cross functionally with technical and business teams
to identify opportunities to better leverage data. The data comes from a variety of sources and it
is the responsibility of the data engineer to make sense of the data using cloud based systems
(AWS) and provide a reliable and structured format to meet the different business needs at
CMS.
Duties And Responsibilities
- Collaborate with the team to build out features for the data platform and consolidate data
- Build, maintain and optimize data pipelines built using Spark
- Advise, consult, and coach other data professionals on standards and practices
- Work with the team to define company data assets
- Migrate CMS’ data platform into Chase’s environment
- Partner with business analysts and solutions architects to develop technical
- Build libraries to standardize how we process data
- Loves to teach and learn, and knows that continuous learning is the cornerstone of every
- Has a solid understanding of AWS tools such as EMR or Glue, their pros and cons and
- Implement automation on applicable processes
- 5 years of experience in a data engineering position
- Proficiency is Python (or similar) and SQL
- Strong experience building data pipelines with Spark
- Strong verbal & written communication
- Strong analytical and problem solving skills
- Experience with relational datastores, NoSQL datastores and cloud object stores
- Experience building data processing infrastructure in AWS
- Bonus: Experience with infrastructure as code solutions, preferably Terraform
- Bonus: Cloud certification
- Bonus: Production experience with ACID compliant formats such as Hudi, Iceberg or
- Bonus: Familiar with data observability solutions, data governance frameworks
- Bachelor’s Degree in Computer Science/Programming or similar is preferred
- Right to work
- Must have legal right to work in the USA