Demo

Data Engineer

Global Applications Solution
Plano, TX Full Time
POSTED ON 2/19/2025
AVAILABLE BEFORE 5/16/2025

Job Title : Data Engineer (CDC, Apache Spark, ETL, AWS)

Client : KFroce

Location : Plano, Texas And Reston, VA (Candidates must be local to Texas or Virginia)

Type : Onsite / Hybrid Opportunity

Job Summary :

We are seeking a highly skilled Data Engineer with expertise in Change Data Capture (CDC) and data pipeline development to join our team at KFroce. The ideal candidate will have experience setting up and managing CDC for multiple types of databases to hydrate a data lake, along with proficiency in building ETL transformations using Apache Spark. This role requires a solid understanding of both batch and streaming data pipelines, as well as hands-on experience with data processing, optimization, and performance tuning in a Big Data environment. Familiarity with AWS services and cloud-based data architectures is essential. This is an onsite and hybrid opportunity, and candidates must be local to Texas.

Key Responsibilities :

  • Design and implement Change Data Capture (CDC) solutions using Debezium or other CDC tools for various databases.
  • Build and maintain data pipelines for streaming and batch processing with Apache Spark using DataFrames, Spark SQL, and Spark Streaming.
  • Perform data transformations and develop ETL jobs to ensure efficient data movement and integration into a data lake.
  • Collaborate with data teams to design scalable, optimized solutions for large-scale data processing.
  • Work with Apache Airflow to orchestrate data pipelines and automate workflows.
  • Utilize AWS cloud services to build robust and scalable data pipelines.
  • Work with AWS services like S3, EMR, Glue Data Catalog, Step Functions, Lambda, MWAA, and AWS Batch to optimize data workflows.
  • Troubleshoot performance issues and optimize the processing of large datasets to ensure high-performance ETL workflows.
  • Keep up to date with emerging technologies in Big Data and cloud services.

Skills & Qualifications :

Technical Skills :

  • Java : Mid to senior level proficiency in Java.
  • Python (Pyspark) : Mid-level experience working with Python and Pyspark for data processing.
  • Apache Spark : Strong experience with Spark DataFrames, Spark SQL, Spark Streaming, and building ETL pipelines.
  • Apache Airflow : Experience in managing and automating workflows using Apache Airflow.
  • Big Data Concepts : Understanding of performance tuning and optimization in large-scale data processing environments.
  • Scala (Optional) : Familiarity with Scala is a plus.
  • Apache Hudi & Apache Griffin (Optional) : Knowledge of Apache Hudi or Apache Griffin is a plus.
  • AWS Services :

  • Extensive knowledge of AWS S3, including CRUD operations.
  • Experience with AWS EMR & EMR Serverless.
  • Familiarity with AWS Glue Data Catalog.
  • Knowledge of AWS Step Functions for orchestration.
  • Experience with AWS MWAA (Managed Workflows for Apache Airflow).
  • Proficient in AWS Lambda (Python).
  • Experience with AWS Batch for running jobs.
  • Familiarity with AWS Deequ (optional).
  • Desired Experience :

  • 12 years of experience in data engineering or related roles, with hands-on experience in CDC, Apache Spark, and AWS-based data pipeline development.
  • Familiarity with big data tools and techniques for processing and optimizing large datasets.
  • Education & Certifications :
  • Bachelor’s degree in Computer Science, Engineering, or related field.
  • Relevant certifications (AWS, Apache Spark) are a plus.
  • Location :

    Plano, Texas (Candidates must be local to Texas; onsite / hybrid opportunity available)

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Data Engineer?

    Sign up to receive alerts about other jobs on the Data Engineer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $143,391 - $179,890
    Income Estimation: 
    $71,122 - $96,652
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $168,522 - $211,152
    Income Estimation: 
    $189,259 - $248,928
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Global Applications Solution

    Global Applications Solution
    Hired Organization Address Bethesda, MD Full Time
    Job Details We are excited to share with you the job description for the Lead Full-Stack developer position at Global Ap...
    Global Applications Solution
    Hired Organization Address Colorado, CO Contractor
    Job Title: IT Technical Writer Client Industry: Information Technology Location: Remote (Preferred candidates local to C...
    Global Applications Solution
    Hired Organization Address St Louis, MO Contractor
    We are seeking an experienced Oracle CX/OCC (Oracle Commerce Cloud) Functional Lead to drive the implementation, enhance...
    Global Applications Solution
    Hired Organization Address San Antonio, TX Full Time
    Job Details Role: Guidewire Policy Center Developer Location: San Antonio, TX Certification: Associate/ ACE Guidewire im...

    Not the job you're looking for? Here are some other Data Engineer jobs in the Plano, TX area that may be a better fit.

    Data Engineer

    Axis Data, Dallas, TX

    Senior Data Engineer

    NTT DATA, Irving, TX

    AI Assistant is available now!

    Feel free to start your new journey!