Demo

Data Engineer

Global Applications Solution
Plano, TX Full Time
POSTED ON 2/19/2025
AVAILABLE BEFORE 5/16/2025

Job Title : Data Engineer (CDC, Apache Spark, ETL, AWS)

Client : KFroce

Location : Plano, Texas And Reston, VA (Candidates must be local to Texas or Virginia)

Type : Onsite / Hybrid Opportunity

Job Summary :

We are seeking a highly skilled Data Engineer with expertise in Change Data Capture (CDC) and data pipeline development to join our team at KFroce. The ideal candidate will have experience setting up and managing CDC for multiple types of databases to hydrate a data lake, along with proficiency in building ETL transformations using Apache Spark. This role requires a solid understanding of both batch and streaming data pipelines, as well as hands-on experience with data processing, optimization, and performance tuning in a Big Data environment. Familiarity with AWS services and cloud-based data architectures is essential. This is an onsite and hybrid opportunity, and candidates must be local to Texas.

Key Responsibilities :

  • Design and implement Change Data Capture (CDC) solutions using Debezium or other CDC tools for various databases.
  • Build and maintain data pipelines for streaming and batch processing with Apache Spark using DataFrames, Spark SQL, and Spark Streaming.
  • Perform data transformations and develop ETL jobs to ensure efficient data movement and integration into a data lake.
  • Collaborate with data teams to design scalable, optimized solutions for large-scale data processing.
  • Work with Apache Airflow to orchestrate data pipelines and automate workflows.
  • Utilize AWS cloud services to build robust and scalable data pipelines.
  • Work with AWS services like S3, EMR, Glue Data Catalog, Step Functions, Lambda, MWAA, and AWS Batch to optimize data workflows.
  • Troubleshoot performance issues and optimize the processing of large datasets to ensure high-performance ETL workflows.
  • Keep up to date with emerging technologies in Big Data and cloud services.

Skills & Qualifications :

Technical Skills :

  • Java : Mid to senior level proficiency in Java.
  • Python (Pyspark) : Mid-level experience working with Python and Pyspark for data processing.
  • Apache Spark : Strong experience with Spark DataFrames, Spark SQL, Spark Streaming, and building ETL pipelines.
  • Apache Airflow : Experience in managing and automating workflows using Apache Airflow.
  • Big Data Concepts : Understanding of performance tuning and optimization in large-scale data processing environments.
  • Scala (Optional) : Familiarity with Scala is a plus.
  • Apache Hudi & Apache Griffin (Optional) : Knowledge of Apache Hudi or Apache Griffin is a plus.
  • AWS Services :

  • Extensive knowledge of AWS S3, including CRUD operations.
  • Experience with AWS EMR & EMR Serverless.
  • Familiarity with AWS Glue Data Catalog.
  • Knowledge of AWS Step Functions for orchestration.
  • Experience with AWS MWAA (Managed Workflows for Apache Airflow).
  • Proficient in AWS Lambda (Python).
  • Experience with AWS Batch for running jobs.
  • Familiarity with AWS Deequ (optional).
  • Desired Experience :

  • 12 years of experience in data engineering or related roles, with hands-on experience in CDC, Apache Spark, and AWS-based data pipeline development.
  • Familiarity with big data tools and techniques for processing and optimizing large datasets.
  • Education & Certifications :
  • Bachelor’s degree in Computer Science, Engineering, or related field.
  • Relevant certifications (AWS, Apache Spark) are a plus.
  • Location :

    Plano, Texas (Candidates must be local to Texas; onsite / hybrid opportunity available)

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Data Engineer?

    Sign up to receive alerts about other jobs on the Data Engineer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $143,391 - $179,890
    Income Estimation: 
    $71,122 - $96,652
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $168,522 - $211,152
    Income Estimation: 
    $189,259 - $248,928
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Global Applications Solution

    Global Applications Solution
    Hired Organization Address Houston, TX Full Time
    Job Details Oil and Gas domain is must.. Job Requirements Manage mechanical and systems technical scope for supply of ne...
    Global Applications Solution
    Hired Organization Address Reston, VA Full Time
    Job Details Job Title: UI Engineer Location: Reston, VA (Hybrid) Duration: 12 Months Final Discussion: In-person at Rest...
    Global Applications Solution
    Hired Organization Address Chicago, IL Full Time
    Urgently Hiring - RN - Emergency Room - IL - Lucrative Pay 🔥🔥RN - Emergency Room - $54/hr🔥🔥 🚀 Chicago, IL ⏰ Shift: ...
    Global Applications Solution
    Hired Organization Address Reston, VA Full Time
    Job Details Need: Only W2 Need: IT Asset Management Support Technician Location: Reston, VA (Onsite - 5 Days/Week) Durat...

    Not the job you're looking for? Here are some other Data Engineer jobs in the Plano, TX area that may be a better fit.

    Senior Data Engineer

    NTT DATA, Irving, TX

    Distinguished Data Engineer

    Verizon Data Services, Irving, TX

    AI Assistant is available now!

    Feel free to start your new journey!