Demo

Big Data Engineer

Eateam
Jersey, NJ Full Time
POSTED ON 2/12/2025
AVAILABLE BEFORE 5/8/2025

Job Overview :

We're seeking a highly skilled Data Engineer, Big Data Engineer to build scalable data pipelines, develop ML models, and integrate big data systems. You'll work with structured, semi-structured, and unstructured data, focusing on optimizing data systems, building ETL pipelines, and deploying AI models in cloud environments.

Key Responsibilities :

Data Ingestion : Build scalable ETL pipelines using Apache Spark, Talend, AWS Glue, Google Dataflow, Apache NiFi. Ingest data from APIs, file systems, and databases.

Data TransformationValidation : Use Pandas, Apache Beam, and Dask for data cleaning, transformation, and validation. Automate data quality checks with Pytest, Unittest.

Big Data Systems : Process large datasets with Hadoop, Kafka, Apache Flink, Apache Hive. Stream real-time data using Kafka, Google Cloud PubSub.

Task Queues : Manage asynchronous processing with Celery, RQ, RabbitMQ, or Kafka. Implement retry mechanisms and track task status.

Scalability : Optimize for performance with distributed processing (Spark, Flink), parallelization (joblib), and data partitioning.

CloudStorage : Work with AWS, Azure, GCP, Databricks. Store and manage data with S3, BigQuery, Redshift, Synapse Analytics, and HDFS.

Required Skills :

ETL Data Processing : Expertise in Apache Spark, AWS Glue, Google Dataflow, Talend.

Big Data Tools : Proficient with Hadoop, Kafka, Apache Flink, Hive, Presto.

Databases : Strong experience with MySQL, PostgreSQL, MongoDB, Cassandra.

Machine Learning : Hands-on with TensorFlow, PyTorch, Scikit-learn, XGBoost.

Cloud Platforms : Experience with AWS, Azure, GCP, Databricks.

Task Management : Familiar with Celery, RQ, RabbitMQ, Kafka.

Version Control : Git for source code management.

Desirable Skills :

Real-time Data Processing : Experience with Apache Pulsar, Google Cloud PubSub.

Data Warehousing : Familiarity with Redshift, BigQuery, Synapse Analytics.

Scalability Optimization : Knowledge of load balancing (NGINX, HAProxy) and parallel processing.

Data Governance : Use of MLflow, DVC, or other tools for model and data versioning.

Tools Technologies :

ETL : Apache Spark, Talend, AWS Glue, Google Dataflow.

Big Data : Hadoop, Kafka, Apache Flink, Presto.

Databases : MySQL, PostgreSQL, MongoDB, Cassandra.

Cloud : AWS, GCP, Azure, Databricks.

Storage : S3, BigQuery, Redshift, Synapse Analytics, HDFS.

Version Control : Git.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Big Data Engineer?

Sign up to receive alerts about other jobs on the Big Data Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$71,122 - $96,652
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$143,391 - $179,890
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Eateam

Eateam
Hired Organization Address Shreveport, LA Full Time
Software Development Engineer 3- 134667 Location : Hillsboro, LA 71118 Duration : 10 Months Job Description : Purpose of...
Eateam
Hired Organization Address Minneapolis, MN Full Time
Primary : SQL, Azure, Azure Data Factory, Azure DevOps, Azure DataBricks Snowflake, SQL Server, GitHub Secondary : Pytho...
Eateam
Hired Organization Address Colorado, CO Full Time
Firmware development for next generation NVMe based Solid State Drives. Developing new features, change requests with hi...
Eateam
Hired Organization Address Evergreen, CO Full Time
Title : Plant Health Care (PHC) Manager Location : Evergreen, CO Job Type : Fulltime (Onsite) Job Description : Client i...

Not the job you're looking for? Here are some other Big Data Engineer jobs in the Jersey, NJ area that may be a better fit.

Big Data Platform Engineer

JS Consulting, West New York, NJ

Big data engineer

Randstad, West New York, NJ

AI Assistant is available now!

Feel free to start your new journey!