What are the responsibilities and job description for the Big Data Engineer position at Eateam?

Job Overview :

We're seeking a highly skilled Data Engineer, Big Data Engineer to build scalable data pipelines, develop ML models, and integrate big data systems. You'll work with structured, semi-structured, and unstructured data, focusing on optimizing data systems, building ETL pipelines, and deploying AI models in cloud environments.

Key Responsibilities :

Data Ingestion : Build scalable ETL pipelines using Apache Spark, Talend, AWS Glue, Google Dataflow, Apache NiFi. Ingest data from APIs, file systems, and databases.

Data TransformationValidation : Use Pandas, Apache Beam, and Dask for data cleaning, transformation, and validation. Automate data quality checks with Pytest, Unittest.

Big Data Systems : Process large datasets with Hadoop, Kafka, Apache Flink, Apache Hive. Stream real-time data using Kafka, Google Cloud PubSub.

Task Queues : Manage asynchronous processing with Celery, RQ, RabbitMQ, or Kafka. Implement retry mechanisms and track task status.

Scalability : Optimize for performance with distributed processing (Spark, Flink), parallelization (joblib), and data partitioning.

CloudStorage : Work with AWS, Azure, GCP, Databricks. Store and manage data with S3, BigQuery, Redshift, Synapse Analytics, and HDFS.

Required Skills :

ETL Data Processing : Expertise in Apache Spark, AWS Glue, Google Dataflow, Talend.

Big Data Tools : Proficient with Hadoop, Kafka, Apache Flink, Hive, Presto.

Databases : Strong experience with MySQL, PostgreSQL, MongoDB, Cassandra.

Machine Learning : Hands-on with TensorFlow, PyTorch, Scikit-learn, XGBoost.

Cloud Platforms : Experience with AWS, Azure, GCP, Databricks.

Task Management : Familiar with Celery, RQ, RabbitMQ, Kafka.

Version Control : Git for source code management.

Desirable Skills :

Real-time Data Processing : Experience with Apache Pulsar, Google Cloud PubSub.

Data Warehousing : Familiarity with Redshift, BigQuery, Synapse Analytics.

Scalability Optimization : Knowledge of load balancing (NGINX, HAProxy) and parallel processing.

Data Governance : Use of MLflow, DVC, or other tools for model and data versioning.

Tools Technologies :

ETL : Apache Spark, Talend, AWS Glue, Google Dataflow.

Big Data : Hadoop, Kafka, Apache Flink, Presto.

Databases : MySQL, PostgreSQL, MongoDB, Cassandra.

Cloud : AWS, GCP, Azure, Databricks.

Storage : S3, BigQuery, Redshift, Synapse Analytics, HDFS.

Version Control : Git.

Apply for this job

Receive alerts for other Big Data Engineer job openings

What is the career path for a Big Data Engineer?

Sign up to receive alerts about other jobs on the Big Data Engineer career path by checking the boxes next to the positions that interest you.

Database Engineer II

Income Estimation:

$92,929 - $122,443

Database Engineer III

Income Estimation:

$122,257 - $154,284

Database Engineer II

Income Estimation:

$92,929 - $122,443

Database Engineer III

Income Estimation:

$122,257 - $154,284

Database Engineer I

Income Estimation:

$71,122 - $96,652

Database Engineer II

Income Estimation:

$92,929 - $122,443

Database Engineer III

Income Estimation:

$122,257 - $154,284

Database Engineer IV

Income Estimation:

$143,391 - $179,890

Job openings at Eateam

Software Engineer

Eateam

Edison, NJ Full Time

Description : Participate in business and functional requirements review meetings which involve devising and strategizin...

IT SECURITY PRINCIPAL ENGINEER

Eateam

Secaucus, NJ Full Time

JOB SUMMARY : The IT Security Principal Engineer position is working within an IT security team to review, evaluate, des...

Technical Lead (Java Full Stack) San Antonio, TX (Onsite only)

Eateam

San Antonio, TX Full Time

Job Description Job Description This is Nida Tahseen from EA Team Inc, I have got interesting Job opportunities in all o...

Automation Control Engineer - San Jose CA ( Day 1 Onsite)

Eateam

San Jose, CA Full Time

Job Description Job Description This is Nida Tahseen from EA Team Inc, I have got interesting Job opportunities in all o...

Not the job you're looking for? Here are some other Big Data Engineer jobs in the Jersey, NJ area that may be a better fit.

Big data engineer

Randstad, West New York, NJ

Big Data Engineer

What are the responsibilities and job description for the Big Data Engineer position at Eateam?

What is the career path for a Big Data Engineer?

Job openings at Eateam

Not the job you're looking for? Here are some other Big Data Engineer jobs in the Jersey, NJ area that may be a better fit.

We don't have any other Big Data Engineer jobs in the Jersey, NJ area right now.

AI Assistant is available now!