What are the responsibilities and job description for the Data Engineer (Python, ML) Full-Time position at Baanyan Software Services, Inc.?
Job Details
Job Title: Data Engineer (Python, ML) Full-Time
Location: New York City, NY (Hybrid/Onsite depending on client needs)
Employment Type: Full-time
About the Role:
We are seeking a skilled and motivated Data Engineer with strong Python programming skills and experience in Machine Learning pipelines to join our team in New York City. This role involves building robust data infrastructure and working closely with Data Scientists and Analysts to operationalize ML models in a production environment.
Key Responsibilities:
Design, build, and maintain scalable data pipelines and ETL processes.
Develop and deploy machine learning models using Python frameworks.
Collaborate with cross-functional teams to gather data requirements and deliver actionable solutions.
Optimize data workflows and ensure high data quality and integrity.
Implement monitoring and alerting for data pipelines and ML models.
Work with large datasets and integrate data from various sources (structured and unstructured).
Required Qualifications:
4 years of experience in Data Engineering with strong Python programming skills.
Solid experience with ETL tools and building data pipelines using tools like Apache Airflow, Luigi, or similar.
Hands-on experience with ML frameworks such as scikit-learn, TensorFlow, or PyTorch.
Proficiency with SQL and experience with relational and NoSQL databases.
Familiarity with cloud platforms (AWS, Google Cloud Platform, or Azure), especially in handling data services (e.g., S3, Redshift, BigQuery).
Experience with containerization and orchestration tools (Docker, Kubernetes) is a plus.
Preferred Qualifications:
Experience working with big data tools like Spark or Hadoop.
Exposure to MLOps concepts and tools.
Background in statistics or applied machine learning.
Compensation: Competitive salary Full benefits package