Demo

MLOps Support Engineer

Octigo Solutions Inc
Reading, PA Full Time
POSTED ON 4/17/2025
AVAILABLE BEFORE 6/17/2025

Job Details

MLOps L2 Support Engineer to provide 24/7 production support for machine learning (ML) and data pipelines. The role requires on-call support, including weekends, to ensure high availability and reliability of ML workflows. The candidate will work with Dataiku, AWS, CI/CD pipelines, and containerized deployments to maintain and troubleshoot ML models in production.

Incident Management & Support:

Provide L2 support for MLOps production environments, ensuring uptime and reliability.

Troubleshoot ML pipelines, data processing jobs, and API issues.

Monitor logs, alerts, and performance metrics using Dataiku, Prometheus, Grafana, or CloudWatch.

Perform root cause analysis (RCA) and resolve incidents within SLAs.

Escalate unresolved issues to L3 engineering teams when needed.

Dataiku Platform Management:

Manage Dataiku DSS workflows, troubleshoot job failures, and optimize performance.

Monitor and support Dataiku plugins, APIs, and automation scenarios.

Collaborate with Data Scientists and Data Engineers to debug ML model deployments.

Perform version control and CI/CD integration for Dataiku projects.

Deployment & Automation:

Support CI/CD pipelines for ML model deployment (Bamboo, Bitbucket etc).

Deploy ML models and data pipelines using Docker, Kubernetes, or Dataiku Flow.

Automate monitoring and alerting for ML model drift, data quality, and performance.

Cloud & Infrastructure Support:

Monitor AWS-based ML workloads (SageMaker, Lambda, ECS, S3, RDS).

Manage storage and compute resources for ML workflows.

Support database connections, data ingestion, and ETL pipelines (SQL, Spark, Kafka).

Security & Compliance:

Ensure secure access control for ML models and data pipelines.

Support audit, compliance, and governance for Dataiku and MLOps workflows.

Respond to security incidents related to ML models and data access.

Required Skills & Experience:

Experience: 5 years in MLOps, Data Engineering, or Production Support.

Dataiku DSS: Strong experience in Dataiku workflows, scenarios, plugins, and APIs.

Cloud Platforms: Experience with AWS ML services (SageMaker, Lambda, S3, RDS, ECS, IAM).

CI/CD & Automation: Familiarity with GitHub Actions, Jenkins, or Terraform.

Scripting & Debugging: Proficiency in Python, Bash, SQL for automation & debugging.

Monitoring & Logging: Experience with Prometheus, Grafana, CloudWatch, or ELK Stack.

Incident Response: Handle on-call support, weekend shifts, and SLA-based issue resolution.

Preferred Qualifications:

Containerization: Experience with Docker, Kubernetes, or OpenShift.

ML Model Deployment: Familiarity with TensorFlow Serving, MLflow, or Dataiku Model API.

Data Engineering: Experience with Spark, Databricks, Kafka, or Snowflake.

ITIL/DevOps Certifications: ITIL Foundation, AWS ML certifications; Dataiku certification

Work Schedule & On-Call Requirements:

Rotational on-call support (including weekends and nights).

Shift-based monitoring for ML workflows and Dataiku jobs.

Flexible work schedule to handle production incidents and critical ML model failures.

Mentor and Knowledge transfer to client project team members

Participate as primary, co and/or contributing author on any and all project deliverables associated with their assigned areas of responsibility

Participate in data conversion and data maintenance

Provide best practice and industry specific solutions

Advise on and provide alternative (out of the box) solutions

Provide thought leadership as well as hands on technical configuration/development as needed.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a MLOps Support Engineer?

Sign up to receive alerts about other jobs on the MLOps Support Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$79,311 - $112,035
Income Estimation: 
$103,503 - $129,573
Income Estimation: 
$93,066 - $107,206
Income Estimation: 
$97,332 - $126,185
Income Estimation: 
$71,122 - $96,652
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$143,391 - $179,890
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Octigo Solutions Inc

Octigo Solutions Inc
Hired Organization Address Irving, TX Full Time
Job Details Network Architect with expertise in telecommunication packet broker solutions to lead a critical migration p...
Octigo Solutions Inc
Hired Organization Address Austin, TX Full Time
Job Details Senior Golang Engineer will work with the Identity Management Services team to design, develop, and maintain...
Octigo Solutions Inc
Hired Organization Address Raleigh, NC Full Time
Job Details Key Areas of Responsibilities Lead technical consultants and developers in delivering services in compliance...
Octigo Solutions Inc
Hired Organization Address Jersey, NJ Full Time
Job Details Strong technical discipline with proven experience coding in languages including Java /spring boot. 15 years...

Not the job you're looking for? Here are some other MLOps Support Engineer jobs in the Reading, PA area that may be a better fit.

ML Ops Support Engineer

Cosqube, Reading, PA

AI Assistant is available now!

Feel free to start your new journey!