What are the responsibilities and job description for the Operations Infrastructure DevOps Engineer position at Abidi Solutions?
Job Title: Operations Infrastructure DevOps Engineer
Location: Fort Worth, TX (Hybrid - 50% Onsite)
Contract Duration: 12 Months (Possible Extension)
About The Role
We are seeking an experienced Operations Infrastructure DevOps Engineer to join our team on a 12-month contract basis. This hybrid role requires a dedicated professional to work 50% onsite in Fort Worth, TX. The ideal candidate will have extensive experience in DevOps and Infrastructure Support, with a strong background in incident and system management, system monitoring and optimization, and incident response.
Key Responsibilities
Location: Fort Worth, TX (Hybrid - 50% Onsite)
Contract Duration: 12 Months (Possible Extension)
About The Role
We are seeking an experienced Operations Infrastructure DevOps Engineer to join our team on a 12-month contract basis. This hybrid role requires a dedicated professional to work 50% onsite in Fort Worth, TX. The ideal candidate will have extensive experience in DevOps and Infrastructure Support, with a strong background in incident and system management, system monitoring and optimization, and incident response.
Key Responsibilities
- Incident and System Management: Collaborate with internal teams and suppliers to analyze and resolve critical IT and Telecom service interruptions. Ensure system availability through effective incident, problem, and change management.
- System Monitoring and Optimization: Continuously monitor systems for faults, identify optimization opportunities, and implement tools and process changes to enhance monitoring and alerting.
- Incident Response and Root Cause Analysis: Participate in major incident response teams, managing escalations and monitoring during significant incidents.
- Self-Motivation and Planning: Define, develop, and execute plans to manage system outages and handle high-stress situations efficiently.
- Availability and Support: Provide on-call support in a 24/7 environment.
- Experience: Minimum 5 years in Event Monitoring and Alerting, DevOps, Infrastructure Support, or IT Major Incident Management.
- Technical Skills: Proficiency in monitoring tools such as Dynatrace and CloudWatch. Experience in application performance tuning and distributed systems/administration (Windows, Unix, Linux, VMWare).
- Scripting/Programming: General scripting skills in Python, Node.js, Ruby, Perl, Bash/sh.
- Tools and Technologies: Familiarity with Zabbix or SCOM preferred. Experience with ITIL best practices (certification is a plus), ServiceNow proficiency, and familiarity with SDLC lifecycle.
- Certifications: Technical certifications related to the field. Cloud certifications (AWS, Azure, etc.) and ITIL V3 or V4 certification are preferred.
- Education: Bachelor’s degree in Computer Science, Information Systems, or Engineering preferred.
- Advanced technical skills in various operating systems and environments.
- Experience with infrastructure as code tools (Terraform, Ansible, etc.).
- Proven ability to improve monitoring and alerting processes.