What are the responsibilities and job description for the IT Engineer position at Smart IT Frame LLC?
Hi All
Job Title: IT Engineer
Location: Reston, VA or Plano TX (Hybrid)
Contract
Must Have:
- Information Technology Infrastructure Library (ITIL) Certification is required.
- Use of Office products including Excel, Word, Outlook)
Job Description/ Responsibilities
- IT Engineer IV
Description:
- Provide support for complex or specialized application or infrastructure tasks, incidents, changes and requests.
- Coordinate and manage changes to the production environment ensuring safety and soundness while working with an agile mindset.
- Lead the operational implementation and maintenance of complex IT infrastructure and application projects
- Troubleshoot and resolve advanced and complex system, service, application and network/connectivity issues
- Provide guidance, training, etc. to junior members and new members of the team
- Identify and drive efforts for automation initiatives
- Identify system performance enhancements, propose and implement solutions for those enhancements
- Identify and resolve monitoring and alerting issues, including threshold updates, new monitors and resolve other problematic alerts
- Handle compliance activities including access and password management, compliance with all policies and procedures.
- Handle escalated incidents from other tiers or teams and quickly triage and remediate issues to minimize any business impact. - Lead critical incident response efforts and root cause analysis
- Lead critical application releases and implementations that include interfacing application changes and coordination
- Update configurations, including testing and peer reviewing.
- Create/modify code, scripts, and monitors to resolve, prevent or monitor application incidents - Provide recommendation on continuous improvement and execute the same
- Troubleshoot the incidents and identify root cause quickly using operations, wire data analytics, application performance management and event correlation monitoring tools.
- Act as primary point of contact to management and stakeholders during business impacting incidents.
- Partnering with the Correction of Errors team, drive the root cause analysis, lessons learned and implementation of enhancements for all ECC communicated incidents.
- Collaborate with other teams to enhance proactive monitoring and observability tools and processes
- Provide reporting and analysis on business impacting incident trends and impacts. ?
- Good communication skills (oral and written)
- Attention to detail
- Ability to multi-task
- Knowledge and use of ticketing systems
- Proven experience with:
- Unix/Linux
- Expert knowledge of AWS cloud platforms, Certification required
- Proficient in the use of the AWS console and CLI
- Advanced scripting and automation skills
- Strong problem-solving abilities
- System and data analysis using common tools (PowerBI, Tableau, Excel, etc.)
- Hands on experience with transaction level monitoring using Splunk and other transaction level dashboarding tools.
- Monitoring experience with tools like Extrahop, SolarWinds, and Catchpoint.
- Ability to analyze dashboards and reporting/monitoring tools to look at trends and patterns in application health and performance.
Top Must Have Skills:
- · Information Technology Infrastructure Library (ITIL) Certification is required.
- · Use of Office products including Excel, Word, Outlook)
Education/Experience:
- · 5-10 year's experience with cloud operations and engineering
- · Bachelor's degree
Notes:
- This resource will be required to travel to either Reston VA or Plano TX to obtain a laptop and a security badge. Additionally, the resource will be required to travel to a Client site quarterly to meet with their POC