Demo

AWS Incident Management Specialist

Avance Consulting
Reston, VA Contractor
POSTED ON 2/20/2025
AVAILABLE BEFORE 3/20/2025

Job Description


Key Job Functions

• Manage IT production incidents to resolution in a 24/7/365 environment using incident management processes and communicate incident status, impact, and resolution actions.

• Hands-on experience managing and monitoring applications deployed on Amazon Web Services (AWS).

• Troubleshoot and resolve incidents on the AWS cloud infrastructure.

• Experience with building tools for monitoring and troubleshooting system resources in an AWS environment. Ability to triage AWS-related incidents using monitoring tools on AWS Cloud.

• Experience with performance engineering of AWS Cloud applications.

• Hands-on experience working with AWS tools like EC2, ELB, RDS, Redshift, DynamoDB, Aurora, Route53, ECS, Lambda, S3, Batch, CloudWatch, CloudTrail, WAF, etc.

• Hands-on experience with transaction-level monitoring using Dynatrace and Splunk.

• Ability to perform transaction-level monitoring and troubleshooting in the AWS cloud platform.

• Monitor the health of applications and the underlying infrastructure.

• Monitoring experience with tools like Extrahop, SolarWinds, Netcool suite, Catchpoint, MoogSoft.

• Analyze dashboards and reporting/monitoring tools to identify trends and patterns in application health and performance.

• Proactively look for hardware, software, and environmental alerts or malfunctions.

• Effectively lead and guide incident triage calls from a technical perspective, analyzing different components of the infrastructure and application environment using a variety of monitoring tools and processes.

• Troubleshoot incidents and identify root causes quickly using operations, wire data analytics, application performance management, and event correlation monitoring tools.

• Perform analysis of data, evaluating multiple application protocols including web, database, storage, and supporting infrastructure such as AWS, UNIX, DNS, LDAP, SSL, SMTP, and FTP.

• Collaborate with technical teams and articulate troubleshooting steps effectively.

• Participate in technical follow-up calls for critical incidents.

• Assist with documentation of Root Cause Analysis (RCA) or Correction of Errors (COE) and data quality for all communicated incidents.

• Ensure appropriate functional and management escalation takes place as per the standards and procedures.

• Follow up on items that could potentially negatively impact production operations, assist with postmortem activities, and support various efforts related to operational improvements.

• Implement new and improved processes, change processes, perform new tasks, create reports, and address ad-hoc requests based on management recommendations.

• Participate in on-call rotation and work on any shifts as needed, including weekends and night shifts.

• Report incident details and metrics to senior leadership.

Minimum Experience Specialized Knowledge & Skills

• 6 years of working experience with different IT Infrastructure components such as Unix/Linux Servers, Wintel Servers, AWS, networks, firewalls, routers, load balancers, VPN, Apache, WebLogic, LDAP, Active Directory, Exchange, Oracle/MS SQL databases, SAN, Virtualization, Email systems, Enterprise monitoring, and access management solutions for single sign-on. Experience with at least eight of the above is preferred.

• Mid-level hands-on working experience with Amazon Web Services (AWS).

• Understanding of different layers of the AWS Infrastructure e.g., WAF, R53, CloudFront, Load Balancing, HA features.

• Proven methodical approach to problem identification, monitoring, problem-solving, and resolution.

• Ability to analyze different components of the infrastructure and application environments during incident triage calls.

• Ability to trace transaction failures and debug the root cause in various layers of the AWS infrastructure and services.

• Aptitude to influence other technical teams on incident calls and articulate troubleshooting steps effectively.

• Experience and confidence working with all levels of management; excellent written and verbal skills.

• Ability to quickly and concisely communicate with senior management on technical issues in non-technical terms and to run large conference calls during incident calls with a wide range of personnel and management levels.

• Strong relationship management skills and aptitude to multi-task and work well in a high-stress environment, both within teams and independently.

• AWS Solution Architect Associate or higher certification.

• Monitoring and observability experience.

• Experience with monitoring dashboards for incident detection and alerting.

• Perform end-to-end analysis of transactions under an observability environment.

• Troubleshoot incidents and identify root causes quickly using wire data analytics, application performance management, and event correlation monitoring tools.

• Diagnose and resolve incidents by providing factual data from various monitoring and instrumentation systems.

• Monitor applications and infrastructure using tools like Splunk, Dynatrace, OpenTel, Catchpoint, xMatters, SignalFx, SolarWinds, Extrahop, etc.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a AWS Incident Management Specialist?

Sign up to receive alerts about other jobs on the AWS Incident Management Specialist career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$115,647 - $153,495
Income Estimation: 
$186,685 - $265,377
Income Estimation: 
$81,253 - $112,554
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$95,407 - $122,738
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$161,406 - $211,884
Income Estimation: 
$188,022 - $236,092
Income Estimation: 
$205,940 - $255,928
Income Estimation: 
$199,907 - $266,531
Income Estimation: 
$195,700 - $270,403
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Avance Consulting

Avance Consulting
Hired Organization Address Louisville, KY Full Time
Job Description Job Description Technical / Functional Skills We are seeking a skilled and innovative Customer Journey A...
Avance Consulting
Hired Organization Address Louisville, KY Full Time
Job Description Job Description Technical / Functional Skills We are seeking a skilled Adobe Journey Optimizer (AJO) Eng...
Avance Consulting
Hired Organization Address Baltimore, MD Full Time
Job Description Job Description Must Have Technical / Functional Skills Strong programming skills in Python, with experi...
Avance Consulting
Hired Organization Address De Forest, WI Full Time
Job Description Job Description Join the Company as a Full-Time Mold Maker in Plastic Injection Molding, where innovatio...

Not the job you're looking for? Here are some other AWS Incident Management Specialist jobs in the Reston, VA area that may be a better fit.

Incident Management Engineer (ES2), ES2

Amazon Web Services (AWS), Herndon, VA

Incident Management Specialist

ExemplarITS, Reston, VA

AI Assistant is available now!

Feel free to start your new journey!