Demo

Observability Engineer /Monitoring Engineer

ACL Digital
Roswell, GA Contractor
POSTED ON 2/24/2025
AVAILABLE BEFORE 3/22/2025

Datadog Engineer / Observability Engineer

Location: Roswell, GA

Mode: Work from Office (4 days)

Duration: 11 Months contract

The NOC Lead will manage cloud infrastructure operations and ensure optimal performance of the project’s cloud environment. This role demands strong expertise in cloud platforms, incident management, operations best practices, and hands-on experience with monitoring tools and observability. The NOC Lead will collaborate closely with the customer’s stakeholders to drive operational excellence, streamline processes, and enhance cloud systems' reliability, scalability, and security.

Key Responsibilities:

Cloud Infrastructure Management:

Manage, monitor, and optimize cloud infrastructure across platforms (e.g., AWS, Azure).

Ensure high availability, scalability, and cost-efficiency of cloud systems.

Oversee deployment and maintenance of applications and services in the cloud environment.

Monitoring Expertise:

Design and implement advanced monitoring, alerting, and observability solutions using Monitoring tools like (Datadog, Grafana, Prometheous).

Configure dashboards, custom metrics, and anomaly detection to provide deep insights into system performance.

Conduct training sessions for the customer’s team on effective Datadog usage.


Incident and Problem Management:

Take ownership of incident management, ensuring rapid detection, escalation, and resolution of issues.

Oversee real-time incident detection, escalation, and resolution processes.

Perform root cause analysis and implement long-term solutions to prevent recurrence.

Develop and enforce operational playbooks for handling critical incidents.


Security and Compliance:

Ensure adherence to security best practices and compliance with customer and industry standards.

Collaborate with security teams to implement identity and access controls, encryption, and vulnerability management.


Reporting and Optimization:

Generate and present regular operational reports to customer stakeholders, including SLA adherence and performance metrics.

Analyze trends to identify areas for optimization and proactively recommend improvements.

Leadership and Collaboration:

Lead and mentor the CloudOps team to deliver top-notch operational performance.

Collaborate with development, QA, and security teams to align operations with business goals.

Present operational reports and insights, including SLA adherence and mobile application performance metrics.


Process Improvement and Automation:

Continuously analyze current processes and identify areas for improvement.

Implement automation tools and techniques to enhance efficiency.

Establish and document standard operating procedures (SOPs) for NOC operations.

Implement Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or Ansible.

Automate repetitive operational tasks to enhance efficiency.

Required Qualifications:

Technical Skills:

Bachelor’s degree in Computer Science, IT, or a related field; a Master’s degree is a plus.

8 years of experience in cloud operations, with at least 3 years in a leadership role.

Strong expertise in monitoring tools like Datadog, Grafana, Prometheous including advanced configuration and monitoring setup.

Proficiency in cloud platforms such as AWS, Azure, or Google Cloud Platform.

Hands-on experience with automation tools like Terraform, Ansible, or CloudFormation.

Solid understanding of DevOps practices, CI/CD pipelines, and container orchestration (e.g., Docker, Kubernetes).

Certifications:

Datadog certifications or proven expertise in the platform is a significant advantage.


Soft Skills:

Strong leadership and team management skills with the ability to work onsite in a customer-facing role.

Excellent communication and interpersonal skills for effective collaboration with stakeholders.

Proactive and solution-oriented mindset to drive improvements and resolve challenges.

Ability to work under pressure, prioritize tasks, and manage multiple priorities effectively

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Observability Engineer /Monitoring Engineer?

Sign up to receive alerts about other jobs on the Observability Engineer /Monitoring Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$81,253 - $112,554
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$95,407 - $122,738
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$161,406 - $211,884
Income Estimation: 
$188,022 - $236,092
Income Estimation: 
$205,940 - $255,928
Income Estimation: 
$199,907 - $266,531
Income Estimation: 
$195,700 - $270,403
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at ACL Digital

ACL Digital
Hired Organization Address Minneapolis, MN Full Time
Domestic Quals AI Engineer - build scaled AI solution including Gen AI Experience with Gen AI / AI techniques, Experienc...
ACL Digital
Hired Organization Address Longmont, CO Full Time
Job Description : Day to Day Responsibilities of this Position and Description of Project : Designs, develops, tests, an...
ACL Digital
Hired Organization Address Nashville, TN Contractor
Duration: 1 Year Description: People & Organizational Development is looking for a dynamic, organized, self-starter for ...
ACL Digital
Hired Organization Address Seattle, WA Full Time
Outline of the Role : Solutioning of high-performing and thoughtfully architected software applications that satisfy our...

Not the job you're looking for? Here are some other Observability Engineer /Monitoring Engineer jobs in the Roswell, GA area that may be a better fit.

AI Assistant is available now!

Feel free to start your new journey!