Demo

Principal Observability and Reliability Tooling Engineer

Diligente Technologies
Alpharetta, GA Full Time
POSTED ON 1/22/2025
AVAILABLE BEFORE 2/19/2025

Principal Observability and Reliability Tooling Engineer

Employment Type: Full time

City: Alpharetta

State: Georgia

Onsite / Hybrid role


About the Role:

Key Responsibilities:

  • Develop and implement best-in-class observability strategies that align with business objectives while staying current with industry trends and best practices.
  • Design and maintain secure, scalable, and highly available platforms for metrics, logging, tracing, real user monitoring, and synthetic monitoring.
  • Lead the organization's adoption of OpenTelemetry, focusing on instrumentation and best practices.
  • Collaborate with cross-functional teams to gather requirements and ensure observability is integrated into the development lifecycle with an emphasis on quality and governance.
  • Research, analyze, and recommend technical solutions to address complex observability challenges and improve system performance.
  • Build and enhance self-service capabilities for monitoring, alerting, and self-healing frameworks to empower engineering teams.
  • Implement and manage observability infrastructure and configurations as code (IaC) using Terraform and Ansible.
  • Troubleshoot and resolve issues related to observability pipelines and platforms.
  • Create and maintain comprehensive documentation for observability practices and tools.
  • Mentor and guide teams on observability principles and best practices.


About You:

Basic Requirements:

  • Bachelor's degree in Engineering, a related technical discipline, or equivalent work experience.
  • At least 8 years of hands-on experience in Observability, with a minimum of 4 years dedicated to developing comprehensive observability strategies and leading their practical implementation.
  • At least 5 years of experience with Observability platforms such as Grafana, Splunk, PagerDuty, Datadog, and SolarWinds.
  • Strong expertise in the adoption and implementation of OpenTelemetry.
  • Experience in creating automation scripts and tools, including Terraform and Ansible for infrastructure management and configuration as code, along with familiarity using GitHub for version control.
  • Experience with public cloud providers, preferably Google Cloud.
  • Excellent problem-solving skills and the ability to thrive in a fast-paced environment.
  • Strong communication and collaboration skills to work effectively with cross-functional teams.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Principal Observability and Reliability Tooling Engineer?

Sign up to receive alerts about other jobs on the Principal Observability and Reliability Tooling Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,553 - $129,430
Income Estimation: 
$118,984 - $155,144
Income Estimation: 
$85,996 - $102,718
Income Estimation: 
$111,859 - $131,446
Income Estimation: 
$110,457 - $133,106
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$122,763 - $145,698
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$136,611 - $163,397
Income Estimation: 
$135,163 - $163,519
Income Estimation: 
$131,953 - $159,624
Income Estimation: 
$150,859 - $181,127
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Diligente Technologies

Diligente Technologies
Hired Organization Address Lowell, MA Full Time
Title : Site Reliability Engineers Location : Lowell, MA Duration : Full-Time Site Reliability Engineers are critical te...
Diligente Technologies
Hired Organization Address Santa Clara, CA Full Time
We are seeking a highly motivated and experienced lead frontend engineer with strong creative skills that is familiar wi...
Diligente Technologies
Hired Organization Address San Francisco, CA Full Time
Description : Business Development Manager Direct Hire Location : San Francisco (Hybrid / Remote) As an Business Develop...
Diligente Technologies
Hired Organization Address San Jose, CA Temporary
Employment Type : Contract to Hire Location : San Jose, California Experience with Quote to Cash is preferred. Resumes h...

Not the job you're looking for? Here are some other Principal Observability and Reliability Tooling Engineer jobs in the Alpharetta, GA area that may be a better fit.

AI Assistant is available now!

Feel free to start your new journey!