Demo

Grafana SRE Architect

VRK IT Vision Inc
Bernards, NJ Full Time
POSTED ON 3/3/2025
AVAILABLE BEFORE 5/27/2025

Job Description

Job Description

Job Summary

The Grafana SRE Architect will lead the design, implementation, and management of scalable, reliable, and performant Grafana-based observability solutions. This role bridges Site Reliability Engineering (SRE) practices with Grafana's ecosystem (Loki, Mimir, Tempo, etc.) to ensure robust monitoring, logging, tracing, and alerting for mission-critical systems. You will collaborate with DevOps, engineering, and infrastructure teams to align technical strategies with business objectives, driving automation, resilience, and cost efficiency across cloud and on-premises environments.

Key Responsibilities

  • Architecture & Design
  • Design end-to-end Grafana solutions for metrics, logs, traces, and dashboards, ensuring scalability, security, and compliance.
  • Architect integrations with Prometheus, Loki, Mimir, Tempo, and third-party tools (e.g., AWS CloudWatch, Datadog).
  • Define best practices for Grafana deployment (self-managed vs. Grafana Cloud) and optimize data storage / retention strategies.
  • SRE Leadership
  • Implement SRE principles : SLAs / SLOs / SLIs, error budgets, and blameless post-mortems.
  • Build automated monitoring / alerting systems to preemptively identify system bottlenecks and failures.
  • Lead incident response, root cause analysis, and remediation for observability-related outages.
  • Collaboration & Integration
  • Partner with DevOps teams to embed Grafana into CI / CD pipelines and automate provisioning via IaC (Terraform, Ansible).
  • Work with developers to instrument applications for observability (OpenTelemetry, custom exporters).
  • Advise stakeholders on cost-effective monitoring strategies and resource optimization.
  • Performance Optimization
  • Tune Grafana dashboards, queries, and data sources for high-performance environments.
  • Optimize PromQL / Loki LogQL queries and manage large-scale time-series databases (Mimir).
  • Conduct capacity planning and disaster recovery testing for Grafana ecosystems.
  • Governance & Security
  • Ensure compliance with security policies (RBAC, SSO, encryption) and audit requirements.
  • Monitor Grafana stack health, perform upgrades, and enforce version control.
  • Mentorship & Innovation
  • Mentor SRE / engineering teams on Grafana best practices and SRE culture.
  • Stay ahead of Grafana / Observability trends and pilot new tools (e.g., AI-driven anomaly detection).

Education & Experience

  • Bachelor's / Master's in Computer Science, Engineering, or related field.
  • 10 years in SRE / DevOps roles, with 5 years hands-on Grafana experience.
  • Proven track record in designing large-scale observability solutions.
  • Managing offshore teams
  • Open to work overlapping hours with offshore teams
  • Technical Skills

  • Expertise in Grafana : Dashboards, plugins, alerting, and integrations (Prometheus, Loki, Mimir, Tempo).
  • Cloud Platforms : AWS / GCP / Azure, Kubernetes, and serverless architectures.
  • Automation : Terraform, Ansible, Python / Go scripting.
  • Monitoring Tools : Thanos, Cortex, Jaeger, OpenTelemetry.
  • Database Optimization : Time-series data (Mimir), log management (Loki).
  • Certifications (Preferred)

  • Grafana Certified : Observability Engineer / Administrator.
  • AWS / GCP / Azure Architect or DevOps certifications.
  • Soft Skills

  • Leadership in cross-functional teams and crisis management.
  • Strong communication for technical and non-technical audiences.
  • Analytical problem-solving and strategic thinking.
  • Preferred Qualifications

  • Contributions to Grafana / Prometheus open-source projects.
  • Experience with AI / ML model monitoring.
  • Knowledge of regulatory frameworks (GDPR, HIPAA).
  • If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Grafana SRE Architect?

    Sign up to receive alerts about other jobs on the Grafana SRE Architect career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $103,114 - $138,258
    Income Estimation: 
    $118,163 - $145,996
    Income Estimation: 
    $120,777 - $151,022
    Income Estimation: 
    $129,363 - $167,316
    Income Estimation: 
    $86,891 - $130,303
    Income Estimation: 
    $92,369 - $122,605
    Income Estimation: 
    $117,024 - $149,811
    Income Estimation: 
    $137,568 - $176,908
    Income Estimation: 
    $158,960 - $205,707
    Income Estimation: 
    $154,597 - $194,610
    Income Estimation: 
    $172,688 - $210,712
    Income Estimation: 
    $170,589 - $211,671
    Income Estimation: 
    $178,619 - $225,190
    Income Estimation: 
    $86,891 - $130,303
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at VRK IT Vision Inc

    VRK IT Vision Inc
    Hired Organization Address Mobile, AL Contractor
    Remote 12 years of experience working with the ServiceNow platform in multiple roles (Architect / Developer / Business A...
    VRK IT Vision Inc
    Hired Organization Address NJ Full Time
    ServiceNow Solution Architect Remote 12 years of experience working with the ServiceNow platform in multiple roles (Arch...
    VRK IT Vision Inc
    Hired Organization Address Basking Ridge, NJ Contractor
    Location: Basking Ridge, NJ (Onsite) Job Summary The Grafana SRE Architect will lead the design, implementation, and man...
    VRK IT Vision Inc
    Hired Organization Address Atlanta, GA Contractor
    Full Stack Microsoft Architect that has built windows/web applications and SQL databases OneStream experience is preferr...

    Not the job you're looking for? Here are some other Grafana SRE Architect jobs in the Bernards, NJ area that may be a better fit.

    Grafana SRE Architect

    VRK IT Vision Inc., Basking Ridge, NJ

    SRE Architect

    Advantage Solutions, St. Louis, MO

    AI Assistant is available now!

    Feel free to start your new journey!