Demo

SRE Lead Engineer

Kaav Inc
Austin, TX Full Time
POSTED ON 4/9/2025
AVAILABLE BEFORE 4/28/2025

Job Description

Job Description

Job Title : SRE Lead Engineer-Fulltime

Location : Austin , TX OR Fort Mill, SC (Hybrid)

Implementation-My manager will let you the Client Details before Submitting to the Client.

Job Description :

We are currently seeking a highly skilled SRE hands-on Lead Engineer with solid experience to help lead transformational initiatives within IT operations, encompassing development as well. As a crucial figure in this role, you will participate / help designing and implementing cutting-edge SRE solutions, driving the transformation of IT operations organizations to adopt an engineering-centric approach.

Responsibilities :

Participate in design, architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance.

Primary skillset to be expertise in Observability as service, Telemetry data collection using Dynatrace APM, SolarWinds, Open-Source tools (Prometheus and Grafana), Log Aggregations (Kibana or Splunk) and AIOPS Tools

Deeper understanding of Login authentication mechanisms using Ping, ForgeRock and SiteMinder technologies (session management and cookie management)

Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications.

Evangelize SRE evolution within IT operations and promoting a culture of engineering excellence and best practices.

Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation.

Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind.

Implement monitoring systems to assess the performance of applications and infrastructure, and proactively identifying areas for optimization.

Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents.

Analyze resource utilization patterns and forecasting future capacity needs to ensure optimal performance and cost-efficiency.

Ensure that SRE practices align with security and compliance requirements and implementing measures to protect systems and data.

Operational excellence with focus on automation and developing tools to streamline operational tasks and increase efficiency.

Provide guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practice.

Ability to develop close relationship with other operational teams to integrate SRE practices and drive overall operational improvements across enterprise.

Stay up to date on industry trends, new technologies, and best practices in SRE and applying relevant advancements to the organization.

Qualifications :

Around 10-12 years of SRE hands on experience with cloud technologies, development, SRE toolsets and automation

Primary skillset to be expertise in Observability as service, Telemetry data collection using Dynatrace APM, SolarWinds, Open-Source tools (Prometheus and Grafana), Log Aggregations (Kibana or Splunk) and AIOPS Tools

Deeper understanding of Login authentication mechanisms using Ping, ForgeRock and SiteMinder technologies (session management and cookie management)

Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications .

Strong hands-on experience with any Cloud Technology (AWS) : Control Tower, Project Setup, Creating Accounts, RDS, SSO

Solid understanding and hands on experience with Docker / Kubernetes

Should have good experience with Linux Commands, GitLab CICD Setup and Terraform (state management, etc)

Monitoring & alerting setup experience with Splunk, Prometheus, Grafana, Kibana, ELK etc.

Hands on APM Tool / s experience, preferably Datadog or AppDynamics or Dynatrace

Good understanding of Observability Framework leveraging programmatic SLI / SLO blueprints to standardize the collection of golden signals.

Should have automation (data refresh, releases, DB snapshots) experience using Ansible or any other scripting languages

Experience with following languages (Groovy-DSL, Java, Python, Yaml and microservices architecture)

Good understanding and hands on experience with MQ, Kafka

Experience with Databases (Oracle, MySQL)

Good to have :

Any of the relevant professional certifications Certified Site Reliability Engineer (CSRE), Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer Professional, , Google Cloud Professional; DevOps Engineer

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a SRE Lead Engineer?

Sign up to receive alerts about other jobs on the SRE Lead Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$71,493 - $96,419
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$158,960 - $205,707
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Kaav Inc

Kaav Inc
Hired Organization Address Austin, TX Full Time
Job Details Job Title: Senior Frontend Developer Location: Onsite (Austin, TX) Contract Duration: 12 Months Job Overview...
Kaav Inc
Hired Organization Address Plano, TX Full Time
Job Details Job Title: Sr. Systems Analyst Healthcare & Insurance IT Location: Plano, TX (On-Site) Experience: 9 to 12 y...
Kaav Inc
Hired Organization Address Seattle, WA Full Time
Job Description Job Description Senior Data Scientist Advanced Machine Learning & AI Location : Seattle, WA / Remote Exp...
Kaav Inc
Hired Organization Address New York, NY Full Time
Job Details Job Title: Fullstack AI Principal Consultant/Lead Location: NYC Metro (2 days onsite in a week) Employment T...

Not the job you're looking for? Here are some other SRE Lead Engineer jobs in the Austin, TX area that may be a better fit.

SRE Engineer

Diverse Lynx, Austin, TX

AI Assistant is available now!

Feel free to start your new journey!