Demo

Site Reliability Engineering(SRE)- Lead

Aptiva Healthcare
Overland Park, KS Full Time
POSTED ON 4/12/2025
AVAILABLE BEFORE 6/11/2025

Location: Overland Park, KS

Key Responsibilities:

  • Team Leadership: Leading and mentoring the SRE team, ensuring they have the resources and guidance needed to perform their roles effectively.
  • System Design and Architecture: Overseeing the design and architecture of reliable systems, ensuring scalability, fault tolerance, and high availability.
  • Incident Management: Coordinating response to incidents, conducting post-mortems, and implementing measures to prevent recurrence.
  • Monitoring and Performance: Setting up and maintaining monitoring tools and dashboards to track system performance and detect issues proactively.
  • Automation: Developing and promoting automation for repetitive tasks to reduce human error and improve efficiency.
  • Collaboration: Working closely with development, operations, and other cross-functional teams to ensure smooth integration and deployment of new features.
  • Capacity Planning: Analyzing system capacity and planning for future growth to ensure the infrastructure can handle increased demand.
  • SLA/SLO Management: Defining and managing Service Level Agreements (SLAs) and Service Level Objectives (SLOs) to meet business requirements.
  • Continuous Improvement: Identifying areas for improvement in system reliability and performance and driving initiatives to address them.
  • Documentation: Ensuring proper documentation of systems, processes, and incident responses to maintain knowledge sharing and consistency. Have a good understanding about APIs.

Example Daily Activities:

  • Reviewing system performance metrics and addressing any anomalies.
  • Leading incident response calls and coordinating with relevant teams.
  • Meeting with stakeholders to discuss reliability goals and progress.
  • Developing scripts and automation tools for system maintenance tasks.
  • Conducting training sessions for team members on best practices.
  • Planning and executing system upgrades and infrastructure improvements.

Qualifications

  • Minimum 10 years of experience in relevant area.
  • Team Leadership: Strong ability to mentor and manage teams using collaborative platforms like Jira, Teams, and Confluence. Excellent communication and collaboration skills.
  • System Design and Architecture: Expertise in designing scalable and reliable systems using tools like Kubernetes, Docker, and cloud services (AWS, Azure, GCP). Experience with Kafka, Cassandra, and other infrastructure tools. Familiarity with middleware technologies such as Kafka, APIs, and Microservices architecture.
  • Incident Management: Proficiency in managing incidents using tools like PagerDuty, xMatters, alongside conducting effective post-mortems.
  • Monitoring and Analytics: Experience with monitoring tools such as Splunk, AppDynamics, Grafana, Prometheus, etc for proactive issue detection.
  • Automation: Skilled in using automation tools like Terraform, Ansible, and scripting languages (Python, Bash, ShellScript) to streamline workflows.
  • Capacity Planning: Familiarity with performance analysis and forecasting tools to ensure infrastructure scalability.
  • SLA/SLO Management: Defining and tracking reliability goals using SRE best practices and tools like ServiceNow.

Continuous Improvement: Ability to assess system reliability with tools like ELK Stack (Elasticsearch, Logstash, Kibana) and implement enhancements.

Job Types: Full-time, Contract

Pay: $55.41 - $60.06 per hour

Expected hours: 40 per week

Schedule:

  • 8 hour shift

Work Location: On the road

Salary : $55 - $60

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineering(SRE)- Lead?

Sign up to receive alerts about other jobs on the Site Reliability Engineering(SRE)- Lead career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$120,143 - $165,703
Income Estimation: 
$182,708 - $261,704
Income Estimation: 
$154,184 - $199,940
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$81,253 - $112,554
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$95,407 - $122,738
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$172,688 - $210,712
Income Estimation: 
$170,589 - $211,671
Income Estimation: 
$178,619 - $225,190
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Aptiva Healthcare

Aptiva Healthcare
Hired Organization Address Anchorage, AK Full Time
Home Health LMSW - Anchorage AK and surrounding areas - Start Date: ASAP - Shift: 8hr Days (*This facility cannot accomm...
Aptiva Healthcare
Hired Organization Address Chandler, AZ Full Time
Licensure & Certification Requirements: License: Active Arizona RN license or Compact license Certifications Required: A...
Aptiva Healthcare
Hired Organization Address Renton, WA Full Time
Licensure & Certification Requirements: License: Active GA RN license or Compact Certifications Required: BLS (AHA) NIHS...
Aptiva Healthcare
Hired Organization Address Americus, GA Full Time
Company Overview: Aptiva Healthcare is a Joint Commission accredited healthcare staffing agency providing high-quality c...

Not the job you're looking for? Here are some other Site Reliability Engineering(SRE)- Lead jobs in the Overland Park, KS area that may be a better fit.

Head of Global Site Reliability Engineering

CSC Cboe Services Company, Lenexa, KS

Sr. Director, Global Site Reliability Engineering

Cboe Global Markets, Inc., Lenexa, KS

AI Assistant is available now!

Feel free to start your new journey!