Demo

Senior Site Reliability Engineer

Kontakt.io
New York, NY Full Time
POSTED ON 4/13/2025
AVAILABLE BEFORE 6/13/2025

Kontakt.io is building the platform that care operations run on.


We reduce waste, cut costs, and improve revenue by improving throughput, asset utilization and staff productivity. Our platform uses AI, RTLS, and EHR data to enable self-learning agents to automate workflows, adapt in real-time, and orchestrate all of care delivery operations.


Easy to deploy and scale, it gives a clear picture of spaces, equipment, and people, eliminating inefficiencies and enhancing the patient experience. With measurable 10X ROI and over 20 use cases, Kontakt.io is the go-to platform for better and faster care delivery operations.


We’re looking for a Senior Site Reliability Engineer to help ensure the reliability, performance, and automation of our cloud-based, real-time platform. This role will focus on keeping our platform running smoothly 24/7, minimizing downtime, improving observability, incident response, and self-healing automation. You will work closely with engineering teams to optimize infrastructure, scale systems efficiently, and ensure our platform meets the needs of our growing healthcare customers.

\n


Responsibilities:
  • Ensure 99.99% uptime of our cloud platform by maintaining highly reliable and resilient infrastructure.
  • Design and implement self-healing, fault-tolerant systems to proactively prevent failures.
  • Define and maintain SLIs, SLOs, and SLAs to ensure proactive performance monitoring and rapid incident resolution.
  • Architect and optimize scalable cloud infrastructure (AWS) for real-time, high-throughput data processing.
  • Improve and manage containerized environments (Kubernetes, Docker) to support multi-region deployments.
  • Implement and enhance infrastructure as code (Terraform) for fully automated infrastructure management.
  • Develop and refine a robust monitoring, alerting, and logging system using Prometheus, Grafana, OpenTelemetry, and Datadog.
  • Participate in incident response and on-call rotations, driving down mean time to detection (MTTD) and mean time to resolution (MTTR).
  • Conduct blameless postmortems and implement lessons learned to improve system resilience.
  • Automate deployment, scaling, and failover mechanisms to reduce manual intervention.
  • Contribute to disaster recovery and business continuity planning to maintain availability of critical healthcare services.
  • Work closely with Product, Engineering, and Infrastructure teams to align SRE initiatives with business goals.


Our requirements:
  • 5 years of experience in Site Reliability Engineering or Cloud Infrastructure.
  • Proven success scaling high-traffic, mission-critical platforms in SaaS, IoT, or healthcare.
  • Deep expertise in cloud platforms (AWS), Kubernetes, and distributed systems.
  • Strong background in monitoring, logging, and observability with Prometheus, OpenTelemetry, or similar tools.
  • Deep knowledge of CI/CD automation, GitOps, and infrastructure as code (Terraform, etc.).
  • Strong understanding of network security, access management, and compliance frameworks (HIPAA, SOC 2).

Bonus Points If You Have:
  • Experience with healthcare IT, including EHR data, FHIR, and HL7 interoperability.
  • Expertise in real-time distributed systems, event-driven architectures, or large-scale data pipelines.


Why You'll Love It Here
  • Own Mission-Critical Reliability – Ensure hospitals and care facilities always stay online with a 99.99% uptime healthcare platform.
  • Scale AI-Powered Infrastructure – Work on real-time automation and self-healing cloud systems that orchestrate care delivery.
  • Drive Big Impact in Healthcare – Help reduce waste, optimize resources, and improve patient care with technology that delivers 10X ROI.
  • Automation-First Culture – Minimize manual ops with cutting-edge automation, observability, and incident response strategies.
  • Join a High-Performing Team – Work with top engineers, AI experts, and healthcare innovators solving real-world challenges


\n

Ready to Build the Future of Healthcare?

Apply now and help scale the platform that care operations run on. 🚀

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Senior Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$172,688 - $210,712
Income Estimation: 
$170,589 - $211,671
Income Estimation: 
$178,619 - $225,190
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$81,253 - $112,554
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$95,407 - $122,738
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Kontakt.io

Kontakt.io
Hired Organization Address Newtown, PA Full Time
Kontakt.io is building the platform that care operations run on. We reduce waste, cut costs, and improve revenue by impr...
Kontakt.io
Hired Organization Address Reno, NV Full Time
Kontakt.io is building the platform that care operations run on. We reduce waste, cut costs, and improve revenue by impr...
Kontakt.io
Hired Organization Address Poland, IN Full Time
Kontakt.io offers hospitals an advanced care delivery operations platform. Our solution leverages intelligent data analy...
Kontakt.io
Hired Organization Address New York, NY Full Time
Kontakt.io is building the platform that care operations run on. We reduce waste, cut costs, and improve revenue by impr...

Not the job you're looking for? Here are some other Senior Site Reliability Engineer jobs in the New York, NY area that may be a better fit.

Senior Site Reliability Engineer

Nayya Health, New York, NY

AI Assistant is available now!

Feel free to start your new journey!