Demo

Site Reliability Engineer

aKUBE
Salt Lake, UT Full Time
POSTED ON 2/17/2025
AVAILABLE BEFORE 3/15/2025

City: Las Vegas, NV or Calabasas, CA or Atlanta, GA

Onsite/ Hybrid/ Remote: Hybrid

Work Authorization: US Citizens ONLY

Overview

We are seeking a Site Reliability Engineer (SRE) to work at the intersection of Security Operations (SecOps), DevOps, Quality Assurance, and IT Operations. This role will be responsible for designing, building, and maintaining scalable, resilient, and secure systems. The ideal candidate will balance development velocity with system reliability, proactively identifying and resolving performance, cost, and infrastructure challenges.

As an SRE, you will drive automation, enhance monitoring and observability, and lead post-incident analysis to improve system reliability. You will also collaborate closely with development teams to optimize system performance, security, and scalability.

Key Responsibilities

  • Automation & Infrastructure Management
  • Design, develop, and implement automation tools to enhance deployment velocity, system reliability, and operational efficiency.
  • Utilize Infrastructure as Code (IaC) solutions (e.g., Terraform, Ansible, Puppet, Chef) to automate IT infrastructure tasks.
  • Continuously optimize cloud infrastructure for cost, scalability, and performance.
  • Monitoring, Incident Response & Reliability Engineering
  • Establish and maintain monitoring, alerting, and incident response strategies to ensure rapid detection and resolution of issues.
  • Own and improve the observability platform to enable proactive issue resolution and minimize downtime.
  • Conduct root cause analysis (RCA) and implement preventive action plans to avoid incident recurrence.
  • Collaboration & Performance Optimization
  • Partner with software development teams to embed reliability, scalability, and security best practices into application architecture.
  • Define and measure Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) to ensure system performance meets business needs.
  • Drive continuous improvement in CI/CD pipelines and deployment strategies.
  • Capacity Planning & Scalability
  • Plan and implement scaling strategies to support growing workloads and user bases.
  • Develop and implement high-availability and disaster recovery solutions.
  • Mentorship & Continuous Learning
  • Stay up to date with emerging technologies, tools, and best practices in site reliability and cloud operations.
  • Mentor junior engineers and foster a culture of knowledge sharing and collaboration.

Required Qualifications

  • Education & Experience
  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 5 years of experience in Site Reliability Engineering, DevOps, or a related role.
  • Technical Skills
  • Proficiency in at least one programming language (e.g., Python, Go, Java, C#) for scripting and automation.
  • Strong understanding of system design, networking, and distributed systems.
  • Hands-on experience with Azure administration, including core services, workloads, security, and subscriptions.
  • Expertise in containerization and orchestration technologies (e.g., Docker, Kubernetes).
  • Experience with monitoring and logging tools (e.g., Azure Monitor, Log Analytics, Splunk, Grafana, Opsgenie).
  • Familiarity with CI/CD tools and best practices.
  • Knowledge of disaster recovery and high-availability solutions.
  • Preferred Certifications
  • Azure DevOps Engineer, Solution Architect, or Support Engineer certification is highly desirable.
  • Other relevant cloud and DevOps certifications are a plus.
  • Soft Skills
  • Excellent problem-solving and troubleshooting abilities.
  • Strong collaboration and communication skills, with the ability to convey technical concepts to both technical and non-technical audiences.
  • Ability to work effectively in cross-functional teams.
  • Entrepreneurial mindset with a focus on quality, innovation, and results.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$158,960 - $205,707
Income Estimation: 
$71,493 - $96,419
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at aKUBE

aKUBE
Hired Organization Address New York, NY Full Time
Job Description City : NYC, NY / Santa Monica, CA Onsite / Hybrid / Remote : Remote Duration : 2 months Rate Range : $67...
aKUBE
Hired Organization Address Santa Monica, CA Full Time
Job Description City : San Francisco, SF / Los Angeles, CA / New York / Seattle, WA / Orlando, FL Onsite / Hybrid / Remo...
aKUBE
Hired Organization Address Santa Monica, CA Full Time
Job Description City : Santa Monica, CA / NYC, NY Onsite / Hybrid / Remote : Hybrid Duration : 13 months Rate Range : $8...
aKUBE
Hired Organization Address TX Full Time
About Us aKUBE is an established IT staffing company expanding into the Texas market. We specialize in providing top-tie...

Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Salt Lake, UT area that may be a better fit.

Sr. Site Reliability Engineer

Varo Bank, Salt Lake, UT

Staff Site Reliability Engineer

Varo Bank, Salt Lake, UT

AI Assistant is available now!

Feel free to start your new journey!