Demo

Site Reliability Engineer

Prama
Phoenix, AZ Contractor
POSTED ON 1/20/2025 CLOSED ON 2/4/2025

What are the responsibilities and job description for the Site Reliability Engineer position at Prama?


Job Title: Site Reliability Engineer (SRE) - Kubernetes and Systems Automation Specialist

Location: Remote

Job Type: Contract


Position Overview: 

We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with a specialization in Kubernetes and Systems Automation. In this role, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You will work closely with engineering, DevOps, and operations teams to design and implement systems that automate operations, reduce manual intervention, and enhance our platform's overall reliability.

Key Responsibilities: 

1. Kubernetes Management: 

  - Deploy, manage, and maintain Kubernetes clusters in cloud and on-premises environments. 

  - Optimize Kubernetes configurations, including namespaces, pods, services, and networking. 

  - Implement robust CI/CD pipelines for containerized applications. 

  - Monitor and troubleshoot Kubernetes workloads to ensure high availability. 

2. Systems Automation: 

  - Develop and manage Infrastructure as Code (IaC) using tools like Terraform, Ansible, or similar. 

  - Automate routine tasks, including monitoring, deployments, scaling, and incident responses. 

  - Write efficient scripts for task automation using Python, Bash, or similar languages. 

  - Collaborate on the design and implementation of automated disaster recovery and failover strategies. 

3. Performance and Reliability: 

  - Set up monitoring and alerting systems using tools like Prometheus, Grafana, or Datadog. 

  - Perform root cause analysis for incidents and implement solutions to prevent future occurrences. 

  - Establish Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure and improve system reliability. 

4. Collaboration and Culture: 

  - Partner with development teams to design and implement scalable and fault-tolerant systems. 

  - Drive a culture of reliability through postmortem analysis and a blameless incident review process. 

  - Provide guidance and training to teams on Kubernetes best practices and systems automation. 


Qualifications: 

Technical Expertise: 

 - Proven experience managing Kubernetes in production environments. 

 - Strong knowledge of containerization technologies (e.g., Docker). 

 - Proficiency in Infrastructure as Code (IaC) tools like Terraform, CloudFormation, or Ansible. 

 - Expertise in programming and scripting languages (e.g., Python, Go, Bash).  

Cloud Experience: 

 - Hands-on experience with major cloud platforms (AWS, GCP, Azure). 

 - Knowledge of hybrid and multi-cloud setups is a plus. 

Systems Engineering: 

 - Strong understanding of Linux/Unix systems and networking fundamentals. 

 - Familiarity with logging and monitoring tools like ELK Stack, Prometheus, Grafana, or similar. 

Soft Skills: 

 - Strong problem-solving and analytical skills. 

 - Excellent communication and teamwork abilities. 

 - Ability to document technical solutions clearly and effectively. 

Preferred Qualifications: 

- Certified Kubernetes Administrator (CKA) or similar certifications. 

- Experience with service mesh technologies like Istio or Linkerd. 

- Familiarity with security best practices in Kubernetes and cloud environments.  



Site Reliability Engineer
Canonical -
Phoenix, AZ
Senior Site Reliability / Gitops Engineer
Canonical -
Phoenix, AZ
Associate Site Reliability Engineer in the Phoenix(AZ) location
Avani Tech Solutions Private Limited -
Phoenix, AZ

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$82,762 - $100,977
Income Estimation: 
$95,852 - $118,073
Income Estimation: 
$120,143 - $165,703
Income Estimation: 
$76,670 - $90,826
Income Estimation: 
$91,609 - $118,978
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
This job has expired.
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Phoenix, AZ area that may be a better fit.

Site Reliability Engineer - REMOTE

System One, Phoenix, AZ

Site Reliability Engineer

Vertex Elite LLC, Phoenix, AZ

AI Assistant is available now!

Feel free to start your new journey!