Demo

Site Reliability Engineer

Calsoft Labs
San Diego, CA Full Time
POSTED ON 4/22/2025
AVAILABLE BEFORE 6/22/2025

Job Details

Job Role: Site Reliability Engineer

Job Type: W2 Contract (12 Months)

Job Location: San Diego, CA (Hybrid)

Job Description:

  • This Site Reliability Engineer role will focus on delivering on all existing services provided by the teams and helping with the development of future services. It will require working with multiple global teams working through access and compatibility with various systems.

Responsibilities:

  • Design Build, deploy and operate a combination of open-source, custom-written, and vendor provided software to provide services.
  • Analyze, review, & fulfil Identity & Access Management (IAM) requests.
  • Improve and support and understand Security Groups flows, execution, and troubleshooting Support a Windows AD service in the cloud.
  • Support an ecosystem of remote systems for remote user access.
  • Collaborate with multiple software & security engineering teams to integrate solutions and contribute to project deliveries.
  • Provide rotational on-call support where you'll respond, detect, triage and resolve production incidents.
  • Collaborate and partner with Security teams that specialize in areas such as compliance, identity & access management (IAM), security groups, and policies.
  • Collaborate on projects deliveries on time and within budget.
  • Developing automation pipelines to streamline development, testing, and deployment workflows within Infrastructure as Code (laC) framework.
  • Collaborating with engineering teams to investigate and troubleshoot complex problems.
  • Improving system monitoring and analysis of various cloud provider services ( AWS, Google Cloud Platform) to speed up error detection and remediation, enhancing performance and reliability.
  • Provide Tier 2 support for all engineering escalations from operational team (Platform Support).
  • Ability to design solutions and provide architectural and infrastructural requirements that promote uptime, laC, speed and security at all phases of the software lifecycle on a global scale.
  • Experience operating in regulated environments such as SOX/PCI.
  • Results driven person with great energy.

Key Qualifications:

  • BS in Computer Science, Software Engineering, or equivalent experience
  • 4 years professional experience operating complex system with at least 3 years at large scale 3 years professional Site Reliability experience operating at scale in high pace environment
  • 4 years working in AWS
  • 4 years hands-on with Akamai DSA experience
  • 4 years hands-on with AWS, Kuberetes, Google Cloud Platform, Infrastructure as Code, administration experience
  • Experience with the following AWS Concepts: Compute Services, Serverless, Identity
  • Experience with Google Cloud Platform and Kubernetes environments
  • Experience with the following AWS systems: AMIs, KMS, IAM, Workspaces, S3, EBS, Security Groups, CloudWatch, CloudTrail, and EC2,
  • Experience with the following systems: Windows AD and Squid Proxy.
  • Infrastructure as Code Tools: Terraform Enterprise, CloudFormation, SAM.
  • Familiarity with the following systems: Wiz, Datadog, Terraform Enterprise, and observability tools such as Datadog and Splunk.
  • Strong software development experience in: Python, and GitHub Build, deploy, and operate services at a fluent level (Linux/Unix).
  • Hands on experience in working with distributed systems and illities" (availability, reliability, scalability, etc.) of the services.
  • Extensive use of automation for Infrastructure as Code preferably via Terraform Enterprise.
  • Should have experience with continuous integration, continuous delivery/deployment tools like Jenkins and ArgoCD.
  • Strong development experience in one of these languages - Python or Go (Python preferred), JavaScript.
  • Hands on experience in working with distributed systems and 'illities" (availability, reliability, scalability, etc.) of the services.
  • Strong hands-on experience building and maintaining infrastructure for micro services.
  • Design and provide operational and infrastructural requirements that promote uptime, speed and security at all phases of SDLC on a global scale.

Required Foundational Skills:

  • Fluency with running distributed services at scale with performance.
  • Proven experience following software engineering best-practices.
  • In depth understanding of Unix/Linux systems internals and networking.
  • Experience with automation and configuration management tools.
  • Experience in AWS public cloud services and deployment.
  • Experience deploying and supporting CI/CD delivery pipelines in a large enterprise environment.
  • Knowledge of the software development lifecycle with experience integrating Open-Source tools.
  • Strong ability to tackle sophisticated issues ranging from system resources to application stack traces.
  • Strong hands-on experience building and maintaining infrastructure for micro services Experience developing tools for system configuration, deployment, and monitoring.
  • Strong belief in driving operational excellence with owning efficiency and automation at the core of operations.
  • PASSIONATE, desire to automate and improve everything including process improvements, standardizing tools and technologies!.
  • Methodical and systematic problem-solving approach.
  • Complete ownership of end-to-end solutions and handling their life cycle.
  • Execution oriented and results driven.
  • Customer and peer relationship focused with strong interpersonal and communication skills.
  • Ability to thrive in a fast-paced, collaborative, team environment.
  • Ability to learn new skills/technologies quickly and independently.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Salary : $70 - $80

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$158,960 - $205,707
Income Estimation: 
$71,493 - $96,419
Income Estimation: 
$92,369 - $122,605

Sign up to receive alerts about other jobs with skills like those required for the Site Reliability Engineer.

Click the checkbox next to the jobs that you are interested in.

  • Bug/Defect Analysis Skill

    • Income Estimation: $208,393 - $280,632
    • Income Estimation: $222,941 - $284,552
  • Computer Simulation Skill

    • Income Estimation: $92,775 - $114,342
    • Income Estimation: $90,032 - $105,965
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Calsoft Labs

Calsoft Labs
Hired Organization Address Lansing, MI Full Time
Job Details Position: Data Warehouse Architect 5 Location: Lansing, MI Duration: 1 Year Job Description: Top Skills & Ye...
Calsoft Labs
Hired Organization Address Lansing, MI Full Time
Job Details Please attach a separate Reference Page to your bid (not within resume) that includes at least 2 professiona...
Calsoft Labs
Hired Organization Address San Diego, CA Full Time
Job Details Top Required Skills: Familiarity with DDR interface, JEDEC spec, bus level view of transactions Develop and ...
Calsoft Labs
Hired Organization Address Mason, OH Full Time
Job Details Job Titile: Google Analytics 4 Expert Location: Mason, OH Job Descripotion: We are seeking a highly skilled ...

Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the San Diego, CA area that may be a better fit.

Site Reliability / Gitops Engineer

Canonical, San Diego, CA

Site Reliability Engineer

Yoh - A Day & Zimmerman Company, San Diego, CA

AI Assistant is available now!

Feel free to start your new journey!