What are the responsibilities and job description for the Site Reliability Engineer position at Incode Technologies?
- Role Title: Site Reliability Engineer
- Direct Report: Head of Site Reliability
- Area: Engineering
- Location: Unites States
We are looking for a highly skilled and proactive Site Reliability Engineer (SRE) to join our growing team. This role is ideal for a professional with a strong background in DevOps, cloud infrastructure, and automation, who thrives in a fast-paced, high-impact environment.
As an SRE, you will work closely with engineering, security, and infrastructure teams to ensure our systems are highly available, scalable, and secure. You will play a crucial role in deployments, incident response, system reliability, and performance optimization, while also contributing to long-term infrastructure strategies. This role also involves working on government-related projects, requiring an understanding of security and compliance frameworks.
Key Responsibilities:
- Partner with the Engineering and Security teams to create, implement and apply SRE principles, processes, and controls.
- Ensure appropriate security practices are communicated and implemented within their application security programs. Support adherence and awareness of these practices.
- Work with the teams to on-board the security tools/technologies.
- Build & support Site Reliability function & participate in building tools to monitor and report system KPIs.
- Deliver tasks based on project objectives; technically support projects through to completion.
- Provides emergency response either by being on-call or by reacting to symptoms according to monitoring and escalation when needed.
- Plan and execute configuration change operations both at the application and the infrastructure level.
- Work with teams to bring continuous improvement to SRE processes and tools.
- Contribute to the hiring process in review questionnaires or being part of the interview team to qualify SRE candidates.
- Improves documentation all around, either in application documentation, or in runbooks, explaining the why, not stopping with the what.
Requirements:
- 5 years of experience in DevOps, SRE domain.
- Experience in developing or administering the security of cloud environments AWS, Azure, etc.
- Practical knowledge of DevOps toolbox: Configuration Management (Ansible, Terraform etc),
- Containers (Docker, Kubernetes), Continuous Integration & Continuous Delivery (CI/CD) (Jenkins, Github CI, CircleCI), Databases (MongoDB, SQL).
- Experience in supporting Linux in production environments, working with Unix firewalls, access controls and disk encryption.
- Experience working with industry standards or programs such as SOC2, ISO 27001, PCI is a plus.
- Practical knowledge of several security practices in SDLC and supporting IT security tools, access control, application security, network security, security architecture and security strategy.
- Good working knowledge of Java, Python, JavaScript.