What are the responsibilities and job description for the Site Reliability Engineer position at Dinohead?
Functional Responsibilities:
- Participate in a collaborative Kanban multi-discipline team working closely with customer to accelerate cloud initiatives and improve processes.
- Work with the customer to design and build CI/CD pipelines. Develop and integrate toolchain systems to provide path to production from development Software Factory.
Essential Job Functions:
- Enable Continuous Integration/Continuous Delivery through appropriate design guidelines.
- Maintain traceability between requirements, design, and test cases.
- Work directly with Development and Operations teams to increase velocity, prioritize tasks, implement requirements, and automate.
Experience and Responsibilities:
- Lead SRE and DevOps work initiatives from inception to production
- Knowledge of architecture concepts including microservices, container orchestration, and traditional 3-teir applications
- Design and implement Kubernetes platforms and tools chains
- Implement infrastructure as code using tools such as Ansible Automation Platform, Puppet, and VMware vRealize Automation
- Develop and maintain code (Bash, Python, YAML, PowerShell, Ruby, Groovy)
- Experience with observability tools such as Log Insight, Elastic Stack, Splunk, QRadar, or Prisma Cloud
- Design and implement enterprise on-premises and hybrid cloud deployments
- Lead efforts using Agile methodologies
- Ability to work both independently and in a team environment with clients and vendors, demonstrated technical leadership skills, good verbal and written communication skills
Required Certifications:
- Must obtain/maintain a DoD 8570 IAT Level II certification (Security , CCNA Security, CySA , GICSP, GSEC, CND, SSCP) within 120 Days of hire.
Desired Certifications:
- CNCF/Kubernetes
- Atlassian
- VMware
- Red Hat
- GitLab
- Oracle
- Palo Alto