What are the responsibilities and job description for the Site Reliability Engineer (SRE) position at Glow Networks?
Job Title: Site Reliability Engineer (SRE) - CI/CD Pipelines
Location: Mountain View CA Hybrid (3 days a week onsite)
Job Summary:
We're seeking an experienced Site Reliability Engineer (SRE) to join our team, focusing on designing, implementing, and maintaining scalable CI/CD pipelines. The ideal candidate will have expert-level knowledge of Kubernetes, AWS, and CI/CD tools like Jenkins and Argo. Proficiency in coding languages such as Python, Go, or Java is required. The SRE will work closely with our development teams to ensure seamless deployment of applications and services.
Responsibilities:
1. Design, implement, and maintain scalable CI/CD pipelines using tools like Jenkins, Argo, and GitOps.
2. Collaborate with development teams to integrate CI/CD pipelines with application code, ensuring automated testing, building, and deployment.
3. Develop and maintain scripts and tools for automating deployment, scaling, and management of applications on Kubernetes and AWS.
4. Ensure high availability, scalability, and performance of CI/CD pipelines and applications.
5. Troubleshoot and resolve issues related to CI/CD pipelines, applications, and infrastructure.
6. Implement monitoring, logging, and alerting tools to ensure pipeline and application health.
7. Collaborate with cross-functional teams to identify and prioritize areas for improvement.
8. Participate in on-call rotations for pipeline and application support.
Requirements:
1. 5 years of experience in Site Reliability Engineering, DevOps, or a related field.
2. Expert-level knowledge of Kubernetes, AWS, and CI/CD tools like Jenkins and Argo.
3. Proficiency in coding languages such as Python, Go, or Java.
4. Experience with containerization using Docker.
5. Experience with infrastructure-as-code tools like Terraform or CloudFormation.
6. Excellent problem-solving skills and attention to detail.
7. Strong communication and collaboration skills.
Nice to Have:
1. Experience with GitOps and infrastructure-as-code.
2. Knowledge of security best practices for CI/CD pipelines.
3. Experience with monitoring and logging tools like Prometheus, Grafana, and ELK.
4. Certification in Kubernetes, AWS, or related technologies.