What are the responsibilities and job description for the Sr DevOps Engineer- Core Services position at Jewelers Mutual?
About Us:
Jewelers Mutual is developing a state-of-the-art core microservices platform to support multiple lines of business. We are seeking a Senior DevOps Engineer to join our team in Raleigh. In this role, you’ll be the technical lead in optimizing and scaling our platform’s infrastructure, CI/CD pipelines, and monitoring capabilities. You’ll work closely with engineering teams to drive operational excellence, enhance developer productivity, and ensure the resilience and scalability of our microservices platform.
What You’ll Do:
- Design and Scale CI/CD Pipelines:
- Architect, implement, and optimize CI/CD pipelines that support automated testing and continuous delivery across multiple environments.
- Develop and integrate pipelines with tools like GitHub Actions, Argo CD, GitLab CI, or Azure DevOps, driving a rapid, reliable release process.
- Infrastructure as Code and Cloud Management:
- Use tools Terraform or Spacelift to design, deploy, and maintain infrastructure in cloud environments (Azure, AWS, or GCP), ensuring consistency, scalability, and security.
- Build and manage Kubernetes clusters (e.g., AKS, EKS, GKE), configuring them for optimal performance and reliability.
- Observability and Monitoring:
- Develop and implement observability frameworks using Prometheus, Grafana, Datadog, and similar tools, providing real-time insights into system health and performance.
- Create actionable alerts and dashboards, empowering teams to monitor services effectively and respond proactively to issues.
- Chaos Engineering and Fault Injection:
- Lead gamedays to test system resilience using tools like Gremlin or Chaos Monkey to simulate real-world disruptions.
- Design and execute failure scenarios, ensuring the platform can handle unexpected events and recover with minimal downtime.
- Developer Experience and Automation:
- Collaborate with development teams to optimize workflows, automate repetitive tasks, and create standards that improve code quality, testing, and deployment efficiency.
- Act as a mentor, sharing best practices in DevOps, cloud management, and observability, and cultivating a DevOps-first culture across teams.
What We’re Looking For:
- 5 years of experience in DevOps, Site Reliability Engineering, or a related role with a focus on SaaS platforms and microservices.
- Proven expertise in CI/CD pipeline management, automation tools, and modern deployment strategies in cloud-native environments.
- Advanced skills in Kubernetes, Docker, and container orchestration in cloud platforms like Azure (AKS), AWS (EKS), or GCP (GKE).
- Hands-on experience with Infrastructure as Code (e.g., Terraform) for provisioning and managing cloud resources.
- Strong background in observability tools (e.g., Prometheus, Grafana, Datadog) for monitoring, alerting, and logging.
- Experience with chaos engineering to test and improve system resilience and incident response.
Bonus Points For:
- Experience with serverless architectures, scaling large cloud applications, and security practices within DevOps.
- Familiarity with advanced observability techniques, including distributed tracing for complex microservices.
- Knowledge of security testing, access control, and data protection in cloud environments.
What We Offer:
- Hybrid work arrangements in our brand-new, state-of-the-art office in the North Hills development of Raleigh, NC.
- A collaborative and supportive team culture where contributions matter, and continuous learning is encouraged.
- Competitive compensation and benefits, including healthcare, 401(k) matching, and generous paid time off.
- The opportunity to be part of a transformative project that will have a lasting impact on Jewelers Mutual’s success.
At Jewelers Mutual, we’re committed to building a resilient, scalable platform that supports the future of our business. If you’re passionate about DevOps, cloud technologies, and creating efficient, resilient systems, we’d love to meet you!