What are the responsibilities and job description for the DevOps Manager position at AtScale?
Company Overview
AtScale enables smarter decision-making by unlocking data-driven insights. The company's semantic layer platform simplifies, accelerates, and extends business intelligence and data science capabilities for enterprise customers across all industries. AtScale empowers customers to democratize data, implement self-service BI, and build a more agile analytics infrastructure to make more impactful decisions.
Job Description
You will manage a globally distributed team of highly skilled SRE and DevOps engineers to support and develop infrastructure, systems, and processes required for our cutting edge technologies for data analytics. You will be key in helping us leverage cutting edge technologies and apply best practices.
This is a technical and managerial position working across multiple cloud providers and infrastructure, CI/CD pipelines, and Observability tools, while also partnering closely with our Engineering Managers to ensure we are providing the support and ability to self-serve required to deliver to our customers successfully. This role also has a customer-facing aspect to support custom implementations.
The expectation for this role is to be a leader and complete management responsibilities but also to be hands-on when required (including working directly with clients).
Working Hours
AtScale is a globally distributed company with headquarters in Boston MA with employees spanning disparate timezones. To facilitate collaboration we require overlap with a core set of working hours from 10am to 5pm eastern time.
Responsibilities
- Collaborate on building a vision for where we need to be for DevOps and infrastructure, and assist in managing a plan for how to move towards the vision.
- Help establish best practices, document designs, and mentor team members
- Collaborate with project teams to ensure consistency in solution designs, leveraging architecture boards and other governance mechanisms
- Enhance and support incident management and escalation process
- Resolve support issues in production and non-production environments
- Define requirements, estimate work, track dependencies, report progress, highlight blockers
Requirements
- BA/BS preferred in a technical or engineering field
- 8 years experience in a DevOps culture and/or SRE team
- 3 years experience managing a team of 4 or more
- Experience with cloud technologies from providers like AWS, Azure, or GCP
- Experience with CI/CD and tools like GitHub Actions, Jenkins, CircleCI, SonarCube
- Experience with infrastructure-as-code and technologies such as Terraform
- Experience with Docker and orchestration technologies such as Kubernetes and Helm charts
- Experience designing robust systems for HA, Fail Over, and Disaster Recovery
- Experience with observability and technologies such as OpenTelemetry, ELK, Prometheus, etc
- Familiarity with KeyCloak or other identity and access management solutions
- Familiarity with cyber security concerns
- Familiarity with core aspects of networking and linux system administration