What are the responsibilities and job description for the Cloud & Site Reliability Engineer position at Bold Penguin, Inc?
As a Cloud & Site Reliability Engineer, you will be a subject matter expert in building highly reliable, highly scalable features and infrastructure. You’ll use DevOps principles to ensure that Bold Penguin’s software systems are always available and ready to scale to meet growing demands.
WHAT YOU'LL DO
WHAT YOU'LL DO
- Ensure reliability, performance, and availability of our platform by working as part of a cross-functional product team.
- Participate in agile ceremonies such as iteration planning, retrospectives, and daily standups.
- Be part of the shared on-call rotation and proactively research possible issues affecting the availability of our platform.
- Understand and clearly articulate tradeoffs in architecture decisions with regards to cost, security, operational efficiencies, performance, and availability.
- Build and maintain infrastructure with executable code (IaC) and automated delivery pipelines.
- Be passionate about Cloud/DevOps/SRE concepts such as Immutable Infrastructure, Cattle vs Pets, Infrastructure as Code, Delivery Pipelines.
- Additional duties and responsibilities as assigned.