What are the responsibilities and job description for the Site Reliability Engineer position at Allara?
About Allara
Allara is a telemedicine platform delivering expert, multidisciplinary healthcare for women with hormonal conditions, covered by insurance. Allara's comprehensive telehealth platform pairs patients with a dedicated care team to provide expert medical guidance, nutrition and lifestyle coaching, and personalized treatment plans that go beyond the basics. As one of the fastest-growing women's health platforms in the US, Allara provides innovative, high-quality care while addressing long-overlooked gaps in the healthcare system for women with hormonal health needs. Allara is backed by leading investors including Google Ventures and Index.
About the role
As a key member of our engineering team and our first dedicated Site Reliability Engineer, you'll play a crucial role in ensuring the reliability, performance, and scalability of our care delivery platform. You'll be instrumental in building and maintaining the infrastructure that supports our mission to deliver accessible, personalized care. This is a chance to be a foundational member of our engineering team, with significant opportunities for professional growth and leadership as we scale. We value innovation, reliability, collaboration, and a commitment to health equity. Join us in ensuring the stability and scalability of our critical platform!
We believe that in-person collaboration is essential for our team's success. We aim for in-person work at least three days per week in our NYC office, with flexibility where needed.
What you'll do
We are looking for someone with proven experience in Site Reliability Engineering or a similar role (Platform Engineer, DevOps Engineer), and a passion for building reliable and scalable systems. We value diverse experiences and perspectives. You are:
Allara is a telemedicine platform delivering expert, multidisciplinary healthcare for women with hormonal conditions, covered by insurance. Allara's comprehensive telehealth platform pairs patients with a dedicated care team to provide expert medical guidance, nutrition and lifestyle coaching, and personalized treatment plans that go beyond the basics. As one of the fastest-growing women's health platforms in the US, Allara provides innovative, high-quality care while addressing long-overlooked gaps in the healthcare system for women with hormonal health needs. Allara is backed by leading investors including Google Ventures and Index.
About the role
As a key member of our engineering team and our first dedicated Site Reliability Engineer, you'll play a crucial role in ensuring the reliability, performance, and scalability of our care delivery platform. You'll be instrumental in building and maintaining the infrastructure that supports our mission to deliver accessible, personalized care. This is a chance to be a foundational member of our engineering team, with significant opportunities for professional growth and leadership as we scale. We value innovation, reliability, collaboration, and a commitment to health equity. Join us in ensuring the stability and scalability of our critical platform!
We believe that in-person collaboration is essential for our team's success. We aim for in-person work at least three days per week in our NYC office, with flexibility where needed.
What you'll do
- Collaborate with development teams to design and implement resilient and scalable infrastructure.
- Develop and maintain automation for deployment, monitoring, and incident response.
- Implement and manage monitoring and alerting systems to proactively identify and address potential issues.
- Conduct post-incident reviews to identify root causes and implement preventative measures.
- Optimize system performance and reliability through capacity planning and performance tuning.
- Support code quality and maintainability through infrastructure as code, automated testing, and documentation.
- Drive continuous improvement in our engineering practices, including incident response, deployment processes, and system design.
- Investigate and resolve platform incidents and bugs.
- Support cost-conscious best practices for our systems
We are looking for someone with proven experience in Site Reliability Engineering or a similar role (Platform Engineer, DevOps Engineer), and a passion for building reliable and scalable systems. We value diverse experiences and perspectives. You are:
- Experienced with cloud platforms (AWS, GCP)
- Experience with containers and container orchestration technologists (Docker, ECS, EKS).
- Familiar with infrastructure as code tools (e.g., Terraform, CloudFormation).
- Experienced with monitoring and alerting tools (e.g., Datadog, New Relic Prometheus, Grafana).
- Experienced with automation
- Knowledgeable about networking, and system administration
- Excited to work across all layers of a platform, diving into whatever is needed
- An excellent collaborator and communicator, with a demonstrated ability to work effectively with cross-functional teams and other engineers
- Able to analyze complex systems and troubleshoot issues effectively
- Passionate about improving system reliability and performance
- Experience in regulated industries, even outside of healthcare
- Experience with container orchestration (e.g., ECS)
- Experience working in fast paced startup environments
- Experience with database optimization
- Experience ensuring security and compliance of our infrastructure
- Compensation: Base Salary Equity
- Comprehensive Medical/Dental/Vision Disability Benefits
- 401(k)
- Unlimited PTO
- Additional perks (monthly in office lunches, annual wellness stipend, etc.)