What are the responsibilities and job description for the Site Reliability Engineer position at Quindar?
What You’ll Be Doing
As a Site Reliability Engineer (SRE), you will be laser focused on deploying, maintaining, and rapidly resolving issues for customers and internal users of Quindar’s SaaS product. You will perform a variety of functions including supporting nominal releases and fixes, taking a leadership role in on-call support and incident remediation, and writing software / infrastructure code alongside development teams.
You will work daily alongside software development, test, and leadership stakeholders throughout project lifecycles to understand deployment and maintenance strategy, minimize potential for architectural disconnects, and build out systems for monitoring and maintaining product capabilities upon release. This will require you to exercise a combination of skillsets spanning software engineering, platform engineering, systems engineering, and software testing.
You will spend time working directly with Quindar’s customers and product / customer success teams to understand common pain points and mitigate in existing capabilities as well as pre-emp in future Quindar development.
You will be a part of the engineering team working in a highly iterative agile product development environment.
Tech Skills
As a Site Reliability Engineer (SRE), you will be laser focused on deploying, maintaining, and rapidly resolving issues for customers and internal users of Quindar’s SaaS product. You will perform a variety of functions including supporting nominal releases and fixes, taking a leadership role in on-call support and incident remediation, and writing software / infrastructure code alongside development teams.
You will work daily alongside software development, test, and leadership stakeholders throughout project lifecycles to understand deployment and maintenance strategy, minimize potential for architectural disconnects, and build out systems for monitoring and maintaining product capabilities upon release. This will require you to exercise a combination of skillsets spanning software engineering, platform engineering, systems engineering, and software testing.
You will spend time working directly with Quindar’s customers and product / customer success teams to understand common pain points and mitigate in existing capabilities as well as pre-emp in future Quindar development.
You will be a part of the engineering team working in a highly iterative agile product development environment.
Tech Skills
- Proficiency in Python and experience working with software teams developing distributed microservice software capabilities
- Experience with cloud services in AWS and ability to diagnose and remediate issues encountered in cloud-native deployments in AWS
- Experience with Kubernetes, containerized applications, and serverless architecture and ability to diagnose and remediate issues encountered in software services deployed in k8s
- Experience with monitoring tools such as DataDog, Splunk, AWS CloudWatch
- Familiarity with Infrastructure as Code tools and engineering fundamentals and ability to work alongside Platform team on both nominal platform architecture and incident remediation
- Strong knowledge of developing and troubleshooting API services, distributed NoSQL and relational databases, caching systems, event-driven and multi-tier architectures
- Strong knowledge of task automation and CI/CD pipeline building, preferably with GitHub actions but not required
- Understanding of Unix/Linux operating systems
- Experience with Git and strong focus on git hygiene and release management
- Deep technical analysis and troubleshooting skills
- Strong sense of urgency and ability to drive rapid remediation of issues while working with stakeholders to build robust and maintainable long-term solutions
- Independent self-starter; able to complete projects on time with minimal guidance
- Strong communications skills and ability to engage effectively across development teams, program management, and customers
- Bachelor’s degree in Computer Science (or related field)
- 5 years of professional experience as software development, platform, and/or reliability engineer
- US Citizenship and ability to obtain a security clearance
- To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C.
- 1157, or (iv) Asylee under 8 U.S.C.
- 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here.
- We are a remote-first workplace with flexible hybrid work schedules in our offices in Seattle and Denver - we provide work from home benefits so you always have a nice place to work, speedy internet, and of course coffee/tea!
- We take work life balance very seriously. We require employees to take 15 days off but provide unlimited PTO and follow most US federal government holidays.
- Mental health is just as important as physical so we provide quarterly health & wellness benefits.
- Comprehensive health insurance for you and your family with 100% coverage for employees.
- We encourage employees to save for retirement and provide 4% 401(k) matching.
- Each quarter we have a 4-day company offsite. Previous locations include San Francisco, Nashville, Denver, Santa Fe, New Orleans, San Diego, and Bozeman.
- Our culture and company is evolving. You will be key in creating the next major or minor version!
Salary : $130,000 - $180,000