What are the responsibilities and job description for the Site Reliability Engineer (Rustici) US, Franklin, Remote position at Learning Technologies Group?
The Site Reliability Engineer (SRE) at Rustici Software contributes to the success of the Site Reliability team in deploying, monitoring, and maintaining multiple large applications along with hundreds of customer environments hosted in AWS. The SRE is an individual contributor reporting to the Director of DevSecOps and will participate in an on-call rotation schedule.
Duties and Responsibilities
- Assist in the improvement of internal automation specifically through the use of “Infrastructure as Code” tools
- Assist in the maintenance of and addition of new features to the infrastructure control plane
- Act as the primary contact to monitor, troubleshoot, and resolve production issues as part of an on-call rotation of roughly one week per month to adhere to a 24 / 7 / 365 SLA
- Collaborate with the Director of DevSecOps as well as other members of the SRE team to explore, plan, and implement or improve the security posture, reliability, performance, and cost of hosted resources
- Collaborate with one or more product development teams on application development direction as relates to aspects of deployment and operational factors
- Collaborate with the members of the support and integration teams to assist with the support of customer environments as relates to deployment and operational factors
- Continuously improve knowledge of best practices in site reliability and technical skills related to security, automation, networking, and system operations
Skills and Experience
Experience
Technical Background
We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, colour, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.