What are the responsibilities and job description for the Senior Site Reliability Engineer position at United Software Group Inc?
Site Reliability Engineer
REMOTE (Nashville TN)
Description:
We are seeking a highly skilled and proactive Site Reliability Engineer (SRE) to join our team remotely. In this role, you will be responsible for managing build pipelines, troubleshooting issues, and ensuring the reliability and performance of our systems. You will work closely with development teams to enhance process flows, manage releases, and provide 24/7 production support.
Day-to-Day Responsibilities:
- Create and manage build pipelines in Bamboo and GitLab.
- Troubleshoot build and deployment failures.
- Oversee release management processes.
- Resolve performance and scaling issues, collaborating with engineers to manage traffic demands.
- Build and support CI/CD pipelines and production releases.
- Develop and maintain release architectures and monitoring frameworks.
- Create and manage requests and changes within ServiceNow.
- Assist developers through Jira tickets.
- Analyze application logs for errors and performance issues.
- Document processes and procedures.
- Champion process improvements and drive efficiencies.
- Create and deploy patches.
- Work with minimal direction and manage priorities effectively.
- Influence change and provide proactive 24/7 production support.
Requirements:
- 5 years of professional experience in technology or a related field.
- 2 years of CI/CD pipeline experience.
- 2 years of experience with Kubernetes/EKS, including pod life cycle management (readiness and liveness checks).
- Intermediate to advanced BASH shell scripting skills.
- Strong experience with Dynatrace APM and RUM; Dynatrace Associate Certification is a plus.
- Intermediate skills with on-prem GitLab CI pipeline creation, troubleshooting, and configuration.
- Knowledge of complex CDN cached website architecture.
- Familiarity with JavaScript (Node.js).
- Experience with application and web servers like Tomcat, Apache.
- Experience with ServiceNow and Jira for change management.
- Experience working within a DevOps team.
Preferred Skills:
- Knowledge of complex CDN cached website architecture.
- Familiarity with JavaScript (Node.js).
- Experience with application and web servers like Tomcat, Apache.
- Experience with ServiceNow and Jira for change management.
- Experience working within a DevOps team.