What are the responsibilities and job description for the Sr. Site Reliability Engineer position at Judge Group, Inc.?
Job Details
Location: Owings Mills, MD
Salary: $60.00 USD Hourly - $65.00 USD Hourly
Description: Our client is currently seeking a Sr. Site Reliability Engineer
This is going to be a W2 contract and is onsite in Owings Mills, MD - MUST be local
Senior Site Reliability Engineer
Overview
The Technology Engineering team is looking for an experienced Site Reliability Engineer to join us as we reimagine production application and infrastructure management. The team is responsible for engineering scalable and resilient hybrid cloud solutions (both AWS and On-prem). You will create tooling and software that monitors and improves the reliability of our systems. In this role, you will research problems, evaluate modern technologies, create prototypes, develop observability tooling, and provide SRE consulting on complex projects.
Key Responsibilities
Business Knowledge
Requirements
Preferred Qualifications
By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact:
This job and many more are available through The Judge Group. Please apply with us today!
Salary: $60.00 USD Hourly - $65.00 USD Hourly
Description: Our client is currently seeking a Sr. Site Reliability Engineer
This is going to be a W2 contract and is onsite in Owings Mills, MD - MUST be local
Senior Site Reliability Engineer
Overview
The Technology Engineering team is looking for an experienced Site Reliability Engineer to join us as we reimagine production application and infrastructure management. The team is responsible for engineering scalable and resilient hybrid cloud solutions (both AWS and On-prem). You will create tooling and software that monitors and improves the reliability of our systems. In this role, you will research problems, evaluate modern technologies, create prototypes, develop observability tooling, and provide SRE consulting on complex projects.
Key Responsibilities
- Design and implement highly automated systems/services ensuring availability, reliability, and scalability of infrastructure and applications.
- Build and maintain monitoring and alerting systems to provide timely feedback on performance and health of systems, network, and applications.
- Design and implement automation tools to reduce manual toil, streamline repetitive tasks, and enhance overall operational efficiency.
- Design and build Service Level Indicator (SLI) metrics, including Service Level Objectives (SLOs), Error Budget, and Burn Rate Alerts.
- Work closely with development teams to embed reliability best practices into the software development process.
- Provide mentorship and training to cross-functional teams on SRE principles, encouraging shared responsibility for service reliability.
- Collaborate with support, operations, and engineering teams to investigate and troubleshoot complex problems.
- Observe and monitor systems to gain insight into system performance, health, and availability.
- Understand what to monitor based on the systems managed, how monitoring data is stored, and how to analyze the data for future actions.
- Participate in continuous improvement efforts spanning multiple domains and inform the generation of new standards.
- Be part of an on-call rotation, continuously enhance automation & documentation, and mentor others on infrastructure automation best practices.
- Overcome differences of opinion and drive team alignment around specific goals or solutions.
- Hold associates and teams accountable for adhering to practices and policies.
Business Knowledge
- Demonstrate deep knowledge of products/flows within supported businesses.
- Decompose complex problems into discrete work units.
- Identify non-obvious relationships and anomalies often overlooked by others.
- Balance strategic and pragmatic concerns when solving problems.
- Make sound decisions with limited facts or resources.
- Make decisions cognizant of the firm's broader business strategy.
- Articulate broader business concerns and/or regulatory landscape, including key risks and controls (e.g., GDPR, MIFID, SOX).
Requirements
- Prior experience as an SRE.
- Strong experience with Monitoring and Automation tools such as Prometheus, Grafana, New Relic.
- Strong familiarity with the SRE "pillars of visibility" (logs, metrics, and traces).
- Scripting experience with Python (NOT Perl).
- Cloud experience with Amazon CloudWatch, Amazon Elastic Container Service (ECS), and Amazon Elastic Kubernetes Service (EKS).
- A 4-year college degree is mandatory for this role.
- Experience in container orchestration solutions in AWS with ECS, Fargate.
- Docker container development experience.
- Skilled in building and maintaining dashboards using tools like Grafana, Prometheus, and Statsd.
- Experience using automation tools such as Terraform, Ansible.
- Excellent written and oral communication skills.
- Strong interpersonal skills, adaptable, and able to learn quickly.
- Off-hour implementations are required.
- Ability to build positive working relationships with business contacts, IT team, and other IT departments.
- Ability to identify tasks and help develop project plans for medium and large-scale projects.
Preferred Qualifications
- College degree in computer science or related technical field with 7 years of systems design, programming, implementation, and integration experience.
- 3 years of experience within the Amazon Web Services platform.
- AWS, Kubernetes Certifications.
By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact:
This job and many more are available through The Judge Group. Please apply with us today!
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
Salary : $60 - $65