Demo

Senior Site Reliability Engineer

ApTask
Maryland, MD Full Time
POSTED ON 2/21/2025
AVAILABLE BEFORE 5/18/2025

About Client :

The Client is a leading global IT services and consulting company, providing a wide range of services to clients in various industries, including banking, financial services, retail, manufacturing, healthcare, and more. It is one of the largest employers in the IT industry and has a vast and diverse workforce. The company places a strong emphasis on employee training and development. Client is known for its commitment to innovation and invests in research and development to stay at the forefront of technological advancements.

It offers a comprehensive set of services, including :

IT Services : Application development, maintenance, and testing.

Consulting : Business consulting, IT strategy, and digital transformation.

Business Process Outsourcing (BPO) : Outsourcing of business processes to improve efficiency.

Enterprise Solutions : Implementation and support of enterprise-level software solutions. Digital Services : Services related to digital technologies, such as analytics, cloud, and IoT.

Salary Range : $130K-$140K / Annum

Job Description :

  • 6 years of experience as a Site Reliability Engineer or equivalent in a similar role.
  • Proven experience in monitoring, analyzing, and optimizing the performance of large-scale distributed systems.
  • Track record of operating and supporting Kubernetes in production at scale - EKS preferred.
  • Expertise in Linux systems administration, including managing servers, operating systems, and network configurations.
  • Strong scripting and automation skills, preferably with experience in Bash, Python, or similar languages.
  • Familiarity with AWS.
  • Experience with DevOps tools and practices, such as GitLab CI / CD, and Docker.
  • Excellent troubleshooting and problem-solving skills with a knack for identifying and resolving complex technical issues.
  • Ability to work independently and as part of a collaborative team, effectively communicating technical concepts to both technical and non-technical stakeholders.
  • A passion for maintaining high availability, performance, and reliability of critical systems in a fast-paced financial environment.

Responsibilities : Availability :

  • Proactively monitor and proactively identify potential issues that could impact the availability of our systems.
  • Implement and maintain automated alerting mechanisms to notify the appropriate parties of potential outages or performance degradation.
  • Collaborate with development teams to design and implement solutions that enhance system resilience and reduce downtime.
  • Latency :

  • Analyze performance metrics to identify and resolve latency bottlenecks in our infrastructure.
  • Implement performance optimization techniques and tools to improve the overall responsiveness of our systems.
  • Work with development teams to ensure that new features and code changes do not introduce performance regressions.
  • Performance :

  • Develop and maintain metrics dashboards to track key performance indicators (KPIs) for our critical systems.
  • Identify performance trends and anomalies that may indicate potential issues or areas for improvement.
  • Recommend and implement performance optimization strategies to enhance the overall efficiency of our systems.
  • Efficiency :

  • Optimize resource utilization and minimize unnecessary expenditure on IT infrastructure.
  • Collaborate with development teams to optimize resource allocation for new applications and services.
  • Release Management :

  • Participate in the release planning process to ensure that software releases are conducted smoothly and without disruptions.
  • Develop and implement automated deployment and rollback procedures to mitigate risks associated with software updates.
  • Monitor the performance of new releases and address any issues that arise promptly.
  • Monitoring :

  • Design, implement, and maintain a comprehensive monitoring infrastructure to track the health and performance of our systems.
  • Analyze monitoring data to identify potential issues and proactively troubleshoot problems before they impact users.
  • Develop and implement alerts and notifications for critical events to ensure timely intervention.
  • Emergency Response :

  • Respond promptly to incidents and work collaboratively to resolve them in a timely manner.
  • Analyze root causes of incidents to identify and implement preventive measures to minimize their recurrence.
  • Document incident responses and lessons learned to enhance our incident handling processes.
  • Participate in capacity planning exercises to anticipate future workloads and make proactive recommendations to expand or optimize infrastructure resources.
  • Stay abreast of emerging technologies, trends, and industry best practices in the field of site reliability engineering and contribute to the continuous improvement of our practices and tools.
  • Work with development teams to review architecture design to ensure high availability and proper disaster recovery strategy.
  • Collaborate with reliability and infrastructure engineering team to build synergy in tooling for the implementation of observability, tracing, and alerting.
  • About ApTask :

    ApTask is a leading global provider of workforce solutions and talent acquisition services, dedicated to shaping the future of work. As an African American-owned and Veteran-certified company, ApTask offers a comprehensive suite of services, including staffing and recruitment solutions, managed services, IT consulting, and project management. With a focus on excellence, collaboration, and innovation, ApTask provides unparalleled opportunities for professional growth and development. As a member of the ApTask team, you will have the chance to connect businesses with top-tier professionals, optimize workforce performance, and drive success across diverse industries. Join us at ApTask and be part of our mission to empower organizations to thrive while fostering a diverse and inclusive work environment.

    Applicants may be required to attend interviews in person or by video conference. In addition, candidates may be required to present their current state or government issued ID during each interview.

    Candidate Data Collection Disclaimer :

    At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our clients for verification and application purposes. Rest assured, we strictly adhere to confidentiality standards and comply with all relevant data protection laws. Please note that we only collect the necessary information as specified by each client and do not request sensitive details during the initial stages of recruitment.

    If you have any concerns or queries about your personal information, please feel free to contact our compliance team at businessexcellence@aptask.com.

    Salary : $130,000 - $140,000

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Senior Site Reliability Engineer?

    Sign up to receive alerts about other jobs on the Senior Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $114,618 - $136,401
    Income Estimation: 
    $144,264 - $191,312
    Income Estimation: 
    $140,435 - $166,410
    Income Estimation: 
    $114,618 - $136,401
    Income Estimation: 
    $144,264 - $191,312
    Income Estimation: 
    $140,435 - $166,410
    Income Estimation: 
    $140,435 - $166,410
    Income Estimation: 
    $151,875 - $212,356
    Income Estimation: 
    $169,957 - $202,398
    Income Estimation: 
    $76,670 - $90,826
    Income Estimation: 
    $91,609 - $118,978
    Income Estimation: 
    $92,877 - $110,401
    Income Estimation: 
    $92,877 - $110,401
    Income Estimation: 
    $120,933 - $155,034
    Income Estimation: 
    $114,618 - $136,401
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at ApTask

    ApTask
    Hired Organization Address Phoenix, AZ Full Time
    About Client: The Client is a leading global IT services and consulting company, providing a wide range of services to c...
    ApTask
    Hired Organization Address Sunnyvale, CA Full Time
    About Client: The Client is a leading global IT services and consulting company, providing a wide range of services to c...
    ApTask
    Hired Organization Address Kansas, KS Full Time
    About Client : The client is a global technology, consulting, and digital solutions company with problem-solving abiliti...
    ApTask
    Hired Organization Address Norcross, GA Full Time
    About Client : Company is a worldwide provider of legal services, serving law firms, corporations, financial institution...

    Not the job you're looking for? Here are some other Senior Site Reliability Engineer jobs in the Maryland, MD area that may be a better fit.

    Senior Site Reliability Engineer

    Peraton, Annapolis, MD

    Senior Site Reliability Engineer - FedRAMP

    VIKTech LLC, Annapolis, MD

    AI Assistant is available now!

    Feel free to start your new journey!