Demo

Site Reliability Engineer (5667)

MetroStar
Hanscom AFB, MA Full Time
POSTED ON 3/4/2025
AVAILABLE BEFORE 9/10/2025

As Site Reliability Engineer, you’ll lead the design, implementation, and management of highly available and scalable systems, applying industry best practices and reliability engineering principles.


We know that you can’t have great technology services without amazing people. At MetroStar, we are obsessed with our people and have led a two-decade legacy of building the best and brightest teams. Because we know our future relies on our deep understanding and relentless focus on our people, we live by our mission: A passion for our people. Value for our customers.


If you think you can see yourself delivering our mission and pursuing our goals with us, then check out the job description below!


What you’ll do:



  • Collaborate with cross-functional teams to identify performance bottlenecks, troubleshoot complex issues, and optimize system performance to meet defined service level objectives.

  • Design and implement monitoring, alerting, and incident response strategies to proactively identify and mitigate potential issues, ensuring uninterrupted service availability.

  • Drive automation initiatives to streamline deployment, configuration management, and infrastructure provisioning processes.

  • Develop and maintain comprehensive documentation for system configurations, processes, and procedures.

  • Participate in on-call rotations and respond to incidents, working diligently to resolve issues and prevent recurrence.


What you’ll need to succeed:



  • Possess an active Secret U.S. Government security clearance or higher

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.

  • Minimum of 3 years of professional experience in a Site Reliability Engineering role or similar capacity.

  • Strong experience with cloud technologies (e.g., AWS, Azure, GCP) and infrastructure as code (e.g., Terraform, Ansible).

  • Proficiency in managing, leading, and engineering incident and outage response

  • Strong engineering experience in network protocols (e.g., TCP/IP, DNS, HTTP/HTTPS, Load Balancing, etc.)

  • Proficiency in programming and scripting languages (e.g., Python, Go, Bash) and RPA (e.g. Blue Prism, UIPath) to automate tasks and develop tools.

  • Deep understanding of containerization and orchestration technologies (e.g., Kubernetes, Docker).

  • Expertise in implementing and managing monitoring and logging solutions (e.g., Splunk, Prometheus, Grafana, ELK stack).

  • Familiarity with CI/CD pipeline development and management (e.g., GitLab CI, Azure DevOps, AWS Lambda, Jenkins)

  • Proven track record of designing, building, and maintaining highly available and scalable systems.

  • Expert proficiency in developing automated functional, regression and performance tests and developing automated testing standards for development teams.

  • Experience facilitating change and configuration management processes to drive reliability.

  • Strong problem-solving skills, with the ability to diagnose complex issues and implement effective solutions.

  • Excellent communication skills, with the ability to collaborate effectively across diverse teams.

Like we said, we are big fans of our people. That’s why we offer a generous benefits package, professional growth, and valuable time to recharge. Learn more about our company culture code and benefits. Plus, check out our accolades.


Commitment to Non-DiscriminationAll qualified applicants will receive consideration for employment based on merit and without regard to sex, race, ethnicity, age, national origin, citizenship, religion, physical or mental disability, medical condition, genetic information, pregnancy, family structure, marital status, ancestry, domestic partner status, sexual orientation, gender identity or expression, veteran or military status, status as a protected veteran, or any other status protected by applicable federal, state, local, or international law.


 What we want you to know:


In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.


 Not ready to apply now? 


Sign up to join our newsletter here.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer (5667)?

Sign up to receive alerts about other jobs on the Site Reliability Engineer (5667) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$151,875 - $212,356
Income Estimation: 
$169,957 - $202,398
Income Estimation: 
$76,670 - $90,826
Income Estimation: 
$91,609 - $118,978
Income Estimation: 
$92,877 - $110,401
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at MetroStar

MetroStar
Hired Organization Address Washington, DC Full Time
25k Sign-on bonus for this role, must have an active TS / SCI security clearace to apply! As a Systems Engineer, you’ll ...
MetroStar
Hired Organization Address Kansas, MO Full Time
As Sr. Oracle Systems Engineer III , you’ll lead the technical operations and engineering efforts for a large, complex s...
MetroStar
Hired Organization Address Hanscom AFB, MA Full Time
As Customer Success Manager , you’ll be responsible for leading and coordinating projects for our government clients fro...
MetroStar
Hired Organization Address Washington, DC Full Time
As Sr. Program Manager , you’ll manage the overall delivery of program to the DoD customer for the development and maint...

Not the job you're looking for? Here are some other Site Reliability Engineer (5667) jobs in the Hanscom AFB, MA area that may be a better fit.

Lead Site Reliability Engineer

UKG Careers, Lowell, MA

Lead Site Reliability Engineer

Ultimate Software, Lowell, MA

AI Assistant is available now!

Feel free to start your new journey!