Demo

Site Reliability Engineer

Stanley David and Associates
Edison, NJ Full Time
POSTED ON 4/4/2025
AVAILABLE BEFORE 6/4/2025

Job Details

Skill: Site reliability engineering-Senior Engineer
Must have skills:
  • Python or Java.
  • Splunk Cloud, Thousand Eyes, cloud platforms such as AWS, Google Cloud, or Azure.
  • Docker and Kubernetes.
Responsibilities:
  • System Reliability: Work with production support teams to implement scalable, maintainable systems, continuously seeking improvements and optimizations in infrastructure and application architecture.
  • Toil Reduction - Automation: Build and maintain tools and scripts for automating repetitive tasks, deployment processes, monitoring, and incident responses, reducing manual interventions and minimizing human errors.
  • Incident Management: Participate in major incidents (on-call rotations), respond to incidents and service outages, promptly investigate and resolve system issues, and conduct post-mortems to prevent future incidents through Problem management.
  • Monitoring and Alerting: Establish and maintain monitoring and alerting systems to proactively identify potential issues, ensuring timely notifications to relevant teams during critical situations.
  • Capacity Planning and Performance Optimization: Monitor system performance, identify bottlenecks, collaborate with engineering teams for performance optimization, and plan for future growth.
  • Error Budgeting and Chaos Engineering: Diagnose and recommend optimization opportunities, conducts mock drills to improve stability and resiliency.
  • Documentation: Develop and maintain comprehensive documentation for system configurations, processes, and troubleshooting procedures to enhance knowledge sharing and team efficiency.
Minimum Qualifications -
  • Knowledgeable in cloud platforms such as AWS, Google Cloud, or Azure, and familiar with containerization technologies like Docker and Kubernetes.
  • Proficient in using infrastructure-as-code tools like Terraform and Ansible for automation and configuration management.
Preferred Qualifications -
  • Experienced in software development with proficiency in programming languages like Python or Java.
  • Familiar with monitoring and logging tools such as Splunk Cloud, ThousandEyes.
  • Understands networking principles and protocols.
  • Capable of working collaboratively in a fast-paced, dynamic environment with excellent problem-solving skills.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Stanley David and Associates

Stanley David and Associates
Hired Organization Address Cincinnati, OH Full Time
Role :: Windows Server Admin Location :: Cincinnati, OH Type :: Fulltime Job Description Extensive experience managing W...
Stanley David and Associates
Hired Organization Address Phoenix, AZ Full Time
Job Details Role :: Google Cloud Platform & AWS Engineer Location :: Phoenix, Az Type :: Fulltime Experience Range - 6 Y...
Stanley David and Associates
Hired Organization Address Sunrise, FL Full Time
Job Details Proficiency programming in more than one object-oriented programming language; JAVA, Python etc. o Experienc...
Stanley David and Associates
Hired Organization Address Nebraska, NE Full Time
Role :: EUC Technician/Site IT Support Location :: Nebraska City, NE Type :: Fulltime Job Description:: Strong in Commun...

Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Edison, NJ area that may be a better fit.

Site Reliability Engineer

Diverse Lynx, Woodbridge, NJ

Optical Reliability Engineer

Cisco Systems, Inc., Holmdel, NJ

AI Assistant is available now!

Feel free to start your new journey!