You haven't searched anything yet.
Job: Site Reliability Engineer
On-Site (Atlanta, St Louis, or Denver)
Must have Skills Google Cloud Platform, Java, .Net
Looking for a highly motivated Site Reliability Engineer, who is capable of build and run large-scale, massively distributed, fault-tolerant systems. Individual to work with teams across the organization and ensures core services reliability and keep an eye on capacity and performance.
Responsible for blameless postmortems and proactive identification of potential outages factor into iterative improvement.
Experience in Designing and Deploying multi-data center Large Scale Web Applications.
Work closely with dev, and ops teams to build highly available, cost-effective systems.
Create new tools and scripts designed for auto-remediation of incidents.
Design/Implementation of Big Data technologies, including Hadoop, MongoDB, Kafka, RabbitMQ, Zookeeper, Spark, ELK, etc.
Responsible for establishing end-to-end monitoring and alerting on all critical aspects to ensure SLAs and get proactive notifications of possible issues for all systems.
Design platforms for extremely high uptime metrics.
Works well independently and requires little or no supervision.
Work with cloud operations team to resolve trouble tickets, developing and running scripts, and troubleshooting.
Fully understand the application, microservices interactions.
Design/Implementation containers/applications in scalable HA/DR multi-tier cloud environments, including new system design, documentation, implementation, and deployment.
Participate in 24x7 an on-call rotation.
Job Requirements (7 years of experience in the following areas):
Experience in providing L4 technical support for production 24x7.
Strong experience in production support and operations.
Design/Implementation of network and presentation tier technologies, including F5, Apache, Nginx, etc.
Experience in Performance Testing/Tuning/Monitoring, maximizing system uptime and availability, ensuring functional and performance SLAs.
Experience with monitoring Application/Infrastructure Performance, and availability.
Automation Experience with Build/deployment, Software Configuration/Continuous Integration/Continuous Delivery/Release Engineering related tasks in an JavaEE/C Environments.
Experience in automating manual processes using Python, Ruby, Unix Shell (bash, ksh), perl, Ant, etc.
Installing, Configuring, Administering, and Tuning of JavaEE Application Servers/Containers like Tomcat, WebSphere, etc.
Installing/maintaining/Administering software on Unix Linux, Windows servers.
Experience with Web service technologies, including REST, SOAP, JSON, XML.
Experience with Cloud Platforms and virtualization Technologies.
Deploying and automating infrastructure/applications in cloud environment using Chef, RPM, etc.
Working closely with Development, QA, Product Management, and Production Ops teams to make sure Product Releases on-time with quality.
Hands on experience Configuring and Administering SCM (GIT, SVN), Build (CMake, Make files, Maven), CI(Jenkins), CD Automation Tools.
Experience with database (RDBMS, NoSql) technologies is a plus.
Experience with Performance Testing is a plus.
Configuring and maintaining SDLC Environments.
Experience in Agile Methodologies and processes.
Strong Automation, problem-solving skills, and ability to follow through to completion.
Demonstrated leadership skills through a variety of activities, including leading or mentoring technical staff.
Strong verbal/written communication skills.
Participate in 24x7 an on-call rotation.
Full Time
IT Outsourcing & Consulting
$104k-117k (estimate)
06/06/2024
08/05/2024
datumsoftware.com
JOHNS CREEK, GA
25 - 50
2001
Private
LATHA GANESHAN
$10M - $50M
IT Outsourcing & Consulting
Datum Software develops, delivers and supports specialized technology solutions for Fortune 1000 companies and government institutions.
The job skills required for Site Reliability Engineer include Problem Solving, Leadership, Presentation, Agile, Python, Written Communication, etc. Having related job skills and expertise will give you an advantage when applying to be a Site Reliability Engineer. That makes you unique and can impact how much salary you can get paid. Below are job openings related to skills required by Site Reliability Engineer. Select any job title you are interested in and start to search job requirements.
The following is the career advancement route for Site Reliability Engineer positions, which can be used as a reference in future career path planning. As a Site Reliability Engineer, it can be promoted into senior positions as a Corrosion Engineer II that are expected to handle more key tasks, people in this role will get a higher salary paid than an ordinary Site Reliability Engineer. You can explore the career advancement for a Site Reliability Engineer below and select your interested title to get hiring information.
If you are interested in becoming a Site Reliability Engineer, you need to understand the job requirements and the detailed related responsibilities. Of course, a good educational background and an applicable major will also help in job hunting. Below are some tips on how to become a Site Reliability Engineer for your reference.
Step 1: Understand the job description and responsibilities of an Accountant.
Quotes from people on Site Reliability Engineer job description and responsibilities
Similarly to the point above, a site reliability engineer can expect to spend time fixing support escalation cases.
03/16/2022: Little Rock, AR
More times than not, site reliability engineers will need to take on-call responsibilities.
01/31/2022: Lexington, KY
Focuses on the reliability of behind-the-scenes systems that help make other teams' jobs more efficient.
02/24/2022: Tuscaloosa, AL
Site reliability engineers may have to spend a considerable amount of time fixing cases related to support escalation.
02/25/2022: Manchester, NH
Step 2: Knowing the best tips for becoming an Accountant can help you explore the needs of the position and prepare for the job-related knowledge well ahead of time.
Career tips from people on Site Reliability Engineer jobs
The objective was to ensure service reliability and availability within operations management.
12/28/2021: Lima, OH
Step 3: View the best colleges and universities for Site Reliability Engineer.