Demo

Site Reliability Engineer

Saxon Global
Los Angeles, CA Full Time
POSTED ON 2/22/2025
AVAILABLE BEFORE 5/18/2025

Looking for a highly motivated Site Reliability Engineer, who is capable of build and run large-scale, massively distributed, fault-tolerant systems. Individual to work with teams across the organization and ensures core services reliability and keep an eye on capacity and performance.

This is for a migration from AWS into GCP. Knowledge and experience with GCP is mandatory, knowledge of AWS is nice to have.

  • Responsible for blameless postmortems and proactive identification of potential outages factor into iterative improvement.
  • Experience in Designing and Deploying multi-data center Large Scale Web Applications.
  • Work closely with dev, and ops teams to build highly available, cost-effective systems.
  • Create new tools and scripts designed for auto-remediation of incidents.
  • Design / Implementation of Big Data technologies, including Hadoop, MongoDB, Kafka, RabbitMQ, Zookeeper, Spark, ELK, etc.
  • Responsible for establishing end-to-end monitoring and alerting on all critical aspects to ensure SLAs and get proactive notifications of possible issues for all systems.
  • Design platforms for extremely high uptime metrics.
  • Works well independently and requires little or no supervision.
  • Work with cloud operations team to resolve trouble tickets, developing and running scripts, and troubleshooting.
  • Fully understand the application, microservices interactions.
  • Design / Implementation containers / applications in scalable HA / DR multi-tier cloud environments, including new system design, documentation, implementation, and deployment.
  • Participate in 24x7 an on-call rotation.

Job Requirements (7 years of experience in the following areas) :

  • Experience in providing L4 technical support for production 24x7.
  • Strong experience in production support and operations.
  • Design / Implementation of network and presentation tier technologies, including F5, Apache, Nginx, etc.
  • Experience in Performance Testing / Tuning / Monitoring, maximizing system uptime and availability, ensuring functional and performance SLAs.
  • Experience with monitoring Application / Infrastructure Performance, and availability.
  • Automation Experience with Build / deployment, Software Configuration / Continuous Integration / Continuous Delivery / Release Engineering related tasks in an JavaEE / C Environments.
  • Experience in automating manual processes using Python, Ruby, Unix Shell (bash, ksh), perl, Ant, etc.
  • Installing, Configuring, Administering, and Tuning of JavaEE Application Servers / Containers like Tomcat, WebSphere, etc.
  • Installing / maintaining / Administering software on Unix Linux, Windows servers.
  • Experience with Web service technologies, including REST, SOAP, JSON, XML.
  • Experience with Cloud Platforms and virtualization Technologies.
  • Deploying and automating infrastructure / applications in cloud environment using Chef, RPM, etc.
  • Working closely with Development, QA, Product Management, and Production Ops teams to make sure Product Releases on-time with quality.
  • Hands on experience Configuring and Administering SCM (GIT, SVN), Build (CMake, Make files, Maven), CI(Jenkins), CD Automation Tools.
  • Experience with database (RDBMS, NoSql) technologies is a plus.
  • Experience with Performance Testing is a plus.
  • Configuring and maintaining SDLC Environments.
  • Experience in Agile Methodologies and processes.
  • Strong Automation, problem-solving skills, and ability to follow through to completion.
  • Demonstrated leadership skills through a variety of activities, including leading or mentoring technical staff.
  • Strong verbal / written communication skills.
  • Participate in 24x7 an on-call rotation.
  • Required Skills : Looking for an SRE who can assist with moving from AWS into GCP. GCP is a required skill. AWS is just a nic eto have. Design / Implementation of Big Data technologies, including Hadoop, MongoDB, Kafka, RabbitMQ, Zookeeper, Spark, ELK, etc.

    Background Check : Yes

    Drug Screen : Yes

    Notes :

    Selling points for candidate :

    Project Verification Info : The information provided below is for Apex Systems AV use only and is not to be distributed publicly, or to any third party. Any distribution of the below information will result in corrective action from Apex Systems Vendor Management. MSA : Blanket Approval Received Client Letter : Will Provide

    Candidate must be your W2 Employee : Yes

    Exclusive to Apex : No

    Face to face interview required : No

    Candidate must be local : Yes

    Candidate must be authorized to work without sponsorship : : No

    Interview times set : : No

    Type of project : Master Job Title :

    Branch Code :

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Site Reliability Engineer?

    Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $92,369 - $122,605
    Income Estimation: 
    $117,024 - $149,811
    Income Estimation: 
    $154,509 - $200,187
    Income Estimation: 
    $188,252 - $252,911
    Income Estimation: 
    $71,493 - $96,419
    Income Estimation: 
    $92,369 - $122,605
    Income Estimation: 
    $117,024 - $149,811
    Income Estimation: 
    $137,568 - $176,908
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Saxon Global

    Saxon Global
    Hired Organization Address Lincoln, NE Full Time
    General duties : Consult with peers and subordinates on application issues Use application development tools and utiliti...
    Saxon Global
    Hired Organization Address Bentonville, AR Full Time
    Apex Systems is seeking candidates for a Sr. React Native Developer with one of our top national clients. If working in ...
    Saxon Global
    Hired Organization Address Bentonville, AR Full Time
    Client : Wal-Mart Role : JAVA Engineers Focused Team of 7-8 ( additional project initiative may be coming as well for an...
    Saxon Global
    Hired Organization Address Bentonville, AR Full Time
    Qualifications / Description : 6 years of Android platform experience with Kotlin, Android Studio, Retrofit, Dagger, Jen...

    Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Los Angeles, CA area that may be a better fit.

    Site Reliability Engineer

    iVedha, Los Angeles, CA

    Site Reliability Engineer

    Tik Tok, Los Angeles, CA

    AI Assistant is available now!

    Feel free to start your new journey!