Demo

Lead Site Reliability Engineer with Java

Spark Infotech
San Antonio, TX Contractor
POSTED ON 4/5/2025
AVAILABLE BEFORE 5/4/2025

Job Description & Key Responsibilities:

As a Lead Site Reliability Engineer (SRE), you will leverage your extensive experience in SRE practices to maintain and enhance the reliability, performance, and scalability of mission-critical systems. You will play a crucial role in ensuring the continuous availability and optimal functioning of our services.

Key Responsibilities:

Senior-Level SRE Expertise: Apply your deep understanding of SRE principles to lead efforts in improving system reliability and operational efficiency.

Incident Management: Provide expert-level support during incidents, ensuring swift resolution with minimal service disruption. Lead post-incident reviews to drive continuous improvement.

Monitoring & Alerting: Design, implement, and optimize monitoring, alerting, and incident response processes. Ensure the effectiveness of these systems to proactively address potential issues.

Automation: Drive the automation of manual processes to enhance operational efficiency, reduce human error, and increase overall system resilience.

CI/CD Pipeline Management: Develop, maintain, and improve automated CI/CD pipelines using tools such as GitLab CI/CD and Jenkins, ensuring seamless and reliable deployment processes.

Cross-Functional Collaboration: Work closely with cross-functional teams to ensure the reliability, performance, and scalability of our infrastructure. Foster a culture of collaboration and knowledge sharing.

Support Across Time Zones: Provide support across all U.S. time zones, with the flexibility to work weekends, rotational shifts, and overtime as required to maintain service continuity.



Required Skills & Qualifications:

Java Programming: Advanced proficiency in Java, with a deep understanding of contemporary software development practices.

Kubernetes & Containerization: Extensive hands-on experience with Kubernetes, including containerization technologies like Docker and Kubernetes storage solutions such as Portworx.

Linux/Unix Systems: Strong command of Linux/Unix operating systems and Shell Scripting (BASH), with a focus on system reliability and automation.

Functional Programming: Proficiency in functional programming languages such as Prolog, Haskell, and OCaml.

Scripting & Automation: Experience with Python or Go, particularly in the context of scripting and automation tasks.

Virtualization: In-depth knowledge of VMware and other virtualization platforms, with a focus on optimizing virtual environments for reliability and performance.

Streaming Technologies: Expertise with Kafka Stream Generator, KSQLDB, cluster federation, and Spark Streams, including experience in managing and optimizing streaming data architectures.

Service Mesh & Networking: Familiarity with Istio and Anthos Service Mesh, with the ability to manage and optimize service meshes for complex environments.

Performance Monitoring & Debugging: Proficiency in using EBPF (Extended Berkeley Packet Filter) for performance monitoring and debugging.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Lead Site Reliability Engineer with Java?

Sign up to receive alerts about other jobs on the Lead Site Reliability Engineer with Java career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$158,960 - $205,707
Income Estimation: 
$71,493 - $96,419
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Spark Infotech

Spark Infotech
Hired Organization Address San Antonio, TX Contractor
Job Description: OTC experience required Responsibilities: • Demonstrate deep understanding and hands-on experience with...
Spark Infotech
Hired Organization Address Alpharetta, GA Full Time
AWS, Azure, Design, Java, Kubernetes, Spring Required skills: • 8 years of experience in working in Development roles • ...
Spark Infotech
Hired Organization Address Chicago, IL Contractor
Must have Skills/Attributes : AWS, Azure, DevOps, React, Redux Position Description Required Education: • Bachelor’s deg...
Spark Infotech
Hired Organization Address Nyack, NY Full Time
~5 years of professional experience sounds appropriate, but the most important thing is that the person is comfortable w...

Not the job you're looking for? Here are some other Lead Site Reliability Engineer with Java jobs in the San Antonio, TX area that may be a better fit.

Lead Site Reliability Engineer with Java

GForge Techsolutions India Private Limited, San Antonio, TX

Lead Site Reliability Engineer

4Sphere Software Solutions, San Antonio, TX

AI Assistant is available now!

Feel free to start your new journey!