What are the responsibilities and job description for the Lead Site Reliability Engineer position at 4Sphere Software Solutions?
π Location: San Antonio, TX (Hybrid)
π Duration: 12 Months (Contract)
πΌ Industry: Banking & Financial Services
We are seeking a Lead Site Reliability Engineer (SRE) with strong Java expertise to support a high-availability banking platform. If you have hands-on experience in Java microservices, Kubernetes, Terraform, and event-driven architectures, this could be a great opportunity for you!
πΉ Key Responsibilities:β Develop & deploy Java-based microservices (Spring Boot, REST APIs, Hibernate, GraphQL)
β Implement & optimize event-driven architectures (Kafka, Spark Streaming)
β Manage Kubernetes (EKS, AKS) clusters with Istio for secure service-to-service communication
β Automate infrastructure using Terraform, Ansible, and CI/CD pipelines (Jenkins, GitHub Actions)
β Enhance monitoring & observability with Prometheus, OpenTelemetry, Grafana, Splunk
β Optimize persistent storage solutions (Portworx, Amazon FSx) for high-availability banking apps
πΉ What Weβre Looking For:βοΈ 8 years of SRE/DevOps experience with a focus on high-scale, cloud-native applications
βοΈ Java (Spring Boot, Microservices, GraphQL, REST API Development)
βοΈ Kafka, Spark Streaming, Event-Driven Architecture
βοΈ Kubernetes, Istio, Terraform, AWS, Azure
βοΈ Observability: OpenTelemetry, Grafana, Splunk, Datadog
βοΈ Hands-on experience in financial services (Banking, FinTech, or Payments is a plus)
βοΈ AWS or Kubernetes certifications (CKA, AWS DevOps Pro) are a plus
π© Interested? Send your resume to bharat@4spheresolutions.com