Demo

Site Reliability Engineer

HCL Global Systems, Inc.
Roanoke, TX Full Time
POSTED ON 3/9/2025
AVAILABLE BEFORE 6/8/2025

Skills : Datadog

Kubernetes

AWS (EKS) and Azure (AKS) would prefer AWS

On-call experience running incidents

Development background : Ansible, Python, node, Javascript, Jenkins, groovy

The Expertise and Skills we're Looking For

  • Bachelor's degree or higher in a technology related field (e.g. Engineering, Computer Science, etc.) required
  • 5-8 years of hands-on experience deploying and / or supporting highly distributed multi-tiered systems at scale
  • Hands-on experience with Public Cloud environments, preferably AWS and Azure. Certifications a plus
  • Hands-on experience with container orchestration, preferably with Kubernetes
  • Working experience on batch processing using tools like Control M, Informatica etc.
  • Ability to solve application issues on Unix / Linux with J2EE, WebSphere, Tomcat and SQL
  • Exposure to basic OS level scripting languages such as Korn / Bash / Jscript
  • Familiarity with ITIL processes like Incident management, Change / Problem management
  • Balancing delivery with ad hoc workloads and re-evaluating priorities
  • Solid understanding of Cloud Computing and DevOps concepts including CI / CD pipelines
  • Hands on experience with one or more observability tools (Prometheus, Grafana, ELK / OpenSearch, OpenTelemetry, Datadog, etc.)
  • Use Datadog, Catchpoint, Splunk & Grafana for Application Observability and monitoring of app & infrastructure
  • Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scale
  • Proven experience in maintaining scalability and resiliency of complex environment.
  • Proven experience in implementing advanced observability practices and techniques at scale.
  • Provide enterprise Cloud and Platform Engineering support for production environments and ability to participate in on-call rotation to provide solutions.
  • Experience in Cloud development (AWS and Azure) and migration skills; Experience with building and operating highly resilient platforms in public cloud environments
  • Ability to triage, complete root cause analysis, and be decisive under pressure
  • Experience managing and interpreting large datasets using query languages and visualization tools
  • Proficient communication skills with an ability to reach both technical and non-technical audience
  • Ability to learn new software, method and practices and bringing them to our developers
  • Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner and build and maintain effective relationships
  • Proven experience performing chaos testing to build confidence in the system's capability to withstand turbulent conditions in production
  • Strong understanding in API testing tools (SoapUI, Postman)
  • Understanding of Agile Methodology
  • Experience managing systems using infrastructure as code tools (IAM, ARM, Terraform, Chef)
  • Handle a huge fleet of on-prem servers (including security & patching oversight)
  • Handle hundreds of SSL certificates for all applications in scope
  • Use Ansible & Python for automating day-to-day activities, Web development with Django, JavaScript
  • Collaboration and Relationships - Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner and build and maintain effective relationship

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$158,960 - $205,707
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at HCL Global Systems, Inc.

HCL Global Systems, Inc.
Hired Organization Address Jackson, MS Full Time
his position will require a qualified Quality Assurance Analyst to take the lead on the following tasks: Support the pla...
HCL Global Systems, Inc.
Hired Organization Address Salt Lake, UT Full Time
Job Details Senior Core Java Developer Job Location:- Salt Lake City, Utah (Candidate needs to work 5 Days at the Client...
HCL Global Systems, Inc.
Hired Organization Address Richmond, VA Full Time
This individual will be primarily responsible for assisting with development of processes and architecture to support mi...
HCL Global Systems, Inc.
Hired Organization Address Richmond, VA Full Time
Seeking a highly organized and detail-oriented Project Coordinator 2 to support project management for the Health Inform...

Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Roanoke, TX area that may be a better fit.

Site Reliability Engineer

Charles Schwab, Roanoke, TX

Principal Site Reliability Engineer

Fidelity Investments, Roanoke, TX

AI Assistant is available now!

Feel free to start your new journey!