Demo

Site Reliability Engineer

HCL Global Systems, Inc.
Roanoke, TX Full Time
POSTED ON 3/9/2025
AVAILABLE BEFORE 6/8/2025

Skills : Datadog

Kubernetes

AWS (EKS) and Azure (AKS) would prefer AWS

On-call experience running incidents

Development background : Ansible, Python, node, Javascript, Jenkins, groovy

The Expertise and Skills we're Looking For

  • Bachelor's degree or higher in a technology related field (e.g. Engineering, Computer Science, etc.) required
  • 5-8 years of hands-on experience deploying and / or supporting highly distributed multi-tiered systems at scale
  • Hands-on experience with Public Cloud environments, preferably AWS and Azure. Certifications a plus
  • Hands-on experience with container orchestration, preferably with Kubernetes
  • Working experience on batch processing using tools like Control M, Informatica etc.
  • Ability to solve application issues on Unix / Linux with J2EE, WebSphere, Tomcat and SQL
  • Exposure to basic OS level scripting languages such as Korn / Bash / Jscript
  • Familiarity with ITIL processes like Incident management, Change / Problem management
  • Balancing delivery with ad hoc workloads and re-evaluating priorities
  • Solid understanding of Cloud Computing and DevOps concepts including CI / CD pipelines
  • Hands on experience with one or more observability tools (Prometheus, Grafana, ELK / OpenSearch, OpenTelemetry, Datadog, etc.)
  • Use Datadog, Catchpoint, Splunk & Grafana for Application Observability and monitoring of app & infrastructure
  • Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scale
  • Proven experience in maintaining scalability and resiliency of complex environment.
  • Proven experience in implementing advanced observability practices and techniques at scale.
  • Provide enterprise Cloud and Platform Engineering support for production environments and ability to participate in on-call rotation to provide solutions.
  • Experience in Cloud development (AWS and Azure) and migration skills; Experience with building and operating highly resilient platforms in public cloud environments
  • Ability to triage, complete root cause analysis, and be decisive under pressure
  • Experience managing and interpreting large datasets using query languages and visualization tools
  • Proficient communication skills with an ability to reach both technical and non-technical audience
  • Ability to learn new software, method and practices and bringing them to our developers
  • Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner and build and maintain effective relationships
  • Proven experience performing chaos testing to build confidence in the system's capability to withstand turbulent conditions in production
  • Strong understanding in API testing tools (SoapUI, Postman)
  • Understanding of Agile Methodology
  • Experience managing systems using infrastructure as code tools (IAM, ARM, Terraform, Chef)
  • Handle a huge fleet of on-prem servers (including security & patching oversight)
  • Handle hundreds of SSL certificates for all applications in scope
  • Use Ansible & Python for automating day-to-day activities, Web development with Django, JavaScript
  • Collaboration and Relationships - Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner and build and maintain effective relationship

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineer?

Sign up to receive alerts about other jobs on the Site Reliability Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$92,369 - $122,605
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$117,024 - $149,811
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$137,568 - $176,908
Income Estimation: 
$158,960 - $205,707
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at HCL Global Systems, Inc.

HCL Global Systems, Inc.
Hired Organization Address Jersey, NJ Full Time
8 years of hands-on experience as an Oracle pl / sql developer. Strong RDBMS skills - SQL querying, writing stored proce...
HCL Global Systems, Inc.
Hired Organization Address Roanoke, TX Full Time
Location : Merrimack, NH / Westlake, TX REQUIRED SKILLS Strong hands-on experience with one or more of the following : K...
HCL Global Systems, Inc.
Hired Organization Address Moscow, ID Full Time
Job Details Install and support Windows environments Experience using ticketing system Strong organization, problem solv...
HCL Global Systems, Inc.
Hired Organization Address Jackson, MS Full Time
Role : Business Analyst / Technical Writer Location : Jackson, MS (first 2 weeks onsite, there after remote) Duration : ...

Not the job you're looking for? Here are some other Site Reliability Engineer jobs in the Roanoke, TX area that may be a better fit.

Sr. Site Reliability Engineer

Charles Schwab, Southlake, TX

Site Reliability Engineer

Lensa, Roanoke, TX

AI Assistant is available now!

Feel free to start your new journey!