What are the responsibilities and job description for the AWS & EKS Engineer position at Galaxy i Technologies, Inc.?
Job Details
AWS & EKS Engineer (Kubernetes & Observability Focus)
Austin, TX / Bay Area, CA
W2 Only
Job Description:
We are looking for a highly skilled AWS & EKS Engineer with strong expertise in Kubernetes
fundamentals, observability, and cloud infrastructure management.
The ideal candidate will have deep experience with Kubernetes CRDs, Istio, Helm Charts, and
Service Accounts/permissions, along with a solid understanding of Grafana, Prometheus, Open
Telemetry, and security best practices.
You will play a key role in managing AWS IAM & EKS, optimizing observability solutions, and
ensuring seamless integration of cloud-native technologies.
Required Skills & Experience:
Strong experience with AWS IAM & EKS.
Deep understanding of Kubernetes (CRDs, Istio, Helm Charts, Service Accounts,
permissions).
Experience with observability tools (Grafana, Prometheus, Open Telemetry).
Proficiency in Git workflows and version control.
Experience with troubleshooting, log analysis, and performance monitoring.
Good understanding of security best practices (certificates, keys, CA management).
Programming experience in Python/JavaScript (OOP in Java is a plus)
Nice-to-Have Skills:
Experience with Infrastructure as Code (Terraform).
CI/CD tools like Spinnaker.
Splunk query language (SPL) knowledge.
Experience with Daemon Sets in Kubernetes.
Key Responsibilities:
AWS IAM & EKS Management: Configure and manage IAM roles, policies, permissions, and EKS
clusters.
Kubernetes Administration:
Deploy and manage CRDs (Custom Resource Definitions), Istio service mesh, and Helm charts.
Configure Service Accounts and fine-tune permissions for security and efficiency.
Observability & Monitoring:
Implement and maintain Open Telemetry Collector, Prometheus (PromQL), and Grafana.
Create dashboards, alerts, and optimize observability for traces & metrics.
Infrastructure as Code (IaC):
Define infrastructure using Terraform (module creation is a plus).
Implement and manage Git-based workflows for FluxCD, Flagger, Spinnaker (nice to have).
Security & Certificates Management:
Manage and troubleshoot keys, certificates, and Certificate Authorities (CAs).
CI/CD & Automation:
Work with Git extensively for version control and automation.
Contribute to CI/CD pipeline improvements (knowledge of FluxCD, Flagger, or Spinnaker is a plus).
Troubleshooting & Log Analysis
Diagnose issues using Splunk Query Language (SPL) and log analysis.
Analyze application & infrastructure logs for performance bottlenecks and errors.
Documentation & Knowledge Sharing
Document processes and best practices for team knowledge transfer.
Note: For Immediate response please reach out to me at suresh2 at galaxy i tech dot com / four eight zero six nine six five three nine four