What are the responsibilities and job description for the AWS & EKS Engineer position at Flexon Technologies Inc.?
Job Details
Job Title: AWS & EKS Engineer
Location: Austin, TX & Sunnyvale, CA & Cupertino, CA (On-site)
Duration: Contract Position
Job Description:
We are looking for a highly skilled AWS & EKS Engineer with strong expertise in Kubernetes fundamentals, observability, and cloud infrastructure management.
The ideal candidate will have deep experience with Kubernetes CRDs, Istio, Helm Charts, and Service Accounts/permissions, along with a solid understanding of Grafana, Prometheus, Open Telemetry, and security best practices.
You will play a key role in managing AWS IAM & EKS, optimizing observability solutions, and ensuring seamless integration of cloud-native technologies.
Required Skills & Experience:
Strong experience with AWS IAM & EKS.
Deep understanding of Kubernetes (CRDs, Istio, Helm Charts, Service Accounts, permissions).
Experience with observability tools (Grafana, Prometheus, Open Telemetry).
Proficiency in Git workflows and version control.
Experience with troubleshooting, log analysis, and performance monitoring.
Good understanding of security best practices (certificates, keys, CA management).
Programming experience in Python/JavaScript (OOP in Java is a plus)
Nice-to-Have Skills:
Experience with Infrastructure as Code (Terraform).
CI/CD tools like Spinnaker.
Splunk query language (SPL) knowledge.
Experience with Daemon Sets in Kubernetes. Key Responsibilities:
AWS IAM & EKS Management: Configure and manage IAM roles, policies, permissions, and EKS clusters.
Kubernetes Administration:
Deploy and manage CRDs (Custom Resource Definitions), Istio service mesh, and Helm charts.
Configure Service Accounts and fine-tune permissions for security and efficiency.
Observability & Monitoring:
Implement and maintain Open Telemetry Collector, Prometheus (PromQL), and Grafana.
Create dashboards, alerts, and optimize observability for traces & metrics.
Infrastructure as Code (IaC):
Define infrastructure using Terraform (module creation is a plus).
Implement and manage Git-based workflows for FluxCD, Flagger, Spinnaker (nice to have).
Security & Certificates Management:
Manage and troubleshoot keys, certificates, and Certificate Authorities (CAs).
CI/CD & Automation:
Work with Git extensively for version control and automation.
Contribute to CI/CD pipeline improvements (knowledge of FluxCD, Flagger, or Spinnaker is a plus).
Troubleshooting & Log Analysis:
Diagnose issues using Splunk Query Language (SPL) and log analysis.
Analyze application & infrastructure logs for performance bottlenecks and errors.
Documentation & Knowledge Sharing:
Document processes and best practices for team knowledge transfer.