What are the responsibilities and job description for the Kubernetes and AWS EKS Engineer position at NetRoadshow?
Job Description
Job Description
Salary :
Kubernetes and AWS EKS Engineer
Role and Responsibilities
Key Responsibilities :
We are Looking for a highly skilled Kubernetes and AWS EKS Engineer to join our team. The ideal candidate will have extensive experience in managing containerized workloads and scaling distributed systems using Kubernetes and AWS EKS. This role involves designing, implementing, and maintaining Kubernetes-based solutions while ensuring high availability, scalability, and security. You will work closely with cross-functional teams to streamline DevOps processes and improve operational efficiencies.
Required Skills :
4 years of experience in DevOps to develop and manage scalable, secure, and reliable cloud infrastructure on AWS.
- 2 years of hands-on experience with Kubernetes and AWS EKS.
- Strong knowledge of Kubernetes architecture, including control planes, nodes, and pod lifecycle management.
- Expertise in AWS services (EKS, VPC, IAM, Route 53, S3, CloudWatch, and RDS).
- Proficiency in containerization technologies, such as Docker.
- Knowledge of Kubernetes networking, including CNI plugins (Calico, WeaveNet, etc.), Ingress Controllers, and Load Balancers.
- Familiarity with monitoring and logging tools (Prometheus, Grafana, Fluentd, AWS CloudWatch).
- Strong understanding of security best practices in Kubernetes, including RBAC, pod security policies, and secrets management.
- Experience with GitOps workflows using tools like ArgoCD or Flux.
- Excellent troubleshooting skills and ability to resolve complex Kubernetes issues.
- Experience with Helm charts for application deployment and customization.
- Hands-on experience with Infrastructure as Code (IaC) tools, such as Terraform or AWS CloudFormation, for managing Kubernetes clusters.
- Experience with scripting and development languages such as Python, Java, C#, Powershell, etc.
Experience in branching, merging, and maintaining code versions using SCM tools like Git and GitHub on both Windows and Linux platforms.
Knowledge of RESTful APIs, distributed architectures, and cloud-native systems.
Basic understanding of machine learning and AI technologies (a plus).
Committed to learning, staying updated with the latest AWS services, trends, and best practices, while recommending improvements to existing infrastructure.
Example Tasks / Workflows :
Automation : Build reusable, testable IaC and pipelines to enhance engineering efficiency.
Incident Resolution : Troubleshoot system issues and perform root-cause analysis to prevent recurrence.
Scalability and Performance : Work with development teams to implement improvements to scalability, performance, and security across applications.
Distributed Workloads : Collaborate with the architecture team to design and deploy distributed systems on Kubernetes.
How Success is Measured :
Delivery of highly available and resilient Kubernetes clusters with minimal downtime.
Successful automation of infrastructure provisioning, deployment, and scaling workflows.
Ensuring Kubernetes clusters meet security and compliance standards.
Efficient resource utilization and cost optimization for Kubernetes workloads.
Seamless collaboration with development and security teams to enable cloud-native solutions.
Required Proficiencies :
Leverage AWS services like EKS, Fargate, and EC2 to build and manage scalable, resilient, and cost-effective Kubernetes environments.
Implement and manage CI / CD pipelines using Jenkins, GitHub Actions, ArgoCD, or Flux to automate application deployments to Kubernetes clusters.
Work with development teams to optimize resource allocation, auto-scaling (HPA, VPA, Cluster Autoscaler), and network policies in Kubernetes for better performance and security.
Build reusable, testable Kubernetes manifests, Helm charts, and Kustomize overlays , leveraging Terraform for infrastructure provisioning and consistency.
Implement best practices for RBAC, Pod Security Standards, IAM roles for service accounts, secrets management , and AWS security controls (WAF, GuardDuty, Shield).
Automate infrastructure provisioning, monitoring, and incident resolution using Prometheus, Grafana, Loki, OpenTelemetry, and AWS CloudWatch to improve operational efficiency.
Collaborate with development teams to optimize application performance, scalability, and security while iterating on infrastructure improvements.
Work as part of a cross-functional team , ensuring seamless integration between infrastructure, development, and security processes to deliver high-quality client-facing and internal solutions.
Build reusable, testable infrastructure-as-code (IaC) using Terraform to promote engineering efficiency and standardization across teams.
Drive security best practices across AWS environments, implementing robust access controls and monitoring to meet compliance and operational standards.
Foster a culture of automation , continuously improving processes like infrastructure provisioning, deployment, and incident resolution to minimize manual intervention.
remote work