What are the responsibilities and job description for the Platform Engineer (Kubernetes) position at Kavaliro?
Job Description
Job Requirements
On-Site
- Interface with stake holders and capture requirements, plan execution of technical requirements and provide verbal and written briefs to gov. stakeholders
- Interface with IA/J6 to provide necessary documentations and artifacts through the RMF process leading to full ATO
- Assist with deployment and architecture of clusters on AWS IL6 environment
- Create a highly available, on-prem Kubernetes RKE2 deployment model for mission critical customer
- Deploy High availability RKE2 cluster in an air-gapped, on-prem datacenter environment
- Create a Kubernetes DevSecOps environment where third-party applications can be developed, secured and deployed within the ecosystem.
- Deploy and maintain MinIO in distributed mode. This enables data to be stored across multiple nodes, ensuring high availability and fault tolerance
- Implement multiple control-plane nodes with a load balancer in front to distribute the traffic and ensure high availability.
- Ensure a multi-node setup for GitOps tools and store configurations redundantly across nodes. Regular backups of configurations are essential.
- Utilize Patroni / Spilo / Scalyr for automating failover to a replica in case the primary node fails. Regular backups and real-time replication should also be implemented
- Ensuring uptime, handling failovers, and scaling as needed.
- Update containers baseline, apply systems patching
- Responsible for the overall functionality and maintenance of HA cluster
- Work closely with DEVSECOP team to coordinate deployment of updates and other security patches.
- Able to work in secure environments in Hickam AFB and Camp Smith
- Maintain a DoD security clearance
- Travel to the Indopacific region as required in support of Gov Customer requirements
- Deploy and remotely maintain Kernos cluster in Japan, other regions
- Active DoD security clearance of secret or higher
- Able to live and work in Hawaii
- Linux system administration experience
- Experience architecting, deploying and maintaining high availability Kubernetes cluster in an air-gapped environment
- Familiar with DoD system security requirements and processes such RMF, DISA STIG, container hardening, security patching, etc.
- Strong technical communicate skills
- Ability to generate and brief architecture design and network diagrams
- Maintain a minimum of Security certification
- Networking experience is a plus
- Motivated individual, willing to work in small team environment, assume different responsibilities when required.
- Problem solving skills with strong communication skill-set
- Familiarity with AWS cloud automation tools such Terraform, GitOps, S3, EKS, AKS, RDS, IAM, CloudWatch, etc. deployment
- Experience architecting high availability core services such HA PostgreSQL, Minio, Ingress NGINX, Keycloak, etc.
Job Requirements
On-Site