What are the responsibilities and job description for the SRE Engineer position at Cordova?
We are seeking a Senior Site Reliability Engineer (SRE). As one of our SREs, you will be 100% hands on with both infrastructure and software development. We support and implement multi-enclave hybrid-cloud software factories for the DoD that includes Kubernetes (K8s) platforms, DevSecOps tools, IaC, Cybersecurity tools, and custom software.
This candidate will have the opportunity to work with technical leaders in Software Development, Cybersecurity, and Operations (DevSecOps) to develop the next generation of software factories. You will work both independently and collaboratively with your team to troubleshoot and resolve highly technical issues that customers encounter while using deploying solutions. You'll partner cross-functionally with product and engineering teams to drive feedback, improve internal & external tooling, launch new products & features, and deliver an exceptional customer experience.
Responsibilities :
- Participate in a collaborative Kanban multi-discipline team working closely with customer to accelerate cloud initiatives and improve processes
- Work with the customer to design and build CI / CD pipelines. Develop and integrate toolchain systems to provide path to production from development Software Factory
- Enable Continuous Integration / Continuous Delivery through appropriate design guidelines.
- Maintain traceability between requirements, design, and test cases
- Work directly with Development and Operations teams to increase velocity, prioritize tasks, implement requirements, and automate.
- Lead SRE and DevSecOps work initiatives from inception to production
- Knowledge of architecture concepts including microservices, container orchestration, and traditional 3-tier applications
- Design and implement Kubernetes platforms and tools chains
- Implement infrastructure as code using tools such as Ansible Automation Platform, and / or VMware vRealize Automation?
- Develop and maintain code (Bash, Python, YAML, PowerShell, Ruby, Groovy)
- Experience with observability tools such as Log Insight, Elastic Stack, Splunk, QRadar, or Prisma Cloud
- Design and implement enterprise on-premises and hybrid cloud deployments
- Lead efforts using Agile methodologies
- Ability to work both independently and in a team environment with clients and vendors, demonstrated technical leadership skills, good verbal and written communication skills
- Provide expertise in integrating and administering Kubernetes (K8s) Platforms (Tanzu, Open Shift, Konvoy), Elastic, Istio, Gitlab and other DevSecOps products.
- Provide expertise in system integration and development in an agile environment
- Troubleshooting and resolving technical support requests created by our customers spanning a growing range of container products and services, including Managed Kubernetes and Container Registries
- Contributing to internal documentation that provides your team with the resources they need to perform in their role and external documentation that allows our customers to self-serve
- Engaging customers and responding to technical questions received through our community Q&A forum
- Representing the voice of support, speaking on behalf of our customers through direct engagement with our product and engineering teams
- Assist customers on-site during release deployment and with periodic system / application patching
Basic Qualifications :
Desired Skills :
Clearance Level :
Education Level :
Do you have the skills to fill this role Read the complete details below, and make your application today.
J-18808-Ljbffr