What are the responsibilities and job description for the Principal Engineer, Cloud SRE/DevOps (SD/DC/Remote) (R3138) position at The Rundown AI, Inc.?
About Shield AI
Founded in 2015, Shield AI is a venture-backed defense technology company focused on protecting service members and civilians with intelligent systems. Its flagship autonomy software, Hivemind, powers aircraft, drones, and other platforms, enabling complex missions with high reliability in contested environments. With offices in San Diego, Dallas, Washington, D.C., and internationally, Shield AI’s products actively support U.S. and allied operations worldwide. For more information, visit www.shield.ai. Follow Shield AI on LinkedIn, Twitter, and Instagram.
As a Cloud SRE / DevOps Engineer on the Forge team, you will be responsible for optimizing Forge’s cloud deployments and owning the processes that enable customers to deploy their own Forge instances. You will manage Shield AI’s internal Hivemind instances, working closely with the software operations and engineering teams to ensure Forge can scale for simulation, testing, and bursts of use. You will also enable seamless upgrades, canary deployments, and system robustness. Additionally, you’ll serve as the primary point of contact for the customer engagement team, providing expert guidance on deploying Forge in customer environments.
What You'll Do :
- Optimize cloud deployments of Forge to ensure scalability, reliability, and cost efficiency.
- Design and document processes for external customers to deploy Forge instances using the SDK in on-premises or hybrid environments.
- Manage and maintain internal Hivemind instances, ensuring their ability to handle large-scale simulation and testing workloads.
- Collaborate with the software operations team to enhance Forge’s ability to scale dynamically, accommodate bursts of use, and support continuous upgrades with minimal disruption.
- Develop tools and processes for canary deployments, ensuring smooth rollouts of new features and updates.
- Serve as the primary technical consultant for the customer engagement team, providing expertise on deploying and managing Forge in external environments.
- Create and maintain detailed, user-friendly documentation and tutorials for deployment processes, catering to both internal teams and external customers.
- Monitor, troubleshoot, and resolve issues related to Forge deployments, ensuring high availability and performance.
Required Qualifications :
Preferred Qualifications :
Shield AI is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender identity or Veteran status. If you have a disability or special need that requires accommodation, please let us know.
J-18808-Ljbffr