What are the responsibilities and job description for the Software Engineer, Data Ingestion position at Torch.AI?
Become Part of the Torch.AI Journey!
Torch.AI is a defense focused AI software company. Unlike traditional government contractors, our team of experts takes calculated risks to self-fund R&D behind the scenes and then sells complete products "off-the-shelf" to mission owners. We conduct deep research into understanding how AI and new data infrastructures can improve a growing array of national defense needs. This allows us to go from ideation to full capability deployment in weeks and months, instead of years. We’re passionate about solving complex problems.
Torch.AI’s focus on the U.S. defense and national security industry offers you an unparalleled opportunity to contribute directly to the safety and well-being of our nation while building new innovative technologies. As a vital partner to the U.S. government and our allies, helping to shape global stability, we offer a dynamic environment to tackle complex challenges across multidisciplinary domains. With substantial investment in innovation, Torch.AI is at the forefront of developing AI, autonomous systems, and advanced national security solutions, founded on the premise that information is the new battlefield. Join us in our mission to help the most important organizations in the world Unlock Human Potential.
The Role: Unlock Your Potential
As a Software Engineer specializing in Data Ingestion at Torch.AI, you will tackle the challenge of identifying, researching, and acquiring publicly available information (PAI) and open-source intelligence (OSINT) data, which serve as critical sources of intelligence for our customers. You will leverage existing collection capabilities and web crawlers within the Torch.AI platform, enhance related technologies and ML models to support customers at scale, and introduce innovative approaches to data collection.
Each of our customers requires unique technical solutions to enable an asymmetric advantage on the battlefield. Torch.AI’s award winning, patented software helps remove common obstacles such as manual-intensive data processing, parsing, and analysis thereby reducing the cognitive burden of the warfighter. Our modular, end-to-end data data processing, orchestration and management platform supports a wide variety of military capabilities and operations. Customers enjoy enterprise-grade solutions that meet specialized needs. Torch.AI encourages company-wide collaboration to share context, skills, and expertise across a variety of tools, technologies, and development practices.
You’ll work autonomously while driving coordinated, collaborative decisions across cross-functional product development teams of defense and national security experts, veterans, experienced AI/ML software engineers. Your code will integrate elegant user experiences with unmatched back-end data processing capabilities by implementing responsive designs, optimizing application performance, and integrating with complex backend services and APIs. You will have the opportunity to harden and scale existing platform capabilities, tools, and technologies, while also working to innovate and introduce new iterative capabilities and features which benefit our company and customers.
Successful candidates thrive in a fast-paced, entrepreneurial, and mission-driven environment. We hire brilliant patriots. You’ll be encouraged to think creatively, challenge conventional approaches, and identify alternative approaches to delivering customers value across complex problem sets. The day-to-day workflow will vary, adapting to the requirements of our customers and the technical needs of respective use cases. One day, you may be supporting the development of a new proof of capability concept for a new customer program; another you may be refining system performance to help scale a production deployment; the next you may be working directly with customers to understand their requirements with deep intellectual curiosity.
What Sets This Role Apart
- Our decentralized operating model puts every employee at the forefront of our customers’ missions. You’ll work within and across both nimble customer centric solutions teams and research and development teams.
- We value customer intimacy, unique perspectives, and dedication to delivering lasting impact and results. You’ll have the opportunity to work on the frontlines of major customer programs and influence lasting success for Torch.AI and your teammates.
- You’ll have the opportunity to work on a wide range of projects, from designing and demonstrating early capabilities and prototypes to deploying large-scale mission systems.
- You’ll contribute directly to Torch.AI’s continued position as a leader in data infrastructure AI in the market and compete against multi-billion-dollar incumbents and high-tech AI companies.
- We develop solutions directly supporting our nation’s warfighters, national security, and prosperity; the impact of your work is directly visible.
Critical Skills
- B.S. degree in a related field or an equivalent combination of training and experience.
- Extensive experience with Python and JSON.
- Ability to extract data from open-access, publicly available data sources.
- Ability to develop and maintain large scale web crawlers to access internet-wide data.
- Ability to build data pipelines for the rapid ingestion of data.
- Extract and clean data from websites, ensuring data accuracy and consistency.
- Optimize system performance through indexing, partitioning, and other techniques.
- Familiarity with Infrastructure-as-Code (IaC) tools and configuration management systems.
What We Value
- An Entrepreneurial mindset.
- Ability to create targeted, deep collection tools to acquire data from high-value sources and maximize recall.
- Design and manage an internet-scale data retrieval system to collect accessible online information.
- Ability to translate data between various formats including JSON, Parquet, Avro.
- Knowledge in Spark, Kafka, and Airflow.
- Proficient in utilizing cloud-based computing and storage platforms.
- Design and maintain database schemas to ensure efficient data storage and retrieval.
- Build tools to enhance the monitoring and troubleshooting of the collection system.
- Work closely with team members to optimize data acquisition workflows.
- Awareness of ethical considerations and responsible AI practices.
- Ensure data quality aligns with established standards and criteria.
- Excellent problem-solving skills, attention to detail, and ability to thrive in a fast-paced, collaborative environment.
- Eligible for Top Secret security clearance.
Professional Ambiance
- This role thrives in a cutting-edge, high-performance workspace.
- Our base of operations is in Leawood, KS.
- We have engineering and forward deployed mission teams in key cities across the U.S.
- This is a full-time on-site role in our Kansas City headquarters.
Equity Program
- All employees are eligible to participate in the company equity incentive program within their first 12 months of employment. We are proud that 100% of our employees are equity-owning partners at Torch.AI.
Incentives and Advantages
- Competitive salary, performance bonus, and benefits package.
- Opportunity to participate in Torch.AI’s employee equity incentive program.
- Unlimited PTO.
- 11 paid holidays each year.
- Dynamic and energetic teammates.
- Incredible chance for professional advancement in a rapidly scaling high-tech environment.
- Weekly in-office catering in our Leawood HQ.
- Access to company entertainment suite at the Kansas City T-Mobile Center, with tickets to all major events and concerts.
- Exceptional medical, dental, and vision insurance.
- Company sponsored life and disability coverage.
- Relocation benefits.
Torch.AI is an Equal Opportunity /Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, national origin, protected veteran status or status as an individual with a disability.
These positions are being reviewed and filled on a rolling basis, and multiple openings may be available for each role.