Demo

Platform & HPC Data Engineer

Rgi Corp
Herndon, VA Full Time
POSTED ON 1/24/2025
AVAILABLE BEFORE 4/23/2025

RGi is searching for a talented Platform and HPC Data Engineer to join our team, where you will play a key role in the design, implementation, and optimization of data management solutions within high-performance computing (HPC) environments. We are looking for a candidate with substantial experience in diverse file systems, data labeling and tagging systems, as well as the configuration of various storage appliances.

In this position you will be responsible for ensuring that data workflows, storage configurations, and metadata management are not only efficient and scalable but also adhere to organizational and government security standards. As part of a dynamic, cross-disciplinary team, you will help address the technical demands of HPC platforms, effective data management, and large-scale computational workflows. Join us in advancing our innovative solutions and shaping the future of high-performance computing!

Clearance :

Active Top Secret clearance with willingness and ability to obtain an SCI and CI polygraph

US Citizenship required

Interested in this role You can find all the relevant information in the description below.

As a Platform & HPC Data Engineer you will...

  • Design and implement data management systems and architectures for HPC platforms, focusing on optimizing data flow, storage, and access in large-scale computing environments.
  • Oversee the configuration, maintenance, and optimization of distributed file systems (e.g., Lustre, IBM Spectrum Scale, NFS, GPFS) and storage solutions used in HPC environments to ensure efficient performance, scalability, and reliability.
  • Implement and manage metadata-driven systems for data labeling / tagging. This includes the development of strategies for classifying, indexing, and organizing datasets to enhance data discoverability, access control, and auditing.
  • Configure and maintain various storage appliances (e.g., NetApp, Dell EMC, HPE) and integrated storage solutions. Ensure that storage devices are optimized for performance, capacity, and availability within the HPC ecosystem.
  • Integrate data storage and management systems with HPC clusters, ensuring seamless data flow between compute nodes and storage appliances. Optimize data pipelines to support high-throughput workloads and minimize bottlenecks in I / O performance.
  • Monitor and improve the performance of storage systems, focusing on I / O throughput, latency, and efficient resource allocation. Use performance metrics to guide optimizations across storage appliances and file systems.
  • Implement security best practices for data access, protection, and management, ensuring compliance with government regulations and internal data governance policies. Configure encryption, access control, and secure data sharing methods.
  • Develop and maintain automation scripts (e.g., using Python, Bash, or Perl) to streamline storage configurations, data labeling / tagging, and system monitoring tasks. Automate processes related to data integration and HPC platform management.
  • Work closely with data scientists, HPC administrators, software developers, and other technical staff to support ongoing projects. Provide expertise in troubleshooting data storage issues and ensuring optimal system performance.
  • Maintain thorough documentation for storage configurations, file system setups, data labeling / tagging procedures, and performance optimization strategies. Provide regular reports on system health, data management processes, and any improvements made.

Platform & HPC Data Engineer Qualifications :

  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field. A Master’s degree or higher is a plus.
  • 7 years of experience in managing data infrastructure in HPC environments, with expertise in file systems, storage appliances, and data workflows.
  • Hands-on experience with distributed file systems, including Lustre, IBM Spectrum Scale (GPFS), NFS, and others commonly used in HPC settings.
  • Proven experience with storage appliance configuration (e.g., NetApp, Dell EMC, HPE, or similar systems), including performance tuning, capacity management, and reliability.
  • Strong experience in implementing data labeling / tagging systems, metadata management, and structuring large datasets for efficient access and compliance.
  • Knowledge of high-performance networking protocols (e.g., InfiniBand, RDMA) and their role in data transfer and storage optimization.
  • Familiarity with data access protocols like GridFTP, rsync, and NFS for large-scale data transfer.
  • Additional Skills We'd Like to See :

  • Experience with cloud storage integration or hybrid cloud environments, with knowledge of cloud-native storage solutions (e.g., AWS S3, Ceph, OpenShift).
  • Familiarity with high-performance computing (HPC) schedulers (e.g., SLURM, PBS, Torque) and their interaction with data storage systems.
  • Understanding of data protection mechanisms, including data replication, backup strategies, and disaster recovery in HPC environments.
  • Experience with containerization (Docker, Singularity) in an HPC context for data processing and application deployment.
  • Experience with machine learning or data science workflows in HPC environments.
  • Who we are :

    Reinventing Geospatial, Inc. (RGi) is a fast-paced small business that has the environment and culture of a start-up, with the stability and benefits of a well-established firm. We solve complex problems within geospatial software development and national defense to make an Immediate Impact for our nation’s soldiers and analysts.

    We pride ourselves on giving employees an exceptional life experience, where creativity thrives, and challenges are simply part of the fun. We provide truly excellent benefits, including :

  • 100% paid employee healthcare & dental insurance
  • Paid parental leave
  • 401k with matching
  • Escalating vacation time
  • Referral bonuses
  • Tuition reimbursement
  • Professional development training
  • Free beverages and snacks
  • Weekly catered lunches and breakfast on Fridays
  • Grow to be our next leader :

    At RGi, fostering a strong and organic corporate culture is paramount and serves as a compass on the decisions we make and how we operate the company. We believe our culture of camaraderie, innovation, and collaboration reflects the caliber of our employees and their dedication to the mission of providing quality software to our customers. As such, we want our employees to feel empowered to seek growth and leadership opportunities within the company and position us to maintain our culture as we grow. RGi provides opportunities, resources, training, and mentorship to all our employees to let them take control of their careers and become a leader or a crucial member of our company. If this is what you are looking for in a company, then you are what we are looking for in an employee.

    Reinventing Geospatial, Inc. is an Equal Opportunity Employer committed to hiring and retaining a diverse workforce. We are an Equal Opportunity Employer, making decisions without regard to race,

    J-18808-Ljbffr

    If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
    Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

    What is the career path for a Platform & HPC Data Engineer?

    Sign up to receive alerts about other jobs on the Platform & HPC Data Engineer career path by checking the boxes next to the positions that interest you.
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $71,122 - $96,652
    Income Estimation: 
    $92,929 - $122,443
    Income Estimation: 
    $122,257 - $154,284
    Income Estimation: 
    $143,391 - $179,890
    View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

    Job openings at Rgi Corp

    Rgi Corp
    Hired Organization Address Herndon, VA Full Time
    RGi looking for a Senior Full Stack Software Developer to lead the charge in developing and integrating innovative intel...
    Rgi Corp
    Hired Organization Address Herndon, VA Full Time
    RGo is on the lookout for a talented DevOps Engineer who is ready to dive into an exciting journey of innovation and int...
    Rgi Corp
    Hired Organization Address Herndon, VA Full Time
    RGi is seeking a skilled Frontend Software Development Engineer to join our team in developing innovative intelligence c...
    Rgi Corp
    Hired Organization Address Fairfax, VA Full Time
    Job Highlights : As a Linux Administrator with RGi, you will be directly supporting intelligence community customers wit...

    Not the job you're looking for? Here are some other Platform & HPC Data Engineer jobs in the Herndon, VA area that may be a better fit.

    Platform & HPC Data Engineer

    Reinventing Geospatial, Inc. (RGi), Herndon, VA

    Platform & HPC Data Engineer

    Platinum Technologies, Herndon, VA

    AI Assistant is available now!

    Feel free to start your new journey!