What are the responsibilities and job description for the Infrastructure Engineer position at Radiant Digital?
Beacon Systems, Inc., a subsidiary of Radiant Digital Solutions, delivers Program Management, Science, Engineering, and Technology Solutions to Federal, Commercial, State, and Local Agencies. Our support extends across leading organizations such as the DoD, NASA, FDA, Voice of America, and several U.S. state governments, including Florida, Rhode Island, Mississippi, North Dakota, Virginia, and West Virginia.
We are currently seeking a DevOps Engineer – Infrastructure Automation for a contract opportunity in Dallas, TX. If you're interested, please ensure your most recent resume highlights all the required skills and experience listed below.
Position: DevOps Engineer – Infrastructure Automation
Location: Dallas, TX
Contract Duration: 12 Months (with possible extension)
Start Date: ASAP
Role Overview
We are looking for a highly skilled and motivated Senior DevOps Engineer to join the Storage and Compute Platform Management team. This contractor role focuses on designing and automating scalable infrastructure systems that support our high-performance computing (HPC) and large-scale storage environments.
You will be a key contributor in building tools and processes that ensure reliability, observability, and performance for multimegawatt-scale CPU and GPU compute farms used in quantitative research and machine learning workloads.
Key Responsibilities
We are currently seeking a DevOps Engineer – Infrastructure Automation for a contract opportunity in Dallas, TX. If you're interested, please ensure your most recent resume highlights all the required skills and experience listed below.
Position: DevOps Engineer – Infrastructure Automation
Location: Dallas, TX
Contract Duration: 12 Months (with possible extension)
Start Date: ASAP
Role Overview
We are looking for a highly skilled and motivated Senior DevOps Engineer to join the Storage and Compute Platform Management team. This contractor role focuses on designing and automating scalable infrastructure systems that support our high-performance computing (HPC) and large-scale storage environments.
You will be a key contributor in building tools and processes that ensure reliability, observability, and performance for multimegawatt-scale CPU and GPU compute farms used in quantitative research and machine learning workloads.
Key Responsibilities
- Design and implement infrastructure automation frameworks for provisioning HPC and storage platforms.
- Apply infrastructure-as-code and configuration management best practices to ensure system consistency and repeatability.
- Collaborate with platform and DevOps teams to enhance system scalability, reliability, and observability.
- Monitor and troubleshoot infrastructure performance and reliability issues across compute and storage components.
- Drive continuous improvement initiatives through performance tuning, automation, and capacity planning.
- Support deployment and operation of distributed systems across the enterprise.
- Extensive experience with infrastructure engineering, especially in compute and storage systems at scale.
- Strong background in Python programming for automation, scripting, and integration tasks.
- Expertise in CI/CD pipelines and tools such as Jenkins, GitLab CI, or ArgoCD.
- Proficient with Infrastructure-as-Code and configuration management tools like Terraform, Ansible, and Puppet.
- Experience with observability and monitoring tools (e.g., Prometheus, Grafana, ELK Stack).
- Solid understanding of Linux system administration and networking principles.
- Hands-on with containerization and orchestration platforms (Docker, Kubernetes).
- Familiarity with public cloud platforms (AWS, Azure, GCP) and hybrid environments.
- Prior exposure to HPC environments or large-scale storage infrastructure is highly desirable.
- Strong communication skills, attention to detail, and a proactive, collaborative work style.
- Experience working in fast-paced, high-availability environments.
- Ability to work independently and manage complex technical projects from start to finish.
- Passion for automation, scalability, and performance optimization.
Salary : $45 - $50