What are the responsibilities and job description for the Software Support - onsite IN position at Royal Communications Consultants Inc?
Software Support - onsite South Bend, INDIANA
• GPFS
- Experience monitoring GPFS , quota sizes, pool sizes and managing it effectively
- Debug GPFS issues with respect to data and metadata corruption, and GPFS performance issues-
• LSF
- manage issues with NHC (node health check) monitoring from LSF-
- Experience with LSF configuration and add remove nodes from LSF and manage downtimes/reservations with LSF
• Puppet/Foreman
- Experience writing puppet code and Hiera, along with running it effectively on hundreds of nodes
- Manage deployments with foreman when necessary, and experience managing pxe, dhcp, dns, kickstart scripts and postscripts
• Icinga/Nagios
- Experience managing Icinga, setting up alerts and downtimes
• Scripting
- Experience writing bash and python scripts with REST API
• User support
- Experience helping users with login issues, managing LDAP/AD accounts, build software and debug software issues
• L3 Linux support
- Monitoring logs and debug OS issues with respect to performance and best practices
• SLA
- less than 10 mins to respond to software alerts
- less than 1 day to resolve the issue (depends on severity and complexity).
- Availability on weekends for Sev1 issues.
Nice to have
• Experience with other scheduling / parallel filesystem technologies
• Scientific computing experience
• Weka experience
• GPFS
- Experience monitoring GPFS , quota sizes, pool sizes and managing it effectively
- Debug GPFS issues with respect to data and metadata corruption, and GPFS performance issues-
• LSF
- manage issues with NHC (node health check) monitoring from LSF-
- Experience with LSF configuration and add remove nodes from LSF and manage downtimes/reservations with LSF
• Puppet/Foreman
- Experience writing puppet code and Hiera, along with running it effectively on hundreds of nodes
- Manage deployments with foreman when necessary, and experience managing pxe, dhcp, dns, kickstart scripts and postscripts
• Icinga/Nagios
- Experience managing Icinga, setting up alerts and downtimes
• Scripting
- Experience writing bash and python scripts with REST API
• User support
- Experience helping users with login issues, managing LDAP/AD accounts, build software and debug software issues
• L3 Linux support
- Monitoring logs and debug OS issues with respect to performance and best practices
• SLA
- less than 10 mins to respond to software alerts
- less than 1 day to resolve the issue (depends on severity and complexity).
- Availability on weekends for Sev1 issues.
Nice to have
• Experience with other scheduling / parallel filesystem technologies
• Scientific computing experience
• Weka experience