What are the responsibilities and job description for the IT Operations Systems Admin position at Desert Financial Credit Union?
The Infrastructure Monitoring and Alerting Specialist is skilled and proactive team member who plays a critical role in ensuring the stability, performance, and security of our organization's infrastructure by managing and optimizing alert and monitoring systems.
What you will do here :
Monitoring System Management :
Design, implement, and maintain an effective infrastructure monitoring system to ensure real-time visibility into the health and performance of critical systems.
Configure and customize monitoring tools to align with organizational needs and specific infrastructure requirements.
Alerting Configuration :
Develop and maintain alerting rules and thresholds to promptly identify and respond to potential issues or anomalies.
Collaborate with cross-functional teams to establish escalation procedures and response protocols.
Incident Response :
Actively monitor alerts and incidents, and provide timely and effective responses to minimize downtime and impact on business operations.
Conduct post-incident reviews to identify root causes and implement preventive measures.
Infrastructure Performance Analysis :
Analyze performance trends, identify bottlenecks, and propose optimizations to enhance the overall efficiency of the infrastructure.
Generate reports and dashboards to communicate key performance metrics to relevant stakeholders.
Integration and Automation :
Integrate monitoring and alerting systems with other infrastructure components to streamline workflows and enhance automation.
Collaborate with development teams to implement monitoring best practices within the software development life cycle.
Documentation :
Maintain comprehensive documentation of monitoring configurations, alerting rules, and incident response procedures.
Create and update documentation for training purposes and knowledge sharing.
Traditional Sys Admin Duties
Support our platform and server operations groups as needed to handle tickets and light project work
Perform other sys admin duties as needed
What you will need :
Bachelor's degree in information technology, Computer Science, or a related field preferred; or Equivalent combination of education and experience required.
3 years proven experience in managing and optimizing infrastructure monitoring and alerting systems required.
Expertise in managing and optimizing infrastructure monitoring and alerting systems required.
Proficiency in using industry-standard monitoring tools such as Solarwinds, Dynatrace, Grafana, or equivalent required.
Demonstrated strong understanding of network protocols, system administration, and infrastructure components required.
Excellent problem-solving skills and the ability to work collaboratively in a fast-paced environment required.
Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes) is a plus preferred.
Expertise with System / Server administration preferred.
Demonstrated competency in scripting and automation skills (e.g., Python, Bash) preferred.
Demonstrated expertise working with API's preferred.
Demonstrated expertise with log management and analysis tools (e.g., ELK Stack) preferred.
Demonstrated knowledge of security best practices and incident response procedures preferred.
We are proud to be an EEO / AA employer M / F / D / V. We maintain a drug-free workplace and perform pre-employment substance abuse testing.
For additional information about our organization, careers, and benefits visit :
LI-Hybrid