What are the responsibilities and job description for the Senior Systems Observability Engineer position at Tech Mahindra (Americas) Inc.?
Job Details
Senior Systems Observability Engineer responsible for designing, implementing, and maintaining observability solutions for IT systems. Focus on enhancing system reliability, performance, and security through monitoring, logging, and tracing. Collaborate with development and operations teams to ensure comprehensive visibility into the infrastructure and applications.
Key Responsibilities:
In this role, you will:
- Define metrics as KPI for each system in coordination with system owners
- Configure alerts and notifications for critical system events to support proactive problem resolution
- Develop dashboards to visualize system performance and health metrics
- Review in-use observability and monitoring tools and recommend changes where needed to ensure optimal coverage
- Provide tiered support to other team members in use of tools supporting observability for systems supporting our network infrastructure
- Apply networking knowledge for seamless infrastructure communication
Skills, Experience and Requirements
As a successful Senior Systems Observability Engineer, you will have:
Education and Experience:
- Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent work experience
Skills and Qualifications:
- 7 years experience with systems engineering responsibilities with a focus on observability and monitoring
- Understanding of networking concepts and protocols
- Strong problem-solving, time management, and multitasking skills
- Excellent communication, teamwork, and adaptability to new technologies
- Technology skills
- Required
- Experience with monitoring tools (Prometheus, Grafana, AWS CloudWatch)
- Strong scripting skills (Ansible, Python, Bash)
- Preferred
- Experience with various Fault/Performance management systems (Solarwinds, Splunk, Vitria Via AIOps)
- Experience with containerization and orchestration (e.g., Docker, Kubernetes)