What are the responsibilities and job description for the SRE(Site Reliability) Architect - Remote (Fulltime) position at The Dignify Solutions, LLC?
- 10 years of Development and Operations experience in building and running applications in production that has uptime over 99%. Related experience and/or training; or equivalent combination of education and experience
- 8 years of experience as a SRE Architect in running large Reliability & Observability Programs for large, complex infrastructure deployments / distributed systems for major Banking customers.
- Good understanding of Observability (monitoring, logging, tracing, metrics), Chaos engineering concepts.
- Proficiency in using Application Performance Monitoring (APM) tool New Relic/Dynatrace for monitoring, logging, tracing and Splunk for Log monitoring.
- Expert level hands on knowledge in cloud platforms like PCF.
- should have implemented solutions around Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for services.
- Understanding of software delivery life cycles, particularly Agile/Lean & DevOps