What are the responsibilities and job description for the Senior applications solution architect - SRE orchestration position at Fortune 500 Companies?
Senior applications solution architect - SRE orchestration
- Job Location: Plano, Texas
- Job Duration: Full Time / Hybrid (2-3 days in office or whenever it’s required)
** NO SPONSORSHIP FROM THE CLIENT **
Job Description
Overview
The Sustain & Operations team, part of the Digital Products and Applications (DPA) organization, delivers and sustains digital products across core strategic and transformational priorities to accelerate digital transformation. Effective digital transformation requires new business processes, innovative digital products, and enhanced operational outcomes. To achieve higher-order outcomes, all applications and underlying infrastructure/services operating independently must be resilient. Additionally, the data interactions across these applications supporting business processes must also maintain resilience. Achieving this requires adopting modern practices that leverage Site Reliability Engineering (SRE) principles to ensure business process resiliency through preemptive detection, diagnosis, and recovery.
Responsibilities
The Senior solutions architect is an advanced subject matter expert of Application architectures, SRE principles, primarily responsible for designing and implementing modern ways of support operations solutions for the landscape of digital products applications while ensuring SRE principles are followed during the design of the products.
The transformation toward modern operations in an SRE construct is for all programs under DPA, per the main purpose
- This involves diagnosing for deviations (anomalies), assessing output accuracy at each process step, and implementing measures to prevent faulty outcomes from impacting business processes and end users.
- To bring this to life, new shift left activities are critical to apply Site Reliability Engineering (SRE) and quality assurance principles within the application design to drive higher first pass yield.
- Once in production, the SRE-driven orchestration toolkit connects all components of the ecosystem and preemptively diagnosing anomalies and remediating through automation, minimizing net business and/or customer impact driving a high second pass yield.
- The Sr. solution architect is responsible for the design & consumption of the SRE practices during design and in production.
- Reporting directly to the SRE & Quality Assurance Sr. director, is responsible to drive the SRE Engineering activities within the application design and building and SRE orchestration platform that enables seamless detect, diagnose and recover minimizing the impact.
- Refine & deliver requirements for service enhancements with the product engineering team and Solution architecture and collaborate with key product managers and stakeholders
- Be a Technical leader in the global DPA team that drives, measures, and optimizes SLA/SLO/SLI and error budgets for the product offerings
- Lead to develop/deliver SRE orchestration software solutions to automate service delivery using techniques such as flash, terraform, ansible or a custom solution
- Collaborate with DevOps / Dev SecOps teams proactively to build reliability and resiliency into their services without disrupting the CI/CD pipelines
- Leads the definition, collection, and analysis of data relevant to products systems and their interactions towards business process resiliency, especially related to impacting customer satisfaction, Revenue, or IT productivity
- Operate in an ever-evolving landscape of product offerings to deliver first-class service to our customers
- Work closely with customer-facing support teams to evolve & empower them with SRE insights
- Participate in on-call support and orchestrating blameless post-mortems and encourage the practise within the organization
- Actively engage and drive AI Ops adoption across teams
Qualifications
- 15 plus years of work experience evolving to a solution architect with 3-5 years of experience in continuously improving and transforming IT operations ways of working
- The ideal Engineer will be highly quantitative, have great judgment, be able to connect dots across workstreams, and efficiently work cross-functionally across teams to ensure SRE orchestrating solutions are meeting customer/end-user expectations
- The candidate will take a pragmatic approach to resolving incidents, including the ability to systemically triangulate root causes and work effectively with external and internal teams to meet objectives.
- Exceptional business relationship skills, including the ability to communicate effectively both internally and externally. You can communicate complex technical data to a non-technical person in a concise, clear, and easily understood manner.
- A firm understanding of SRE (Software Reliability Engineering) and IT Service Management (ITSM) processes with a track record for improving service offerings - resolving incidents, providing a seamless customer/end-user experience, and proactively identifying and mitigating areas of risk
- Experience in leading high-performing teams
- Deep hands-on technical expertise, excellent verbal and written communication skills