What are the responsibilities and job description for the Director Infrastructure Operations and Reliability Engineering position at onsemi?
Director Infrastructure Operations and Reliability Engineering
Job Description
This critical leadership role will lead the transformation of Infrastructure Operations at onsemi into a highly reliable, available, performant, and resilient infrastructure and cloud service to enable successful delivery of IT and business operations supporting our engineering, corporate and manufacturing functions. The role will be responsible for morphing the current function into a hybrid cloud operations function with practices enabling on premise, Cloud, and SaaS based workloads. The role will help shift the function from an infrastructure centric operations team into one that enables operational capabilities across IT in the areas of digital service management, operational intelligence and availability management for critical business functions across the company.
The role will bring significant depth in all facets of Infrastructure Operations and expertise in operational excellence practices across process, technology, people and lead the partnership with strategic outsourcing suppliers. The role will be responsible for a large-scale change in practices and culture that values operational excellence and quality. The right leader will model leadership behaviors to foster accountability, prioritize customer service, create team cohesion, foster continuous learning and improvement, and provide coaching and development of team members.
The role will be responsible for leading a global team of engineers and operations specialists overseeing critical operations, managing infrastructure health, driving continuous improvement efforts, enabling resiliency against operational and security events, ensuring cost effective infrastructure and managing business partnerships across IT and business partners.
The complexities of the role include effectively managing operations for global efficiency while managing site specific delivery, security, and availability commitments.
The role will report into the IT Infrastructure CTO and will partner with peers in Infrastructure Engineering, Architecture, Manufacturing IT, End User Services, Cybersecurity, Corporate Digital Delivery, and Product Engineering.
Qualifications and Required Experience
- 15 years of senior leadership experience in IT Operations.
- 10 of experience leading operational transformation and reliability improvement programs.
- 10 years of experience with outsourced suppliers leading service improvement and maturing partnerships.
- 5 years of experience with hybrid cloud including SaaS, IaaS and private infrastructure cloud.0
- Deep technical expertise in core infrastructure, ITSM practices, Backup and Disaster recovery, monitoring and automation capabilities.
- Leadership and management competencies including inspirational leadership, continuous improvement and metrics driven operational management.
- Working knowledge and ability to implement lean based operating system including daily reviews, problem solving, standards setting and continuous improvement projects and sprints
- Working knowledge of Agile and ability to administer sprint based planning and delivery.
- Ability to optimally organize resources across global and site footprint through intelligent tiering of service levels.
Essential Job Functions And Responsibilities
The sub functions and responsibilities under this role include:
- Execute and optimize compute operations across global and site specific data centers.
- Drive maturity of Storage, backup and Disaster recovery capabilities and processes.
- Optimize and streamline Database Operations
- Develop and mature C loud operations including SaaS and IaaS.
- Oversee Data Center operations across global IT, engineering and manufacturing factories.
- Engineer and enable reliability outcomes with Infrastructure and application observability platforms and tools.
- Optimally organize Level 1, Level 2 and Level support functions for above areas.
- Lead Infrastructure automation services , enabling automated health checks and automation of break fix and provisioning activities.
- Institute Site reliability engineering services to ensure standards definition, assessment, and remediation to necessary standards.
- Definition, measurement, reporting of key metrics around uptime , disaster recovery, infrastructure health, process excellence.
- Cultivate strong partnership with Cybersecurity to drive Security hardening and fortification activities and programs.
- Own Life cycle management of infrastructure including planning, metrics, risk management and execution.
- Responsible for management of infrastructure budget and infrastructure supplier partnerships.