What are the responsibilities and job description for the Senior Site Reliability Engineer position at Providence Partners?
Job Details
Join us and be part of a team that values innovation, collaboration, and growth. We'd love to hear from you if you're passionate about driving reliability, streamlining processes, and leading in a fast-paced environment.
Ready to take your expertise to the next level and make a meaningful impact? Join as a Senior Site Reliability Engineer, where you ll play a pivotal role in maintaining and improving the reliability of critical systems for a billion-dollar enterprise. This is your opportunity to lead innovation, streamline processes, and shape the future of IT infrastructure.
As a Senior Site Reliability Engineer, you'll enhance system reliability, drive automation, and optimize incident response strategies. Collaborating closely with cross-functional teams, you ll guide our transition to cloud infrastructure while championing a culture of reliability and operational excellence.
Key Responsibilities:
- Spearhead initiatives to automate repetitive tasks, enabling self-service capabilities and enhancing system efficiency.
- Take charge of critical production incidents, acting as Senior Incident Commander to ensure rapid resolution and effective communication.
- Conduct post-incident reviews, translating insights into actionable improvements.
- Architect and manage advanced observability systems to detect and address issues proactively.
- Collaborate with stakeholders to define Service Level Objectives (SLOs) and ensure strategies align with organizational goals.
- Mentor and support team members, fostering a culture of continuous learning and technical excellence.
- Lead projects from discovery to execution using Agile methodologies, delivering measurable outcomes.
Must-Have Skills and Experience
- Bachelor's degree in a related field or equivalent professional experience.
- 5 years in site reliability engineering, DevOps, or related disciplines.
- Strong leadership skills in incident response and operational management.
- Proven ability to work independently and deliver results in a dynamic environment.
- Expertise in building and maintaining complex, scalable systems.
Preferred Skills and Experience
- Familiarity with ITIL frameworks in modern IT settings.
- Proficiency in programming languages like Java or C# and scripting with Python, PowerShell, or Bash.
- Hands-on experience with automation tools such as Terraform and Ansible.
- Advanced understanding of CI/CD pipelines and version control systems like Git.
- Proven success with monitoring, logging, and alerting tools.
- Strong interpersonal and communication skills, with the ability to inspire and lead change.
Austin, Texas, residency is required.
Outstanding Benefits
- Annual performance bonuses and merit-based pay increases.
- Generous retirement plans, including a 4% automatic employer contribution and 401k match up to 6%.
- $1,000 Lifestyle Savings Account annually for personal wellness.
- Comprehensive insurance from day one, including health, dental, vision, and more.
- Ample time off: three weeks of vacation, nine holidays, and two personal days yearly.
- Tuition reimbursement and professional development opportunities.
- On-site perks: free gym access, fitness classes, snacks, and wellness resources.
Benefits:
- 401(k)
- 401(k) matching
- Dental insurance
- Flexible schedule
- Flexible spending account
- Health insurance
- Health savings account
- Life insurance
- Paid time off
- Parental leave
- Professional development assistance
- Referral program
- Retirement plan
- Tuition reimbursement
- Vision insurance