What are the responsibilities and job description for the Sr Site Reliability Engineer position at Disney Experiences?
“We Power the Magic!” That’s our motto at Disney Experiences (DX). Our team creates world-class immersive digital experiences for the Company’s premier vacation brands including Disney’s Parks & Resorts worldwide, Disney Cruise Line, Aulani, a Disney Resort & Spa, and Disney Vacation Club.
This team focuses on empowering that storytelling through innovative solutions.
The hiring range for this position in Glendale, CA/Anaheim, CA is $138,800 - $186,100 per year and in Seattle, WA is $145,400 - $195,000 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.
This team focuses on empowering that storytelling through innovative solutions.
- Reduce/Eliminate Guest Impacting Incidents/Outages across the Disney Experiences Portfolios
- Allow the product teams to focus on development and enhancement of our Products
- Providing 24/7 on-call support as required
- Monitoring and optimizing site reliability for optimal performance
- Implementing automation to eliminate repetitive tasks
- Managing incidents and conducting root cause analysis
- Remediating security issues of our services in the cloud
- Troubleshooting cloud infrastructure application and network issues
- Maintaining comprehensive technical documentation
- Provisioning reliable cloud infrastructure services
- Architecting active-active multi-region systems
- Eliminating single points of failure
- Planning capacity and auto-scaling cloud services
- Designing, building, and integrating software
- Monitoring cloud applications (VMs/Containers/Kubernetes)
- Mentoring operations team members in their areas of expertise
- You like working with clients - you will work with customers/product engineering to gather requirements. You like hearing stories.
- You have a passion for improvement - you have passion for improving processes (e.g. through less code, fewer manual steps, fewer systems, improving velocity).
- You are law-abiding but agent-of-change - you will advocate compliance with known standards and engage engineers to improve upon processes
- You are a team player - you mentor others and contribute support documentation; here, heroes work at enriching the team
- You can multitask - you are action oriented, capable of working concurrent projects
- You have a developer mindset and are comfortable writing code
- With an operations mindset you have some experience in maintaining production systems
- Responsible for creating breakdown of tasks to meet project objectives
- Responsible for on time ticket and task completion
- Responsible for turning strategy into multiple project objectives
- Accountable for/teaching other engineers how to create breakdowns of tasks
- Accountable for/teaching other engineers how to complete tasks
- Responsible for sharing their work/experiences with the greater org
- 5 years relevant, progressive experience in the reliability engineering space.
- Knowledge of Build/Release skills: Work with product development teams to maintain SDLC pipelines.
- Expert monitoring skills: Ensure monitoring tools effectively notify guest-facing issues.
- Expert team communication skills: Ensure team understands and approves solutions.
- Expert technical fundamentals: Mastery of Unix system administration duties.
- Experience in the public cloud: Proficient with launching products on platforms like Google, AWS, Azure, SalesForce, and private clouds.
- Experience in Infrastructure as Code (IaC): Adopt an IaC mindset (Terraform, Helm, Chef).
- At least one of the following languages: Golang or Python
- Container orchestration: Kubernetes, ECS, AppEngine
- Building docker images
- Alerting and Monitoring: using Appdynamics, Splunk, Grafana, etc
- CICD ( with Jenkins or Harness, GitHub or Gitlab )
- SDLC Build and Release processes
- Languages: NodeJs, Java
- AWS Cloud (Fargate, ECS, Lambdas, ApiGateways, EC2, S3, ALB/ELB, Elasticache, EKS, KMS-Secret Manager, VPCs, IAM)
- Google Cloud Platform (App Engine, Kubernetes ( Helm/Tiller ), Cloud Functions, Firebase, IAM)
- Logging/Monitoring/Alerting (Cloudwatch/Splunk/AppDynamics/Elasticache/Grafana)
- Terraform/Atlantis
- Rundeck, Chef, Ansible, Vault
- MessageQueueing: RabbitMQ, PubSub
- Load balancers
- Bachelor's degree in computer science, Information Systems or related equivalent experience.
- Master’s degree in computer science, Information Systems or related equivalent experience.
The hiring range for this position in Glendale, CA/Anaheim, CA is $138,800 - $186,100 per year and in Seattle, WA is $145,400 - $195,000 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.
Salary : $138,800 - $186,100