What are the responsibilities and job description for the Sr Systems Reliability Engineer position at EXPENTOR?
Job Title: Sr Systems Reliability Engineer
Location: Seattle, WA – onsite 3 days a week
The Company
Headquartered in Los Angeles, this leader in the Entertainment & Media space is focused on delivering world-class stories and experiences to it's global audience. To offer the best entertainment experiences, their technology teams focus on continued innovation and utilization of cutting edge technology.
Platform / Stack
You will work with technologies that include Python, AWS, Terraform, and Ansible.
What You'll Do As a Sr Systems Reliability Engineer
You could be a great fit if you have:
Location: Seattle, WA – onsite 3 days a week
The Company
Headquartered in Los Angeles, this leader in the Entertainment & Media space is focused on delivering world-class stories and experiences to it's global audience. To offer the best entertainment experiences, their technology teams focus on continued innovation and utilization of cutting edge technology.
Platform / Stack
You will work with technologies that include Python, AWS, Terraform, and Ansible.
What You'll Do As a Sr Systems Reliability Engineer
- Collaborate and provide technical leadership within and across teams
- Code, and deploy systems, define and establish best practices in cloud hosting environments using self-healing, infrastructure-as-code, security, and automation patterns
- Develop useful telemetry, alerts, and response to identify and address reliability risks
- Participate in on-call rotation with other engineering teams
- Identify, experiment, & evangelize new technologies, ideas, and best practices across the broader engineering community
You could be a great fit if you have:
- 5 years of experience in technical operations or systems reliability engineering
- Minimum 3 years operating complex, large-scale Enterprise guest-facing Applications or web sites
- Configuration management and orchestration experience (e.g. Chef, Terraform, Cloud Formation)
- Experience with one or more languages in your skillset (e.g. GO, Python, Java, Ruby)
- Containerization experience (e.g. Docker, Kubernetes, Mesos, Elastic Container Service)
- Skilled in Cloud/PaaS Environments (e.g. AWS, Google Cloud Compute)
- Thorough knowledge of continuous integration tools (e.g. Jenkins)
- Experience with F5 load balancing helpful
- UNIX/LINUX and some Windows server experience, including expertise in system installation, configuration, administration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures
- Web (IIS, Apache) and Java application (Tomcat, Jboss, etc) server expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures
- Over 5 years of experience as a Systems Reliability Engineer
- 3 years of experience with AWS
- 3 years of experience with Terraform
- 3 years of experience using GitLab, Ansible or other automation products
- Experience utilizing Python for scripting
Sr. Reliability Engineer (Starlink)
Space Exploration Technologies Corporation -
Redmond, WA
Site Reliability Engineer, C2 Systems
Anduril Industries, Inc. -
Seattle, WA
Sr. Database Reliability Engineer II
Axon -
Seattle, WA