Demo

Site Reliability Engineering (SRE) Specialist

Alibaba Cloud
Seattle, WA Full Time
POSTED ON 4/7/2025 CLOSED ON 4/11/2025

What are the responsibilities and job description for the Site Reliability Engineering (SRE) Specialist position at Alibaba Cloud?

Elastic Compute Service (ECS) is a core product of Alibaba Cloud. The Elastic Compute team is dedicated to building world-leading cloud computing infrastructure. As a key component of Alibaba Cloud's self-developed Apsara operating system , Elastic Compute Service (ECS) provides full-stack computing resources covering virtual machine instances, container services and Heterogeneous computing clusters.



Through technological innovation and product optimization, the Alibaba Cloud Elastic Compute team continuously drives advancements in cloud computing technologies, delivering high-quality computing services to users worldwid

e. Our goal is not only to support enterprises in achieving elastic scalability but also to deeply empower infrastructure innovation in the New era . Our mission is to build an intelligent foundation of "Computing as a Service," enabling developers to focus on businesses to concentrate on breakthroughs, without worrying about the complex engineering implementations from chips to clusters



.

SRE Te

am:The Alibaba Cloud Elastic Compute Service (ECS) SRE (Site Reliability Engineering) team is a critical force in ensuring system stability and reliability. The SRE team focuses on guaranteeing the high availability, high performance, and robust stability of ECS products through technical expertise and innovati


on.
The Alibaba Cloud ECS SRE team is not only a core technical safeguard but also a driver of technological innovation and continuous optimization . By leveraging technical capabilities and collaborative teamwork, we ensure the stability and reliability of ECS products, safeguarding global customers' businesses. Additionally, we are committed to advancing cloud computing technologies through knowledge sharing and industry collaborati


on .
Joining the Alibaba Cloud ECS SRE team offers the opportunity to engage in the development and optimization of world-leading cloud computing technologies, while growing alongside a passionate and creative


  1. team.
    Responsible for the delivery and operation/maintenance of various clusters, and participate in the architecture design and construction of the infrastructure operation pla
  2. tform.Establish and optimize operation/maintenance service systems to achieve product stability and SLA
  3. goals.Develop delivery standards, document maintenance specifications, and enhance daily work efficiency through tool plat
  4. forms.This position involves on-call responsibilities, requiring timely customer response within Service Level Agreement (SLA) timeframes, driving issue resolution and improving customer exper



ience.

Quali
f

  1. ication:5 years of operation and maintenance (O&M) experience in IT, internet, or cloud computing ind
  2. ustries;Proficient in Linux operating systems and mainstream protocols (e.g., TCP/IP), with solid hands-on experience in troubleshooting OS and network
  3. issues.Familiar with containerization and orchestration technologies such as Kubernetes, Slurm,
  4. and LSF.Ability to analyze and document technical issues systematically, develop tools/systems to optimize workflows, and improve operational efficiency through automation and platform-based so
  5. lutions.Strong self-driven learning capabilities, excellent communication skills, and experience leading cross-team projects. Results-driven and action-oriented, with a commitment to exc



ellence.

The pay range for this position at commencement of employment is expected to be between $133,200/year and $219,600/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and e


xperience.
If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and mark


et factors.
Alibaba U.S. based full time regular employees have access to medical, dental, and vision insurance, a 401(k) plan and basic life insurance, and wellbeing benefits like FSA, subject to the terms and conditions of the applicable plans then in effect. U.S. based employees are also eligible to receive up to 12 paid holidays, accrue up to 15 paid vacation days for this position, and receive up to 72 hours paid sick time (front-loaded) per ca


lendar year.

Salary : $133,200 - $219,600

Site Reliability Engineering
Microsoft Legal Department -
Redmond, WA
Alibaba Cloud-Site Reliability Engineering (SRE) Specialist-Seattle
Alibaba -
Seattle, WA
Site Reliability Engineering - Software Development Engineer Sr
Blue Origin Personnel, LLC -
Seattle, WA

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Site Reliability Engineering (SRE) Specialist?

Sign up to receive alerts about other jobs on the Site Reliability Engineering (SRE) Specialist career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$92,877 - $110,401
Income Estimation: 
$120,933 - $155,034
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$114,618 - $136,401
Income Estimation: 
$144,264 - $191,312
Income Estimation: 
$140,435 - $166,410
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
This job has expired.
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Alibaba Cloud

Alibaba Cloud
Hired Organization Address Sunnyvale, CA Full Time
We're seeking a skilled RDMA Ops Engineer to optimize and maintain high-performance networking infrastructure for our co...
Alibaba Cloud
Hired Organization Address Sunnyvale, CA Full Time
Job Description 1. Customer Relationship Establishment & Business Opportunity Expansion Proactively gain insights into k...
Alibaba Cloud
Hired Organization Address Washington, DC Full Time
Job Description We, Alibaba Overseas Engineering & TPM team, are seeking for a highly skilled and experienced Constructi...
Alibaba Cloud
Hired Organization Address Seattle, WA Full Time
In Alibaba Cloud, we provide the fundamental Cloud technology and infrastructure to help merchants, brands and other bus...

Not the job you're looking for? Here are some other Site Reliability Engineering (SRE) Specialist jobs in the Seattle, WA area that may be a better fit.

Director of Site Reliability Engineering

Veeam Software, Seattle, WA

AI Assistant is available now!

Feel free to start your new journey!