What are the responsibilities and job description for the Lead Site Reliability Engineer (AWS/Azure) position at Kforce Technology Staffing?
Job Details
RESPONSIBILITIES:
Kforce has a client that is seeking a Lead Site Reliability Engineer (AWS/Azure) in San Diego, CA.
Overview:
The Lead Site Reliability Engineer is responsible for driving the organizational reliability strategy and conducting resiliency design reviews to ensure the reliability, scalability, and performance of our company's software systems and applications meet organizational service level objectives (SLOs) and error budgets. The role is responsible for leading a team of Site Reliability Engineers in designing, implementing, and maintaining the infrastructure and tools necessary to support our platforms, as well as improving our monitoring, automation, and deployment processes. This role involves strategic planning, technical leadership, and collaboration with various stakeholders including Company's Product Delivery, Data Services, DevOps, DataOps, and Infrastructure teams to support organizational goals.
REQUIREMENTS:
* Bachelor's degree or 8 years demonstrated work experience or an equivalent combination of related training and experience and at least three of those years spent in a leadership level role(s) required
* Proven leadership experience and ability to manage a team
* Experienced in cloud-based hosting solutions (AWS, Azure)
* Experienced with Cloud server environments (AWS, Azure)
* Experienced in Agile software development best practices utilizing Continuous Integration & Delivery Pipelines as well as agile tools such as Jira
* Proven experience with large-scale software implementation (high transaction volume, high-availability concepts)
* Deep knowledge of software deployment, versioning (GIT) and release management processes
* Deep knowledge with infrastructure design, implementation, and support
* Collaborate with stakeholders to define RPO/RTO for Company's system footprint
* Expert in Cloud-based redundancy, high availability, and reliability strategies
* Expert in reliability, scalability, and performance optimization
* Expert at maintaining Linux/Unix, stronger preference and Windows systems administration, provisioning, configuration, monitoring, and troubleshooting Web Servers in a 7x24 customer facing environment
* Strong Linux and Windows Administration & scripting
* Solid Database Administration skills (MySQL, MariaDB, RDS, SQL Server, and Azure Storage services)
* Deep knowledge of current methodologies in high performance operations and scalable multi-site implementations
* Proficient at automated provisioning, automated configuration management, and containerization solutions and tools
* Excellent written and verbal communication skills
* Proficient in communicating to both technical and management levels
* Highly adaptable
* Ability to create DR strategies and execute DR drills
* Ability to interact with external customers and staff members
* Ability to work in a fast paced, constantly expanding environment
The pay range is the lowest to highest compensation we reasonably in good faith believe we would pay at posting for this role. We may ultimately pay more or less than this range. Employee pay is based on factors like relevant education, qualifications, certifications, experience, skills, seniority, location, performance, union contract and business needs. This range may be modified in the future.
We offer comprehensive benefits including medical/dental/vision insurance, HSA, FSA, 401(k), and life, disability & ADD insurance to eligible employees. Salaried personnel receive paid time off. Hourly employees are not eligible for paid time off unless required by law. Hourly employees on a Service Contract Act project are eligible for paid sick leave.
Note: Pay is not considered compensation until it is earned, vested and determinable. The amount and availability of any compensation remains in Kforce's sole discretion unless and until paid and may be modified in its discretion consistent with the law.
This job is not eligible for bonuses, incentives or commissions.
Kforce is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.
By clicking ?Apply Today? you agree to receive calls, AI-generated calls, text messages or emails from Kforce and its affiliates, and service providers. Note that if you choose to communicate with Kforce via text messaging the frequency may vary, and message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You will always have the right to cease communicating via text by using key words such as STOP.
Kforce has a client that is seeking a Lead Site Reliability Engineer (AWS/Azure) in San Diego, CA.
Overview:
The Lead Site Reliability Engineer is responsible for driving the organizational reliability strategy and conducting resiliency design reviews to ensure the reliability, scalability, and performance of our company's software systems and applications meet organizational service level objectives (SLOs) and error budgets. The role is responsible for leading a team of Site Reliability Engineers in designing, implementing, and maintaining the infrastructure and tools necessary to support our platforms, as well as improving our monitoring, automation, and deployment processes. This role involves strategic planning, technical leadership, and collaboration with various stakeholders including Company's Product Delivery, Data Services, DevOps, DataOps, and Infrastructure teams to support organizational goals.
REQUIREMENTS:
* Bachelor's degree or 8 years demonstrated work experience or an equivalent combination of related training and experience and at least three of those years spent in a leadership level role(s) required
* Proven leadership experience and ability to manage a team
* Experienced in cloud-based hosting solutions (AWS, Azure)
* Experienced with Cloud server environments (AWS, Azure)
* Experienced in Agile software development best practices utilizing Continuous Integration & Delivery Pipelines as well as agile tools such as Jira
* Proven experience with large-scale software implementation (high transaction volume, high-availability concepts)
* Deep knowledge of software deployment, versioning (GIT) and release management processes
* Deep knowledge with infrastructure design, implementation, and support
* Collaborate with stakeholders to define RPO/RTO for Company's system footprint
* Expert in Cloud-based redundancy, high availability, and reliability strategies
* Expert in reliability, scalability, and performance optimization
* Expert at maintaining Linux/Unix, stronger preference and Windows systems administration, provisioning, configuration, monitoring, and troubleshooting Web Servers in a 7x24 customer facing environment
* Strong Linux and Windows Administration & scripting
* Solid Database Administration skills (MySQL, MariaDB, RDS, SQL Server, and Azure Storage services)
* Deep knowledge of current methodologies in high performance operations and scalable multi-site implementations
* Proficient at automated provisioning, automated configuration management, and containerization solutions and tools
* Excellent written and verbal communication skills
* Proficient in communicating to both technical and management levels
* Highly adaptable
* Ability to create DR strategies and execute DR drills
* Ability to interact with external customers and staff members
* Ability to work in a fast paced, constantly expanding environment
The pay range is the lowest to highest compensation we reasonably in good faith believe we would pay at posting for this role. We may ultimately pay more or less than this range. Employee pay is based on factors like relevant education, qualifications, certifications, experience, skills, seniority, location, performance, union contract and business needs. This range may be modified in the future.
We offer comprehensive benefits including medical/dental/vision insurance, HSA, FSA, 401(k), and life, disability & ADD insurance to eligible employees. Salaried personnel receive paid time off. Hourly employees are not eligible for paid time off unless required by law. Hourly employees on a Service Contract Act project are eligible for paid sick leave.
Note: Pay is not considered compensation until it is earned, vested and determinable. The amount and availability of any compensation remains in Kforce's sole discretion unless and until paid and may be modified in its discretion consistent with the law.
This job is not eligible for bonuses, incentives or commissions.
Kforce is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.
By clicking ?Apply Today? you agree to receive calls, AI-generated calls, text messages or emails from Kforce and its affiliates, and service providers. Note that if you choose to communicate with Kforce via text messaging the frequency may vary, and message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You will always have the right to cease communicating via text by using key words such as STOP.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.