What are the responsibilities and job description for the DevOps and Site Reliability Engineer (SRE) position at Cloud Bigdata?

Job Details

JOB DESCRIPTION:
Develop and maintain CI/CD pipelines to automate deployments and improve workflow efficiency.
Manage cloud infrastructure (AWS, Google Cloud Platform, Azure) using Infrastructure-as-Code (IaC) tools like Terraform and Ansible.
Ensure system reliability by monitoring, troubleshooting, and improving application performance.
Implement and track Service-Level Indicators (SLIs), Service-Level Objectives (SLOs), and Service-Level Agreements (SLAs).
Automate infrastructure management using scripting languages (Python, Bash) and configuration management tools.
Collaborate with development teams to improve deployment practices and system architecture.
Manage incident response, conduct root cause analysis, and implement post-mortem processes.
Monitor and ensure scalability by analyzing system capacity and performance bottlenecks.
Ensure the security and compliance of cloud environments and deployments.
Continuously improve automation, reliability, and performance of systems.

REQUIRED SKILL SET:
Hands on experience in a DevOps, SRE, or similar role.
Strong knowledge of cloud platforms (AWS, Google Cloud Platform, Azure) and containerization (Docker, Kubernetes).
Experience with Infrastructure-as-Code (Terraform, CloudFormation, Ansible).
Expertise in CI/CD tools (Jenkins, GitLab CI, etc.) and version control (Git).
Solid understanding of monitoring and logging tools (Prometheus, Grafana, ELK Stack).
Proficiency in scripting (Python, Bash) and automation.
Strong problem-solving skills with a focus on system reliability and performance.
Knowledge of microservices architecture and distributed systems is a plus.
Cloud certifications and experience with Agile methodologies are preferred.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Apply for this job

Receive alerts for other DevOps and Site Reliability Engineer (SRE) job openings

Job openings at Cloud Bigdata

Data Architect with Palantir

Cloud Bigdata

Chicago, IL Full Time

Job Details Key Responsibilities: Assess readiness for Palantir Foundry implementation and environment suitability. Revi...

Data Scientist

Cloud Bigdata

Austin, TX Full Time

Job Details Job Description: Role will support 5 business segments); these business segments drive $330m in marketing ge...

Sr. Java Full Stack Developer

Cloud Bigdata

Weehawken, NJ Full Time

Job Details As an Engineer on our group, you will be working with our Agile teams building applications leveraging An ar...

SAP PAYROLL TECHNICAL LEAD

Cloud Bigdata

Calabasas, CA Full Time

Job Details Job Summary : We are seeking a highly skilled Payroll Technical Consultant with extensive experience in US P...

Not the job you're looking for? Here are some other DevOps and Site Reliability Engineer (SRE) jobs in the Texas, TX area that may be a better fit.

DevOps and Site Reliability Engineer (SRE)

What are the responsibilities and job description for the DevOps and Site Reliability Engineer (SRE) position at Cloud Bigdata?

Job Details

What is the career path for a DevOps and Site Reliability Engineer (SRE)?

Job openings at Cloud Bigdata

Not the job you're looking for? Here are some other DevOps and Site Reliability Engineer (SRE) jobs in the Texas, TX area that may be a better fit.

We don't have any other DevOps and Site Reliability Engineer (SRE) jobs in the Texas, TX area right now.

AI Assistant is available now!