What are the responsibilities and job description for the Site Reliability Engineer with Openshift - prefer local from Dallas Texas position at ARK InfoTech Spectrum?
Position : Site Reliability Engineer (SRE)
Location : Dallas, TX (Onsite)
Duration : 6 – 12 Months
Must Have : We are seeking a talented Site Reliability Engineer (SRE) to join our team. As an SRE, you will bridge the gap between development and operations, ensuring our systems are reliable, scalable, and efficient. You will collaborate with cross-functional teams to build and maintain infrastructure, improve system performance, and automate operational processes.
Job Description :
Key Responsibilities :
Reliability and Availability :
Ensure the reliability and availability of mission-critical systems.
Design and implement monitoring, alerting, and incident management strategies.
Performance and Scalability :
Optimize system performance, scalability, and capacity planning.
Conduct performance tuning and load testing to identify bottlenecks.
Automation and CI / CD :
Develop and maintain CI / CD pipelines for automated deployment.
Automate operational tasks and infrastructure management using scripts and tools.
Infrastructure Management :
On-premise infrastructure management and container orchestration platforms using OpenShift and Kubernetes.
Implement infrastructure as code (IaC) using tools like Terraform or other related tool.
Security and Compliance :
Ensure system security and compliance with industry standards.
Implement and maintain backup, disaster recovery, and high-availability solutions.
Collaboration and Communication :
Collaborate with development teams to build reliable and scalable software.
Communicate system status, incidents, and performance metrics to stakeholders.
Qualifications :
Education and Experience :
Bachelors degree in Computer Science, Engineering, or related field (or equivalent experience).
5 ] years of experience in Site Reliability Engineering, DevOps, or Systems Engineering.
Technical Skills :
Proficiency with On-Premise and cloud platforms (AWS, GCP, or Azure)
Experience with containerization and orchestration (OpenShift, Docker, Kubernetes).
Strong hands on experience with OpenShift
Strong scripting skills (e.g., Python, Bash).
Experience with CI / CD tools (Jenkins, GitLab CI, CircleCI).
Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK stack).
Soft Skills :
Excellent problem-solving and analytical skills.
Strong communication and collaboration abilities.
Ability to work in a fast-paced, dynamic environment.
Preferred Qualifications :
Experience with Infrastructure as Code (IaC) tools (Terraform, CloudFormation).
Knowledge of security best practices and compliance standards.