What are the responsibilities and job description for the Site Reliability Engineer position at VeriiPro?
We are unable to sponsor any visas at this point of time! ONLY FULL-TIME UC/GC
We are looking for an experienced Site Reliability Engineer (SRE) with expertise in Linux systems, container orchestration, automation, and cloud platforms. The ideal candidate will have hands-on experience with Kubernetes (including Rancher), storage solutions, and monitoring tools, along with strong scripting and database skills.
- Manage Linux systems (RHEL/CentOS), filesystems, and utilities.
- Work with container orchestration frameworks, Kubernetes objects, and Rancher.
- Handle storage solutions, including ONTAP volumes, backups, and disaster recovery planning.
- Create and maintain automation scripts (Shell/Ansible/Python) for deployments, monitoring, and validations.
- Configure and schedule monitoring tasks using tools like Cron and Airflow.
- Utilize monitoring tools like Dynatrace, Apica, and Grafana for system performance.
- Build and manage CI/CD pipelines (preferred).
- Work with SQL and NoSQL databases.
- Manage cloud-based infrastructure, specifically AWS.
- Handle incidents and perform problem management.
- Extensive experience with Linux systems and distributed computing.
- Proficiency in Kubernetes, automation scripting, and monitoring tools.
- Knowledge of storage systems, databases, and cloud platforms.
- Strong problem-solving and incident-management skills.