What are the responsibilities and job description for the Need SRE / Sr. DevOps Engineer position at Aeron Smith?
Job Details
SRE / Sr. DevOps Engineer
Summary of Experience
- Requires 12 years of SRE / DevOps development engineering.
- Experience in working with cloud environments Azure preferred.
- Experience with Kubernetes, Azure Kubernetes (AKS) preferred.
- Experience with using Kafka, Event Hub, NATS or any messaging broker.
- Experience with Cassandra, PostgreSQL, Mongo, Elastic Search, Cosmos DB
- Experience on Azure DevOps, Jenkins/ Python / Terraform / Ansible
- Experience with Databricks
- Experience with DataDog, Splunk or other logging and APM tools.
- Experience in working with Linux environments.
- Experience building complex, scalable, high-performance software systems that have been successfully delivered to customers
Azure Cloud, AKS Scalability, monitoring, deployment, check logs, ensure node and pod health.
Databases include - Cassandra, Mongo, PostgreSQL, NoSQL
Databricks Notebooks There are a lot of jobs on Databricks experience with Databricks to know how a notebook is created and run - run queries against the database and find discrepancies and perform fixes.
Experience with using Kafka, Event Hub, NATS or any messaging broker.
JAVA Based microservices, responsible for deployment, scripting language is python. Should have an understanding around terraform.
Emphasis on Logs and Monitoring (Datadog and Splunk)
RESPONSIBILITIES
- We are seeking an experienced, self-motivated Senior Engineer who is technically very strong with a strong Linux background.
- deep knowledge in micro services, backend storage design, NoSQL database, distributed systems and very good troubleshooting skills.
- Typical activities include production monitoring, creating monitoring dashboards, setting up alerts, triaging alerts coupled with the ability to drive efforts and solution improvements effectively across various IT and business functions.
- In this role, a person will be responsible for setting up monitoring dashboards, alerts, maintaining production systems, deploying code in Production, monitoring alerts, resolving issues, and leading production troubleshooting calls.
- Working with Product Owners and other developers to implement highly scalable reactive application platform solutions in Cloud based Linux environments.