Job title : Devops Engineer
Location : San Francisco, CA (Onsite)
Job Type : Contract Role
Job Description :
Should have AWS, terraform & Nodejs hands-on experience with Devops
1. Monitoring and Incident Response :
- Set up and maintain monitoring tools to detect performance issues, failures, and other incidents.
- Respond to system outages and incidents, troubleshoot issues, and restore services as quickly as possible.
2. Automation :
Automate repetitive tasks like deployment, scaling, and incident response to reduce manual work.Write scripts or use tools to handle tasks that would otherwise be done by hand.3. Performance and Capacity Planning :
Ensure that systems are optimized for performance and can handle the expected workload.Forecast future needs and plan for scaling infrastructure and services.4. Infrastructure Management :
Manage cloud or on-premise infrastructure, ensuring it is configured and running efficiently.Handle configuration management and deploy infrastructure as code (IaC) where applicable.5. System Reliability :
Design systems with fault tolerance, scalability, and disaster recovery in mind.Conduct regular tests, such as chaos engineering exercises, to improve system robustness.6. Collaboration with Development Teams :
Work closely with software engineers to design and deploy software that is resilient and easy to operate.Implement best practices in DevOps and Continuous Integration / Continuous Delivery (CI / CD).7. Incident Analysis and Postmortems :
Perform root cause analysis of incidents and outages to prevent them from happening again.Write postmortem reports and develop action plans to improve system reliability.8. Service-Level Objectives (SLOs) and Service-Level Agreements (SLAs) :
Define and track SLOs and SLAs to ensure systems meet the required reliability and performance levels.Monitor these metrics and take corrective action when thresholds are breached.9. Security and Compliance :
Ensure that systems comply with security and regulatory requirements.Implement and monitor security controls, patching, and vulnerability management.10. Continuous Improvement :
Proactively identify areas of improvement in system architecture, reliability, and operations processesDiverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.