What are the responsibilities and job description for the Site Reliability Engineer || Hybrid || Can sit in Kansas City, MO or Dallas, TX position at Paradigm Infotech?

Job Details

Role: Site Reliability Engineer

Location: Can sit in Kansas City, MO or Dallas, TX

Work mode :( Hybrid 3 days onsite)

Rate: ---k/year

Visa: USC

Job Description

Job Duties and Responsibilities:

Operational performance and stability: work with other members of assigned value stream to ensure that the in-scope applications/platforms are meeting performance and stability requirements; this includes managing major incidents to mitigation/resolution
Problem management: perform post-incident reviews of all major incidents and determine action items required to avoid similar issues/minimize downtime for future incidents
Monitoring and metrics:
work with application development team to ensure that assigned applications/platforms have appropriate monitoring and metrics in place to appropriately measure performance and stability
identify functional and nonfunctional improvements
act as the operations representative in value stream planning and prioritization sessions to ensure that the operational needs of the assigned applications/platforms are addressed as needed
hold quarterly operational performance reviews with value stream management
Operational readiness:
ensure that applications/platforms in the value stream are operationally ready for production; this includes annual review of all SOPs/knowledge articles
monitor review for any new feature launch or other significant changes that may impact monitoring
Release Planning and Coordination:
work with other members of assigned value stream to ensure that the production releases for their in-scope applications/platforms are properly planned and coordinated; this includes holding change/release implementation reviews to ensure thorough and appropriate implementation plans
provide review and sign-off/approval of change tickets for the assigned value stream
represent the value stream in Change Advisory board meetings
participate in Program Increment Planning sessions as a liaison for operations and infrastructure support
provide information regarding upcoming critical changes to the value stream
The requirements herein are intended to describe the general nature and level of work performed by employee, but is not a complete list of responsibilities, duties, and skills required. Other duties may be assigned.

Requirements

Minimum Requirements:

Education and Experience

Undergraduate degree in Computer Science or related technical field, or equivalent practical experience
5 to 8 years of hands-on, professional software development experience in building scalable applications that includes experience working in a multi-platform environment and multi-cloud hybrid environments; experience with CI/CD pipeline tools; experience with clustering technologies (High Availability, Resiliency, Reliability and Scaling); and experience in DevOps skills and methodologies
Proficiency in design principles of monitoring and alerting systems
Proficiency with one or more general purpose programming languages; one or more scripting languages; automation tools; development tools; API interaction and development; and one or more version control systems

Preferred Technologies

Cloud providers Azure, AWS, CGP
Operating systems Linux, Windows
General programming languages Python, Go, Typescript
Scripting languages Bash, Powershell
CI/CD Azure DevOps, Flux, Github
Automation tools Terraform, Ansible
Development Visual Studio, Git, FastAPI, React, Typescript, Redis
Monitoring and incident response PagerDuty, ServiceNow, Prometheus, Grafana
Infrastructure tools AKS, Kubernetes, Docker, VMware, Infoblox, Active Directory, Nutanix
Network Cisco, Hyperflex, F5, Azure Frontdoor, Azure Application Gateway

Knowledge, Skills and Abilities

Proficiency in monitoring, alerting, analyzing and troubleshooting
Good understanding of defining and executing high availability, disaster recovery, sustained resiliency, and chaos engineering tests
Able to demonstrate a passion for finding and improving efficiencies with infrastructure, development and deployment automation
Able to enhance and maintain complex software components and distributed systems
Able to define and drive adoption of a best-in-class monitoring framework to accomplish end-to-end application or service monitoring
Able to monitor and report on SLA/SLO for given applications services
Able to develop and maintain operational dashboards to establish key performance indicators and trends
Able to lead and participate in non-functional testing (performance and resilience) and identify bottlenecks, opportunities for optimization, and capacity demands
Knowledge of control application log collection and analysis; able to automate processes and systems configuration/deployment
Able to partner with security engineers and develop plans and automation to aggressively and safely respond to new risks and vulnerabilities
Able to design and architect operational solutions for managing applications and infrastructure with the specific goal of increasing the automation, repeatability, and consistency of operational tasks
Able to partner and train infrastructure and operations teams; this includes the creation and/or enhancement of SOPs, knowledge articles, etc.
Able to analyze and participate in periodic on-call duties to prevent, solve and automate the response to problems on mission critical services
Must be able to read, write and speak English

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Apply for this job

Receive alerts for other Site Reliability Engineer || Hybrid || Can sit in Kansas City, MO or Dallas, TX job openings

Site Reliability Engineer || Hybrid || Can sit in Kansas City, MO or Dallas, TX

What are the responsibilities and job description for the Site Reliability Engineer || Hybrid || Can sit in Kansas City, MO or Dallas, TX position at Paradigm Infotech?

Job Details

What is the career path for a Site Reliability Engineer || Hybrid || Can sit in Kansas City, MO or Dallas, TX?

Job openings at Paradigm Infotech

Not the job you're looking for? Here are some other Site Reliability Engineer || Hybrid || Can sit in Kansas City, MO or Dallas, TX jobs in the Kansas, MO area that may be a better fit.

We don't have any other Site Reliability Engineer || Hybrid || Can sit in Kansas City, MO or Dallas, TX jobs in the Kansas, MO area right now.

AI Assistant is available now!