What are the responsibilities and job description for the Site Reliability Engineer position at Momento USA LLC?
Position: Site Reliability Engineer
No of positions- 3
Location- Onsite 3 days a week in Riverwoods, IL- Chicago
We need 3 SRE’s local to Chicago deployed immediately. Please help with the profiles so we can share with the clients.
Expert Application Engineer (SRE)
Job Description
As an Application Reliability Engineer, you’ll tap into your passion for finding and fixing inefficiencies to solve our reliability and performance issues. In our Agile environment, you’ll focus on availability, latency, performance, efficiency, change and problem management, monitoring, emergency response and capacity planning of our services. Your projects will deliver enhanced infrastructure, development, and deployment automation.
At a minimum, here’s what we need: 8 Years – Information Technology, (Software) Engineering, or related
Responsibilities
Dan Norman
IT Recruiter
Momento USA | Exceeding Customer Expectations…
440 Benigno Blvd, Unit#A-5 2nd Floor, Interstate Business Park, Bellmawr, NJ 08031
Phone No: 856-372-4626 Ext: 1026 Fax: (866) 605-1171
Email: dan@momentousa.com Web: www.MomentoUSA.com
Linkedin : linkedin.com/in/dan-norman-b3b184203
Note: Momento USA is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.
No of positions- 3
Location- Onsite 3 days a week in Riverwoods, IL- Chicago
We need 3 SRE’s local to Chicago deployed immediately. Please help with the profiles so we can share with the clients.
Expert Application Engineer (SRE)
Job Description
As an Application Reliability Engineer, you’ll tap into your passion for finding and fixing inefficiencies to solve our reliability and performance issues. In our Agile environment, you’ll focus on availability, latency, performance, efficiency, change and problem management, monitoring, emergency response and capacity planning of our services. Your projects will deliver enhanced infrastructure, development, and deployment automation.
At a minimum, here’s what we need: 8 Years – Information Technology, (Software) Engineering, or related
Responsibilities
- Analyse, design, program, test, and deploy new user stories and features with high quality (security, reliability, operations) to production
- Achieves team commitments (and influence others to do the same) by using informal leadership & highly developed communication skills
- Has an oversight on design decisions and guides team to achieve key results for products assigned to them
- Remediates issues using engineering principles and creates proactive design solutions for potential failures
- Work with a team of site reliability engineers that is responsible for building the continuous reliability mindset, shepherding problem management, and driving key site reliability engineering practices into the organization.
- Design and drive monitoring, alerting, ticket reporting strategies to measure SLA, SLO, MTTI, MTTR. Etc. and align with management expectations to reduce/minimize prod downtime.
- Guide site reliability automation to help eliminate manual toil and create a self-healing capability
- Participate in selection of appropriate automation tools, defining technology, quality, experience and implementation standards and practices within own technical domain.
- Fosters a culture of excellence and continuous learning within the chapter. Establishes and tracks to appropriate OKRs to ensure outcomes are met.
- Creates solutions addressing high impact technology and business priorities
- Competent in multiple contexts, such as programming languages, security, automation, testing, infrastructure, and performance and is the go-to person for many people (inside and outside of their team)
- Proactively identifies and mitigates issues based on intuition and experience in multiple domains
- Experienced with AWS Cloud
- Experienced in building and managing OCP clusters, deploy applications into OCP
- Experience with SRE design to address reliability and resiliency with availability of 5-9s
- Experience in managing caching solutions like Hazelcast, GemFire or Terracota
- Experience in setting up and managing Kafka
- High level of familiarity with the Linux command line and scripting
- Extremely comfortable with production environments, firewalls, and networking
- Strong experience in deploying, observing, altering, logging, and monitoring systems (Splunk, Datadog, AppDynamics, Instana) with a mindset towards predictive analysis.
- Working knowledge of the automation tools such as Ansible, Terraform, or Chef
- Experience in performing RCA, Disaster Recovery activities, Chaos Engineering
- Highly preferred experience working in the payments industry
- Deep knowledge and understanding of emerging trends in the SRE field.
- Experience developing in Java (or other similar languages)
- Studied architectural patterns at scale, including thoughtfully designed APIs, repeatable delivery pipelines, and efficient computer engineering principles.
- Working knowledge of messaging services like RabbitMQ, SQS, Kafka
- Strong Experience with Continuous Integration and Continuous Delivery models including Blue/Green and/or Canary release models
- Open-shift Container Platform
- (Splunk, Datadog, AppDynamics, Instana)
- HazelCast.
- Ansible, Terraform, or Chef
- RabbitMQ, SQS, Kafka
- Linux VMs , Shell Scripting
- AWS CLoud
- Postgress Database
Dan Norman
IT Recruiter
Momento USA | Exceeding Customer Expectations…
440 Benigno Blvd, Unit#A-5 2nd Floor, Interstate Business Park, Bellmawr, NJ 08031
Phone No: 856-372-4626 Ext: 1026 Fax: (866) 605-1171
Email: dan@momentousa.com Web: www.MomentoUSA.com
Linkedin : linkedin.com/in/dan-norman-b3b184203
Note: Momento USA is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.