What are the responsibilities and job description for the Site Reliability Engineer position at Tentek, Inc.?
Site Reliability Engineer
Client location: Orlando, FL
Work location: Orlando, FL (onsite)
Duration of Assignment: 12 months
W2 only position
JOB DESCRIPTION:
Our Mission statement
- Reduce/Eliminate Guest Impacting Incidents/Outages across the Guest Experience portfolio.
- Allow the product teams to focus on development and enhancement of our Products.
Qualities we are looking for:
- You like working with clients - you will work with customers/product engineering to gather requirements. You like hearing stories.
- You have a passion for improvement - you have passion for improving processes (e.g. through less code, fewer manual steps, fewer systems, improving velocity).
- You are law-abiding but agent-of-change - you will advocate compliance with known standards and engage engineers to improve upon processes.
- You are a team player - you mentor others and contribute support documentation; here, heroes work at enriching the team.
- You can multitask - you are action oriented, capable of working with concurrent projects.
- You have a developer mindset and are comfortable writing code.
- With an operations mindset you have some experience in maintaining production systems.
Expectations:
In this job you are:
- Responsible for creating breakdown of tasks to meet project objectives.
- Responsible for on time ticket and task completion.
- Responsible for turning strategy into multiple project objectives.
- Responsible for sharing their work/experiences with the greater org.
You will:
- Create/maintain/improve/troubleshoot SDLC pipelines.
- Create/maintain/improve/troubleshoot monitoring technologies.
- Create/maintain/improve/troubleshoot infrastructure technologies (cloud and on prem).
- Create/maintain/improve documentation on the technologies that the team builds.
- Shadow operation and engineering team members in their areas of subject matter expertise.
Qualifications:
- Have expert Build/Release skills - you will work with product development teams across the enterprise to test in code delivery SDLC pipelines.
- Have expert monitoring skills - you will work on ensuring the tools that keep monitoring are up and effective at notifying guest- facing issues.
- Have expert team communication skills - you will work to ensure that the larger team understands and approves of their solutions.
- Have expert technical fundamentals - you must have an expert level command of Unix system administration duties.
- Have experience in the public cloud - you are proficient with launching products in a variety of hosting solutions, including public (Google, AWS, Azure, Salesforce) and private cloud systems.
- Have experience in Infrastructure as Code ( IAAS ) - you subscribe to Infrastructure as code mindset (Terraform, Helm, Chef).
- Experience with chaos testing and relevant software (Gremlin, FIS).
Preferred Qualifications:
- Previous internship or large-scale project experience.
- Experienced with at least one of the following languages: Golang or Python.
- Familiarity with Node.js, Java.
- Have worked with CI/CD tooling such as Jenkins or Gitlab.
- Preferred experience with alerting and monitoring tools such AppDynamics and Splunk.
- Familiarity with:
○ SDLC Build and Release processes.
○ Building docker images.
○ Container orchestration: Kubernetes and ECS.
- Proficiency with one of the following cloud providers: AWS, Google or Microsoft.
- Proficiency with:
○ Terraform, Helm or Chef.
○ Networking basics (routing, firewalls, AWS security groups).
○ Troubleshooting / analysis of applications: Splunk, AppDynamics, Grafana, etc.
○ OS performance troubleshooting and ability to install and configure operating system packages.
- Familiarity with:
○ Oauth2.
○ Security principles on patching, compliance, and changing control process.
Required Education:
- Pursuing a degree in Computer Science or related technical experience and authorized to work in the U.S. without requiring sponsorship now or in the future.
Salary : $70 - $75