What are the responsibilities and job description for the Site Reliability Engineer position at Tentek, Inc.?

Site Reliability Engineer

Client location: Orlando, FL

Work location: Orlando, FL (onsite)

Duration of Assignment: 12 months

W2 only position

JOB DESCRIPTION:

Our Mission statement

Reduce/Eliminate Guest Impacting Incidents/Outages across the Guest Experience portfolio.
Allow the product teams to focus on development and enhancement of our Products.

Qualities we are looking for:

You like working with clients - you will work with customers/product engineering to gather requirements. You like hearing stories.
You have a passion for improvement - you have passion for improving processes (e.g. through less code, fewer manual steps, fewer systems, improving velocity).
You are law-abiding but agent-of-change - you will advocate compliance with known standards and engage engineers to improve upon processes.
You are a team player - you mentor others and contribute support documentation; here, heroes work at enriching the team.
You can multitask - you are action oriented, capable of working with concurrent projects.
You have a developer mindset and are comfortable writing code.
With an operations mindset you have some experience in maintaining production systems.

Expectations:

In this job you are:

You will:

Create/maintain/improve/troubleshoot SDLC pipelines.
Create/maintain/improve/troubleshoot monitoring technologies.
Create/maintain/improve/troubleshoot infrastructure technologies (cloud and on prem).
Create/maintain/improve documentation on the technologies that the team builds.
Shadow operation and engineering team members in their areas of subject matter expertise.

Qualifications:

Have expert Build/Release skills - you will work with product development teams across the enterprise to test in code delivery SDLC pipelines.
Have expert monitoring skills - you will work on ensuring the tools that keep monitoring are up and effective at notifying guest- facing issues.
Have expert team communication skills - you will work to ensure that the larger team understands and approves of their solutions.
Have expert technical fundamentals - you must have an expert level command of Unix system administration duties.
Have experience in the public cloud - you are proficient with launching products in a variety of hosting solutions, including public (Google, AWS, Azure, Salesforce) and private cloud systems.
Have experience in Infrastructure as Code ( IAAS ) - you subscribe to Infrastructure as code mindset (Terraform, Helm, Chef).
Experience with chaos testing and relevant software (Gremlin, FIS).

Preferred Qualifications:

Previous internship or large-scale project experience.
Experienced with at least one of the following languages: Golang or Python.
Familiarity with Node.js, Java.
Have worked with CI/CD tooling such as Jenkins or Gitlab.
Preferred experience with alerting and monitoring tools such AppDynamics and Splunk.
Familiarity with:

○ SDLC Build and Release processes.

○ Building docker images.

○ Container orchestration: Kubernetes and ECS.

Proficiency with one of the following cloud providers: AWS, Google or Microsoft.
Proficiency with:

○ Terraform, Helm or Chef.

○ Networking basics (routing, firewalls, AWS security groups).

○ Troubleshooting / analysis of applications: Splunk, AppDynamics, Grafana, etc.

○ OS performance troubleshooting and ability to install and configure operating system packages.

○ Oauth2.

○ Security principles on patching, compliance, and changing control process.

Required Education:

Pursuing a degree in Computer Science or related technical experience and authorized to work in the U.S. without requiring sponsorship now or in the future.

Salary : $70 - $75

Apply for this job

Receive alerts for other Site Reliability Engineer job openings

Site Reliability Engineer