What are the responsibilities and job description for the AWS Cloud Engineer- Remote near Cleveland, OH or Austin, TX position at OEC?
Candidates must reside near Cleveland, OH or Austin, TX for an occasional on-site visit. Relocation not available.
Job Summary/Objective
Plans and coordinates the design, installation, administration, and usage of server, storage, virtualization, and other distributed technologies in public and private cloud. Conducts system analysis and development to keep systems current with changing technologies. Coordinates communication and support between user community and the technical/operations functions of IT in relationship with all Windows/Linux/Cloud functions. Takes initiative to review and trouble shoot issues. Maintains or creates detailed documentation of configurations, process, and procedures for company infrastructure.
Key Responsibilities & Duties (essential to the job)
Manages day-to-day maintenance tasks, including identifying and resolving problems, conducting root-cause analysis, and responding to live site issues.
Designs, deploys, and manages infrastructure on Azure/AWS/On Premise Systems, ensuring high availability, fault tolerance, and scalability.
Provides 2nd level technical support to team members, and provides technical support and troubleshooting for application and system related issues.
Researches, installs, and tests software updates and patches to supported applications and various operating systems. Supports ongoing efforts, giving and receiving technical information, communicating with various teams across the company.
Creates, maintains, and provides documentation of systems architecture, cloud configurations, troubleshooting procedures, best practices, support guidelines, system metrics, project information and plans, and training information.
Provides technology stewardship for distributed technologies such as DNS, SMTP, domain controllers, active directory applications, ssl certificates, web servers, and FTP services, and stewardship for Azure/AWS Cloud-based solution architectures, leveraging in-depth knowledge of PaaS and IaaS, Azure Active Directory, managed Virtual Networks, DNS, and Logging.
Researches software updates and patches for supported applications and various operating systems, and assists system engineers with installation and testing.
Researches new technologies, as requested, and in coordination with senior level team members or leadership.
Addresses areas where existing policies and procedures require change, or where new ones need to be developed.
Monitors system logs and activity on company servers. Reacts to alerts and escalates accordingly by engaging the necessary resources (internal and external) to OEC.
Develops and maintains infrastructure code using Terraform and CI/CD pipelines to provision and manage cloud resources efficiently.
Utilizes PowerShell, Bash, Ansible or other scripting languages to automate routine tasks.
Education
A bachelor’s degree from an accredited college or university is required, with a focus in Business Information Systems, Computer Science, Information Technology, or related discipline. In the absence of a degree, equivalent work experience directly related to the key responsibilities of the role will be considered as a substitute for the degree.
Experience, Skills and Key Competencies
At least 4 years of experience in troubleshooting hardware/software for servers in production environments, plus 2 or more years of systems engineering experience designing and implementing Azure, AWS, VMWare and Windows/Linux server technologies.
Must also be able to demonstrate the following skills and abilities:
In-depth knowledge of Azure and AWS is preferred.
Broad knowledge of Windows, Linux, VMware, Entra/Active Directory, DNS, SMTP, Hybrid Networking.
Strong interpersonal skills with the ability to establish collaborative working relationships across global teams.
Ability to apply principles of analytical thinking to define problems, identify gaps, collect data, establish facts, and develop recommendations to solve problems based on all considerations.
Able to quickly learn and gain an understanding of OEC technologies.
Special Position Requirements
Must be available for 24-hour ‘on-call’ service to respond to system emergencies.