What are the responsibilities and job description for the Site Reliability Engineer (SRE) position at Datasage Technologies?
1099/C2C
Hybrid Role
Site Reliability Engineer (SRE)
The Site Reliability Engineer (SRE) resource to work closely with management and technical staff to perform system maintenance, operations and monitoring services. The Site Reliability Engineer will provide daily management and maintenance of core ETS' enterprise systems and ensure performance, system monitoring, RCA (root cause analysis), addressing unresolved issues, and providing infrastructure support for modifications and modernization.
Ensure the reliability, scalability, and performance of CAPMAN systems and infrastructure. By proactively addressing incidents, optimizing system performance, automating workflows, and enhancing system resilience, you will play a critical role in maintaining our platform's stability and supporting business continuity.
Perform activities in compliance with IT standards and procedures, audit requirements, reliability, quality, and security. Coordinate all maintenance and server upgrades with CAPMAN and other ETS' teams to ensure system uptime during essential system performance instances.
Provide support for all CAPMAN environments including system upgrades, testing, and troubleshooting assistance ensuring maximum uptime. This effort will require occasional after-hours and weekend work.
Coordinate with various DHCS' sections to ensure all areas of CAPMAN environments are running at maximum potential uptime with minimal downtime.
Provide recommendations, coordination, and support regarding the transition of CAPMAN system to a cloud-hosted environment or managed cloud services.
MANDATORY QUALIFICATIONS
Seven (7) years of FTE experience on a large-scale project, aligning IT systems with organizational business processes.
Five (5) years of FTE experience in the installation/implementation, configuration, and troubleshooting using the following stack:
1. Windows Server Operating System
2. Internet Information Services
3. Windows Services
4. .NET Framework
Three (3) years of FTE experience
1. Experience working on AWS IAM, CLI, EC2 (Windows Server), VPC, Storage, S3, RDS, EBS, Lambda functions, CloudWatch, Network configurations, Security groups
2. DevOps tool experience such as Jenkins or AWS Code pipeline, Code build, Git
3. Python/Boto3, Power Shell scripting,
4. Cloud Native Infrastructure background
Three (3) years of FTE experience working on Datadog configuration and management
DESIREABLE QUALIFICATIONS
Two (2) years of experience in Infrastructure as Code (IaC): Terraform, AWS Cloud formation, Container Orchestration: Docker/Kubernetes, Application Performance Management (APM) tool: Datadog, Code scanning tool: Checkmarx, Confluence, Jira, Splunk, AWS CDK, Active Directory Integration, Biztalk, ServiceNow, Network troubleshooting
Job Type: Full-time
Pay: $40.00 - $55.00 per hour
Schedule:
- 8 hour shift
Work Location: Hybrid remote in Sacramento, CA 95814
Salary : $40 - $55