What are the responsibilities and job description for the Software Engineer (Observability Platform Engineer) position at TekWissen ®?
Job Title: Software Engineer (Observability Platform Engineer)
Job Location: Pleasanton, CA 94588
Job Type: Contract
Duration: 6 Months
Work Type: Hybrid
Pay Rate: $60-70/ Hourly/ W2
Overview:
TekWissen is a global workforce management provider headquartered in Ann Arbor, Michigan that offers strategic talent solutions to our clients world-wide. Client is an American on‑demand financial management, human capital management, and student information system software vendor.
Job Description:
- The Data Platform and Observability (DPOE) team enables insight into the operation and performance of Client's increasingly sophisticated computing infrastructure and application environment.
- Our large-scale distributed data platform digests hundreds of terabytes of structured and unstructured data in the form of billions of messages produced daily by Client applications and infrastructure services.
- We develop software and fully coordinate solutions for the entire data pipeline from data collection to visualization and reporting.
- If you enjoy writing efficient software, crafting innovative capabilities by integrating the latest cloud-native technologies, and tuning and scaling large distributed systems, you will enjoy working with us!
- Observability Platform Engineering (PPE) team deploys and operates Observability cloud infrastructure (Kubernetes, CI/CD tooling, networking, security) for the DPOE observability teams.
- Additional engineers are needed to handle the transition to full availability of all Observability services and the now-growing scalability and performance challenges.
- PPE team is a modern full-service DevOps team responsible for the development, creation, and operation of Client Observability Services (Metrics, Logs, Traces, Stats & other Services).
- As a DPOE team member and specialist in Cloud Platform Engineering, you will play a key role in developing the agile automation that underpins and enables our next-generation observability platform.
- As the common cloud platform infrastructure team, we own everything from virtualized compute and storage to critical infrastructure observability to governance to CI/CD.
You will:
- Collaborate with the security and networking teams to configure and deploy defense-in-depth security controls and highly performant networking infrastructure.
- Develop an architecture, configuration standards, and automation to enable low-risk agile service deployment by all DPOE teams.
- Create infrastructure monitoring and tooling for platform performance tuning and debugging.
- Drive problems impacting critical systems to solution and implement automation to prevent reoccurrence.
- Participate in the on-call rotation to support DPOE critical systems.
- Research, evaluate, and develop new open source and cloud native tools and technologies as needed to meet new requirements.
Basic Qualifications:
- 3 years of software engineering experience using one or more of the following: Java, Python, Golang.
- 3 years of AWS public cloud engineering DevOps experience with extensive expertise in networking and security architecture, configuration, and deployment
- 3 years of Design, Build & Maintain large scale Kubernetes clusters
- 3 years with container orchestration platforms like - EKS/ECS/GKE etc
- 3 years hands on experience with Terraform and/or Ansible and/or Pulumi and/or other IaC technologies.
- 3 years in Cloud native Open Source tools
- 3 years of Hands-on experience with ArgoCD for continuous deployment and orchestration
Other Qualifications:
- Expertise in TCP/IP protocol debugging, IP routing configuration, and firewall/proxy/load balancer configuration
- Deep knowledge in the configuration of security services such as key management systems, AWS IAM policies, Kerberos, and LDAP/AD integration
- Experience with public cloud provider technology stacks at scale, especially VM provisioning, Kubernetes, data storage, and stream processing services
- Distributed system performance analysis and optimization experience
- Development experience with a wide range of programming languages, for example, Java, Python, Golang, etc.
- Experience learning complex open-source service internals via code inspection.
- Experience with modern software development tools including CI/CD and methodologies like Agile.
- Experience with Linux system internals and tuning.
- Strong written and oral communication skills and the ability to explain esoteric technical details clearly to engineers without a similar background.
TekWissen® Group is an equal opportunity employer supporting workforce diversity.
Salary : $60 - $70