What are the responsibilities and job description for the Google Cloud Engineer position at Diverse Lynx?
Skills : Google Cloud Platform, GCP, Cloud Admin, GCP, PySpark
Job Description :
Google / AWS / Azure public cloud, PySpark, Big Query and Google Airflow)
- Participate in 24x7x365 SAP Environment rotational shift support and operations
- As a team lead you will be responsible for maintaining the upstream Big Data environment day in day out where millions of financial data flowing through, consists of PySpark, Big Query , Datgaproc and Google Air flow
- You will be responsible for streamlining and tuning existing Big Data systems and pipelines and building new ones. Making sure the systems run efficiently and with minimal cost is a top priority
- Manage the operations team in your respective shift, You will be making changes to the underlying systems
- This role involves providing day-to-day support, enhancing platform functionality through DevOps practices, and collaborating with application development teams to optimize database operations..
- Architect and optimize data warehouse solutions using Big Query to ensure efficient data storage and retrieval.
- Install / build / patch / upgrade / configure big data applications
- Manage and configure Big Query environments, datasets, and tables.
- Ensure data integrity, accessibility, and security in the Big Query platform.
- Implement and manage partitioning and clustering for efficient data querying.
- Define and enforce access policies for Big Query datasets.
- Implement query usage caps and alerts to avoid unexpected expenses.
- Should be very comfortable with troubleshooting Linux-based systems on issues and failures with good grasp of the Linux command line
- Create and maintain dashboards and reports to track key metrics like cost, performance.
- Integrate Big Query with other Google Cloud Platform (GCP) services like Dataflow, Pub / Sub, and Cloud Storage.
- Enable Big query through tools like Jupiter notebook, Visual Studio code, other CLI's
- Implement data quality checks and data validation processes to ensure data integrity.
- Manage and monitor data pipelines using Airflow and CI / CD tools (e., Jenkins, Screwdriver) for automation.
- Collaborate with data analysts and data scientists to understand data requirements and translate them into technical solutions.
- Provide consultation and support to application development teams for database design, implementation, and monitoring.
- Proficiency in Unix / Linux OS fundamentals, shell / Perl / python scripting, and Ansible for automation.
- Disaster Recovery & High Availability
- Expertise in planning and coordinating disaster recovery principles, including backup / restore operations
- Experience with geo-redundant databases and Red hat cluster
- Accountable for ensuring that delivery is within the defined SLA and agreed milestones (projects) by following best practices and processes for continuous service improvement.
- Work closely with other Support Organizations (DB, Google, PySpark data engineering and Infrastructure teams)
- Incident Management, Change Management, Release Management and Problem Manage me
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.