What are the responsibilities and job description for the Data Scientist (6257U), Quantitative Biosciences - 76306 position at University of California, Berkeley?
Data Scientist (6257U), Quantitative Biosciences - 76306
About Berkeley
At the University of California, Berkeley, we are dedicated to fostering a community where everyone feels welcome and can thrive. Our culture of openness, freedom and belonging make it a special place for students, faculty and staff.
As a world-leading institution, Berkeley is known for its academic and research excellence, public mission, diverse student body, and commitment to equity and social justice. Since our founding in 1868, we have driven innovation, creating global intellectual, economic and social value.
We are looking for applicants who reflect California's diversity and want to be part of an inclusive, equity-focused community that views education as a matter of social justice.
Departmental Overview
The Arkin laboratory for systems and synthetic biology seeks to uncover the evolutionary design principles of cellular networks and populations and to exploit them for applications. To do so they are developing a framework to effectively combine comparative functional genomics, quantitative measurement of cellular dynamics, biophysical modeling of cellular networks, and cellular circuit design to ultimately facilitate applications in health, the environment, and the circular bioeconomy on earth and in space. They lead projects that advance a predictive, mechanistic understanding of microbial biology and the impact of microbial communities on their ecosystems.
Position Summary
This position is for an experienced data scientist / engineer to help build a data integration, analysis and management system for a multidisciplinary research program aimed at designing microbial communities ('pre- and probiotics') to protect human airways from infection. The project's approach integrates advanced microbiome engineering, multimodal functional metagenomics, high-throughput microbial isolation and characterization, model-driven synthetic community composition, and iterative design-build-test cycles to move formulations toward clinical translation. The team is highly collaborative and working towards well-defined goals and milestones.
Responsibilities
- Set and maintain standards for data and metadata reporting, software documentation and version control, and protocol reproducibility across the project.
- Develop, implement, and maintain software pipelines and workflows for data integration, centralization, and harmonization, accommodating multiple data types.
- Interact closely with collaborating teams to design and establish an integrated data framework that supports value-added analyses, visualization tools, and user-friendly data access.
- Perform quality checks on datasets, troubleshoot issues, and implement best practices for data storage and version control.
- Collaborate with modeling and machine learning experts to refine data processing methods and incorporate advanced analytics.
- Support training of team members in data handling, informatics tools, and best practices in reproducible research.
- May attend program meetings as required.
Required Qualifications
Preferred Qualifications
Salary & Benefits
This is a full-time (40 hours / week), contract appointment, eligible for UC benefits with the possibility of extension. The budgeted annual salary range is $88,900.00 - $126,400.00.
How to Apply
Other Information
Equal Employment Opportunity
The University of California is an Equal Opportunity / Affirmative Action Employer. #J-18808-Ljbffr
Salary : $88,900 - $126,400