What are the responsibilities and job description for the AI/ML Scientist position at BioTalent?
A clinical-stage bio-pharmaceutical company focused on cancer treatments is seeking a Senior Scientist to enhance its in-house large language model (LLM) for sequence-to-structure-to-activity relationship modeling in antibody discovery, protein engineering, and immuno-oncology applications.
In this role, you will design, develop, and implement AI models, data pipelines, and parallel computing architectures to accelerate the discovery and development of novel therapeutics using in-house LLM implementations to accelerate therapeutic discovery.
Key Responsibilities:
- Fine-tune Llama 3.3 models for sequence-to-structure-to-activity predictions, incorporating domain-specific knowledge.
- Design data generation pipelines and develop algorithms for data processing and augmentation.
- Fine-tune Llama 3.3 using internal R&D data to optimize model performance.
- Integrate language models to store data in a vector database for processing and design.
- Implement data flow management from laboratory systems to AI databases for seamless integration.
- Optimize parallel computing and model training using technologies like MPI, OpenMP, Dask, and Ray.
- Develop software for model training, data processing, and workflow integration.
- Implement data security measures and manage backup strategies for sensitive data.
- Develop disaster recovery and incident response procedures for business continuity.
Requirements:
- Ph.D. or Master’s in Computer Science, AI, Bioinformatics, Computational Biology, or related field.
- 3 years of AI/ML model development experience, especially in natural language processing and database management.
- Proficiency in Python, C , or Julia.
- Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and parallel computing.
- Strong knowledge of antibody discovery, protein engineering, and immuno-oncology.
- Strong communication and collaboration skills.