What are the responsibilities and job description for the Senior Principal AI Product Engineer – Data position at Aitopics?
5000 - Vertex US - Boston, United States
Vertex Pharmaceuticals invests in scientific innovation to create transformative medicines for people with serious diseases.
At Vertex, we are pioneering the use of large language models and generative AI to create transformative solutions across our organization. Our corporate data science team is at the forefront of that effort, and we are looking for a Senior AI Product Engineer – Data to build the foundations for the next generation of human augmented AI products.
The ideal candidate brings strong expertise in data engineering for AI products and is well-versed in the latest Large Language Model (LLM) technologies including Retrieval-Augmented Generation (RAG) and unstructured knowledge bases. We are seeking engineers with strong experience collaborating with data scientists to prepare and organize data for AI solutions, and to help transition these solutions from pilot to production.
You will work on a highly collaborative, centralized team of data scientists, product engineers, and strategists that drive value and impact for our highest priority business needs. You will work side-by-side with internal partners across clinical, commercial, manufacturing and general and administrative areas to develop creative solutions that contribute meaningfully to our business and patients.
You will develop AI focused data products for managing text document libraries including automation of NLP embeddings into vector database assets, automation of RAG procedures, and deployment into a full data pipeline in collaboration with UI developers.
Key Duties and Responsibilities :
- Identify, evaluate, and integrate diverse data sources into data products for the Generative AI program
- Develop and optimize data processing workflows for chunking, indexing, and vectorization for both text and non-text data sources
- Benchmark and implement various vector stores, embedding techniques, and retrieval methods
- Create a flexible pipeline supporting multiple embedding algorithms, vector stores, and search types (e.g., vector search, hybrid search)
- Implement and maintain auto-tagging systems and data preparation processes for LLMs
- Integrate and optimize workflows using various vector store technologies
- Write clean, maintainable data pipelines that generate AI and data science-ready outputs
- Collaborate with data scientists and front-end engineers to design scalable AI-driven solutions
- Communicate the technical approaches and associated benefits or drawbacks to multiple stakeholders via oral presentations or written documentation
- Provide engineering oversight across projects and support data scientists and engineers in optimizing approaches
- Continually scan the external environment for and bring in novel technological innovations that can be applied to our projects
Knowledge and Skills :
Education and Experience :
Flex Designation :
Hybrid-Eligible Or On-Site Eligible
Flex Eligibility Status :
In this Hybrid-Eligible role, you can choose to be designated as :
1. Hybrid : work remotely up to two days per week; or select
2. On-Site : work five days per week on-site with ad hoc flexibility.
Company Information :
Vertex is a global biotechnology company that invests in scientific innovation.
Vertex is committed to equal employment opportunity and non-discrimination for all employees and qualified applicants without regard to a person's race, color, sex, gender identity or expression, age, religion, national origin, ancestry, ethnicity, disability, veteran status, genetic information, sexual orientation, marital status, or any characteristic protected under applicable law. Vertex is an E-Verify Employer in the United States. Vertex will make reasonable accommodations for qualified individuals with known disabilities, in accordance with applicable law.
J-18808-Ljbffr