What are the responsibilities and job description for the Member of Technical Staff, Data Pipeline position at Boson AI?
Boson AI is an early-stage startup building large language tools for everyone to use. Our founders (Alex Smola, Mu Li), and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.
We are seeking machine learning engineers to join our team full-time in our Santa Clara office. As part of your role, you will help us build pipelines of data collection, data filtering, synthetic data generation and data analysis. This will help us build more lifelike AI models. You will work closely with other scientists and engineers to empower our next generation of large multimodal model.
Making sure you fit the guidelines as an applicant for this role is essential, please read the below carefully.
Responsibilities :
- Design and develop data collection pipelines to gather and preprocess diverse datasets (beyond language) from various sources (beyond web crawls).
- Design and develop data processing pipelines, including data labeling, data filtering, data cleaning, data visualization, data auditing, etc.
- Implement machine learning models to improve the quality and diversity of data, e.g., quality classifier, document layout model, speech transcribe model.
You may be a good fit if you have :
Strong candidates may also have :
150,000 - $300,000 a yearBoson AI offers 401k with employer matching, Gold level healthcare, HSA, FSA and free meals (we have dried mangoes, too).
J-18808-Ljbffr
Salary : $150,000 - $300,000