At Ninety, we’re relentlessly focused on helping people build great companies.
Ninety is an innovative, cloud-based platform that provides leadership teams with everything they need to build focused, aligned, and thriving organizations.
Our platform not only simplifies the hard work associated with building great organizations but organizes, stores, and makes easily accessible all the associated information.
As the Ninety organization continues to experience exceptional growth, we are building an LLM-driven experience that will allow our users to interact with our product in a whole new way, and we are looking for a Data Scientist who thrives in a data science and machine learning environment with a profound love for curating, building, and optimizing dataset construction. You will serve as the key departmental cross-functional conduit, adept at sourcing both internal content within other 90 teams, and external content that enriches Ninety’s domain-specific data needs. You will have extensive experience in, and a passion for, wrangling, normalizing, cleaning, and preparing machine-ready datasets from unstructured data sources, and will be
Your key responsibilities will include:
- Identification of opportunities for data acquisition, internally and externally
- Responsibility for Data Quality Assurance. Must understand the principles of data quality
- Creation and management of a Content Management System for efficient data storage and retrieval
- Own the construction of datasets
- Creation, maintenance, and improvement of Golden Record datasets
- Feature engineering, with an understanding of deriving new features and handling standard problems of supervised learning including knowledge of relevant math, e.g. linear algebra
- Creation of synthetic data for LLMs, identifying the reasons for its use, and understanding the decision-making process behind such practices
- Exploration of options when labels are not available in an NLP dataset and knowledge of how to handle sparse data, e.g., when recall is low across the distribution of content
- Ensure data integrity and accuracy through robust quality control processes and data validation
- Remaining well-versed in the latest practices, including advice on the creation of synthetic data for LLMs, identifying the reasons for its use, and understanding the decision-making process behind such practices.
- Extract and source all necessary data for modeling and machine-ready training
To be successful in this role you must have:
- Understanding of the complete lifecycle of data, from sourcing and transformation to modeling and insights generation. Coordinate the data workflow and manage the iterative data feedback loops essential for machine learning model refinement (e.g., RLHF)
- Proven expertise in managing ETL processes and owning the data lifecycle from extraction to modeling preparation
- Strong capabilities in data quality assurance, ensuring the integrity and accuracy of data through sophisticated validation techniques.
- Experience with web crawling/scraping and integrating diverse data sources into cohesive and reliable datasets.
- Strong familiarity with both structured and unstructured data, applying advanced transformation and normalization techniques.
- Expertise in Python, SQL, GIT, CI/CD practices, deep learning libraries (PyTorch, Tensorflow, Hugging Face, advanced statistical libraries). You should feel comfortable writing production source code and building data integrations
- Substantial experience in data cleaning, preprocessing, and transformation techniques
- Experience with evaluation methods for model testing and tuning using Golden Record data
- Understanding of both structured and unstructured data to improve traditional processes
- Deep understanding of ETL processes, data cleaning, transformation techniques, and the construction of datasets optimized for machine learning
- Willingness to engage in an early-stage environment and actively contribute to its growth
Preferred qualifications:
- 3 years of experience in a data science, data engineering, or ML role, Bachelor's or Master's degree in Computer Science, Data Science, Engineering, or a related field.
- APIs, SDKs and other programming processes commonly used in data science and ML
- Experience with cloud services like AWS, Google Cloud Platform, or Azure, specifically their data management and ML services. Comfortable with setting up basic cloud infrastructure
- Experience with statistical analysis for ML; close understanding of data distributions and diagnosis
- Experience with developing validation rules and benchmarks
- Demonstrated ability to develop and optimize data structures for AI applications, including supervised learning methods.
- Excellent skills in technical writing and documentation, maintaining clear records to support team collaboration and project continuity.
- Knowledge of current trends, available models and architectures, and major advancements in the NLP field
About Us as an Employer:
- Ninety’s focus on helping organizations focus, align, and thrive isn't just for our clients. We also believe deeply in helping our employees flourish. At 90, we focus on attracting, developing, and retaining our kind of great people. How?
- We believe in , giving our employees the freedom and flexibility to work from wherever they need to live their best life.
- As-needed vacation. Don’t worry about punching the clock or losing days.
- Health and Dental insurance with employer contribution toward premiums
- Employer Paid Life insurance and Long Term Disability Coverage
- HSA, FSA, and DCFSA accounts available
- Productivity/Wellness allowance
- Generous Paid Parental Leave
- Professional development allowance
- Technology allowance
- Company gatherings with travel allowance.
- Why we love doing what we do: Small to midsize businesses are the foundation of almost every healthy community. They provide not just employment but opportunities for people to learn, grow and become leaders who take responsibility for the well-being of the community.
- Our values:
G …Get Smart Stuff Done
T …Teamwork
R …Resilient
I … Inquisitive
B …Best
E …Extra Mile
The “Seat” we have open
As a part of our team, you'll contribute to a data-driven culture that leverages cutting-edge technology to solve real-world problems.
Once we are confident we have someone who is a great core-value fit and well suited for the seat they are in, we provide a high degree of autonomy so they can master their seat leveraging their Unique Abilities™ and then take on more and more responsibility.
If the above sounds appealing to you, we invite you to apply and see if you are well suited to help us help companies achieve real, healthy, and sustainable growth!
Powered by JazzHR
HMXzisGOKo