What are the responsibilities and job description for the Lead Data Scientist position at Culture Amp?
How you can help make a better world of work
We are looking for an experienced Lead Data Scientist to drive the creation of understandable, enriched and meaningful data assets and products that will power Culture Amps advanced features. We have a unique, world class data set covering employee engagement and performance globally. This role is to lead increasing investment in this data to power actionable insights and personalised AI assistance across our project set. This role will be leading from the front - You will be a master of your craft, lead the delivery from a small cross functional team, as well as managing the growth and performance of several junior Data Scientists.
You will
- Lead an enrichment team that will establish and evolve AI powered data pipelines that create advanced data assets and products to power Culture Amps platform and products.
- Work with stakeholders to ensure clear definition of purpose, value, and dependencies such that our investments are leveraged by multiple consumers
- Be part of the Data Science leadership team that contributes to strategy as well as maturing our data science practice including: insights generation, experimentation, metrics definition, evaluation and monitoring.
- Develop robust data pipelines, and LLM based enrichment processes
- Stay up to date with current research, literature and provider offerings to enable the pragmatic application to Culture Amps customer needs.
You have
- Experience leading a team delivering data assets and products
- Knowledge of analytic engineering modelling practices and experience in creating performant data artefacts at scale
- Experience maturing practice and managing Data Scientists
- Deep experience with SQL, preferably with enterprise database systems like Redshift, Postgres. Ideally experience with data processing tools like DBT and Dask
- Experience with establishing scalable, robust, and reliable data pipelines for enrichment in a platform setting.
- Deep experience in Python data science space, particularly around NLP - LangChain, LangSmith, pandas, numpy, sci kit learn, scipy, hugging face etc.
- Understanding of statistical and machine learning models. Knowledge of experimental design, statistical testing and model validation.
- Experience in data visualization tools such as plotly, seaborn, streamlit etc would be an advantage.
- Experience with data modelling and exposure to tools like metaflow or airflow is desirable.
- Experience with common software practices like version control, continuous deployment, and testing.
- Industry, or equivalent academic experience in researching and developing Generative AI powered Products and services.
You are
- Curious and have a learning mindset.
- Enjoy identifying tough problems that can be solved with scientific thinking
- A pragmatic, critical thinker, asking the right questions, and backing up assertions with logic and data
- Able to make the complex simple, to persuasively communicate with a range of stakeholders
- Excited about making data and AI products that have real impact in people’s lives.
- Someone who shares our passion for making a better world of work.
- Not afraid to learn through making mistakes and asking for help.