What are the responsibilities and job description for the Data Scientist position at Embue?
This is a hybrid position. Candidates must commit to working in our Worcester, MA office 2-3 days per month.
Embue is an early-stage company at the intersection of proptech and climate tech that has created a whole building intelligence, automation and control platform for multifamily portfolios. With Embue, any type of apartment building can quickly become smarter, more profitable and more energy and carbon efficient.
Our software, combined with IoT devices placed throughout an apartment building, does everything from measuring and controlling the temperature of each unit and common area to detecting mold and water leaks to gauging the health of and controlling central equipment.
Our development team is small and growing. Together, the team has built an enterprise-class product that our customers rave about, and now we’re looking to take Embue to the next level with the addition of a Data Scientist.
Responsibilities
- Partners with leadership to understand business problems and translate them into identifiable machine learning problems which can be delivered as technical solutions and actionable recommendations
- Coordinate with business teams to monitor outcomes and refine/ improve the machine learning models
- Work across the spectrum of statistical modeling including supervised, unsupervised, & deep learning techniques to apply the right level of solution to the right problem
- Collaborate with data and software engineers to enable deployment of models that will scale across the Embue’s ecosystem
- Own the code review process to ensure stringent coding guidelines are met
- Choose the right machine learning approach & models while utilizing open source languages such as R, Python, etc.
- Lead data mining and collection procedures for all business use cases and guarantee data quality and integrity
- Utilize data visualization tools to deliver insights to stakeholders and present technical solutions to non-technical audience in a simple and clear manner
- Spot and evaluate emerging/cutting edge, open source, data science/machine learning libraries/big data platforms (e.g. XGBoost, H2Oai, Spark, Hadoop, etc.)
- Build frameworks leveraging APIs to industrialize AI models across the organization
- Adhere to stringent quality assurance and documentation standards using version control and code repositories (e.g., Git, GitHub, Markdown)
Desired Skills & Experience
- You should have a bachelor’s or master’s degree in computer science, statistics informatics, or another quantitative field
- You should have experience in solving real-life problems using machine learning
- Hands-on knowledge of data wrangling, data cleaning/ preparation, dimensionality reduction is required
- Knowledge of linear algebra, statistical and probabilistic modeling is required
- Exploratory data analysis and hypothesis testing to identify ML opportunities is a plus
- Experience in major machine learning frameworks such as Pytorch, Scikit-Learn, Tensorflow, Pandas, SparkML etc.
- Solid proficiency in Python, R, or other relevant languages
- Familiarity with databases like MySQL, Oracle, SQL Server, NoSQL, etc. is desirable
- Experience working with Amazon SageMaker or Azure ML Studio for deployments is a plus
- Strong analytical and critical thinking skills. You should also have a business mindset, swift to identify risk situations and opportunities, and able to generate creative solutions to business problems
Embue offers competitive compensation and benefits. We have a flexible hybrid work model that comes with a home office stipend.
Embue provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.