Demo

Data Scientist

Yale School of Medicine
New Haven, CT Full Time
POSTED ON 1/15/2025
AVAILABLE BEFORE 4/11/2025

Position Focus :

The Cardiovascular Data Science (CarDS) Lab at Yale University has an opening for a Data Scientist to participate in a series of projects that focus on leveraging healthcare data to improve patient care. The work spans work with structured and unstructured data in the electronic health record (EHR), with the opportunity to work on applications of machine learning / deep learning in novel areas of healthcare. The position is open for a driven individual with either a master’s or doctoral training in data science, computer science, or a similar background with some experience working with large datasets. Prior experience with healthcare is not required but will be helpful. The ideal candidate will have an interest in broad career development in a dynamic environment that allows them to develop as a leader in healthcare data science and innovation.

The team at CarDS lab represents a multidisciplinary and exciting group of postdoctoral trainees and graduate and undergraduate students across Yale, with broad collaboration with informatics, computer science, and statistics groups at Yale and at several leading institutions nationally. There will also be a close collaboration with the Yale-New Haven Hospital Center for Outcomes Research and Evaluation (CORE). Our team has developed novel tools to measure and improve the quality of care using data from the EHR and software solutions for the early diagnosis of several cardiovascular disorders. The research in the lab allows unique collaborative opportunities with industry and health technology partners. Under the direction of the Principal Investigator, the ideal candidate will perform a variety of duties involving the development of data and analytic pipelines for research studies and will work as a member of a research team to provide input in the design of the study, perform data analysis, and lead or assist drafting analytical sections for peer-review publication for various projects. The candidate is expected to lead several efforts, including working with the team to develop, discover and apply novel machine learning applications to healthcare. Responsibilities will also include participating in the design, implementation, and maintenance of data pipelines and leading / assisting in building algorithms for deep learning with close collaboration from the study team.

While programming experience with python and / or R is required, experience with one or more of the following skills will be an asset. However, if the individual is willing to learn these skills, there are opportunities to learn them in this position : distributed and cluster computing, with a specific focus on PySpark, working with large tabular data with python / R, basic principles of natural language processing and their applications in python with PyTorch / Huggingface / SpaCy, applications of computer vision and signal processing in Tensorflow or PyTorch, and the ability to deploy and work in containerized environments.

Develop and execute new and / or highly complex algorithms and statistical predictive models and determine analytical approaches and modeling techniques to evaluate potential future outcomes. Establish analytical rigor and statistical methods to analyze large amount of data, using advanced statistical techniques and mathematical analyses. Manage analytical projects from data exploration, model building, performance evaluation, through implementation. Develop work plans and monitor progress and project timelines. Document coding and changes to work plans using established work group methods in GitHub. Interact with a multidisciplinary team of internal and external peers to regularly, effectively, and openly communicate progress and outcomes of planned work. Attend weekly team meetings to discuss team and project-related activities, issues, change, communications, and updates.

Essential Duties

Develop and execute new and / or highly complex algorithms and statistical predictive models and determine analytical approaches and modeling techniques to evaluate potential future outcomes. Establish analytical rigor and statistical methods to analyze large amount of data, using advanced statistical techniques and mathematical analyses. Manage analytical projects from data exploration, model building, performance evaluation, through implementation. Develop work plans and monitor progress and project timelines. Document coding and changes to work plans using established work group methods in GitHub. Interact with a multidisciplinary team of internal and external peers to regularly, effectively, and openly communicate progress and outcomes of planned work. Attend weekly team meetings to discuss team and project-related activities, issues, change, communications, and updates.

Required Education and Experience

Master’s Degree in computer science, applied / computational mathematics, engineering, biostatistics, statistics, or a quantitative field such as astronomy or geology, and 2 years of hands-on experience in deep learning or an equivalent combination of education and demonstrated experience.

Required Skill / Ability 1 :

Demonstrated expertise working with Linux, Python, and Java.

Required Skill / Ability 2 :

Ability to work with large structured and unstructured datasets, and GPU-accelerated computing. Proven experience with Large Language Models.

Required Skill / Ability 3 :

Sound background in theoretical and applied machine learning / deep learning with applications to either language, signals, or images.

Required Skill / Ability 4 :

Demonstrated strong ability to communicate technical ideas and results to non-technical customers in written and verbal formats.

Required Skill / Ability 5 :

Strong organizational, time management, and leadership skills. Ability and willingness to work in a highly collaborative team environment and matrixed organization.

Preferred Education, Experience and Skills :

Master’s degree in computer science, applied / computational mathematics, engineering, biostatistics, statistics, and hands-on demonstrated experience in deep learning or a PhD in any of the previously mentioned fields.

Drug Screen

Health Screening

Background Check Requirements

All candidates for employment will be subject to pre-employment background screening for this position, which may include motor vehicle, DOT certification, drug testing and credit checks based on the position description and job requirements. All offers are contingent upon the successful completion of the background check. For additional information on the background check requirements and process visit "Learn about background checks" under the Applicant Support Resources section of Careers on the It's Your Yale website.

Health Requirements

Certain positions have associated health requirements based on specific job responsibilities. These may include vaccinations, tests, or examinations, as required by law, regulation, or university policy.

Posting Disclaimer

The intent of this job description is to provide a representative summary of the essential functions that will be required of the position and should not be construed as a declaration of specific duties and responsibilities of the particular position. Employees will be assigned specific job-related duties through their hiring departments.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Data Scientist?

Sign up to receive alerts about other jobs on the Data Scientist career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$90,112 - $113,166
Income Estimation: 
$116,765 - $144,626
Income Estimation: 
$142,836 - $179,016
Income Estimation: 
$177,911 - $222,488
Income Estimation: 
$90,112 - $113,166
Income Estimation: 
$116,765 - $144,626
Income Estimation: 
$116,765 - $144,626
Income Estimation: 
$142,836 - $179,016
Income Estimation: 
$73,798 - $89,311
Income Estimation: 
$90,112 - $113,166
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Yale School of Medicine

Yale School of Medicine
Hired Organization Address New Haven, CT Full Time
Position Focus : The research assistant will support the implementation of research protocols focused on autism and rela...
Yale School of Medicine
Hired Organization Address New Haven, CT Full Time
Position Focus : Dispatches personnel to the scene of alarms, crimes in progress or events at local and distant Yale Uni...
Yale School of Medicine
Hired Organization Address New Haven, CT Full Time
Position Focus : Provide superior clinical and administrative support to patients and clinicians in a fast-paced outpati...
Yale School of Medicine
Hired Organization Address New Haven, CT Full Time
Position Focus : With minimal supervision and specialized, expert subject matter knowledge of Admissions events, process...

Not the job you're looking for? Here are some other Data Scientist jobs in the New Haven, CT area that may be a better fit.

Data Scientist

Mango Analytics, Branford, CT

ApconiX - Data Scientist

Glasgows, Cheshire, CT

AI Assistant is available now!

Feel free to start your new journey!