Demo

Research Engineer Intern - Perception, Vision Language Models

PlusAI
Santa Clara, CA Intern
POSTED ON 3/22/2025
AVAILABLE BEFORE 5/22/2025

As a Research Engineer Intern – Vision-Language Models for E2E Autonomous Driving, you’ll explore the potential of vision-language models to enhance reasoning, scene understanding, and interpretability in end-to-end autonomous driving. You’ll have the opportunity to work towards a publication at a top tier venue by contributing to key areas of model development, including curating both real-world and synthetic training data, fine-tuning foundational vision-language models, and designing robust evaluation frameworks.

\n


Responsibilities:
  • Lead model development efforts using vision-language models for end-to-end autonomous driving systems
  • Curate high-quality training datasets from both real-world trips and synthetic sources
  • Optimize model architectures and fine-tune pre-trained foundational models to enhance performance and adapt to specific challenges
  • Design and implement evaluation frameworks to rigorously assess model performance in real-world driving environments


Required Skills:
  • Pursuing MS or PhD in CS, EE, mathematics, statistics or related field
  • Thorough understanding of deep learning principles and familiarity with vision language models
  • 2-3 years experience with implementing and training deep learning models in at least one deep learning framework (PyTorch, Tensorflow, Jax)


Preferred Skills:
  • Past experiences in projects involving design, training or fine-tuning of vision language models and familiarity with knowledge distillation, quantization, vLLM
  • Past experiences in deep learning projects related to autonomous driving 
  • Publication record in relevant venues (CVPR, ICLR, ICCV, ECCV, NeurIPS, AAAI, SIGGRAPH)


\n
$19 - $65 an hour
Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience.
\n

Salary : $19 - $65

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Engineer Intern - Perception, Vision Language Models?

Sign up to receive alerts about other jobs on the Research Engineer Intern - Perception, Vision Language Models career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$98,763 - $126,233
Income Estimation: 
$116,330 - $143,011
Income Estimation: 
$113,077 - $147,784
Income Estimation: 
$98,763 - $126,233
Income Estimation: 
$116,330 - $143,011
Income Estimation: 
$113,077 - $147,784
Income Estimation: 
$56,489 - $71,327
Income Estimation: 
$70,310 - $88,223
Income Estimation: 
$66,679 - $90,237
Income Estimation: 
$70,310 - $88,223
Income Estimation: 
$88,950 - $110,401
Income Estimation: 
$84,958 - $111,603
Income Estimation: 
$113,077 - $147,784
Income Estimation: 
$135,356 - $164,911
Income Estimation: 
$153,902 - $198,246
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at PlusAI

PlusAI
Hired Organization Address Santa Clara, CA Full Time
We are seeking a Senior Machine Learning Engineer with expertise in deep learning, data analysis, and vehicle dynamics m...
PlusAI
Hired Organization Address Santa Clara, CA Full Time
Plus is a global provider of highly automated driving and fully autonomous driving solutions with headquarters in Silico...
PlusAI
Hired Organization Address Munich, ND Intern
Plus is a global provider of highly automated driving and fully autonomous driving solutions with headquarters in Silico...
PlusAI
Hired Organization Address Santa Clara, CA Full Time
Plus is a global provider of highly automated driving and fully autonomous driving solutions with headquarters in Silico...

Not the job you're looking for? Here are some other Research Engineer Intern - Perception, Vision Language Models jobs in the Santa Clara, CA area that may be a better fit.

AI Assistant is available now!

Feel free to start your new journey!