Demo

Research Engineer, Media Understanding

DeepMind
Mountain View, CA Full Time
POSTED ON 1/28/2025
AVAILABLE BEFORE 3/28/2025

Research Engineer, Media Understanding- Multimodal Representation Models

Mountain View, CA

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

The Role

As part of the multimodal features team in Media Understanding at Google DeepMind, you will have the opportunity to advance the state-of-the-art research in Embedding/representation models in context of large language models. You'll be at the forefront of developing models that power Google products used by billions of people worldwide. Your work will directly impact how these products understand and interact with diverse media, including text, images, audio, and video.  This is a unique opportunity to shape the future of multimodal AI and its applications in a dynamic and impactful environment.

We are a team of research/software engineers, research scientists, and machine learning experts, working together to enable superhuman understanding of the visual world. We are aiming at training the most powerful omnimodal embedding model which can be used for retrieval and other agentic use cases in Google products. 

You'll be developing the next SOTA models for multimodal understanding. Your work will include researching new modeling techniques, implementing research ideas, running experiments to evaluate improvements, and identifying new opportunities.

Key Responsibilities

As a member of the media understanding team, you will be responsible for conducting core and applied research in computer vision and language understanding to support a multitude of Google products and use cases. Your job responsibilities will include:

  • Conducting core research in the areas of computer vision, language understanding, multimodal models, large scale AI models and other key computer vision tasks.
  • Training and evaluating AI models for a variety of product use cases. 
  • Researching, Implementing, and adapting state of the art deep learning approaches for Google’s use cases
  • Collaborating closely with other GDM and partner teams to make progress towards building the most advanced embedding models.

About You

We are an applied research team that takes on challenging real-world problems and thrives on finding solutions in the presence of ambiguity. In order to set you up for success as a Research Engineer/Scientist at Google DeepMind, we look for the following skills and experience:

  • Ph.D. in Computer Science or related quantitative field, or B.S./M.S. in Computer Science or related quantitative field with 5 years of relevant experience.
  • Innovate and assess new machine learning models and techniques for pilot projects, quickly demonstrating viability and potential impact.  Transform successful prototypes into scalable solutions for wider integration within Google's products.
  • Conduct research to identify and address impactful problems inspired by current and future real-world needs. Investigate and develop novel solutions by studying related work, conducting experiments, and constructing prototypes and demonstrations.
  • Collaborate with product teams to drive the implementation of research insights, fostering innovation and the development of new products.

In addition, the following would be an advantage: 

  • Strong research experience and publication record in top tier conferences.
  • Experience with core software engineering and applied implementations of AI 
  • A good team player who has demonstrated that they can work across teams given that image-text involves collaborating with both research and product teams.
  • Hands-on experience with Google-scale infrastructure would be a plus, e.g. large scale data mining from various Google data stores. Automation pipeline. Client deployment across PAs.

The US base salary range for this full-time position is between $215,000 - $250,000 bonus equity benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Application deadline: Friday, February 28th 2025

Note: In the event your application is successful and an offer of employment is made to you, any offer of employment will be conditional on the results of a background check, performed by a third party acting on our behalf. For more information on how we handle your data, please see our Applicant and Candidate Privacy Policy.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Engineer, Media Understanding?

Sign up to receive alerts about other jobs on the Research Engineer, Media Understanding career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$113,077 - $147,784
Income Estimation: 
$135,356 - $164,911
Income Estimation: 
$153,902 - $198,246
Income Estimation: 
$113,077 - $147,784
Income Estimation: 
$135,356 - $164,911
Income Estimation: 
$153,902 - $198,246
Income Estimation: 
$135,356 - $164,911
Income Estimation: 
$153,053 - $187,211
Income Estimation: 
$153,902 - $198,246
Income Estimation: 
$98,763 - $126,233
Income Estimation: 
$116,330 - $143,011
Income Estimation: 
$113,077 - $147,784
Income Estimation: 
$116,330 - $143,011
Income Estimation: 
$135,356 - $164,911
Income Estimation: 
$153,902 - $198,246
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at DeepMind

DeepMind
Hired Organization Address New York, NY Full Time
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualitie...
DeepMind
Hired Organization Address New York, NY Full Time
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualitie...
DeepMind
Hired Organization Address Mountain View, CA Full Time
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualitie...
DeepMind
Hired Organization Address Mountain View, CA Full Time
**Drive Breakthrough Capabilities** As a Research Engineer at Google DeepMind, you will have opportunities to drive brea...

Not the job you're looking for? Here are some other Research Engineer, Media Understanding jobs in the Mountain View, CA area that may be a better fit.

C++ Software Engineer

Understanding Recruitment, Fremont, CA

AI Assistant is available now!

Feel free to start your new journey!