Santa Clara, CA, United States of America
8 hours ago
Senior Research Scientist, Multi-Modal Language Models

We're now looking for a Senior Research Scientist, Multi-Modal Language Models!

NVIDIA is seeking a Senior Research Scientist passionate about multi modal language models. Our team drives Nemotron Multi-modal technology and with your help, we will continue to drive our models to be state of the art open-source multi-modal models. We have a unique perspective in that we strive for open models, open weights, open data. We want to deliver models that work amazingly well in the real world right out of the box, and we also want to uplift the whole ecosystem of users of multi-modal LLMs.

What you'll be doing:

Driving new abilities into the model

Improving generalization of existing functionalities by understanding weak points, designing a data synthesiis solution, and retraining models

Developing recipes for training models that mix multiple modalities together, such as text, image, video, audio, etc …

Design solutions that improve pareto efficiency

Collaborating with researchers to translate cutting-edge ideas into production-ready implementations.

Exploring new paradigms for evaluation.

Demonstrating strong engineering practices, and contributing to open-source communities.

What we need to see:

PhD in Computer science, Electrical Engineering, or related field, or equivalent research experience in LLMs, systems, or related areas.

4+ years of experiences in computer vision, especially multi-modal LLMs.

Proficiency in Python with hands-on experience in frameworks such as PyTorch.

Solid background in computer science fundamentals: algorithms, data structures, parallel/distributed computing, and systems programming.

Proven ability to collaborate across research and engineering teams in multifaceted environments.

Ways to stand out from the crowd:

Specific multi-modal LLM research experience

Experience developing and scaling large distributed systems for deep learning.

Contributions to open-source LLM systems or large-scale AI infrastructure

Widely considered to be one of the technology world’s most desirable employers, NVIDIA has some of the most forward-thinking and hardworking people in the world inventing the future for us. Are you a creative and collaborative researcher interested in seeking new challenges? If so, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 192,000 USD - 304,750 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until February 8, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Confirm your E-mail: Send Email