NVIDIA Nemotron Nano 2 VL First Look & Testing - An Open Source Multimodal Model
Автор: Bijan Bowen
Загружено: 2025-10-28
Просмотров: 8549
Описание:
Timestamps:
00:00 - Intro
00:46 - Technical Look
02:59 - Video Understanding Test
04:08 - PDF Understanding Test
06:43 - Multi-Image Testing
08:35 - Image Location Test
09:14 - Channel Review Test
10:37 - Image To Website Test
11:48 - PDF Summary Test
13:07 - Image Caption Testing
14:12 - Creative Writing Test
16:35 - Invoice Analysis Test
17:31 - Closing Thoughts
HF Link: https://nvda.ws/4qqJ6uN
In this video, we take a look at the new Nemotron Nano 2 VL model from NVIDIA. This hybrid transformer-mamba model is a 12B vision model that delivers strong performance across OCR, image, and video understanding tasks. We begin with a technical overview of the model, followed by detailed multimodal testing across a range of scenarios to evaluate its real-world capabilities.
AI Integration & Consulting: https://bijanbowen.com
Join the Discord: / discord
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: