SAM - Segment Anything model for promptable pixel segmentation

Автор: Mak Gaiduk

Загружено: 2024-10-15

Просмотров: 1140

Описание: This video talks about SAM - an attempt to build a foundational segmentation model by Facebook researchers.
Instead of classical segmentation problems like Semantic Segmentation or Instance Segmentation, which rely on existing set of labels, SAM deals with "Promptable Segmentation" - given a bounding box or a point "prompt", return segmentation for the object to which that point belongs, no matter what that object might be.
This unusual approach allowed creating of a huge, diverse dataset for segmentation and training a good model. On the other hand, SAM model cannot solve other segmentation tasks (like instance segmentation) on its own, but it can do that with an extra model to generate prompts, like object detector for example.
The video also dives deep into the architecture of the model - prompt encoding, Decoder with query-to-image cross attention, upscaling with Transposed Convolution, maks segmentation and Dice and IoU loss.
Important links:
Original paper: https://arxiv.org/pdf/2304.02643
DETR paper: https://arxiv.org/pdf/2005.12872
MaskFormer paper: https://arxiv.org/pdf/2107.06278
Tesla Autopilot video with 3d depth estimation: • FULL Andrej Karpathy Tesla Autonomous Driv...
Tesla Optimus video with occupancy detection: https://www.youtube.com/live/ODSJsviD...

00:00 - Intro
01:17 - Segmentation Task
07:04 - Promptable Segmentation
08:58 - Data Engine
14:34 - Model Architecture
15:45 - Prompt Encoding
18:02 - Decoder, Cross Attention
28:04 - Loss
33:08 - Transposed Convolution
37:02 - Results

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

SAM - Segment Anything model for promptable pixel segmentation

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

DETR for panoptic segmentation - algorithm and code reading

DETR for panoptic segmentation - algorithm and code reading

Segment Anything - Model explanation with code

Segment Anything - Model explanation with code

356 final review-part 2

356 final review-part 2

Explaining the Segment Anything Model - Network architecture, Dataset, Training

Explaining the Segment Anything Model - Network architecture, Dataset, Training

01. Distributed training parallelism methods. Data and Model parallelism

01. Distributed training parallelism methods. Data and Model parallelism

NotebookLM тихо обновился. Как делать Инфографику, Презентации, Видеопересказ.

NotebookLM тихо обновился. Как делать Инфографику, Презентации, Видеопересказ.

Объяснение сегментации всего на бумаге: новая модель фундамента от Meta AI впечатляет!

Объяснение сегментации всего на бумаге: новая модель фундамента от Meta AI впечатляет!

Segment Anything Model (SAM) - Foundational Model Deep Dive

Segment Anything Model (SAM) - Foundational Model Deep Dive

Информационная инфляция: конец эпохи знаний

Информационная инфляция: конец эпохи знаний

331 - Fine-tune Segment Anything Model (SAM) using custom data

331 - Fine-tune Segment Anything Model (SAM) using custom data

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

307 — Сегментируйте изображения в Python без обучения с помощью модели Segment Anything (SAM)

307 — Сегментируйте изображения в Python без обучения с помощью модели Segment Anything (SAM)

CoDETR - SOTA object detection with transformers

CoDETR - SOTA object detection with transformers

Segment Anything Model (SAM) from Meta AI: model architecture, data engine, results and limitations

Segment Anything Model (SAM) from Meta AI: model architecture, data engine, results and limitations

Reading Segment Anything source code

Reading Segment Anything source code

NLP Series III: Text Retrieval Embeddings, E5. Algorithm and code read.

NLP Series III: Text Retrieval Embeddings, E5. Algorithm and code read.

RTDETR v2 - Real Time Object Detection: Updates, algorithm and code reading

RTDETR v2 - Real Time Object Detection: Updates, algorithm and code reading

RT DETR - realtime object detection with transformers

RT DETR - realtime object detection with transformers

How does Segment Anything 2 (SAM 2) work? Paper and Network Architecture Explained!

How does Segment Anything 2 (SAM 2) work? Paper and Network Architecture Explained!

Как LLM могут хранить факты | Глава 7, Глубокое обучение

Как LLM могут хранить факты | Глава 7, Глубокое обучение