So you think you know Text to Video Diffusion models?

Автор: Neural Breakdown with AVB

Загружено: 2024-09-14

Просмотров: 8135

Описание: Video Diffusion Generative AI is the next frontier for AI. In this video we discuss the problem, the challenges, the solutions, and the seminal papers in the field like Google's Imagen, Meta's Make-a-video, Nvidia's Video Latent Diffusion Model (LDM), and OpenAI's SORA. On the way, we discuss the core concepts of Image Diffusion models, like Forward and Reverse Diffusion, UNet, convolution, and diffusion transformers. This video is meant to be a quick overview of all the major concepts in the field - hope you guys and gals found it useful for deeper dives.

Buy me a coffee at https://ko-fi.com/neuralavb !
Support us on Patreon to access slides and video material!
patreon.com/NeuralBreakdownwithAVB

Related videos:
What are Conditional Image Diffusion Models?
   • Text to Image Diffusion AI Model from scra...

What is Latent Space?
   • Visualizing the Latent Space: This video w...

How do LLMs generate images? (The answer is not diffusion)
   • If LLMs are text models, how do they gener...

Transformers and Attention Playlist
   • Everything Language Processing and LLMs!

Visit our Patreon for full access to code and other documents/animations:
  / neuralbreakdownwithavb

#generativeai #deeplearning #ai

Useful papers:
Video Diffusion Models: https://arxiv.org/abs/2204.03458
Imagen: https://imagen.research.google/video/
Make A Video: https://makeavideo.studio/
Video LDM: https://research.nvidia.com/labs/toro...
CogVideoX: https://arxiv.org/abs/2408.06072
OpenAI SORA article: https://openai.com/index/sora/
Useful article: https://lilianweng.github.io/posts/20...
Survey Papers: https://arxiv.org/abs/2310.10647 and https://arxiv.org/abs/2405.03150

Timestamps:
0:00 - Intro
0:39 - Text to Image Conditional Diffusion Models
2:16 - Challenges with Video Diffusion Models
3:43 - VDM (2022)
4:50 - Factorized 3D Unet models
5:46 - Meta Make A Video
7:28 - Google Imagen Video
8:07 - Nvidia Video LDM
9:36 - OpenAI SORA

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

So you think you know Text to Video Diffusion models?

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

But how do AI images and videos actually work? | Guest video by Welch Labs

But how do AI images and videos actually work? | Guest video by Welch Labs

Text to Image Diffusion AI Model from scratch - Explained one line of code at a time!

Text to Image Diffusion AI Model from scratch - Explained one line of code at a time!

Diffusion Language Models vs Autoregressive Language Models

Diffusion Language Models vs Autoregressive Language Models

Flow-Matching vs Diffusion Models explained side by side

Flow-Matching vs Diffusion Models explained side by side

Почему «Трансформеры» заменяют CNN?

Почему «Трансформеры» заменяют CNN?

Diffusion Language Models: The Next Big Shift in GenAI

Diffusion Language Models: The Next Big Shift in GenAI

Что НА САМОМ ДЕЛЕ скрывается внутри ИИ? Главная причина успеха нейросетей...

Что НА САМОМ ДЕЛЕ скрывается внутри ИИ? Главная причина успеха нейросетей...

Почему диффузия работает лучше, чем авторегрессия?

Почему диффузия работает лучше, чем авторегрессия?

Рекурсивные языковые модели (РЛМ) — давайте создадим самых крутых агентов! (Теория и код)

Рекурсивные языковые модели (РЛМ) — давайте создадим самых крутых агентов! (Теория и код)

Writing Mixture of Experts LLMs from Scratch in PyTorch

Writing Mixture of Experts LLMs from Scratch in PyTorch

Diffusion Models (DDPM & DDIM) - Easily explained!

Diffusion Models (DDPM & DDIM) - Easily explained!

Лучший документальный фильм про создание ИИ

Лучший документальный фильм про создание ИИ

Прорыв в создании современных генераторов изображений на основе ИИ | Модели диффузии, часть 1

Прорыв в создании современных генераторов изображений на основе ИИ | Модели диффузии, часть 1

How AI Image Generators Work (Stable Diffusion / Dall-E) - Computerphile

How AI Image Generators Work (Stable Diffusion / Dall-E) - Computerphile

Я разобрал всю ИИ-экосистему Google — 7 ключевых инструментов

Я разобрал всю ИИ-экосистему Google — 7 ключевых инструментов

Diffusion Models for AI Image Generation

Diffusion Models for AI Image Generation

The physics behind diffusion models

The physics behind diffusion models

Краткое объяснение больших языковых моделей

Краткое объяснение больших языковых моделей

Момент, когда мы перестали понимать ИИ [AlexNet]

Момент, когда мы перестали понимать ИИ [AlexNet]

Объяснение моделей преобразования текста в видео

Объяснение моделей преобразования текста в видео