Applied Deep Learning – Class 43 | Self Attention Mathematical Formula

Автор: gened

Загружено: 2026-02-19

Просмотров: 2

Описание: In this session of Applied Deep Learning, we explore the mathematical formula of self-attention as presented in the “Attention Is All You Need” paper.

This lecture is theory-only and focuses on deriving and understanding the core equations that make self-attention work in transformer models.

📚 In this lecture, we cover:

🔹 The Self-Attention Equation
We break down the fundamental formula from the paper:

Attention(Q, K, V) = softmax((Q · Kᵀ) / √dₖ) · V

…and explain what each term means, why the scaling factor √dₖ matters, and how softmax transforms similarity scores into attention weights.

🔹 Why This Formula Works
Learn how:
✔ Queries compare with keys to produce relevance scores
✔ Scaling prevents overly large gradients
✔ Softmax transforms scores into probabilities
✔ Weighted values produce contextualized outputs

🔹 Intuition Behind Each Step
Rather than just memorizing equations, we explain the meaning behind them — how words in a sentence attend to each other, how attention weights are computed, and how output vectors are formed.

🔹 Connection to Transformers
This formula is the centerpiece of:
✔ Self-Attention
✔ Scaled Dot-Product Attention
✔ The entire Transformer architecture

This session gives you the mathematical grounding necessary before moving to Multi-Head Attention and full Transformer implementation.

📂 Notebook Link:
https://github.com/GenEd-Tech/Applied...

👍 Like, Share & Subscribe for more AI, Deep Learning & NLP content
💬 Comment if you want the next session on Multi-Head Attention

#DeepLearning #SelfAttention #MathOfAttention #Transformer #NLP #MachineLearning #AI #AppliedDeepLearning

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Applied Deep Learning – Class 43 | Self Attention Mathematical Formula

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Applied Deep Learning – Class 39 | Need for self Attention

Applied Deep Learning – Class 39 | Need for self Attention

Applied Deep Learning – Class 35 | Encode_Decoder_Implementation

Applied Deep Learning – Class 35 | Encode_Decoder_Implementation

🚀 Как YouTube ускорил LLM-рекомендации в 800 раз? Разбор технологии STATIC

🚀 Как YouTube ускорил LLM-рекомендации в 800 раз? Разбор технологии STATIC

Applied Deep Learning – Class 44 | Multi Head Attention

Applied Deep Learning – Class 44 | Multi Head Attention

Applied Deep Learning – Class 40 | Dynamic/Contextual Embeddings in Self Attention

Applied Deep Learning – Class 40 | Dynamic/Contextual Embeddings in Self Attention

Applied Deep Learning – Class 48 | Layer Normalization

Applied Deep Learning – Class 48 | Layer Normalization

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

Самая Сложная Задача В Истории Самой Сложной Олимпиады

Самая Сложная Задача В Истории Самой Сложной Олимпиады

КЛАССИЧЕСКАЯ МУЗЫКА ДЛЯ ВОССТАНОВЛЕНИЯ НЕРВНОЙ СИСТЕМЫ🌿 Нежная музыка успокаивает нервную систему 22

КЛАССИЧЕСКАЯ МУЗЫКА ДЛЯ ВОССТАНОВЛЕНИЯ НЕРВНОЙ СИСТЕМЫ🌿 Нежная музыка успокаивает нервную систему 22

РАЗБОР ЗАДАЧЕК ИЗ КНИГИ ЗЕМСКОВА!

РАЗБОР ЗАДАЧЕК ИЗ КНИГИ ЗЕМСКОВА!

Почему вы не можете выучить язык (и это не про способности)

Почему вы не можете выучить язык (и это не про способности)

End to End Save Time with AI! Build a YouTube Video Summarizer Using LangChain | Beginners Learning

End to End Save Time with AI! Build a YouTube Video Summarizer Using LangChain | Beginners Learning

КАК УСТРОЕН TCP/IP?

КАК УСТРОЕН TCP/IP?

Фильм Алексея Семихатова «ГРАВИТАЦИЯ»

Фильм Алексея Семихатова «ГРАВИТАЦИЯ»

Как вылечить БЕЗ операций Близорукость,Дальнозоркость,Астигматизм,Косоглазие.Упражнения проф.Жданова

Как вылечить БЕЗ операций Близорукость,Дальнозоркость,Астигматизм,Косоглазие.Упражнения проф.Жданова

7 ПАРАДОКСОВ БЕСКОНЕЧНОСТИ

7 ПАРАДОКСОВ БЕСКОНЕЧНОСТИ

Czy prezydent Karol Nawrocki przyjmie listy uwierzytelniające od nowego ambasadora Rosji w Polsce?

Czy prezydent Karol Nawrocki przyjmie listy uwierzytelniające od nowego ambasadora Rosji w Polsce?

Пожалуй, главное заблуждение об электричестве [Veritasium]

Пожалуй, главное заблуждение об электричестве [Veritasium]

Вся IT-база в ОДНОМ видео: Память, Процессор, Код

Вся IT-база в ОДНОМ видео: Память, Процессор, Код

Обучение EXCEL. УРОК 2: Основы форматирования. Первая таблица. Рабочая область. Горячие клавиши.

Обучение EXCEL. УРОК 2: Основы форматирования. Первая таблица. Рабочая область. Горячие клавиши.