[MAI554] Transformers for Language Modeling | Transformer Block and Architecture

Автор: Anis Koubaa

Загружено: 2025-04-10

Просмотров: 73

Описание: Transformers for Language Modeling: Transformer Block and Architecture | Prof. Anis Koubaa | MAI554 Deep Learning for Language Modeling Course

In this video, Prof. Anis Koubaa dives into the core concepts of the Transformer architecture for language modeling, exploring key components like masked self-attention, multihead attention, and the crucial role of the Feed-Forward Network (FNN) in the transformer block. 🧠✨

We also discuss the importance of normalization and residual connections in enhancing the model’s performance, along with the innovative mixture of experts technique to improve efficiency. 💡

🔍 Topics Covered:
• Transformer Block Overview
• Masked Self-Attention
• Multihead Attention Mechanism
• The Role of Feed-Forward Networks
• Normalization & Residual Connections
• Mixture of Experts in Transformers

This lecture is part of the MAI554 Deep Learning for Language Modeling course at Alfaisal University. Don't forget to like, share, and subscribe for more deep learning insights! 🎓💻

#DeepLearning #Transformers #LanguageModeling #AI #MachineLearning #MultiheadAttention #FNN #Normalization #ResidualConnections #MixtureOfExperts

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

[MAI554] Transformers for Language Modeling | Transformer Block and Architecture

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Градиентный спуск, как обучаются нейросети | Глава 2, Глубинное обучение

Градиентный спуск, как обучаются нейросети | Глава 2, Глубинное обучение

Расслабляющая музыка, исцеляющая от стресса, беспокойства и депрессивных состояний, исцеляет

Расслабляющая музыка, исцеляющая от стресса, беспокойства и депрессивных состояний, исцеляет

[MAI544] Transformers for Language Modeling - The Power of Self-Attention (Part I) 🎯📚

[MAI544] Transformers for Language Modeling - The Power of Self-Attention (Part I) 🎯📚

[MAI554] Retrieval-Augmented Generation (RAG) | Prof. Anis Koubaa

[MAI554] Retrieval-Augmented Generation (RAG) | Prof. Anis Koubaa

[MAI554] Machine Translation: From Rule-Based to Transformers

[MAI554] Machine Translation: From Rule-Based to Transformers

🎓 IoT Security: Principles and Practice | Prof. Anis Koubaa

🎓 IoT Security: Principles and Practice | Prof. Anis Koubaa

Embeddings in NLP & Topic Modeling: LDA, VAE & BERTopic | TUM Advanced NLP Lecture

Embeddings in NLP & Topic Modeling: LDA, VAE & BERTopic | TUM Advanced NLP Lecture

Full-Stack IoT Application Architecture: Smart Agriculture Monitoring

Full-Stack IoT Application Architecture: Smart Agriculture Monitoring

Конец империи. Почему Ильхам Алиев пошел против Путина

Конец империи. Почему Ильхам Алиев пошел против Путина

Японец по цене ВАЗа! Оживляем пацанскую мечту :)

Японец по цене ВАЗа! Оживляем пацанскую мечту :)