What are Transformer Models and how do they work?
Автор: Serrano.Academy
Загружено: 2023-11-02
Просмотров: 154418
Описание:
Check out the latest (and most visual) video on this topic! The Celestial Mechanics of Attention Mechanisms: • Keys, Queries, and Values: The celestial m...
This is the last of a series of 3 videos where we demystify Transformer models and explain them with visuals and friendly examples.
Video 1: The attention mechanism in high level • The Attention Mechanism in Large Language ...
Video 2: The attention mechanism with math • The math behind Attention: Keys, Queries, ...
Video 3 (This one): Transformer models
If you like this material, check out LLM University from Cohere!
https://llm.university
Get the Grokking Machine Learning book!
https://manning.com/books/grokking-ma...
Discount code (40%): serranoyt
(Use the discount code on checkout)
00:00 Introduction
01:50 What is a transformer?
04:35 Generating one word at a time
08:59 Sentiment Analysis
13:05 Neural Networks
18:18 Tokenization
19:12 Embeddings
25:06 Positional encoding
27:54 Attention
32:29 Softmax
35:48 Architecture of a Transformer
39:00 Fine-tuning
42:20 Conclusion
Повторяем попытку...

Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: