SLMs are WAY too big
Автор: Machine & Deep Learning Israel
Загружено: 2026-01-26
Просмотров: 46
Описание:
הרצאה זו היא חלק מכנס GenML 2025 של קהילת MDLI.
אתם יכולים לצפות בשאר ההרצאות ובמצגות פה: https://mdli.co.il/en25.
In this talk we will showcase SLMs and their different architecture.
We will start by analyzing the current market state of SLMs vs LLMs, what triggers companies to make the transition and what needs to be considered when doing so. We will show how to train SLMs, discuss current challenges, and explain how knowledge distillation can be used to overcome them. Then, we will examine the different SLMs architectures, encoder-only (bi-directional) vs decoder-only (autoregressive) models, and demonstrate on what use cases encoders prevail.
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: