Игорь Котенков - RLHF Intro: from Zero to Aligned Intelligent Systems
Автор: ODS AI Ru
Загружено: 2023-06-09
Просмотров: 5254
Описание:
A story about Text Summarization
What the Alignment is, and what's the problem?
How RLHF works
Data setup, and why we'd like to follow instructions
Reward Modeling and PPO
Why RLHF works (and when it doesn't)
ChatGPT improvements
What's next and what to expect?
Data Fest 2023:
https://ods.ai/events/datafestonline2023
Трек "Instruct Models":
https://ods.ai/tracks/df23-instruct-m...
Наши соц.сети:
Telegram: https://t.me/datafest
Вконтакте: https://vk.com/datafest
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: