Open-Source Spotlight - Titan Takeoff - Fergus Finn
Автор: DataTalksClub ⬛
Загружено: 2023-08-30
Просмотров: 456
Описание:
Titan Takeoff: This is a server designed for optimized inference of large language models.
00:00 Intro and a few words about Fergus
00:12 Titanml
00:26 The Takeoff Server
00:55 Demo: using The Takeoff Server to quickly and easily deploy large language models locally
04:00 Examples of interacting with a large language model using The Takeoff Server
06:16 Optimisations: getting around needing a big GPU - optimising through quantisation, how it works and what it means for memory usage
07:04 Running a model on a 4GB laptop with 4GB of GPU and 16 GB RAM
07:55 User interfaces of The Takeoff Server: chat and playground
11:00 Running The Takeoff server locally with a CPU
12:13 - Helm charts, Kubernetes deployment, resources
14:34 Models that are supported by The Takeoff Server
17:11 List of things contributors can help with - open source community project & how to get involved
18:24 How to find the Discord channel
19:00 How to give a star on GitHub
19:46 Future plans - generating structured content from large language, more optimisations for more models, ongoing improvements to the ergonomics
23:10 Advice for people interested in this area
Links:
GitHub repository: github.com/titanml/takeoff
GitHub community: / discord
Document: https://docs.titanml.co/blog
Free MLOps course: https://github.com/DataTalksClub/mlop...
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
Повторяем попытку...

Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: