The Future is Sparse: Embedding Compression for Scalable Retrieval in Recommender Systems

Автор: ACM RecSys

Загружено: 2025-10-01

Просмотров: 200

Описание: The speaker motivates embedding compression challenges as embedding tables grow with entity cardinality and dimensionality. They survey alternatives such as low-dimensional approximations, quantization, and offloading, noting quality, portability, and latency issues. CompressSAE uses a linear encoder with top-k activation and a bias-free linear decoder with row normalization to produce sparse embeddings that preserve cosine similarity. Retrieval can use standard sparse dot product or a kernel trick to simulate original-space similarity. Training requires only existing embeddings, not raw data, and converges quickly. Experiments show stronger accuracy–compression trade-offs than Matryoshka, faster inference, and competitive online results. Sparsity is controlled via the hyperparameter k.

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

The Future is Sparse: Embedding Compression for Scalable Retrieval in Recommender Systems

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео