ycliper

Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
Скачать

1-Bit LLM: The Most Efficient LLM Possible?

Автор: bycloud

Загружено: 2025-06-18

Просмотров: 117642

Описание: Download Tanka today https://www.tanka.ai and enjoy 3 months of free Premium!
You can also get $20 / team for each referrals

I've been planning for a bitnet video for the longest time, and with the release of bitnet b1.58 2B4T gave me the perfect chance to brief you on the history of 1-bit LLM! Fun fact, the major bitnet research is mostly done by the same researchers.

My Newsletter
https://mail.bycloud.ai/

my project: find, discover & explain AI research semantically
https://findmypapers.ai/

My Patreon
  / bycloud  


Quantifying the Capabilities of LLMs across Scale and Precision
[Paper] https://arxiv.org/abs/2405.03146v2

BitNet: Scaling 1-bit Transformers for Large Language Models
[Paper] https://arxiv.org/abs/2310.11453v1

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
[Paper] https://arxiv.org/abs/2402.17764v1

BitNet a4.8: 4-bit Activations for 1-bit LLMs
[Paper] https://arxiv.org/abs/2411.04965v1

Efficient Construction of Model Family through Progressive Training Using Model Expansion
[Paper] https://arxiv.org/abs/2504.00623v1

BitNet b1.58 2B4T Technical Report
[Paper] https://arxiv.org/abs/2504.12285
[Web Demo] https://bitnet-demo.azurewebsites.net/
[HuggingFace] https://huggingface.co/microsoft/bitn...
[Code] https://github.com/microsoft/BitNet

[Additional Recs]
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge
https://arxiv.org/abs/2407.00088v2

FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation
https://arxiv.org/abs/2407.07093v1

Matmul or No Matmul in the Era of 1-bit LLMs
https://arxiv.org/abs/2408.11939v2

1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs
https://arxiv.org/abs/2410.16144v2

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
https://arxiv.org/abs/2502.11880v1

Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
https://arxiv.org/abs/2502.11895v1

(NEW!) BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
https://arxiv.org/abs/2504.18415

(NEW!) BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation
https://arxiv.org/abs/2506.07530


Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI

This video is supported by the kind Patrons & YouTube Members:
🙏Nous Research, Chris LeDoux, Ben Shaener, DX Research Group, Poof N' Inu, Andrew Lescelius, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa


[Discord]   / discord  
[Twitter]   / bycloudai  
[Patreon]   / bycloud  
[Business Inquiries] [email protected]
[Profile & Banner Art]   / pygm7  
[Video Editor] Abhay
[Ko-fi] https://ko-fi.com/bycloudai

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...
1-Bit LLM: The Most Efficient LLM Possible?

Поделиться в:

Доступные форматы для скачивания:

Скачать видео

  • Информация по загрузке:

Скачать аудио

Похожие видео

Andrej Karpathy: Software Is Changing (Again)

Andrej Karpathy: Software Is Changing (Again)

The Deceptively Simple Math Problem No One Can Solve

The Deceptively Simple Math Problem No One Can Solve

How do Transistors Work?  How are Transistors Assembled Inside a CPU?

How do Transistors Work? How are Transistors Assembled Inside a CPU?

Nvidia, You’re Late. World’s First 128GB LLM Mini Is Here!

Nvidia, You’re Late. World’s First 128GB LLM Mini Is Here!

But what is quantum computing?  (Grover's Algorithm)

But what is quantum computing? (Grover's Algorithm)

AI prompts are driving me ✨insane✨

AI prompts are driving me ✨insane✨

Можно ли поменять родину так быстро? / вДудь

Можно ли поменять родину так быстро? / вДудь

The Invention That Saved Science

The Invention That Saved Science

The Android Tablet Problem

The Android Tablet Problem

The Most Misunderstood Concept in Physics

The Most Misunderstood Concept in Physics

© 2025 ycliper. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]