I ran 80B model on 16GB GPU - It's surprisingly good! (Qwen 3 Coder Next Review)
Автор: Red Stapler
Загружено: 2026-02-25
Просмотров: 18999
Описание:
Can you actually run an 80B parameter AI model on a 16GB GPU? In this video, we push the RTX 5060 Ti to its absolute limits by running Qwen 3 Coder Next 80B A3B entirely locally! We’ll test its coding capabilities, compare it head-to-head with Gemini 3.1 Pro, and see if this quantized model is practical for everyday local AI development.
With Unsloth's 3-bit iMatrix quantization, optimizing 50k context length, and managing VRAM to fit this SOTA model onto a consumer graphics card. Watch as we put Qwen 3 coder Next to the test with Three.js particles, complex web design layouts, and Python game to see where it shines—and where it breaks.
📌 Timestamps:
0:00 - Intro: Pushing the 16GB GPU to the Limit
0:21 - Qwen 3 Coder Next (80B) Specs & Details
0:47 - The Portable AI PC Setup
1:50 - Test 1: 3D Audio Visualizer Web App (Three.js)
2:37 - Test 2: UI Design Prompt vs. Gemini 3.1 Pro
3:31 - Test 3: Complex Web Layouts & Limitations
4:48 - Final Test: Python Space Shooter Arcade Game
5:18 - Final Verdict & Conclusion
** My Portable AI PC Setup **
CPU: https://amzn.to/3Lx52Vv
GPU: https://amzn.to/3YWMPDS
RAM: https://amzn.to/49l7vdD
Board: https://amzn.to/4poMEfp
PSU: https://amzn.to/4po6xU0
🔗 Links & Resources:
Qwen 3 Coder Next: https://huggingface.co/unsloth/Qwen3-...
Follow Red Stapler on X: https://x.com/redStapler_twit
#LocalAI #Qwen #LLM #Coding #AI
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: