local LLM inferencing on a smartphone with NO Internet.

Автор: Yitz

Загружено: 2025-02-18

Просмотров: 71

Описание: local LLM inferencing on a smartphone with NO Internet.

using ONNX ONNX Runtime and transformers.js with Qwen 2.5 0.5b model from Hugging Face using the https://ailocalhost.com chat UI.

I could go on about how it's not working properly... and how I had it working properly with webGPU and steaming responses, and more models would work EARLIER, but since I'm hardly a software developer, I didn't have versions saved and so that was all lost and I'm back here .. so now with just wasm (CPU) working, not streaming responses, many models that worked earlier won't load (I could paste a load of errors too), and none of the transformers parameters are being sent properly to the pipeline commands. lol 😂 well well ... it works enough for a demo!

the server routing, model selection, maxtokens, temp, top p, presence penalty are all working... and so it's good for testing .. and showing off!

plus all the other features work fine, so can still use it for Ollama and Open AI API servers with all the other features working 💯 💯 💯 💯

ITQIX Technology Group

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

local LLM inferencing on a smartphone with NO Internet.

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Abstract Black and White wave pattern| Height Map Footage| 3 hours Topographic 4k Background

Abstract Black and White wave pattern| Height Map Footage| 3 hours Topographic 4k Background

Я прошёл ЦРУ-тест начального уровня. Вычислил пароль по фото

Я прошёл ЦРУ-тест начального уровня. Вычислил пароль по фото

Почему создатели контента ждут Seedance 2.0

Почему создатели контента ждут Seedance 2.0

Уборщик испугался | Агрессивный бодибилдер против 32-килограммовой швабры в спортзале

Уборщик испугался | Агрессивный бодибилдер против 32-килограммовой швабры в спортзале

The Future of Tactical Radios and Silvus

The Future of Tactical Radios and Silvus

Новое расширение Claude для Chrome: секретное оружие, которое должен использовать каждый

Новое расширение Claude для Chrome: секретное оружие, которое должен использовать каждый

Rounded Neon Multicolored lines Animation Background Video | Footage | Screensaver

Rounded Neon Multicolored lines Animation Background Video | Footage | Screensaver

Smoke Mood — Просто расслабься | Deep House микс 2025 • Чилл / Ночные вайбы / Снятие стресса #8

Smoke Mood — Просто расслабься | Deep House микс 2025 • Чилл / Ночные вайбы / Снятие стресса #8

Первый старт самой большой европейской ракеты современности: Ariane 64

Первый старт самой большой европейской ракеты современности: Ariane 64

Stockfish Solved Chess. Goodbye.

Stockfish Solved Chess. Goodbye.

Deep House Mix 2024 | Deep House, Vocal House, Nu Disco, Chillout Mix by Diamond #3

Deep House Mix 2024 | Deep House, Vocal House, Nu Disco, Chillout Mix by Diamond #3

Ziemkiewicz: W UE przegłosują nawet, że księżyc jest z sera! Koniec biologicznej prawdy

Ziemkiewicz: W UE przegłosują nawet, że księżyc jest z sera! Koniec biologicznej prawdy

IN HIS PRESENCE - Soaking worship instrumental | Prayer and Devotional

IN HIS PRESENCE - Soaking worship instrumental | Prayer and Devotional

The Top Metrics for Vertical SaaS Companies

The Top Metrics for Vertical SaaS Companies

Milano Cortina 2026 | Ilia MALININ (USA) | Men’s Single Skating – Free Skating

Milano Cortina 2026 | Ilia MALININ (USA) | Men’s Single Skating – Free Skating

THIS is why large language models can understand the world

THIS is why large language models can understand the world

PEACE - Soaking worship instrumental | Prayer and Devotional

PEACE - Soaking worship instrumental | Prayer and Devotional

Google AI Studio для начинающих (2026)

Google AI Studio для начинающих (2026)

WebLLM: A high-performance in-browser LLM Inference engine

WebLLM: A high-performance in-browser LLM Inference engine

OpenClaw - полный разбор: Tools, Skills, Agents, Sub-agents

OpenClaw - полный разбор: Tools, Skills, Agents, Sub-agents