Spaces:
Running
Running
File size: 1,503 Bytes
ac63855 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 | ---
title: VibeThinker WebGPU Train + Hot-Swap LoRA
emoji: 🔥
colorFrom: red
colorTo: yellow
sdk: static
app_file: index.html
pinned: false
license: apache-2.0
models:
- WeiboAI/VibeThinker-3B
short_description: In-browser WebGPU LoRA training + hot-swap (VibeThinker-3B)
---
# VibeThinker-3B · in-browser WebGPU training + hot-swap LoRA
A static Space that runs **VibeThinker-3B** entirely in your browser tab on a
custom **WebGPU int4 runtime** — no server, no upload, nothing leaves the page.
What it shows off:
- **WebGPU kernels** — int4 GEMV/GEMM, fused RMSNorm+QKV+RoPE, paged attention,
f32 accumulation in f16 paths. The whole forward + backward stack is WGSL.
- **In-browser training** — fine-tune a LoRA adapter in the tab with a *real*
backward pass + AdamW over the frozen int4 base (gradient checkpointing,
f32 master weights). The guided demo teaches made-up facts in ~30s.
- **Hot-swap LoRA** — every trained adapter is saved locally (IndexedDB) and
hot-swaps into inference instantly; export to `.safetensors` or re-load later.
The model weights are streamed once from
[`WeiboAI/VibeThinker-3B`](https://huggingface.co/WeiboAI/VibeThinker-3B) on the
Hub and cached in your browser. A Space does **not** have to be tied to a model;
the model above is linked for discovery only.
> Requires a WebGPU-capable browser (recent Chrome/Edge; Safari Technology
> Preview). ~6 GB weights, one-time download.
Source: <https://github.com/maceip/vibethinker-webgpu-lora>
|