xiaomoguhzz
/

VisionEncoder

@@ -32,7 +32,7 @@ The repo is organized into three top-level folders.
 | `ckpts/4b_stock` | 4B stock baseline (raw Qwen3.5 ViT, skips declip), checkpoint-505, 9.5G |
 | `ckpts/4b_v9_1` | 4B V9.1 (V-JEPA 2.1 video self-distill), checkpoint-505, 9.5G |
-Download either and feed it straight to evaluation (see the GitHub README, step 7) to skip declip + S1 + S2.
 ## `legacy/` — historical assets (~368G)

 | `ckpts/4b_stock` | 4B stock baseline (raw Qwen3.5 ViT, skips declip), checkpoint-505, 9.5G |
 | `ckpts/4b_v9_1` | 4B V9.1 (V-JEPA 2.1 video self-distill), checkpoint-505, 9.5G |
+Download either and feed it straight to evaluation (see the GitHub README, section 4 — MLLM evaluation) to skip declip + S1 + S2.
 ## `legacy/` — historical assets (~368G)