Update model card.
Browse files
README.md
CHANGED
|
@@ -13,15 +13,16 @@ pipeline_tag: text-to-motion
|
|
| 13 |
# KV-Control (T-Concat v4 backbone)
|
| 14 |
|
| 15 |
Sparse-keyframe, multi-joint controllable text-to-motion generation. The
|
| 16 |
-
repository at [github.com/
|
| 17 |
contains the full training and inference code.
|
| 18 |
|
| 19 |
## What is here
|
| 20 |
|
| 21 |
| Path | Content | Size |
|
| 22 |
|---|---|---|
|
| 23 |
-
| `base_t_concat_v4/model/net_best_fid.tar` | Pre-trained T-Concat v4 masked-transformer base (the paper main backbone) | 168 MB |
|
| 24 |
-
| `kv_control/model/
|
|
|
|
| 25 |
| `vqvae/net_best_fid.pth` | Part-aware VQ-VAE tokenizer (128 codes × 6 parts) | 236 MB |
|
| 26 |
| `vqvae/skeleton_partition.json` | Skeleton partition for the part-aware VQ | 1 KB |
|
| 27 |
| `stats/{mean,std}.npy` | Normalization stats matching the released VQ | 4 KB |
|
|
@@ -35,13 +36,33 @@ contains the full training and inference code.
|
|
| 35 |
## How to use
|
| 36 |
|
| 37 |
```bash
|
| 38 |
-
git clone https://github.com/
|
| 39 |
cd KV-Control
|
| 40 |
bash scripts/download_checkpoints.sh # populates checkpoints/, aux/ → glove/, body_models/
|
| 41 |
```
|
| 42 |
|
| 43 |
Refer to the GitHub README for installation and quick-start commands.
|
| 44 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 45 |
## Licenses
|
| 46 |
|
| 47 |
* Our weights (`base_t_concat_v4`, `kv_control`, `vqvae`, `stats`) — **MIT**.
|
|
|
|
| 13 |
# KV-Control (T-Concat v4 backbone)
|
| 14 |
|
| 15 |
Sparse-keyframe, multi-joint controllable text-to-motion generation. The
|
| 16 |
+
repository at [github.com/CHDTevior/KV-Control](https://github.com/CHDTevior/KV-Control)
|
| 17 |
contains the full training and inference code.
|
| 18 |
|
| 19 |
## What is here
|
| 20 |
|
| 21 |
| Path | Content | Size |
|
| 22 |
|---|---|---|
|
| 23 |
+
| `base_t_concat_v4/model/net_best_fid.tar` | Pre-trained T-Concat v4 masked-transformer base (the paper main backbone, Ep 400) | 168 MB |
|
| 24 |
+
| `kv_control/model/net_best_top3.tar` | **Cross multi-joint** KV-Control adapter — paper Tab 4 multi-joint block (`net_best_top3` @ Ep 6000, control=cross) | 520 MB |
|
| 25 |
+
| `kv_control_trajectory/model/net_best_kps.tar` | **Single-joint pelvis** KV-Control adapter — paper Tab 4 headline row (`net_best_kps` @ Ep 6000, control=trajectory) | 520 MB |
|
| 26 |
| `vqvae/net_best_fid.pth` | Part-aware VQ-VAE tokenizer (128 codes × 6 parts) | 236 MB |
|
| 27 |
| `vqvae/skeleton_partition.json` | Skeleton partition for the part-aware VQ | 1 KB |
|
| 28 |
| `stats/{mean,std}.npy` | Normalization stats matching the released VQ | 4 KB |
|
|
|
|
| 36 |
## How to use
|
| 37 |
|
| 38 |
```bash
|
| 39 |
+
git clone https://github.com/CHDTevior/KV-Control.git
|
| 40 |
cd KV-Control
|
| 41 |
bash scripts/download_checkpoints.sh # populates checkpoints/, aux/ → glove/, body_models/
|
| 42 |
```
|
| 43 |
|
| 44 |
Refer to the GitHub README for installation and quick-start commands.
|
| 45 |
|
| 46 |
+
## Checkpoint provenance & expected metrics
|
| 47 |
+
|
| 48 |
+
Both released KV-Control adapters are evaluated with the paper **M3 hybrid**
|
| 49 |
+
protocol on the HumanML3D `test` split (Stage-1 dynamic TTT `each_iter=35
|
| 50 |
+
--ttt_dynamic` T=10; Stage-2 600-step embedding opt; `cfg=3.25`,
|
| 51 |
+
`--cond_drop_prob 0.0 --pred_num_batch 16 --seed 3407`):
|
| 52 |
+
|
| 53 |
+
| Checkpoint | `--control` | Paper row | Expected (5r mean) |
|
| 54 |
+
|---|---|---|---|
|
| 55 |
+
| `kv_control/model/net_best_top3.tar` | `cross` | Tab 4 multi-joint | KPS ≈ **0.80 cm** (best 0.71) |
|
| 56 |
+
| `kv_control_trajectory/model/net_best_kps.tar` | `trajectory` | Tab 4 headline | KPS ≈ **0.40 cm**, FID ≈ 0.065, Top-3 ≈ 0.799 |
|
| 57 |
+
|
| 58 |
+
The single-joint pelvis row is the paper headline; the cross checkpoint is the
|
| 59 |
+
multi-joint result. They come from two separate fine-tuning runs (pelvis vs
|
| 60 |
+
cross), both on the same frozen `base_t_concat_v4` backbone. See the GitHub
|
| 61 |
+
README §3 for the exact reproduction commands. `scripts/sanity_check_equivalence.py`
|
| 62 |
+
regenerates one designed trajectory and reports KPS (≈ 1.7 cm on that
|
| 63 |
+
hand-crafted 6-joint sample); it is an install smoke test, **not** a benchmark
|
| 64 |
+
or an external-reference diff.
|
| 65 |
+
|
| 66 |
## Licenses
|
| 67 |
|
| 68 |
* Our weights (`base_t_concat_v4`, `kv_control`, `vqvae`, `stats`) — **MIT**.
|