Tevior commited on
Commit
acc4e78
·
verified ·
1 Parent(s): a96fbaa

Update model card.

Browse files
Files changed (1) hide show
  1. README.md +25 -4
README.md CHANGED
@@ -13,15 +13,16 @@ pipeline_tag: text-to-motion
13
  # KV-Control (T-Concat v4 backbone)
14
 
15
  Sparse-keyframe, multi-joint controllable text-to-motion generation. The
16
- repository at [github.com/Tevior/KV-Control](https://github.com/Tevior/KV-Control)
17
  contains the full training and inference code.
18
 
19
  ## What is here
20
 
21
  | Path | Content | Size |
22
  |---|---|---|
23
- | `base_t_concat_v4/model/net_best_fid.tar` | Pre-trained T-Concat v4 masked-transformer base (the paper main backbone) | 168 MB |
24
- | `kv_control/model/net_best_kps.tar` | KV-Control adapter trained on the base above | 520 MB |
 
25
  | `vqvae/net_best_fid.pth` | Part-aware VQ-VAE tokenizer (128 codes × 6 parts) | 236 MB |
26
  | `vqvae/skeleton_partition.json` | Skeleton partition for the part-aware VQ | 1 KB |
27
  | `stats/{mean,std}.npy` | Normalization stats matching the released VQ | 4 KB |
@@ -35,13 +36,33 @@ contains the full training and inference code.
35
  ## How to use
36
 
37
  ```bash
38
- git clone https://github.com/Tevior/KV-Control.git
39
  cd KV-Control
40
  bash scripts/download_checkpoints.sh # populates checkpoints/, aux/ → glove/, body_models/
41
  ```
42
 
43
  Refer to the GitHub README for installation and quick-start commands.
44
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
45
  ## Licenses
46
 
47
  * Our weights (`base_t_concat_v4`, `kv_control`, `vqvae`, `stats`) — **MIT**.
 
13
  # KV-Control (T-Concat v4 backbone)
14
 
15
  Sparse-keyframe, multi-joint controllable text-to-motion generation. The
16
+ repository at [github.com/CHDTevior/KV-Control](https://github.com/CHDTevior/KV-Control)
17
  contains the full training and inference code.
18
 
19
  ## What is here
20
 
21
  | Path | Content | Size |
22
  |---|---|---|
23
+ | `base_t_concat_v4/model/net_best_fid.tar` | Pre-trained T-Concat v4 masked-transformer base (the paper main backbone, Ep 400) | 168 MB |
24
+ | `kv_control/model/net_best_top3.tar` | **Cross multi-joint** KV-Control adapter — paper Tab 4 multi-joint block (`net_best_top3` @ Ep 6000, control=cross) | 520 MB |
25
+ | `kv_control_trajectory/model/net_best_kps.tar` | **Single-joint pelvis** KV-Control adapter — paper Tab 4 headline row (`net_best_kps` @ Ep 6000, control=trajectory) | 520 MB |
26
  | `vqvae/net_best_fid.pth` | Part-aware VQ-VAE tokenizer (128 codes × 6 parts) | 236 MB |
27
  | `vqvae/skeleton_partition.json` | Skeleton partition for the part-aware VQ | 1 KB |
28
  | `stats/{mean,std}.npy` | Normalization stats matching the released VQ | 4 KB |
 
36
  ## How to use
37
 
38
  ```bash
39
+ git clone https://github.com/CHDTevior/KV-Control.git
40
  cd KV-Control
41
  bash scripts/download_checkpoints.sh # populates checkpoints/, aux/ → glove/, body_models/
42
  ```
43
 
44
  Refer to the GitHub README for installation and quick-start commands.
45
 
46
+ ## Checkpoint provenance & expected metrics
47
+
48
+ Both released KV-Control adapters are evaluated with the paper **M3 hybrid**
49
+ protocol on the HumanML3D `test` split (Stage-1 dynamic TTT `each_iter=35
50
+ --ttt_dynamic` T=10; Stage-2 600-step embedding opt; `cfg=3.25`,
51
+ `--cond_drop_prob 0.0 --pred_num_batch 16 --seed 3407`):
52
+
53
+ | Checkpoint | `--control` | Paper row | Expected (5r mean) |
54
+ |---|---|---|---|
55
+ | `kv_control/model/net_best_top3.tar` | `cross` | Tab 4 multi-joint | KPS ≈ **0.80 cm** (best 0.71) |
56
+ | `kv_control_trajectory/model/net_best_kps.tar` | `trajectory` | Tab 4 headline | KPS ≈ **0.40 cm**, FID ≈ 0.065, Top-3 ≈ 0.799 |
57
+
58
+ The single-joint pelvis row is the paper headline; the cross checkpoint is the
59
+ multi-joint result. They come from two separate fine-tuning runs (pelvis vs
60
+ cross), both on the same frozen `base_t_concat_v4` backbone. See the GitHub
61
+ README §3 for the exact reproduction commands. `scripts/sanity_check_equivalence.py`
62
+ regenerates one designed trajectory and reports KPS (≈ 1.7 cm on that
63
+ hand-crafted 6-joint sample); it is an install smoke test, **not** a benchmark
64
+ or an external-reference diff.
65
+
66
  ## Licenses
67
 
68
  * Our weights (`base_t_concat_v4`, `kv_control`, `vqvae`, `stats`) — **MIT**.