Ben commited on
Commit
bc89e3d
·
1 Parent(s): 7ed4dd8

Update v2 release assets

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -16,16 +16,16 @@ models:
16
 
17
  # Representation over Routing Demo
18
 
19
- Interactive demo for the preprint **"Representation over Routing: Overcoming Surrogate Hacking in Multi-Timescale PPO"**.
20
 
21
- This Space visualizes the four pretrained ablation-stage agents from the associated model repository. The default selection shows **Stage 4: Target Decoupling**, the proposed method, performing a LunarLander rollout with a fixed deterministic demo seed.
22
 
23
  ## Links
24
 
25
  * **Paper:** https://arxiv.org/abs/2604.13517
26
  * **Model Weights:** https://huggingface.co/ben-dlwlrma/Representation-Over-Routing
27
- * **GitHub Repository:** https://github.com/ben-dlwlrma/Representation-Over-Routing
28
 
29
  ## Notes
30
 
31
- The paper experiments were conducted on `LunarLander-v2`. This hosted demo uses `LunarLander-v3` for compatibility with current Gymnasium releases, while keeping the same actor architecture and pretrained weights.
 
16
 
17
  # Representation over Routing Demo
18
 
19
+ Interactive demo for the preprint **"Representation over Routing: Diagnosing Temporal Routing Pathologies in Multi-Timescale PPO"**.
20
 
21
+ This Space visualizes four pretrained diagnostic-stage agents from the associated model repository. The default selection shows **Stage 4: Target Decoupling**, a structural separation variant, performing a LunarLander rollout with a fixed deterministic demo seed.
22
 
23
  ## Links
24
 
25
  * **Paper:** https://arxiv.org/abs/2604.13517
26
  * **Model Weights:** https://huggingface.co/ben-dlwlrma/Representation-Over-Routing
27
+ * **GitHub Code Repository:** https://github.com/ben-dlwlrma/Representation-Over-Routing
28
 
29
  ## Notes
30
 
31
+ The paper experiments were conducted on `LunarLander-v2`. This hosted demo uses `LunarLander-v3` for compatibility with current Gymnasium releases, while keeping the same actor architecture and pretrained weight format. The Space is a qualitative visualization, not an additional benchmark result.