LinguDistill: Recovering Linguistic Ability in Vision- Language Models via Selective Cross-Modal Distillation Paper • 2604.00829 • Published Apr 1 • 8
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published about 1 month ago • 203
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published about 1 month ago • 112
LinguDistill: Recovering Linguistic Ability in Vision- Language Models via Selective Cross-Modal Distillation Paper • 2604.00829 • Published Apr 1 • 8
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated about 1 month ago • 278k • 2.82k
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated 6 days ago • 774k • • 347