GenericAgent: A Token-Efficient Self-Evolving LLM Agent via Contextual Information Density Maximization (V1.0) Paper • 2604.17091 • Published 7 days ago • 11
MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval Paper • 2604.18584 • Published 5 days ago • 14
The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation Paper • 2604.16830 • Published 7 days ago • 13
Concrete Jungle: Towards Concreteness Paved Contrastive Negative Mining for Compositional Understanding Paper • 2604.13313 • Published 11 days ago • 12
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence Paper • 2604.18292 • Published 5 days ago • 78
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper • 2604.18486 • Published 5 days ago • 84
Crowded in B-Space: Calibrating Shared Directions for LoRA Merging Paper • 2604.16826 • Published 7 days ago • 18
SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents Paper • 2604.17308 • Published 6 days ago • 22
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents Paper • 2604.18543 • Published 5 days ago • 26
WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models Paper • 2604.18224 • Published 5 days ago • 22
GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification Paper • 2604.14258 • Published 10 days ago • 23
ArtifactNet: Detecting AI-Generated Music via Forensic Residual Physics Paper • 2604.16254 • Published 8 days ago • 3
Universal statistical signatures of evolution in artificial intelligence architectures Paper • 2604.10571 • Published 13 days ago • 4
The Amazing Agent Race: Strong Tool Users, Weak Navigators Paper • 2604.10261 • Published 8 days ago • 7
Can Large Language Models Reinvent Foundational Algorithms? Paper • 2604.05716 • Published 18 days ago • 7