S0 Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models Paper • 2604.01168 • Published 25 days ago • 7
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published 25 days ago • 42
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper • 2604.04323 • Published 21 days ago • 41
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 16 days ago • 76
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 13 days ago • 98
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents Paper • 2604.06132 • Published 20 days ago • 117
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 18 days ago • 285
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 11 days ago • 63
view article Article PangolinGuard: Fine-Tuning ModernBERT as a Lightweight Approach to AI Guardrails Mar 23, 2025 • 13