To Mix or To Merge: Toward Multi-Domain Reinforcement Learning for Large Language Models Paper • 2602.12566 • Published Feb 13 • 1
LiveClawBench: Benchmarking LLM Agents on Complex, Real-World Assistant Tasks Paper • 2604.13072 • Published Mar 20
MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling Paper • 2602.03359 • Published Feb 3 • 10
Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding Paper • 2509.06923 • Published Sep 8, 2025 • 22 • 2
Keyword-Centric Prompting for One-Shot Event Detection with Self-Generated Rationale Enhancements Paper • 2508.07598 • Published Aug 11, 2025
Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding Paper • 2509.06923 • Published Sep 8, 2025 • 22
Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding Paper • 2509.06923 • Published Sep 8, 2025 • 22