Seed2.0 Model Card: Towards Intelligence Frontier for Real-World Complexity Paper • 2607.00248 • Published 4 days ago • 23
MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training Paper • 2606.30406 • Published 5 days ago • 11
VideoSearch-R1: Iterative Video Retrieval and Reasoning via Soft Query Refinement Paper • 2607.00446 • Published 3 days ago • 16
Ko-WideSearch: A Korean Breadth-Search Benchmark for Exhaustive Set Enumeration by Web Agents Paper • 2606.27595 • Published 9 days ago • 8
TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents Paper • 2606.28480 • Published 8 days ago • 47
Agentic Abstention: Do Agents Know When to Stop Instead of Act? Paper • 2606.28733 • Published 7 days ago • 140
Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning Paper • 2606.31825 • Published 4 days ago • 13
Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning Paper • 2606.31825 • Published 4 days ago • 13
SingGuard: A Policy-Adaptive Multimodal LLM Guardrail with Dynamic Reasoning Paper • 2606.22873 • Published 12 days ago • 15
Ko-WideSearch: A Korean Breadth-Search Benchmark for Exhaustive Set Enumeration by Web Agents Paper • 2606.27595 • Published 9 days ago • 8
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 11 days ago • 144
OpenBioRQ: Unsolved Biomedical Research Questions for Agents Paper • 2606.21959 • Published 14 days ago • 4
OpenBioRQ: Unsolved Biomedical Research Questions for Agents Paper • 2606.21959 • Published 14 days ago • 4