OmniGUI: Benchmarking GUI Agents in Omni-Modal Smartphone Environments Paper • 2605.18758 • Published Apr 3 • 13
OmniGUI: Benchmarking GUI Agents in Omni-Modal Smartphone Environments Paper • 2605.18758 • Published Apr 3 • 13
DreamPolish: Domain Score Distillation With Progressive Geometry Generation Paper • 2411.01602 • Published Nov 3, 2024 • 11
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published Jul 1, 2025 • 255
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published Jul 1, 2025 • 255
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents Paper • 2503.01935 • Published Mar 3, 2025 • 30 • 3
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper • 2501.12380 • Published Jan 21, 2025 • 84