gui agent
updated
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Paper
• 2412.04454
• Published
• 71
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
Paper
• 2506.03143
• Published
• 53
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary
Reinforcement Learning
Paper
• 2505.12370
• Published
UIShift: Enhancing VLM-based GUI Agents through Self-supervised
Reinforcement Learning
Paper
• 2505.12493
• Published
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on
Mobile Devices
Paper
• 2406.08451
• Published
• 26
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection
Behavior
Paper
• 2506.08012
• Published
• 7
ShowUI-Aloha: Human-Taught GUI Agent
Paper
• 2601.07181
• Published
• 3
OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent
Paper
• 2601.07779
• Published
• 28
Step-GUI Technical Report
Paper
• 2512.15431
• Published
• 132
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents
Paper
• 2512.22047
• Published
• 30
MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
Paper
• 2512.19432
• Published
• 13