osunlp/Mind2Web
Viewer
•
Updated
•
253
•
3.08k
•
123
Towards Generalist Agents for the Web (NeurIPS'23 Spotlight)
Note Original Mind2Web Dataset
Note Multimodal version of the Mind2Web Dataset
Note An Online benchmark of Mind2Web-level web tasks
Note First LLM-based web agent Ecologically valid eval
Note SeeAct: First generalist web agent with visual perception
Visualize AI agent performance with tables and interactive plots
Note Leaderboard of Online-Mind2Web