yale-nlp/Qwen3-VL-8B-Anchor-Windows
770k
•
Updated
•
11
Natural Language Processing at Yale
ANCHOR: Branch-Point Data Generation for GUI Agents
Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers