ViewFusion: Structured Spatial Thinking Chains for Multi-View Reasoning
Paper
• 2603.06024 • Published
• 4
Official resources for ViewFusion: Structured Spatial Thinking Chains for Multi-View Reasoning.
ViewFusion is a framework for multi-view reasoning that introduces structured spatial thinking chains to improve cross-view understanding and spatial reasoning.
This repository provides resources related to the paper, such as model weights, datasets, or other project materials.
If you find this work useful, please cite:
@misc{tao2026viewfusionstructuredspatialthinking,
title={ViewFusion: Structured Spatial Thinking Chains for Multi-View Reasoning},
author={Xingjian Tao and Yiwei Wang and Yujun Cai and Yifan Song and Jing Tang},
year={2026},
eprint={2603.06024},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2603.06024},
}
Base model
Qwen/Qwen3-VL-4B-Instruct