view reply Hopefully they can involve NEO-Unify in discussions of their paper, haha~ It would be deserved.
Qwen/Qwen3-Embedding-0.6B Feature Extraction • 0.6B • Updated 13 days ago • 5.83M • • 1.01k
google/siglip2-giant-opt-patch16-256 Zero-Shot Image Classification • 2B • Updated Feb 21, 2025 • 14.3k • 4
facebook/dinov3-vitl16-pretrain-lvd1689m Image Feature Extraction • 0.3B • Updated Aug 19, 2025 • 691k • 246
Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models Paper • 2601.19834 • Published Jan 27 • 25
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published Dec 30, 2025 • 52
In Pursuit of Pixel Supervision for Visual Pre-training Paper • 2512.15715 • Published Dec 17, 2025 • 11