Steer3D ziqima/Edit3D-Bench Viewer • Updated Dec 16, 2025 • 247 • 330 • 2 ziqima/Steer3D-Data Preview • Updated Dec 16, 2025 • 2.63k ziqima/Steer3D Image-to-3D • Updated Dec 16, 2025 • 2
TWIN [CVPR 2026] Datasets and models from the paper "Same or Not? Enhancing Visual Perception in Vision-Language Models" glab-caltech/TWIN-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated Dec 30, 2025 • 6 • 2 glab-caltech/TWIN-InternVL3_5-1B Image-Text-to-Text • 1B • Updated Dec 30, 2025 • 3 • 1 glab-caltech/FGVQA Viewer • Updated Dec 30, 2025 • 12k • 46 • 2 glab-caltech/TWIN Viewer • Updated Mar 30 • 562k • 128 • 3
ConverSeg aadarsh99/ConverSeg-Net-3B Updated Feb 16 aadarsh99/ConverSeg-Training-Data Viewer • Updated 21 days ago • 70.9k • 3.44k • 1 aadarsh99/ConverSeg Viewer • Updated Feb 16 • 1.69k • 188 • 1
VALOR [ICLR 2026] Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" glab-caltech/VALOR-8B 8B • Updated Dec 11, 2025 • 36 glab-caltech/VALOR-GroundingDINO Object Detection • Updated Dec 11, 2025 No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers Paper • 2512.08889 • Published Dec 9, 2025 • 1
No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers Paper • 2512.08889 • Published Dec 9, 2025 • 1
ConverSeg aadarsh99/ConverSeg-Net-3B Updated Feb 16 aadarsh99/ConverSeg-Training-Data Viewer • Updated 21 days ago • 70.9k • 3.44k • 1 aadarsh99/ConverSeg Viewer • Updated Feb 16 • 1.69k • 188 • 1
Steer3D ziqima/Edit3D-Bench Viewer • Updated Dec 16, 2025 • 247 • 330 • 2 ziqima/Steer3D-Data Preview • Updated Dec 16, 2025 • 2.63k ziqima/Steer3D Image-to-3D • Updated Dec 16, 2025 • 2
TWIN [CVPR 2026] Datasets and models from the paper "Same or Not? Enhancing Visual Perception in Vision-Language Models" glab-caltech/TWIN-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated Dec 30, 2025 • 6 • 2 glab-caltech/TWIN-InternVL3_5-1B Image-Text-to-Text • 1B • Updated Dec 30, 2025 • 3 • 1 glab-caltech/FGVQA Viewer • Updated Dec 30, 2025 • 12k • 46 • 2 glab-caltech/TWIN Viewer • Updated Mar 30 • 562k • 128 • 3
VALOR [ICLR 2026] Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" glab-caltech/VALOR-8B 8B • Updated Dec 11, 2025 • 36 glab-caltech/VALOR-GroundingDINO Object Detection • Updated Dec 11, 2025 No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers Paper • 2512.08889 • Published Dec 9, 2025 • 1
No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers Paper • 2512.08889 • Published Dec 9, 2025 • 1