Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated 12 days ago • 80.3k • 445 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 5.04k • 48 LifuWang/DistillT5 0.1B • Updated Apr 11, 2025 • 94 • 29
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 5.04k • 48
Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated 12 days ago • 80.3k • 445 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 5.04k • 48 LifuWang/DistillT5 0.1B • Updated Apr 11, 2025 • 94 • 29
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 5.04k • 48