Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AI-Insight 's Collections
💡HF Papers Live 1: Reinforcement Learning
💡HF Papers Live 2: Code Bench
💡HF Papers Live 3: AI for Science
💡HF Papers Live 4: Multi Modal models
💡HF Papers Live 5: Omni-Modal models
💡HF Papers Live 6: OCR

💡HF Papers Live 4: Multi Modal models

updated Dec 3, 2025
Upvote
-

  • internlm/Intern-S1

    Image-Text-to-Text • 241B • Updated 2 days ago • 63.7k • 257

  • Intern-S1: A Scientific Multimodal Foundation Model

    Paper • 2508.15763 • Published Aug 21, 2025 • 273

  • MiniCPM-V: A GPT-4V Level MLLM on Your Phone

    Paper • 2408.01800 • Published Aug 3, 2024 • 93

  • openbmb/MiniCPM-V-4_5

    Image-Text-to-Text • 9B • Updated 21 days ago • 103k • 1.08k

  • openbmb/MiniCPM-V-4

    Image-Text-to-Text • Updated Sep 15, 2025 • 103k • 462

  • zai-org/GLM-4.5V

    Image-Text-to-Text • 108B • Updated Oct 25, 2025 • 45.6k • • 710

  • zai-org/GLM-4.1V-9B-Thinking

    Image-Text-to-Text • 10B • Updated Oct 25, 2025 • 425k • 774

  • GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

    Paper • 2507.01006 • Published Jul 1, 2025 • 252

  • Ovis2.5 Technical Report

    Paper • 2508.11737 • Published Aug 15, 2025 • 113

  • AIDC-AI/Ovis2.5-2B

    Image-Text-to-Text • 3B • Updated Feb 13 • 114k • 200

  • stepfun-ai/step3

    Image-Text-to-Text • 321B • Updated Jan 29 • 103k • 166
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs