Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Cooper 's Collections
image-interleave
RL
Code
image-quality
video-inpainting
text2video
image-text
advertisement
Map
table
Agent
Deepfake detection
KIE
Comic
cartoon
reasoning
multi-image
music-theory
grounding
VLA
Med
counting
Spatial
video
OCR
STEM
mix-multimodal-datasets
model
image-point
chart
imageCode
image-qa
image-caption
knowledge
GUI

image-qa

updated 28 days ago
Upvote
-

  • Mayfull/LRV-Instruction

    Viewer • Updated Oct 11, 2025 • 181k • 129 • 1

  • nvidia/Nemotron-VLM-Dataset-v2

    Viewer • Updated Dec 18, 2025 • 4.58M • 2.39k • 89

  • array/SAT

    Preview • Updated Feb 16 • 529 • 13

  • WildVision/wildvision-internal-data

    Viewer • Updated Aug 21, 2024 • 155k • 2.38k • 5

  • PhoenixZ/OmniAlign-V

    Updated Mar 1, 2025 • 112 • 7

  • PhoenixZ/OmniAlign-V-DPO

    Viewer • Updated Mar 1, 2025 • 133k • 109 • 6

  • allenai/pixmo-cap-qa

    Viewer • Updated Dec 5, 2024 • 272k • 205 • 10

  • moonshotai/WorldVQA

    Viewer • Updated Feb 4 • 3k • 1.15k • 66

  • YangyiYY/LVLM_NLF

    Preview • Updated Nov 17, 2023 • 101 • 12

  • pufanyi/MIMICIT

    Viewer • Updated Mar 28, 2024 • 5.1M • 40 • 48

  • MMInstruction/M3IT

    Updated Nov 24, 2023 • 4.23k • 136

  • openlamm/Ch3Ef

    Updated Sep 28, 2024 • 86 • 3

  • AntGroup-MI/Osprey-724K

    Preview • Updated Feb 5, 2024 • 73 • 15

  • dutta18/Physical-Reasoning-VQA-45K

    Viewer • Updated 28 days ago • 64.9k • 557
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs