Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
OpenGVLab 's Collections
InternVideo3
InternVideo-Next
Vlaser
NaViL
InternVL3.5-Flash
InternVL3.5-Core
InternVL3.5
ScaleCUA
SDLM
Docopilot
ZeroGUI
InternVL3
VisualPRM
Mono-InternVL
PIIP
VideoChat-R1
InternVideo2.5
VideoMAE-v2
VideoChat-Flash
InternVL2.5
InternVL2.5-MPO
InternVL2.0
InternVL1.5
InternVL1.0
V2PE
InternVL Adaptation
InternVideo2
VideoChat
VideoMamba
InternVid
OmniCorpus
All-Seeing Project
InternImage
PVT v2
InternVL Data

InternVideo3

updated about 5 hours ago

InternVideo3 enhances long-horizon multimodal tasks through Multimodal Contextual Reasoning and efficient attention mechanisms

Upvote
1

  • InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning

    Paper • 2606.12195 • Published 15 days ago • 23

  • yanziang/InternVideo3_Dataset

    Viewer • Updated 14 days ago • 380k • 187 • 2

  • yanziang/InternVideo3-8B-Instruct

    Video-Text-to-Text • 9B • Updated 14 days ago • 537 • 6
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs