Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alecccdd 's Collections
Fun
Impressive Large Models
Vision Tasks
Vision Tasks (Watermark)
Vision Tasks (Humans)
Vision Datasets
Vision Datasets (Human)
Video Tasks
Diffusion Tasks
Audio Tasks
Text Generation
Text Datasets (Reasoning)
Text Datasets (Grammar)
ReID
small & highly efficient

Vision Tasks

updated 6 days ago
Upvote
-

  • BAAI/seggpt-vit-large

    0.4B • Updated Feb 22, 2024 • 22.4k • 5

  • q-future/one-align

    Zero-Shot Image Classification • Updated May 14, 2024 • 195k • 43

  • Running on Zero
    16

    Qwen-VL Object-Detection

    ✨
    16

    Compare Qwen-VL models for object detection.


  • Running
    37

    Joycaption Watermark Detection

    🔥
    37

    Watermark detection


  • Running on Zero
    Featured
    39

    SAM3 VLM-FO1

    👁
    39

    Complex text label dection using SAM3 with VLM-FO1


  • SnJake/Ref2Font

    Text-to-Image • Updated 22 days ago • 105 • 33

  • facebook/DepthLM

    Image-Text-to-Text • Updated 27 days ago • 108 • 36

  • Running on Zero
    Featured
    47

    DA-2

    ⚽
    47

    Official demo of DA^2: Depth Anything in Any Direction


  • DepthLM: Metric Depth From Vision Language Models

    Paper • 2509.25413 • Published Sep 29, 2025 • 7
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs