Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
eshmoideas
's Collections
ANY GEN ++
MM GEN+RET
INDAI 2047
Indic AI
Entertainment
multimodel Features
Audio DATA Preparation
Voice
Q - Physics Featuring
Vidio DATA Preparation
Mechanicals
BIOLOGY
NLP
Vidio Featuring
Image Featuring
Deep Learning
DATA Analysis
Machine Learning
DATA Preparation
TIME
Pre Training
EVOLUTION TEST - TRAINING
Design
PHYSICS
IMAGE
WORLD
MEDICAL
Autonomous Vehicles
Physical AI
Nature
PHYSICS - RADIO
Speech
Simulation
Musics
Featureing
Quantum Computing
Diffusers
ROLES
ALGORITMS
DEV
Pers FINANCE
Creative
Hardware
CUDA
RAG
User
OCR
Cyber Security
Multi Model
3D - Animation
Image - Vidio
Robotics
Others
Quantum
Game
Twin
Math
Code
Audio
Science
Vision
General
Web
Research
Light
STEM
Bio Intelligence
Data sets
Emotional Intelligence
AR/VR/XR
Training
Data
Trading
MEDIA
Vision
updated
Feb 21
Upvote
-
apple/FastVLM-7B-int4
1B
•
Updated
Sep 3, 2025
•
67
•
31
apple/FastVLM-1.5B
Text Generation
•
2B
•
Updated
Sep 3, 2025
•
2.96k
•
80
apple/FastVLM-1.5B-int8
0.5B
•
Updated
Sep 3, 2025
•
212
•
20
apple/MobileCLIP2-L-14
Updated
Oct 9, 2025
•
36
•
4
apple/MobileCLIP2-S4
Updated
Oct 9, 2025
•
65
•
14
apple/MobileCLIP2-S2
Updated
Oct 9, 2025
•
70
•
15
apple/MobileCLIP2-B
Updated
Oct 9, 2025
•
56
•
3
apple/coreml-depth-anything-v2-small
Depth Estimation
•
Updated
Jun 24, 2024
•
1.04k
•
96
apple/coreml-FastViT-T8
Image Classification
•
Updated
Jun 13, 2024
•
28
•
17
apple/DFN5B-CLIP-ViT-H-14-378
Updated
Feb 28, 2025
•
12M
•
108
ByteDance-Seed/VINCIE-3B
Image-to-Image
•
Updated
Sep 9, 2025
•
14
•
43
ByteDance-Seed/Tar-TA-Tok
Updated
Jul 2, 2025
•
7
ByteDance-Seed/Tar-7B
Any-to-Any
•
9B
•
Updated
Jul 2, 2025
•
68
•
40
ByteDance-Seed/SeedVR-7B
Video-to-Video
•
Updated
Jun 20, 2025
•
60
•
10
ByteDance-Seed/UI-TARS-1.5-7B
Image-Text-to-Text
•
8B
•
Updated
Apr 18, 2025
•
19.4k
•
538
ByteDance-Seed/UI-TARS-72B-SFT
Image-Text-to-Text
•
73B
•
Updated
Jan 25, 2025
•
230
•
25
ByteDance-Seed/UI-TARS-2B-SFT
Image-Text-to-Text
•
2B
•
Updated
Jan 25, 2025
•
1.77k
•
37
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
Updated
Jan 25, 2025
•
1.01k
•
227
nvidia/Liver_Scan_Pi0_Cosmos_Rel
Updated
Sep 16, 2025
•
1
nvidia/MambaVision-L-1K
Image Classification
•
0.2B
•
Updated
Mar 27, 2025
•
110
•
6
decart-ai/Lucy-Edit-Dev-ComfyUI
Updated
Nov 7, 2025
•
14
decart-ai/Lucy-Edit-Dev
Video-to-Video
•
Updated
Nov 20, 2025
•
800
•
336
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text
•
236B
•
Updated
Nov 26, 2025
•
92k
•
•
391
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text
•
236B
•
Updated
Nov 26, 2025
•
1.39M
•
•
383
bytedance-research/LVFace
Image Feature Extraction
•
Updated
Aug 21, 2025
•
414
•
28
Tesslate/Synthia-S1-27b-Q4_K_M-GGUF
27B
•
Updated
Apr 3, 2025
•
82
•
6
nvidia/NV-Segment-CT
Image Segmentation
•
Updated
Apr 1
•
139
•
17
nvidia/nemotron-graphic-elements-v1
Object Detection
•
Updated
Mar 11
•
15
•
22
nvidia/nemotron-table-structure-v1
Object Detection
•
Updated
Mar 11
•
89
•
31
nvidia/Cosmos-Embed1-448p
1B
•
Updated
Mar 13
•
5.68k
•
8
google/paligemma2-28b-mix-448-jax
Image-Text-to-Text
•
Updated
Feb 7, 2025
•
2
google/paligemma2-3b-pt-448-keras
Image-Text-to-Text
•
Updated
Dec 11, 2024
•
268
google/paligemma-3b-ft-coco35l-224
Image-Text-to-Text
•
3B
•
Updated
Jul 19, 2024
•
168
•
1
google/pix2struct-ai2d-base
Visual Question Answering
•
0.3B
•
Updated
Dec 24, 2023
•
1.42k
•
43
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
Updated
Dec 10, 2025
•
1.66M
•
733
microsoft/Magma-8B
Robotics
•
9B
•
Updated
Dec 10, 2025
•
1.03k
•
415
microsoft/VITRA-VLA-3B
Robotics
•
Updated
Dec 9, 2025
•
18
•
14
microsoft/udop-large-512-300k
Image-Text-to-Text
•
0.7B
•
Updated
Dec 2, 2025
•
65
•
34
microsoft/kosmos-2.5
Image-Text-to-Text
•
Updated
Aug 28, 2025
•
84.5k
•
270
microsoft/GUI-Actor-2B-Qwen2-VL
Image-Text-to-Text
•
2B
•
Updated
Aug 9, 2025
•
298
•
20
osunlp/UGround-V1-72B
Image-Text-to-Text
•
73B
•
Updated
Jan 23, 2025
•
18
•
4
microsoft/Florence-2-large-ft
Image-Text-to-Text
•
0.8B
•
Updated
Aug 4, 2025
•
32.9k
•
384
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11, 2025
•
153k
•
493
microsoft/trocr-small-printed
Image-to-Text
•
61.4M
•
Updated
May 27, 2024
•
32.6k
•
48
microsoft/trocr-small-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
57.5k
•
63
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
179k
•
160
deepmind/vision-perceiver-fourier
Image Classification
•
Updated
Sep 24, 2023
•
631
•
1
Upvote
-
Share collection
View history
Collection guide
Browse collections