Analyze images to detect objects, points, keypoints, or text
Text-to-3D and Image-to-3D Generation
The agent using over 9000 vision models from the HF Hub.
Generate any application by Vibe Coding it
Generate object masks on images with SAM2
Segment and caption objects in images and videos
Detect objects in images or videos
State-of-the-art Zero-shot Object Detection