Running Agents Featured 36 QwenScope 🔥 36 Explore and steer Qwen3 model features with interactive heatmaps
Running on Zero Agents Featured 49 RF-DETR Realtime Webcam Demo 🎯 49 Segment objects in live webcam and uploaded media
TIPSv2 Collection TIPSv2 foundational vision-language models. Webpage: https://gdm-tipsv2.github.io/ • 9 items • Updated Apr 14 • 36
RDP LoRA: Geometry-Driven Identification for Parameter-Efficient Adaptation in Large Language Models Paper • 2604.19321 • Published Apr 21 • 8
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 127k • • 2.88k
SigLino: Vision Foundation Models (SigLIP2 + DINOv3) Collection Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated Apr 10 • 17
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published Jan 8 • 59
Running on Zero Agents Featured 955 FLUX.2 [dev] 💻 955 Generate or edit images from text and optional photos