5 49

Daniel Serrano

dnlserrano

https://dnlserrano.dev

AI & ML interests

computer vision, biometrics, face, facial recognition, deepfakes, pad, mad, age, bias

Recent Activity

liked a model 3 days ago

CohereLabs/North-Mini-Code-1.0

liked a model 3 months ago

microsoft/Phi-4-reasoning-vision-15B

liked a Space 4 months ago

baohuynhbk14/Qwen3-VL-Multimodal-Search-DEMO

View all activity

Organizations

None yet

liked a model 3 days ago

CohereLabs/North-Mini-Code-1.0

Text Generation • 30B • Updated 7 days ago • 19.6k • 470

liked a model 3 months ago

microsoft/Phi-4-reasoning-vision-15B

Image-Text-to-Text • 15B • Updated Mar 18 • 13.3k • 172

liked a Space 4 months ago

Qwen3-VL Multimodal Search Engine

🔥

Cross-modal text-image search powered by Qwen3-VL

liked 7 models 4 months ago

nvidia/omni-embed-nemotron-3b

Qwen/Qwen3-VL-Embedding-2B

nvidia/C-RADIOv4-H

Image Feature Extraction • 0.7B • Updated Jan 30 • 27.8k • 77

liked a Space 6 months ago

Qwen Image 2512

👀

371

Rewrite image prompts into detailed English descriptions

liked a Space 9 months ago

FastVLM WebGPU

🍎

446

Real-time video captioning powered by FastVLM

upvoted 2 collections 9 months ago

FastVLM

Collection

Efficient Vision Encoding for Vision Language Models • 8 items • Updated Mar 2 • 114

MobileCLIP2

Collection

MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 30 items • Updated Apr 23 • 64

liked a model 10 months ago

apple/MobileCLIP-S2

Updated Feb 28, 2025 • 96 • 16

liked 2 models 11 months ago

google/gemma-3n-E4B

Image-Text-to-Text • 8B • Updated Jul 14, 2025 • 3.48k • 141

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 207k • • 2.51k

liked a model 12 months ago

google/videoprism-base-f16r288

Video Classification • Updated Jul 29, 2025 • 10.9k • 107

liked 2 models about 1 year ago

Wan-AI/Wan2.1-VACE-14B

Image-to-Video • Updated May 19, 2025 • 5.38k • 499

ByteDance/ContentV-8B

Text-to-Video • Updated Jun 24, 2025 • 22 • 57