R
Romanoffalex
AI & ML interests
None yet
Recent Activity
updated a collection 5 days ago
Best small models liked a model 8 days ago
huihui-ai/Huihui4-8B-A4B-v2 liked a model 13 days ago
datalab-to/chandra-ocr-2Organizations
None yet
CV models
-
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text β’ 33B β’ Updated β’ 108k β’ 482 -
baidu/ERNIE-4.5-VL-28B-A3B-Base-PT
Image-Text-to-Text β’ 29B β’ Updated β’ 97 β’ 38 -
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction β’ 7B β’ Updated β’ 12.4k β’ 226 -
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text β’ 30B β’ Updated β’ 2.51k β’ 537
Upscalers
video gan
Graphic gan
- Running on ZeroAgentsFeatured942
OminiControl
π942Generate custom images from a reference photo and text
- Running on ZeroAgentsFeatured2.08k
PuLID-FLUX
π€2.08kGenerate custom images from text and a reference photo
- RunningAgents661
PR Puppet Sora
π661Generate AI videos from text prompts
-
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 9.25k β’ β’ 1.32k
llm ru
- PausedAgents47
Saiga 13b Q4_1 llama.cpp Retrieval QA
π47Upload files and ask questions based on their content
-
Deci/DeciLM-7B
Text Generation β’ 7B β’ Updated β’ 555 β’ 226 - Running92
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
π92Evaluate multilingual models using FineTasks
Codex model
Audio
-
stabilityai/stable-audio-open-1.0
Text-to-Audio β’ Updated β’ 22k β’ 1.45k -
laion/emonet-face-binary
Preview β’ Updated β’ 41 β’ 3 -
laion/emonet-face-hq
Viewer β’ Updated β’ 2.5k β’ 91 β’ 2 - Paused240
Omnilingual ASR Media Transcription
π240Transcribe audio/video files into text instantly
3d
dataset
Llms alfa test
- Running on ZeroAgents1.12k
OOTDiffusion
π₯Ό1.12kHigh-quality virtual try-on ~ Your cyber fitting room
-
stabilityai/stable-diffusion-3-medium
Text-to-Image β’ Updated β’ 3.85k β’ β’ 4.95k -
rain1011/pyramid-flow-sd3
Text-to-Video β’ Updated β’ 839 -
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation β’ 50B β’ Updated β’ 37.1k β’ 233
Best small models
Codex model
CV models
-
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text β’ 33B β’ Updated β’ 108k β’ 482 -
baidu/ERNIE-4.5-VL-28B-A3B-Base-PT
Image-Text-to-Text β’ 29B β’ Updated β’ 97 β’ 38 -
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction β’ 7B β’ Updated β’ 12.4k β’ 226 -
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text β’ 30B β’ Updated β’ 2.51k β’ 537
Audio
-
stabilityai/stable-audio-open-1.0
Text-to-Audio β’ Updated β’ 22k β’ 1.45k -
laion/emonet-face-binary
Preview β’ Updated β’ 41 β’ 3 -
laion/emonet-face-hq
Viewer β’ Updated β’ 2.5k β’ 91 β’ 2 - Paused240
Omnilingual ASR Media Transcription
π240Transcribe audio/video files into text instantly
Upscalers
3d
video gan
dataset
Graphic gan
- Running on ZeroAgentsFeatured942
OminiControl
π942Generate custom images from a reference photo and text
- Running on ZeroAgentsFeatured2.08k
PuLID-FLUX
π€2.08kGenerate custom images from text and a reference photo
- RunningAgents661
PR Puppet Sora
π661Generate AI videos from text prompts
-
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 9.25k β’ β’ 1.32k
Llms alfa test
- Running on ZeroAgents1.12k
OOTDiffusion
π₯Ό1.12kHigh-quality virtual try-on ~ Your cyber fitting room
-
stabilityai/stable-diffusion-3-medium
Text-to-Image β’ Updated β’ 3.85k β’ β’ 4.95k -
rain1011/pyramid-flow-sd3
Text-to-Video β’ Updated β’ 839 -
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation β’ 50B β’ Updated β’ 37.1k β’ 233
llm ru
- PausedAgents47
Saiga 13b Q4_1 llama.cpp Retrieval QA
π47Upload files and ask questions based on their content
-
Deci/DeciLM-7B
Text Generation β’ 7B β’ Updated β’ 555 β’ 226 - Running92
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
π92Evaluate multilingual models using FineTasks