From Pixels to Words -- Towards Native Vision-Language Primitives at Scale
Haiwen Diao
Paranioar
AI & ML interests
Vision-and-Language, Parameter-efficient Transfer Learning, Multi-modal Large Language Model
Recent Activity
liked a model 1 day ago
sensenova/SenseNova-U1-8B-MoT-SFT liked a model 1 day ago
sensenova/SenseNova-U1-8B-MoT upvoted a collection 1 day ago
SenseNova-U1