OneVL series vision-language models
Xiaomi Research
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Xiaomi-GUI-0 Technical Report
Discrete-WAM: Unified Discrete Vision-Action Token Editing for World-Policy Learning
models 21
xiaomi-research/OneVL_mlp_NAVSIM
14B • Updated • 31
xiaomi-research/Baseline_cot_NAVSIM
570k • Updated • 4
xiaomi-research/Baseline_answer_NAVSIM
570k • Updated • 6
xiaomi-research/OneVL_visual_decoder_pt_ar1
Image-Text-to-Text • 5B • Updated • 12
xiaomi-research/OneVL_visual_decoder_pt
Image-Text-to-Text • 5B • Updated • 37
xiaomi-research/OneVL_ROADWork
Image-Text-to-Text • 14B • Updated • 7 • 1
xiaomi-research/OneVL_NAVSIM
Image-Text-to-Text • 14B • Updated • 220
xiaomi-research/OneVL_Impromptu
Image-Text-to-Text • 14B • Updated • 29
xiaomi-research/OneVL_AlpamayoR1
Image-Text-to-Text • 14B • Updated • 67
xiaomi-research/TTS-PRISM-7B
Audio Classification • 8B • Updated • 29