AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing
Organization Card
Embark with pragmatic innovation.
Venture boldly into the unknown.
Challenge the AGI with deep thinking.
Ignite every curiosity with creative spark.
Ask Mi Anything!
models 24
XiaomiMiMo/MiMo-Audio-Tokenizer
1B • Updated • 2.88k • 38
XiaomiMiMo/MiMo-Audio-7B-Instruct
Any-to-Any • 8B • Updated • 17.6k • 160
XiaomiMiMo/MiMo-Audio-7B-Base
Any-to-Any • 8B • Updated • 160 • 55
XiaomiMiMo/MiMo-V2.5-Pro-FP4-DFlash
Text Generation • 554B • Updated • 46.8k • 140
XiaomiMiMo/MiMo-V2.5-Base
311B • Updated • 213 • 30
XiaomiMiMo/MiMo-V2.5
311B • Updated • 218k • 340
XiaomiMiMo/MiMo-V2.5-Pro-Base
Text Generation • 1T • Updated • 239 • 40
XiaomiMiMo/MiMo-V2.5-Pro
Text Generation • 1T • Updated • 101k • • 688
XiaomiMiMo/MiMo-V2.5-ASR
Automatic Speech Recognition • 8B • Updated • 2.52k • 101
XiaomiMiMo/MiMo-V2-Flash-Base
Text Generation • 310B • Updated • 134 • 50