XiaomiMiMo/MiMo-V2-Flash
Text Generation
•
310B
•
Updated
•
170k
•
•
619
None defined yet.
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing
MiMo-V2-Flash Technical Report