XiaomiMiMo/MiMo-V2-Flash
Text Generation
•
Updated
•
353k
•
•
629
None defined yet.
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing
MiMo-V2-Flash Technical Report