XiaomiMiMo/MiMo-V2-Flash
Text Generation
•
Updated
•
253k
•
•
624
None defined yet.
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing
MiMo-V2-Flash Technical Report