Base Model for TransMLA
mengfanxu
fxmeng
AI & ML interests
None yet
Recent Activity
upvoted a paper about 19 hours ago
GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding submitted a paper about 22 hours ago
GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding upvoted a paper 8 days ago
MISA: Mixture of Indexer Sparse Attention for Long-Context LLM InferenceOrganizations
None yet