Base Model for TransMLA
mengfanxu
fxmeng
AI & ML interests
None yet
Recent Activity
submitted a paper 1 day ago
GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding upvoted a paper 9 days ago
MISA: Mixture of Indexer Sparse Attention for Long-Context LLM InferenceOrganizations
None yet