Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
5
12
127
mengfanxu
fxmeng
Follow
zuoke's profile picture
21world's profile picture
dawn0704's profile picture
19 followers
·
32 following
https://fxmeng.github.io
fxmeng
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding
submitted
a paper
1 day ago
GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding
upvoted
a
paper
9 days ago
MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference
View all activity
Organizations
None yet
fxmeng
's datasets
12
Sort: Recently updated
fxmeng/transmla_pretrain_100m_tokens
Viewer
•
Updated
Jul 5, 2025
•
100k
•
10
fxmeng/transmla_pretrain_1B_tokens
Viewer
•
Updated
Jul 5, 2025
•
1.14M
•
15
fxmeng/transmla_pretrain_6B_tokens
Viewer
•
Updated
Jul 5, 2025
•
5.94M
•
2.12k
fxmeng/pissa-dataset
Viewer
•
Updated
Jan 8, 2025
•
844k
•
2.05k
•
3
fxmeng/big-bench-hard-continue-finetuning
Viewer
•
Updated
Dec 19, 2024
•
10.3k
•
36
•
1
fxmeng/commonsense_filtered
Viewer
•
Updated
Dec 11, 2024
•
170k
•
79
•
1
fxmeng/MetaMath-GSM240K
Viewer
•
Updated
Nov 14, 2024
•
240k
•
116
•
1
fxmeng/MetaMath-MATH155K
Viewer
•
Updated
Nov 14, 2024
•
155k
•
257
fxmeng/CodeFeedback-Python105K
Viewer
•
Updated
Nov 14, 2024
•
105k
•
85
•
6
fxmeng/llava_finetune_336x336
Preview
•
Updated
Apr 26, 2024
•
19
fxmeng/llava_pretrain_336x336
Preview
•
Updated
Apr 26, 2024
•
18
fxmeng/WizardLM_evol_instruct_V2_143k
Viewer
•
Updated
Apr 16, 2024
•
143k
•
13
•
2