Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7
1
xz
mxz
Follow
0 followers
·
3 following
AI & ML interests
NLP ML RL
Organizations
None yet
mxz
's models
7
Sort: Recently updated
mxz/qwen-R1-3B
3B
•
Updated
Mar 4, 2025
•
3
mxz/qwen-R1-1.5B
2B
•
Updated
Mar 4, 2025
•
3
mxz/qwen-R1-0.5b
0.5B
•
Updated
Mar 3, 2025
•
4
mxz/llama3-8b-dpo
Text Generation
•
8B
•
Updated
Jul 28, 2024
•
1
mxz/llama3-8b-ppo
Text Generation
•
8B
•
Updated
Jul 28, 2024
•
3
mxz/llama3-8b-sft
Text Generation
•
8B
•
Updated
Jul 28, 2024
•
2
mxz/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jul 17, 2024