Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xz's picture
7 1

xz

mxz
·

AI & ML interests

NLP ML RL

Organizations

None yet

models 7

mxz/qwen-R1-3B

3B • Updated Mar 4, 2025 • 5

mxz/qwen-R1-1.5B

2B • Updated Mar 4, 2025 • 5

mxz/qwen-R1-0.5b

0.5B • Updated Mar 3, 2025 • 6

mxz/llama3-8b-dpo

Text Generation • 8B • Updated Jul 28, 2024 • 6

mxz/llama3-8b-ppo

Text Generation • 8B • Updated Jul 28, 2024 • 9

mxz/llama3-8b-sft

Text Generation • 8B • Updated Jul 28, 2024 • 10

mxz/ppo-LunarLander-v2

Reinforcement Learning • Updated Jul 17, 2024 • 1

datasets 4

mxz/awesome-dpo

Viewer • Updated Jul 28, 2024 • 302k • 13

mxz/CValues

Viewer • Updated Jul 26, 2024 • 146k • 21

mxz/CValues_DPO

Viewer • Updated Jul 26, 2024 • 146k • 9

mxz/alpaca_en_zh_ruozhiba_gpt4-data

Viewer • Updated Jul 26, 2024 • 190k • 16
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs