Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
liyaxuan's picture
20 2

liyaxuan

lllyx
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL
upvoted a paper about 8 hours ago
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction
upvoted a paper about 8 hours ago
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning
View all activity

Organizations

None yet

lllyx 's collections 1

Rethinking OPD
This collection includes the models used in the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recip
  • lllyx/Qwen3-1.7B-SFT

    Text Generation • 2B • Updated 7 days ago • 696 • 1
  • Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

    Paper • 2604.13016 • Published 27 days ago • 91
  • lllyx/Qwen3-4B-Base-GRPO

    Text Generation • 4B • Updated 7 days ago • 142 • 1
Rethinking OPD
This collection includes the models used in the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recip
  • lllyx/Qwen3-1.7B-SFT

    Text Generation • 2B • Updated 7 days ago • 696 • 1
  • Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

    Paper • 2604.13016 • Published 27 days ago • 91
  • lllyx/Qwen3-4B-Base-GRPO

    Text Generation • 4B • Updated 7 days ago • 142 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs