Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
20
2
liyaxuan
lllyx
Follow
0 followers
·
4 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 21 hours ago
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL
upvoted
a
paper
about 21 hours ago
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction
upvoted
a
paper
about 21 hours ago
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning
View all activity
Organizations
None yet
lllyx
's models
2
Sort: Recently updated
lllyx/Qwen3-1.7B-SFT
Text Generation
•
2B
•
Updated
8 days ago
•
696
•
1
lllyx/Qwen3-4B-Base-GRPO
Text Generation
•
4B
•
Updated
8 days ago
•
142
•
1