Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Bo Wang's picture
1 6 3

Bo Wang

Musicode
buaa42wxy's profile picture WillQvQ's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago
Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections
submitted a paper about 9 hours ago
The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping
upvoted a paper about 11 hours ago
The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping
View all activity

Organizations

None yet

upvoted a paper about 9 hours ago

Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections

Paper • 2507.00018 • Published Jun 15, 2025 • 1
upvoted a paper about 11 hours ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 2 days ago • 80
upvoted a collection 4 days ago

MOSS-VL

Collection
2 items • Updated 6 days ago • 48
upvoted a paper about 1 month ago

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Paper • 2603.04918 • Published Mar 5 • 56
upvoted a paper 6 months ago

Sparser Block-Sparse Attention via Token Permutation

Paper • 2510.21270 • Published Oct 24, 2025 • 25
upvoted a paper 11 months ago

REARANK: Reasoning Re-ranking Agent via Reinforcement Learning

Paper • 2505.20046 • Published May 26, 2025 • 18
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs