Hwang yechan PRO
SoonOk
AI & ML interests
AI&ML&ReinforcementLearning&DeepRL &DeepLearning
Recent Activity
upvoted a paper 26 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models upvoted a paper about 1 month ago
OWL: Optimized Workforce Learning for General Multi-Agent Assistance in
Real-World Task Automation updated
a model about 2 months ago
SoonOk/AuxKTO