Training GUI agents with augmented reasoning data and a tailored post-training recipe
Rui Yang PRO
Ray2333
AI & ML interests
Deep Reinforcement Learning
Recent Activity
updated a dataset 3 days ago
Ray2333/Judge_data_plus published a dataset 3 days ago
Ray2333/Judge_data_plus authored a paper 9 days ago
Orchard: An Open-Source Agentic Modeling Framework