AI & ML interests
None defined yet.
Recent Activity
Papers
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning
$π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
-
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora
Reinforcement Learning • 8B • Updated • 279 -
RLinf/RLinf-OpenVLAOFT-ManiSkill-Base-Main
8B • Updated • 275 -
RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning • 8B • Updated • 71 -
RLinf/RLinf-OpenVLAOFT-LIBERO-130
Reinforcement Learning • 8B • Updated • 64 • 2
-
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora
Reinforcement Learning • 8B • Updated • 279 -
RLinf/RLinf-OpenVLAOFT-ManiSkill-Base-Main
8B • Updated • 275 -
RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning • 8B • Updated • 71 -
RLinf/RLinf-OpenVLAOFT-LIBERO-130
Reinforcement Learning • 8B • Updated • 64 • 2
models
68
RLinf/WideSeek-R1-4b
Text Generation
•
4B
•
Updated
•
10
RLinf/RLinf-Pi05-GSEnv-PutCubeOnPlate-V0-SFT
4B
•
Updated
RLinf/RLinf-OpenVLAOFT-RoboTwin-RL-move_can_pot
8B
•
Updated
•
8
RLinf/RLinf-OpenVLAOFT-RoboTwin-RL-lift_pot
8B
•
Updated
•
7
RLinf/RLinf-OpenVLAOFT-RoboTwin-SFT-move_can_pot
8B
•
Updated
•
12
RLinf/RLinf-OpenVLAOFT-RoboTwin-SFT-handover_block
8B
•
Updated
•
15
RLinf/RLinf-OpenVLAOFT-RoboTwin-SFT-lift_pot
8B
•
Updated
•
15
RLinf/RLinf-OpenVLAOFT-RoboTwin-RL-handover_block
Updated
RLinf/RLinf-OpenSora-LIBERO-Object
1B
•
Updated
•
4
RLinf/RLinf-OpenSora-LIBERO-Spatial
1B
•
Updated
•
13