penfever/rl__24GPU_base__exp_rpt_pymethods2test-large__r2egym-nl2bash-stack__40-0 Viewer • Updated about 2 hours ago • 12.4k
penfever/rl__24GPU_base__exp_rpt_pymethods2test-large__r2egym-nl2bash-stack__40-0 Viewer • Updated about 2 hours ago • 12.4k
laion/rl__24GPU_base__exp_rpt_pymethods2test-large__r2egym-nl2bash-stack__40-0-20-8B 8B • Updated about 2 hours ago
laion/rl__24GPU_base__exp_rpt_pymethods2test-large__r2egym-nl2bash-stack__40-0-20-8B 8B • Updated about 2 hours ago
laion/tasrep-a1mfc-gfistaqc-dev1-scaff-maxeps-swes-r2eg-32b__Qwen3-32B Text Generation • 33B • Updated about 5 hours ago
laion/tasrep-a1mfc-gfistaqc-dev1-scaff-maxeps-swes-r2eg-32b__Qwen3-32B Text Generation • 33B • Updated about 5 hours ago
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning Paper • 2605.00347 • Published 5 days ago • 11
laion/tasrep-a1mfc-gfistaqc-dev1-scaff-maxeps-swes-r2eg-32b-4000__Qwen3-32B 33B • Updated 1 day ago • 12