phuongntc/qwen3_06b_grpo_noSFT_multievalsumviet2_VR_dropR Text Generation • Updated about 15 hours ago
phuongntc/qwen3_06b_grpo_noSFT_multievalsumviet2_VR_DropC Text Generation • Updated about 15 hours ago
phuongntc/qwen3_06b_grpo_noSFT_multievalsumviet2_VR_DropF Text Generation • Updated about 16 hours ago
phuongntc/qwen3_06b_grpo_noSFT_multievalsumviet2_nopenalty Text Generation • Updated 22 days ago • 16
phuongntc/qwen3_0.6b_ppo_penalty_multievalsumviet2_fix1000 Text Generation • 0.6B • Updated 26 days ago • 18
phuongntc/qwen3_0.6b_ppo_penalty_multievalsumviet2_final Text Generation • 0.6B • Updated 27 days ago • 21