koutch/qwen_qwen3-instruct-4b_train_grpo_v2_train_code Text Generation • 4B • Updated about 6 hours ago
koutch/qwenb_falcon_qwen3-8b_train_grpo_v1_2.json Text Generation • 8B • Updated about 22 hours ago • 35
koutch/qwen_falcon_qwen3-instruct-4b_train_grpo_v1_2.json Text Generation • 4B • Updated about 23 hours ago • 30
koutch/qwenb_falcon_6.json_train_dpo_v1_2.json Text Generation • 8B • Updated about 23 hours ago • 34