agurung/Qwen2.5-7B-Instruct-1M-NRL-NCP-GRPO-PPL-UNBOUNDED Text Generation • 8B • Updated Jul 28, 2025 • 3
agurung/Qwen2.5-7B-Instruct-1M-NRL-NCP-GRPO-NLL-PIECEWISE-REWARD_20ep Text Generation • 8B • Updated Jul 25, 2025 • 3
agurung/Qwen2.5-7B-Instruct-1M-NRL-NCP-GRPO-NLL-PIECEWISE-REWARD Text Generation • 8B • Updated Jul 22, 2025 • 4
agurung/Qwen2.5-7B-Instruct-1M-NRL-NCP-GRPO-NLL-UNBOUNDED Text Generation • 8B • Updated Jul 15, 2025 • 8
agurung/character_sheet_entailment_model___mistral7bv2 Text Generation • 7B • Updated Oct 1, 2024 • 7