thwannbe/Llama-3.1-8B-Instruct-GSM8K-RLVR-Distill-Persona-Mixed Text Generation • 8B • Updated about 18 hours ago
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated about 18 hours ago
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Persona-Mixed Text Generation • 8B • Updated about 19 hours ago
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Sft-Persona-Mixed Text Generation • 8B • Updated 5 days ago • 22
thwannbe/Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill Text Generation • 8B • Updated 5 days ago • 19