thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated 1 day ago • 16
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Persona-Mixed Text Generation • 8B • Updated 1 day ago • 12
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Sft-Persona-Mixed Text Generation • 8B • Updated 5 days ago • 22
thwannbe/Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill Text Generation • 8B • Updated 5 days ago • 19