DistilQwen Collection Two students, one methodology. 30B teacher → 1.7B and 0.6B via proof-weighted distillation + legal SFT. Six models, Apache 2.0. • 9 items • Updated 1 day ago
DistilQwen Collection Two students, one methodology. 30B teacher → 1.7B and 0.6B via proof-weighted distillation + legal SFT. Six models, Apache 2.0. • 9 items • Updated 1 day ago
DistilQwen Collection Two students, one methodology. 30B teacher → 1.7B and 0.6B via proof-weighted distillation + legal SFT. Six models, Apache 2.0. • 9 items • Updated 1 day ago
reaperdoesntknow/Qwen3-0.6B-Distilled-30B-A3B-Thinking-SFT-GGUF Text Generation • 0.8B • Updated 5 days ago • 198
reaperdoesntknow/Qwen3-0.6B-Distilled-30B-A3B-Thinking-SFT-GGUF Text Generation • 0.8B • Updated 5 days ago • 198
reaperdoesntknow/Qwen3-0.6B-Distilled-30B-A3B-Thinking-SFT Text Generation • 0.8B • Updated 5 days ago • 33 • 1
reaperdoesntknow/Qwen3-0.6B-Distilled-30B-A3B-Thinking-SFT Text Generation • 0.8B • Updated 5 days ago • 33 • 1