llmfan46/MiniMax-M2.7-ultra-uncensored-heretic-GGUF Text Generation • 229B • Updated 1 day ago • 3.52k • 6
llmfan46/MiniMax-M2.7-BF16-ultra-uncensored-heretic Text Generation • 229B • Updated 1 day ago • 166 • 4
Fixed Chat Templates for Qwen 3.5 & 3.6 Collection Rewritten Jinja templates fixing 5 bugs in official Qwen 3.5/3.6. Works in LM Studio, llama.cpp, MLX, vLLM. • 1 item • Updated 17 days ago • 3
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 5 days ago • 140
SpecDrift Collection Models released as a part of Attention-Drift Paper, trained for deployment on production • 2 items • Updated 7 days ago • 2
Gemma 4 Assistant GGUF Collection Gemma 4 MTP assistant drafters as GGUF (F16/Q8_0/Q5_K_M/Q4_K_M/Q4_K_S). Speculative-decoding heads for the atomic-llama-cpp-turboquant fork. • 4 items • Updated 10 days ago • 10
llmfan46/Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved-NVFP4-Experts-Only Image-Text-to-Text • 35B • Updated 9 days ago • 3.5k • 3