inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4 Text Generation • Updated 5 days ago • 171
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 6 days ago • 183
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w8a8 Text Generation • 235B • Updated 6 days ago • 178
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated 6 days ago • 161
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-noise Image-Text-to-Text • 32B • Updated 6 days ago • 129
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-hybrid Image-Text-to-Text • 32B • Updated 6 days ago • 125
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-heuristic Image-Text-to-Text • 32B • Updated 6 days ago • 156
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-noise Image-Text-to-Text • 30B • Updated 6 days ago • 130
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-hybrid Image-Text-to-Text • 30B • Updated 6 days ago • 115
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-heuristic Image-Text-to-Text • 30B • Updated 6 days ago • 106
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-noise Image-Text-to-Text • 28B • Updated 6 days ago • 112
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid Image-Text-to-Text • 28B • Updated 6 days ago • 289
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-heuristic Image-Text-to-Text • 28B • Updated 6 days ago • 118
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-noise Image-Text-to-Text • 26B • Updated 6 days ago • 120
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-hybrid Image-Text-to-Text • 26B • Updated 6 days ago • 124
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-heuristic Image-Text-to-Text • 26B • Updated 6 days ago • 115
inference-optimization/Qwen3.6-35B-A3B-5.0-bits-mode-noise Image-Text-to-Text • 24B • Updated 6 days ago • 109
inference-optimization/Qwen3.6-35B-A3B-5.0-bits-mode-hybrid Image-Text-to-Text • 24B • Updated 6 days ago • 146
inference-optimization/Qwen3.6-35B-A3B-5.0-bits-mode-heuristic Image-Text-to-Text • 24B • Updated 6 days ago • 453