RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 19 days ago • 26.9k • 9
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_hybrid 20B • Updated 11 days ago • 26
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_noise 20B • Updated 11 days ago • 20
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_heuristic 20B • Updated 11 days ago • 20
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_hybrid 22B • Updated 11 days ago • 18
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_noise 22B • Updated 11 days ago • 20
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_heuristic 22B • Updated 11 days ago • 17
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_hybrid 23B • Updated 11 days ago • 21
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_noise 23B • Updated 11 days ago • 18
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_heuristic 23B • Updated 11 days ago • 24
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_hybrid 25B • Updated 11 days ago • 22
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_noise 25B • Updated 11 days ago • 18
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_heuristic 25B • Updated 11 days ago • 22
inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_hybrid 26B • Updated 10 days ago • 27
inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_noise 26B • Updated 10 days ago • 23
inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_heuristic 27B • Updated 10 days ago • 21