inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-hybrid-per-tensor 6B • Updated Apr 22 • 31
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-heuristic-per-tensor 6B • Updated Apr 22 • 30
inference-optimization/Llama-3.1-8B-Instruct-5-bits-mode-heuristic-per-tensor 5B • Updated Apr 22 • 33
inference-optimization/Llama-3.2-3B-Instruct-7-bits-mode-heuristic-per-tensor 3B • Updated Apr 22 • 14
inference-optimization/Llama-3.2-3B-Instruct-6.5-bits-mode-hybrid-per-tensor 3B • Updated Apr 22 • 14
inference-optimization/Llama-3.2-3B-Instruct-6.5-bits-mode-heuristic-per-tensor 3B • Updated Apr 22 • 14
inference-optimization/Llama-3.2-3B-Instruct-6-bits-mode-heuristic-per-tensor 3B • Updated Apr 22 • 14
inference-optimization/Llama-3.2-3B-Instruct-5.5-bits-mode-hybrid-per-tensor 3B • Updated Apr 22 • 13
inference-optimization/Llama-3.2-3B-Instruct-5.5-bits-mode-heuristic-per-tensor 3B • Updated Apr 22 • 13
inference-optimization/Llama-3.2-3B-Instruct-5-bits-mode-heuristic-per-tensor 3B • Updated Apr 22 • 11
inference-optimization/Llama-3.2-1B-Instruct-7-bits-mode-heuristic-per-tensor 1B • Updated Apr 22 • 13
inference-optimization/Llama-3.2-1B-Instruct-6.5-bits-mode-hybrid-per-tensor 1B • Updated Apr 22 • 15
inference-optimization/Llama-3.2-1B-Instruct-6.5-bits-mode-heuristic-per-tensor 1B • Updated Apr 22 • 14
inference-optimization/Llama-3.2-1B-Instruct-6-bits-mode-heuristic-per-tensor 1B • Updated Apr 22 • 13