nm-testing/Qwen3-Next-80B-A3B-Instruct-NVFP4
Updated
•
969
•
2
nm-testing/Llama-3.2-1B-Instruct-quip-w4a16
0.8B
•
Updated
•
1.76k
nm-testing/Llama-3.2-1B-Instruct-group-activations
1B
•
Updated
•
2
nm-testing/qwen3-80b-fp8-dynamic
80B
•
Updated
•
1
nm-testing/gemma-3-4b-it-s_q-W4A8-G512
5B
•
Updated
nm-testing/llama3.3-70B-speculators.09-10-2025-eagle3
2B
•
Updated
nm-testing/Llama-3.2-1B-Instruct-quipv-w4a16
0.7B
•
Updated
nm-testing/Llama-3.2-1B-Instruct-quip
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2-online
0.7B
•
Updated
nm-testing/Qwen3-Coder-30B-A3B-Instruct-W4A16-awq
5B
•
Updated
•
526
•
3
nm-testing/TinyLlama-1.1B-Chat-v1.0-MXFP4A16
0.6B
•
Updated
•
1
nm-testing/llama4-scout-17b-eagle3-dummy-drafter
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2R4-w4a16
0.7B
•
Updated
•
1.78k
nm-testing/Llama-3.1-8B-Instruct-quip-w4a16
2B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3-FP8_asym-attn
8B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3
8B
•
Updated
nm-testing/gemma-3n-2b-quantized.w4a16-test
4B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-FP8-Dynamic
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4-FP8-Dynamic
0.8B
•
Updated
nm-testing/gpt-oss-20b-BF16-linearized
nm-testing/gpt-oss-20b-BF16-W4A16-G128
Updated
nm-testing/gptoss-NVFP4A16
21B
•
Updated
•
6
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-hadamard-w4a16
0.7B
•
Updated
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-hadamard-w4a16
0.7B
•
Updated
•
1
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-eye-w4a16
0.7B
•
Updated
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-eye-w4a16
0.7B
•
Updated
•
3
nm-testing/Meta-Llama-3-8B-Instruct-quip-w4a16
nm-testing/gemma-3n-E2B-it-W4A16-G128
4B
•
Updated
•
1
nm-testing/block-quantization-fp8-qwen3-0.6B
0.8B
•
Updated
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
•
1.0B
•
Updated
•
1