nm-testing/Qwen3-30B-A3B-Fp8
31B
•
Updated
nm-testing/Llama-2-7b-hf-weight-input-quant-compressed
24.4M
•
Updated
•
17
nm-testing/Llama-2-7b-hf-weight-input-quant-uncompressed
24.4M
•
Updated
nm-testing/Qwen3-30B-A3B-awq-w4a16-g128-sym
5B
•
Updated
•
1
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-0602-v2
nm-testing/Llama-3.1-8B-Instruct-NVFP4A16-0602
5B
•
Updated
•
1
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-0602
5B
•
Updated
•
13
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-0531-v3
5B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4himBHs0531-v3
5B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-0531-v2
5B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-0531
5B
•
Updated
nm-testing/gemma-3-4b-it-quantized.w8a8_previous
29B
•
Updated
nm-testing/gemma-3-4b-it-quantized.w8a8_temp
5B
•
Updated
nm-testing/Mistral-Small-3.1-24B-Instruct-2503-W4A16-G128
4B
•
Updated
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-updated-v2
5B
•
Updated
nm-testing/Llama-3.1-8B-Instruct-NVFP4A16-temp2
5B
•
Updated
nm-testing/Llama-3.1-8B-Instruct-NVFP4-v4-temp
5B
•
Updated
nm-testing/Llama-3.1-8B-Instruct-NVFP4A16-temp
5B
•
Updated
•
6
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4-v5
0.7B
•
Updated
nm-testing/DeepSeek-V2-Lite-W8A8-Dynamic-Per-Token
16B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4-Updated4
0.7B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4-Updated3
0.7B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4-Updated2
0.7B
•
Updated
nm-testing/Devstral-Small-2505-FP8-dynamic
Text Generation
•
24B
•
Updated
•
80
•
1
nm-testing/Mixtral-8x7B-Instruct-v0.1-W8A8-updated-smoothquant
47B
•
Updated
nm-testing/Llama-4-Maverick-17B-128E-Instruct-for-quant
402B
•
Updated
•
8
nm-testing/Sparse-Llama-3.1-8B-2of4-tldr
Text Generation
•
5B
•
Updated
nm-testing/DeepSeek-Coder-V2-Lite-Instruct-W8A8-smoothquant
16B
•
Updated
nm-testing/DeepSeek-Coder-V2-Lite-Instruct-W8A8-No-smoothquant
16B
•
Updated
•
3
nm-testing/Mixtral-8x7B-Instruct-v0.1-W8A8-No-Smoothquant
47B
•
Updated