NM Testing

company

AI & ML interests

None defined yet.

Recent Activity

nm-autobot updated a model about 18 hours ago

nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Static-Asym-e2e

nm-autobot updated a model about 18 hours ago

nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Asym-e2e

nm-autobot updated a model about 19 hours ago

nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-e2e

View all activity

nm-testing 's models 511

nm-testing/TinyLlama-1.1B-Chat-v1.0-FP4

0.7B • Updated May 9, 2025 • 93 • 1

nm-testing/Llama-3_1-Nemotron-Ultra-253B-v1-FP8-dynamic

253B • Updated Apr 21, 2025 • 17 • 2

nm-testing/Mistral-Small-3.1-24B-Instruct-2503-FP8

Image-Text-to-Text • Updated Apr 17, 2025 • 8 • 4

nm-testing/DeepSeek-Coder-V2-Lite-Instruct-quantized.w8a8

16B • Updated Apr 16, 2025 • 4

nm-testing/l4-scout-int4-debug

20B • Updated Apr 16, 2025 • 3

nm-testing/pixtral-12b-FP8-dynamic

Image-Text-to-Text • Updated Apr 11, 2025 • 179 • 1

nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-Asym-Updated-ActOrder

0.3B • Updated Apr 10, 2025 • 4.92k

nm-testing/TinyLlama-1.1B-Chat-v1.0-awq-group128-asym256

0.3B • Updated Apr 10, 2025 • 3

nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-Asym-Updated-Channel

0.3B • Updated Apr 9, 2025 • 2

nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-Asym-Updated

0.3B • Updated Apr 9, 2025 • 1

nm-testing/Llama-2-7b-hf-gsm8k-quant_w4a16_sym-uncompressed

7B • Updated Mar 27, 2025 • 2

nm-testing/Llama-2-7b-hf-gsm8k-quant_w4a16_sym-compressed

1B • Updated Mar 27, 2025 • 3

nm-testing/Llama-2-7b-hf-gsm8k-gptq_w4a16_sym-uncompressed

7B • Updated Mar 27, 2025 • 2

nm-testing/Llama-2-7b-hf-gsm8k-gptq_w4a16_sym-compressed

1B • Updated Mar 27, 2025 • 2

nm-testing/Llama-2-7b-hf-gsm8k-awq_w4a16_sym-uncompressed

7B • Updated Mar 27, 2025 • 5

nm-testing/Llama-2-7b-hf-gsm8k-awq_w4a16_sym-compressed

1B • Updated Mar 27, 2025 • 1

nm-testing/Llama-2-7b-hf-gsm8k-awq_gptq_sym-uncompressed

7B • Updated Mar 27, 2025 • 2

nm-testing/Llama-2-7b-hf-gsm8k-awq_gptq_sym-compressed

1B • Updated Mar 27, 2025 • 2

nm-testing/Mixtral-8x7B-Instruct-v0.1-FP8-Dynamic

47B • Updated Mar 26, 2025 • 2

nm-testing/Llama-3.1-8B-Instruct-W4A16-G128-shared-pipeline

2B • Updated Mar 16, 2025 • 5

nm-testing/Qwen2-VL-2B-Instruct-FP8-dynamic-cli

2B • Updated Mar 14, 2025 • 2

nm-testing/Qwen2-VL-2B-Instruct-FP8_DYNAMIC

Image-Text-to-Text • 2B • Updated Mar 14, 2025 • 3

nm-testing/whisper-large-v3-quantized.w4a16

0.3B • Updated Mar 13, 2025 • 2

nm-testing/whisper-large-v3-quantized.w8a8_sq

2B • Updated Mar 13, 2025 • 3

nm-testing/whisper-large-v3-quantized.w8a8

2B • Updated Mar 12, 2025 • 2

nm-testing/llama2.c-stories110M-gsm8k-fp8_dynamic-compressed

0.1B • Updated Mar 12, 2025 • 10.2k

nm-testing/llama2.c-stories110M-gsm8k-recipe_w4a16_actorder_weight-compressed

60.5M • Updated Mar 12, 2025 • 10.4k

nm-testing/Llama-3.2-1B-Instruct-W4A16-uncompressed-mse-hadamard

5B • Updated Mar 11, 2025 • 3

nm-testing/llama2.c-stories15M

Text Generation • 24.4M • Updated Mar 10, 2025 • 7.63k

nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-kv_cache-qkv_proj

8B • Updated Mar 10, 2025 • 3