nm-testing/llama2.c-stories110M-gsm8k-recipe_w4a16_actorder_weight-compressed 60.5M • Updated Mar 12, 2025 • 502
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-kv_cache-qkv_proj 8B • Updated Mar 10, 2025 • 1
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-q_proj 8B • Updated Mar 10, 2025 • 1
nm-testing/llama2.c-stories42M-gsm8k-quantized-only-uncompressed 58.2M • Updated Feb 12, 2025 • 1.94k