Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
inference-optimization
's Collections
test-models
Granite 4 Small and Tiny Quantized Models
NVIDIA-Nemotron-3-Nano-30B-A3B Quantized Models
Qwen3-Next-80B-A3B Quantized Models
Mixed Precision Models
KV Cache Quantization
test-models
updated
2 days ago
Upvote
-
inference-optimization/test_tencentbac_fastmtp
Updated
2 days ago
•
5
inference-optimization/test_qwen3_next_mtp
Updated
2 days ago
•
5
Upvote
-
Share collection
View history
Collection guide
Browse collections