Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Inference Optimization

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

krishnateja95  updated a model 1 day ago
inference-optimization/Qwen3.5-35B-A3B-FP8-Dynamic
krishnateja95  published a model 1 day ago
inference-optimization/Qwen3.5-35B-A3B-FP8-Dynamic
kylesayrs  updated a model 1 day ago
inference-optimization/Kimi-K2-Instruct-0905-BF16-FP8-BLOCK
View all activity

Alexandre Marques's profile picture Megan Flynn's profile picture Dipika's profile picture Krishna Teja Chitty-Venkata's profile picture Helen Zhao's profile picture Fynn Schmitt-Ulms's profile picture Neural Magic Research's profile picture Chibueze Ukachi's profile picture Eldar Kurtić's profile picture Rahul Tuli's profile picture Kyle Sayers's profile picture Brian Dellabetta's profile picture Linghao Kong's profile picture Michael Goin's profile picture

inference-optimization 's models 97

inference-optimization/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Head

33B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Tensor

71B • Updated Dec 4, 2025

inference-optimization/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Head

71B • Updated Dec 4, 2025

inference-optimization/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

71B • Updated Dec 4, 2025 • 4

inference-optimization/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head

71B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor

8B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

8B • Updated Dec 4, 2025 • 3
  • Previous
  • 1
  • 2
  • 3
  • 4
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs