Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
30
Follow
AWS Inferentia and Trainium
161
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
655
main
optimum-neuron-cache
4 contributors
History:
14068 commits
dacorvo
HF Staff
Synchronizing local compiler cache.
f4a925d
verified
less than a minute ago
inference-cache-config
Update inference-cache-config/trn1/llama4.json
26 days ago
neuronxcc-2.19.8089.0+8ab9f450
Synchronizing local compiler cache.
3 months ago
neuronxcc-2.20.9961.0+0acef03a
Synchronizing local compiler cache.
3 months ago
neuronxcc-2.21.18209.0+043b1bf7
Synchronizing local compiler cache.
27 days ago
neuronxcc-2.21.33363.0+82129205
Synchronizing local compiler cache.
less than a minute ago
neuronxcc-2.22.12471.0+b4a00d10
Synchronizing local compiler cache.
27 days ago
neuronxcc-2.23.6484.0+3b612583
Synchronizing local compiler cache.
2 days ago
.gitattributes
2.07 MB
Synchronizing local compiler cache.
about 21 hours ago
README.md
Safe
1.27 kB
Add SageMaker deployment instructions
almost 2 years ago