Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
26
Follow
AWS Inferentia and Trainium
143
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
591
main
optimum-neuron-cache
/
inference-cache-config
/
trn1
9.76 kB
4 contributors
History:
3 commits
dacorvo
HF Staff
Update inference-cache-config/trn1/mixtral.json
8343560
verified
14 days ago
granite.json
1.41 kB
clean-up Trainium 1 cached configurations
15 days ago
llama3.json
2.82 kB
clean-up Trainium 1 cached configurations
15 days ago
llama4.json
1.04 kB
clean-up Trainium 1 cached configurations
15 days ago
mixtral.json
760 Bytes
Update inference-cache-config/trn1/mixtral.json
14 days ago
phi4.json
601 Bytes
clean-up Trainium 1 cached configurations
15 days ago
qwen3-moe.json
575 Bytes
clean-up Trainium 1 cached configurations
15 days ago
qwen3.json
2.13 kB
clean-up Trainium 1 cached configurations
15 days ago
smollm3.json
430 Bytes
clean-up Trainium 1 cached configurations
15 days ago