amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-cpu Text Generation • Updated Jan 30
Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA Collection ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU • 8 items • Updated Jun 16 • 8
Running on CPU Upgrade 13.7k Open LLM Leaderboard 🏆 13.7k Track, rank and evaluate open LLMs and chatbots