Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 26 days ago • 133
view article Article How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio Aug 14 • 25
Running 3.6k The Ultra-Scale Playbook 🌌 3.6k The ultimate guide to training LLM on large GPU Clusters
Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21 • 25.7k • • 1.26k
dbmdz/bert-large-cased-finetuned-conll03-english Token Classification • 0.3B • Updated Sep 6, 2023 • 1.63M • • 92