view article Article Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models tiiuae • Jul 4, 2025 • 11
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance tiiuae • May 21, 2025 • 39
view article Article Falcon-Arabic: A Breakthrough in Arabic Language Models tiiuae • May 21, 2025 • 39
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. tiiuae • May 15, 2025 • 36
view article Article Welcome Falcon Mamba: The first strong attention-free 7B model +4 JingweiZuo, yellowvm, DhiyaEddine, IChahed, ybelkada, Gkunsch • Aug 12, 2024 • 113
view article Article Welcome Llama 3 - Meta's new open LLM +3 philschmid, osanseviero, pcuenq, ybelkada, lvwerra • Apr 18, 2024 • 295
view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware +7 Titus-von-Koeller, jiaweizhao, mdouglas, hiyouga, ybelkada, muellerzr, amyeroberts, smangrul, BenjaminB • Mar 20, 2024 • 32
view article Article Quanto: a PyTorch quantization backend for Optimum +1 dacorvo, ybelkada, marcsun13 • Mar 18, 2024 • 45
view article Article Fine-Tuning Gemma Models in Hugging Face +2 svaibhav, alanwaketan, ybelkada, ArthurZ • Feb 23, 2024 • 46
view article Article Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face +5 lewtun, philschmid, osanseviero, pcuenq, olivierdehaene, lvwerra, ybelkada • Dec 11, 2023 • 14
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 1.12k
view article Article Overview of natively supported quantization schemes in 🤗 Transformers +3 ybelkada, marcsun13, IlyasMoutawwakil, clefourrier, fxmarty • Sep 12, 2023 • 13
view article Article Overview of natively supported quantization schemes in 🤗 Transformers +3 ybelkada, marcsun13, IlyasMoutawwakil, clefourrier, fxmarty • Sep 12, 2023 • 13
view article Article Making LLMs lighter with AutoGPTQ and transformers +4 marcsun13, fxmarty, PanEa, qwopqwop, ybelkada, TheBloke • Aug 23, 2023 • 64
view article Article The Falcon has landed in the Hugging Face ecosystem +6 lvwerra, ybelkada, smangrul, lewtun, olivierdehaene, pcuenq, philschmid, osanseviero • Jun 5, 2023 • 17
view article Article The Falcon has landed in the Hugging Face ecosystem +6 lvwerra, ybelkada, smangrul, lewtun, olivierdehaene, pcuenq, philschmid, osanseviero • Jun 5, 2023 • 17
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 ybelkada, timdettmers, artidoro, sgugger, smangrul • May 24, 2023 • 180
view article Article Introducing RWKV - An RNN with the advantages of a transformer +2 BlinkDL, Hazzzardous, sgugger, ybelkada • May 15, 2023 • 25