Running on CPU Upgrade Featured 2.38k The Smol Training Playbook 📚 Featured 2.38k The secrets to building world-class LLMs
view article Article Exploring Environments Hub: Your Language Model needs better (open) environments to learn Sep 4 • 28
meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 5.13M • • 4.99k
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 1 day ago • 61
DiLoCo: Distributed Low-Communication Training of Language Models Paper • 2311.08105 • Published Nov 14, 2023 • 16
Contra (Bottleneck T5) Collection Text autoencoders capable of embedding and generating text in a fixed-size latent space, useful for embeddings and latent space text editing. • 4 items • Updated Oct 3, 2023 • 28