OpenCulture Collection A multilingual dataset of public domain books and newspapers. β’ 25 items β’ Updated Mar 2 β’ 133
Running on CPU Upgrade Featured 3.1k The Smol Training Playbook π 3.1k The secrets to building world-class LLMs
view article Article Releasing Common Corpus: the largest public domain dataset for training LLMs Mar 20, 2024 β’ 32
Runtime error Featured 141 smolagents LLM leaderboard π 141 A leaderboard for LLMs powering smolagents