view article Article Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era By frimelle and 1 other • Aug 20 • 15
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face Jul 29 • 190
Comma v0.1 Artifacts Collection A collection of artifacts related to Comma v0.1—a 7B parameter LLM trained on public domain and openly licensed text • 3 items • Updated Jun 6 • 4
Common Pile v0.1 Filtered Data Collection An LLM pre-training dataset produced by filtering and deduplicating the raw text collected in the Common Pile v0.1 • 31 items • Updated Jun 6 • 19
Common Pile v0.1 Raw Data Collection 8TB of public domain and openly licensed text • 30 items • Updated Aug 14 • 20
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6 • 33
BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated May 1 • 53
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • Jan 20 • 52
Community Projects Collection Datasets, models, and spaces created by the community • 19 items • Updated Sep 23 • 1
NanoBEIR 🍺 Collection A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 22
Positional Datasets Collection Datasets where each row is a chess position • 4 items • Updated 22 days ago • 8
Rated Games Dataset Collection Datasets where each row is a rated chess game • 10 items • Updated Jul 10 • 8