Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ottehack 's Collections
Pretrain Data
Pretrain Data Utils
Common Reasoning
Science Data
Coder SFT Data
Coder SFT Data (Long-COT )
Coder DPO
Math SFT Data
Math RL Data
WebPage Related
Funny Questions (Long-COT)
Coder Models
Reasoning Model

Pretrain Data Utils

updated Aug 13, 2025
Upvote
-

  • mlfoundations/fasttext-oh-eli5

    Updated Aug 1, 2024 • 29

  • hkust-nlp/preselect-fasttext-classifier

    Text Classification • Updated Mar 6, 2025 • 46 • 8

  • HuggingFaceFW/fineweb-edu-classifier

    Text Classification • 0.1B • Updated Nov 17, 2024 • 28.4k • • 203
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs