Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lewtun 's Collections
β€” Awesome RL datasets πŸ“ˆ β€”
β€” Long-context post-training 🧢 β€”
H4
Awesome RLHF
Mistral 7B + UltraChat + Arithmo checkpoints
Hub tools
Gemma RLAIF

β€” Awesome RL datasets πŸ“ˆ β€”

updated Sep 23
Upvote
-

  • ScaleAI/SWE-bench_Pro

    Viewer β€’ Updated Sep 25 β€’ 731 β€’ 11.6k β€’ 33

  • agentica-org/DeepScaleR-Preview-Dataset

    Viewer β€’ Updated Feb 10 β€’ 40.3k β€’ 5.28k β€’ 171

  • open-r1/DAPO-Math-17k-Processed

    Viewer β€’ Updated Apr 10 β€’ 34.8k β€’ 3.93k β€’ 40
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs