Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lewtun
's Collections
β Awesome RL datasets π β
β Long-context post-training π§Ά β
H4
Awesome RLHF
Mistral 7B + UltraChat + Arithmo checkpoints
Hub tools
Gemma RLAIF
β Awesome RL datasets π β
updated
Sep 23
Upvote
-
ScaleAI/SWE-bench_Pro
Viewer
β’
Updated
Sep 25
β’
731
β’
11.6k
β’
33
agentica-org/DeepScaleR-Preview-Dataset
Viewer
β’
Updated
Feb 10
β’
40.3k
β’
5.28k
β’
171
open-r1/DAPO-Math-17k-Processed
Viewer
β’
Updated
Apr 10
β’
34.8k
β’
3.93k
β’
40
Upvote
-
Share collection
View history
Collection guide
Browse collections