— Awesome RL datasets 📈 — - a lewtun Collection

lewtun 's Collections

— Awesome RL datasets 📈 —

— Long-context post-training 🧶 —

H4

Mistral 7B + UltraChat + Arithmo checkpoints

— Awesome RL datasets 📈 —

updated Sep 23