models and datasets used in the SparkleRL&SPARKLE project
-
sparkle-reasoning/SparkleRL-7B-Stage2-mix
Text Generation • 8B • Updated -
sparkle-reasoning/SparkleRL-7B-Stage2-hard
Text Generation • 8B • Updated -
sparkle-reasoning/SparkleRL-7B-Stage2-aug
Text Generation • 8B • Updated • 1 • 3 -
sparkle-reasoning/SparkleRL-7B-Stage1
Text Generation • 8B • Updated • 3 • 2