SVRL/verl-scalable-0908-8k_Qwen3-8B-Base_general-reasoner-fineweb-filter400-webinstruct Updated Sep 9
SVRL/verl-scalable-0831_general-reasoner-fineweb-webinstruct_Qwen3-4B-Base-step640 4B • Updated Sep 2 • 2
SVRL/verl-scalable-0827_batch128_ppomini32_general-reasoner-megamath_Qwen3-4B-Base-step600 4B • Updated Aug 30 • 1
SVRL/verl-scalable-0707_temp0.7_Qwen3-14B-RL-Base-200k-600step-0707_webinstruct-verified Updated Jul 11
SVRL/verl-scalable-0707_temp1.0_Qwen3-14B-RL-Base-200k-600step-0707_webinstruct-verified Updated Jul 10