lindsaybordier/Qwen3-0.6B-DPO_argilla_ultrafeedback-binarized-preferences_keywords-filtered Text Generation • 0.6B • Updated May 25