This repository hosts a [Janus-Pro 1B] trained by GCPO. The reward model is Geneval.
The training code is available at GCPO
Files info
Base model