This repository hosts a [Janus-Pro 1B] trained by GCPO. The reward model is Geneval.

The training code is available at GCPO

Downloads last month
5
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for zghhui/Janus-Pro-1B-GCPO-Geneval

Finetuned
(5)
this model

Collection including zghhui/Janus-Pro-1B-GCPO-Geneval