merve
/
Qwen2.5-VL-3B-Instruct-trl-mpo-rlaif-v

Model card Files Files and versions
xet
Metrics Training metrics Community