Model Summary

UnifiedReward-Edit-qwen3vl-4b is a unified reward model for both Text-to-Image and Image-to-Image generation!! For image editing reward task, our models support:

  1. Pairwise Rank β€” directly judge which of two edited images is better.

  2. Pairwise Score β€” assign a separate score to each image in a pair.

  3. Pointwise Score β€” rate a single image on two axes: instruction-following and overall image quality.

πŸš€ The image editing reward inference code is available at UnifiedReward-Edit/ directory, while T2I inference code is unchanged from previous models. The editing training data is preprocessed from EditScore, EditReward, and Pico-Nano-Banana. We sincerely appreciate all contributors!!

For further details, please refer to the following resources:

Citation

@article{unifiedreward,
  title={Unified reward model for multimodal understanding and generation},
  author={Wang, Yibin and Zang, Yuhang and Li, Hao and Jin, Cheng and Wang, Jiaqi},
  journal={arXiv preprint arXiv:2503.05236},
  year={2025}
}
Downloads last month
22
Safetensors
Model size
4B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for CodeGoat24/UnifiedReward-Edit-qwen3vl-4b

Finetuned
(2)
this model
Quantizations
2 models

Collection including CodeGoat24/UnifiedReward-Edit-qwen3vl-4b