Update README.md
Browse files
README.md
CHANGED
|
@@ -15,15 +15,12 @@ base_model:
|
|
| 15 |
|
| 16 |
|
| 17 |
# Unified-Reward-7B
|
| 18 |
-
We are actively gathering feedback from the community to improve our models. **We welcome your input and encourage you to stay updated through our repository
|
| 19 |
-
|
| 20 |
-
|
| 21 |
-
[2025/4/15] 🔥🔥 We updated the `UnifiedReward-7B` to enhance its generalization and performance, incorporating valuable feedback from the community.
|
| 22 |
|
| 23 |
|
| 24 |
## Model Summary
|
| 25 |
|
| 26 |
-
`Unified-Reward-7b` is the first unified reward model for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
|
| 27 |
|
| 28 |
For further details, please refer to the following resources:
|
| 29 |
- 📰 Paper: https://arxiv.org/pdf/2503.05236
|
|
|
|
| 15 |
|
| 16 |
|
| 17 |
# Unified-Reward-7B
|
| 18 |
+
We are actively gathering feedback from the community to improve our models. **We welcome your input and encourage you to stay updated through our repository**!
|
|
|
|
|
|
|
|
|
|
| 19 |
|
| 20 |
|
| 21 |
## Model Summary
|
| 22 |
|
| 23 |
+
`Unified-Reward-7b` is the first unified reward model for multimodal understanding and generation assessment based on [LLaVA-OneVision-7b](https://huggingface.co/lmms-lab/llava-onevision-qwen2-7b-ov), enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
|
| 24 |
|
| 25 |
For further details, please refer to the following resources:
|
| 26 |
- 📰 Paper: https://arxiv.org/pdf/2503.05236
|