CodeGoat24
/

UnifiedReward-7b

Model card Files Files and versions

CodeGoat24 commited on 7 days ago

Commit

7d8a9fc

·

verified ·

1 Parent(s): e3a5ddd

Update README.md

Files changed (1) hide show

README.md +2 -5

README.md CHANGED Viewed

@@ -15,15 +15,12 @@ base_model:
 # Unified-Reward-7B
-We are actively gathering feedback from the community to improve our models. **We welcome your input and encourage you to stay updated through our repository**!!
-[2025/4/15] 🔥🔥 We updated the `UnifiedReward-7B` to enhance its generalization and performance, incorporating valuable feedback from the community.
 ## Model Summary
-`Unified-Reward-7b` is the first unified reward model for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
 For further details, please refer to the following resources:
 - 📰 Paper: https://arxiv.org/pdf/2503.05236

 # Unified-Reward-7B
+We are actively gathering feedback from the community to improve our models. **We welcome your input and encourage you to stay updated through our repository**!
 ## Model Summary
+`Unified-Reward-7b` is the first unified reward model for multimodal understanding and generation assessment based on [LLaVA-OneVision-7b](https://huggingface.co/lmms-lab/llava-onevision-qwen2-7b-ov), enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
 For further details, please refer to the following resources:
 - 📰 Paper: https://arxiv.org/pdf/2503.05236