kevinpro commited on
Commit
acd3d88
·
verified ·
1 Parent(s): 926beb4

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -1,3 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
1
  # R-PRM: Reasoning-Driven Process Reward Modeling
2
 
3
  <p align="center">
 
1
+ ---
2
+ license: apache-2.0
3
+ language: zh
4
+ tags:
5
+ - reinforcement-learning
6
+ - reward-model
7
+ - dpo
8
+ model_name: R-PRM-7B-DPO
9
+ pipeline_tag: text-generation
10
+ ---
11
+
12
  # R-PRM: Reasoning-Driven Process Reward Modeling
13
 
14
  <p align="center">