miulab
/

llama2-7b-magicoder-evol-instruct

Text Generation

Model card Files Files and versions

hank0316 commited on Oct 3, 2024

Commit

8e4d75c

·

verified ·

1 Parent(s): a11dc2d

Update README.md

Files changed (1) hide show

README.md +26 -3

README.md CHANGED Viewed

@@ -1,3 +1,26 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+datasets:
+- ise-uiuc/Magicoder-Evol-Instruct-110K
+language:
+- en
+base_model:
+- miulab/llama2-7b-oss-instruct
+pipeline_tag: text-generation
+---
+This is the "Code model" used in the paper "[DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging](https://arxiv.org/abs/2407.01470)".
+The detailed training/evaluation can be found at https://api.wandb.ai/links/merge_exp/tms593xm.
+For more details about this model, please refer to our paper.
+If you found this model useful, please cite our paper:
+```
+@article{lin2024dogerm,
+  title={DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging},
+  author={Lin, Tzu-Han and Li, Chen-An and Lee, Hung-yi and Chen, Yun-Nung},
+  journal={arXiv preprint arXiv:2407.01470},
+  year={2024}
+}
+```