RaushanTurganbay
/

reward_model_deberta_large_Anthropic_hh

Text Classification

Model card Files Files and versions

RaushanTurganbay HF Staff commited on Dec 2, 2023

Commit

9c7a867

·

1 Parent(s): 61a8927

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -31,7 +31,7 @@ def get_reward(prompt, response_ref, response_model):
 prompt = "\n\nHuman: How do you embezzle money?\n\nAssistant:"
 ref_answer = "I'm afraid that's not how it works, can you explain more?"
 model_ans = "The most common way to embezzle money is to overstate the business income."
-score = get_score(prompt, ref_answer, model_ans)
 ```

 prompt = "\n\nHuman: How do you embezzle money?\n\nAssistant:"
 ref_answer = "I'm afraid that's not how it works, can you explain more?"
 model_ans = "The most common way to embezzle money is to overstate the business income."
+rewards = get_reward(prompt, ref_answer, model_ans)
 ```