NousResearch
/

Nous-Hermes-2-Mixtral-8x7B-DPO

Text Generation

text-generation-inference

Model card Files Files and versions

teknium commited on Jan 15, 2024

Commit

2eb41a4

·

verified ·

1 Parent(s): 51b7f98

Update README.md

Files changed (1) hide show

README.md +4 -10

README.md CHANGED Viewed

@@ -38,7 +38,7 @@ This is the SFT + DPO version of Mixtral Hermes 2, we will also be providing an
     - GPT4All
     - AGIEval
     - BigBench
-    - TruthfulQA
 3. [Prompt Format](#prompt-format)
 4. [Inference Example Code](#inference-code)
 5. [Quantized Models](#quantized-models)
@@ -131,14 +131,6 @@ BigBench:
 ```
 Average: 49.70
-TruthfulQA:
-```
-|    Task     |Version|Metric|Value |   |Stderr|
-|-------------|------:|------|-----:|---|-----:|
-|truthfulqa_mc|      1|mc1   |0.4162|±  |0.0173|
-|             |       |mc2   |0.5783|±  |0.0151|
-```
 ## GPT4All
@@ -148,9 +140,11 @@ TruthfulQA:
 ## BigBench Reasoning Test
-## TruthfulQA:
 # Prompt Format

     - GPT4All
     - AGIEval
     - BigBench
+    - Comparison to Mixtral-Instruct
 3. [Prompt Format](#prompt-format)
 4. [Inference Example Code](#inference-code)
 5. [Quantized Models](#quantized-models)
 ```
 Average: 49.70
 ## GPT4All
 ## BigBench Reasoning Test
+## Comparison to Mixtral Instruct:
+Our benchmarks show gains in many benchmarks against Mixtral Instruct v0.1, on average, beating the flagship Mixtral model.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/TuB0kC6rLmCCkiGLKB2_j.png)
 # Prompt Format