Update README.md
Browse files
README.md
CHANGED
|
@@ -30,7 +30,14 @@ GOVERNING TERMS: Use of this model is governed by the [NVIDIA Open Model License
|
|
| 30 |
|
| 31 |
## Arena Hard LeaderBoard
|
| 32 |
|
| 33 |
-
As of 18 Mar 2025, augmenting models with the Feedback-Edit Inference Time Scaling (ITS) approach
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 34 |
|
| 35 |
| Model | Arena Hard (95% CI) |
|
| 36 |
|:-----------------------------|:----------------|
|
|
|
|
| 30 |
|
| 31 |
## Arena Hard LeaderBoard
|
| 32 |
|
| 33 |
+
As of 18 Mar 2025, augmenting models with the Feedback-Edit Inference Time Scaling (ITS) approach leads to the highest performance on Arena Hard.
|
| 34 |
+
|
| 35 |
+
The Feedback-Edit Inference Time Scaling system comprise of the following models:
|
| 36 |
+
|
| 37 |
+
1. [Llama-3.3-Nemotron-70B-Feedback](https://huggingface.co/nvidia/Llama-3.3-Nemotron-70B-Feedback)
|
| 38 |
+
2. [Llama-3.3-Nemotron-70B-Edit](https://huggingface.co/nvidia/Llama-3.3-Nemotron-70B-Edit)
|
| 39 |
+
3. [Llama-3.3-Nemotron-70B-Select](https://huggingface.co/nvidia/Llama-3.3-Nemotron-70B-Select)
|
| 40 |
+
|
| 41 |
|
| 42 |
| Model | Arena Hard (95% CI) |
|
| 43 |
|:-----------------------------|:----------------|
|