stephen-flood
's Collections
Benchmarks
updated
Viewer
•
Updated
•
2.09k
•
59
•
4
Viewer
•
Updated
•
5.82M
•
16.7k
•
43
Viewer
•
Updated
•
231k
•
308k
•
597
Viewer
•
Updated
•
17.6k
•
440k
•
1k
Viewer
•
Updated
•
19.6k
•
67
lighteval/legal_summarization
Viewer
•
Updated
•
26.9k
•
313
•
25
Viewer
•
Updated
•
1.6k
•
288
•
1
lighteval/synthetic_reasoning
Viewer
•
Updated
•
33k
•
800
•
7
lighteval/synthetic_reasoning_natural
Viewer
•
Updated
•
22k
•
76
•
15
Viewer
•
Updated
•
90.3k
•
255
•
3
lighteval/GPT3_unscramble
Viewer
•
Updated
•
50k
•
32
•
1
lighteval/aimo_progress_prize_1
Viewer
•
Updated
•
10
•
14
Viewer
•
Updated
•
1.7k
•
62
Viewer
•
Updated
•
72.5k
•
2.78k
•
140
Viewer
•
Updated
•
860k
•
9.47k
•
508
Qwen/Qwen2.5-Math-RM-72B
Text Classification
•
73B
•
Updated
•
32.6k
•
81
Jofthomas/hermes-function-calling-thinking-V1
Viewer
•
Updated
•
3.57k
•
1.05k
•
71
NousResearch/hermes-function-calling-v1
Viewer
•
Updated
•
11.6k
•
2.01k
•
356
Viewer
•
Updated
•
15.7k
•
190
•
5
Viewer
•
Updated
•
621M
•
32.8k
•
82
open-web-math/open-web-math
Viewer
•
Updated
•
6.32M
•
13.7k
•
321