STEM Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 104k • 467 lmms-lab/HLE-Verified Preview • Updated Feb 28 • 5.15k • 5 skylenage-ai/HLE-Verified Viewer • Updated Feb 27 • 2.5k • 27.5k • 18
STEM Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 104k • 467 lmms-lab/HLE-Verified Preview • Updated Feb 28 • 5.15k • 5 skylenage-ai/HLE-Verified Viewer • Updated Feb 27 • 2.5k • 27.5k • 18