Leaderboards - a sdiazlor Collection

sdiazlor 's Collections

Instruction Models

Computer Vision Models

Data Related Tools

Leaderboards

updated Jul 14, 2025

Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions

Running

Agents

20

InferBench

🥇

20

A cost/quality/speed Leaderboard for Inference Providers!
Running on CPU Upgrade

7.6k

MTEB Leaderboard

📊

7.6k

Embedding Leaderboard
Running on CPU Upgrade

14.1k

Open LLM Leaderboard

🏆

14.1k

Track, rank and evaluate open LLMs and chatbots
Running

4.96k

Arena Leaderboard

🏆

4.96k

View the LMArena leaderboard in full‑screen
Runtime error

Agents

76

La Leaderboard

🌸

76

Evaluate open LLMs in the languages of LATAM and Spain.
Running

Agents

112

Judge Arena

💻

112

View and compare open‑source AI model rankings with ELO scores
Running

Agents

Featured

590

LLM-Perf Leaderboard

🏆

590

Compare LLM hardware performance and find the best model
Running

Agents

209

Vidore Leaderboard

🥇

209

Browse and compare visual document retrieval model scores
Running on CPU Upgrade

Agents

1.02k

Open VLM Leaderboard

🌎

1.02k

VLMEvalKit Evaluation Results Collection
Build error

Agents

Featured

85

SEED-Bench Leaderboard

🏆

85

Submit model evaluation results to leaderboard
Runtime error

Agents

23

MM-UPD Leaderboard

🥇

23

Submit and evaluate model results on MM-UPD benchmarks
Paused

Agents

24

MMBench Leaderboard

🚀

24

Explore MMBench Leaderboard data