InferBench
π₯
17
A cost/quality/speed Leaderboard for Inference Providers!
Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions
A cost/quality/speed Leaderboard for Inference Providers!
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
View LMArena model leaderboard
Evaluate open LLMs in the languages of LATAM and Spain.
View and compare openβsource AI model rankings with ELO scores
Explore LLM performance across hardware configurations
Compare and rank visual document retrieval models across different benchmarks
VLMEvalKit Evaluation Results Collection
Submit model evaluation results to leaderboard
Submit and evaluate model results on MM-UPD benchmarks
Explore MMBench Leaderboard data