InferBench
π₯
17
A cost/quality/speed Leaderboard for Inference Providers!
Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions
A cost/quality/speed Leaderboard for Inference Providers!
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
View the latest LMArena model leaderboard
Evaluate open LLMs in the languages of LATAM and Spain.
View and compare openβsource AI model rankings with ELO scores
Explore LLM performance across hardware configurations
Explore and compare visual document retrieval benchmark results
VLMEvalKit Evaluation Results Collection
Submit model evaluation results to leaderboard
Submit and evaluate model results on MM-UPD benchmarks
Explore MMBench Leaderboard data