Multimodal benchmarks that test various aspects of LLMs, VLMs, LMMs
-
Multimodal Clembench
π3Explore and compare multimodal models with interactive leaderboards and plots
-
SEED-Bench Leaderboard
π85Submit model evaluation results to leaderboard
-
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Paper β’ 2311.16502 β’ Published β’ 37 -
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
Paper β’ 2409.02813 β’ Published β’ 31