pinned
Running
23
RISEBench Gallery
👀
A Gallery of Generation Results on RISEBench
None defined yet.
A Gallery of Generation Results on RISEBench
A Leaderboard for LMM spatial understanding capabilities
VLMEvalKit Subjectivce Benchmark Results
Compass Academic Leaderboard Full Version
A Leaderboard that demonstrates LMM reasoning capabilities
Compass Academic Leaderboard
VLMEvalKit Evaluation Results Collection
View and filter MMBench leaderboard data
VLMEvalKit Eval Results in video understanding benchmark
CompassJudger Subjective Evaluation Learderboard
JudgerBench Leaderboard
Display a web page
Evaluate code snippets across multiple languages
Explore and interact with AI assistant capabilities
Display a web page