Post
492
introducing: VLM vibe eval ๐ชญ
visionLMsftw/VLMVibeEval
vision LMs are saturated over benchmarks, so we built vibe eval ๐ฌ
> compare different models with refreshed in-the-wild examples in different categories ๐ค
> submit your favorite model for eval
no numbers -- just vibes!
vision LMs are saturated over benchmarks, so we built vibe eval ๐ฌ
> compare different models with refreshed in-the-wild examples in different categories ๐ค
> submit your favorite model for eval
no numbers -- just vibes!