CLINIC: Evaluating Multilingual Trustworthiness in Language Models for Healthcare Paper • 2512.11437 • Published 18 days ago • 3
left|,circlearrowright,text{BUS},right|: A Large and Diverse Multimodal Benchmark for evaluating the ability of Vision-Language Models to understand Rebus Puzzles Paper • 2511.01340 • Published Nov 3 • 12
Leveraging Large Language Models for Predictive Analysis of Human Misery Paper • 2508.12669 • Published Aug 18 • 14
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens Paper • 2506.17218 • Published Jun 20 • 29