-
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models
Paper • 2502.00698 • Published • 18 -
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Paper • 2502.01142 • Published • 10 -
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Paper • 2502.01100 • Published • 10 -
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles
Paper • 2502.01081 • Published • 8
Zhitong Gao
ZhitongGao
AI & ML interests
None yet
Recent Activity
updated
a collection
about 10 hours ago
Vlm
updated
a collection
about 10 hours ago
Vlm
updated
a collection
about 10 hours ago
Vlm
Organizations
None yet
Collections
1
datasets
None public yet