Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning Paper • 2410.10735 • Published Oct 14, 2024 • 2
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published 8 days ago • 22