ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention Paper β’ 2507.01004 β’ Published 5 days ago β’ 7
Small Models Struggle to Learn from Strong Reasoners Paper β’ 2502.12143 β’ Published Feb 17 β’ 38
Evaluating Vision-Language Models as Evaluators in Path Planning Paper β’ 2411.18711 β’ Published Nov 27, 2024
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search Paper β’ 2503.10582 β’ Published Mar 13 β’ 23
Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators Paper β’ 2503.19877 β’ Published Mar 25 β’ 1
VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge Paper β’ 2504.10342 β’ Published Apr 14 β’ 11
Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time Paper β’ 2504.12329 β’ Published Apr 12
The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think Paper β’ 2505.10185 β’ Published May 15 β’ 25
VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation Paper β’ 2506.03930 β’ Published Jun 4 β’ 24
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper β’ 2507.00432 β’ Published 6 days ago β’ 54
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper β’ 2507.00432 β’ Published 6 days ago β’ 54
MARBLE Collection Data and model collection for MARBLE: https://github.com/a43992899/MARBLE/ β’ 8 items β’ Updated 5 days ago β’ 1
MARBLE Collection Data and model collection for MARBLE: https://github.com/a43992899/MARBLE/ β’ 8 items β’ Updated 5 days ago β’ 1
OAgents: An Empirical Study of Building Effective Agents Paper β’ 2506.15741 β’ Published 19 days ago β’ 35