SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning Paper • 2505.16186 • Published 3 days ago • 5
SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning Paper • 2505.16186 • Published 3 days ago • 5 • 2
Multimodal Reasoning Collection A collection for Multimodal Reasoning Models and Benchmarks. • 5 items • Updated 2 days ago
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Paper • 2505.15778 • Published 3 days ago • 10
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Paper • 2505.15778 • Published 3 days ago • 10 • 2
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models Paper • 2310.03903 • Published Oct 5, 2023 • 1
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models Paper • 2504.13367 • Published Apr 17 • 24
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 22
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 22
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 22 • 2
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models Paper • 2310.03903 • Published Oct 5, 2023 • 1
Neuro-Symbolic Procedural Planning with Commonsense Prompting Paper • 2206.02928 • Published Jun 6, 2022