Submitted by ambean 22 Clinical knowledge in LLMs does not translate to human interactions · 11 authors 3
Submitted by lgy0404 20 LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects · 18 authors 4
Submitted by judge 14 SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning · 8 authors 2
Submitted by QizhiPei 14 CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges · 9 authors 4
Submitted by cloudcatcher2 9 Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency · 7 authors 2
Submitted by akhaliq 8 Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory · 5 authors 2
Submitted by iofu728 8 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention · 11 authors 2
Submitted by AaronZ345 6 Versatile Framework for Song Generation with Prompt-based Control · 11 authors 2
Submitted by soujanyaporia 5 NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks · 8 authors 2
Submitted by renqiux0302 5 TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving · 13 authors 2
Submitted by FocusV857 3 ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers · 5 authors 2
Submitted by observerw 3 ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development · 6 authors 2