NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings Paper • 2509.04011 • Published 8 days ago • 27
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks Paper • 2509.01396 • Published 11 days ago • 53
Robix: A Unified Model for Robot Interaction, Reasoning and Planning Paper • 2509.01106 • Published 11 days ago • 45
Provable Benefits of In-Tool Learning for Large Language Models Paper • 2508.20755 • Published 15 days ago • 11
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper • 2508.14460 • Published 23 days ago • 80
Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments Paper • 2508.08791 • Published about 1 month ago • 16