view article Article Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics By dcarpintero β’ Jul 22, 2024 β’ 6
Emerging Properties in Unified Multimodal Pretraining Paper β’ 2505.14683 β’ Published 26 days ago β’ 130
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ May 12 β’ 437
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs By davidberenstein1957 and 1 other β’ May 7 β’ 35
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper β’ 2505.02567 β’ Published May 5 β’ 75
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper β’ 2504.20571 β’ Published Apr 29 β’ 94
UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities Paper β’ 2504.20734 β’ Published Apr 29 β’ 62
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 289
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Paper β’ 2503.16365 β’ Published Mar 20 β’ 41
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper β’ 2503.11576 β’ Published Mar 14 β’ 108
view article Article Manus AI: The Best Autonomous AI Agent Redefining Automation and Productivity By LLMhacker β’ Mar 6 β’ 171
view article Article Trace & Evaluate your Agent with Arize Phoenix By m-ric and 2 others β’ Feb 28 β’ 40
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google By ariG23498 and 2 others β’ Feb 19 β’ 70
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 259
view article Article Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios By pratikbhavsar and 1 other β’ Feb 12 β’ 22
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.26k