Daniel Schmidt

danschmidt88

AI & ML interests

None yet

Recent Activity

upvoted an article 19 days ago

Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics

upvoted a paper 23 days ago

Emerging Properties in Unified Multimodal Pretraining

upvoted an article about 1 month ago

Vision Language Models (Better, Faster, Stronger)

View all activity

Organizations

None yet

danschmidt88's activity

upvoted an article 19 days ago

Article

Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics

•

Jul 22, 2024

• 6

upvoted a paper 23 days ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published 26 days ago • 130

upvoted 2 articles about 1 month ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 437

Article

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

May 7

• 35

upvoted 4 papers about 1 month ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5 • 75

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 94

UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Paper • 2504.20734 • Published Apr 29 • 62

upvoted an article 2 months ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 289

upvoted an article 3 months ago

Article

Open R1: Update #4

and 3 others •

Mar 26

• 48

upvoted 2 papers 3 months ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Paper • 2503.16365 • Published Mar 20 • 41

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 108

upvoted 2 articles 3 months ago

Article

Manus AI: The Best Autonomous AI Agent Redefining Automation and Productivity

•

Mar 6

• 171

Article

Trace & Evaluate your Agent with Arize Phoenix

and 2 others •

Feb 28

• 40

upvoted 4 articles 4 months ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

and 2 others •

Feb 19

• 70

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 259

Article

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

and 1 other •

Feb 12

• 22

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.26k