Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
AdinaY 
posted an update 3 days ago
Post
2300
Dolphin 🔥 A multimodal document image parsing model from ByteDance
, built on an analyze-then-parse paradigm.

ByteDance/Dolphin

✨ MIT licensed
✨ Handles text, tables, figures & formulas via:
- Reading-order layout analysis
- Parallel parsing with smart prompts

In this post