view post Post 12955 deepseek-ai/DeepSeek-OCR is out! 🔥 my take ⤵️ > pretty insane it can parse and re-render charts in HTML> it uses CLIP and SAM features concatenated, so better grounding> very efficient per vision tokens/performance ratio> covers 100 languages See translation
Weekly Releases (Jun 05, 2026) Comfy-Org/Ideogram-4 Updated 9 days ago • 147 jdopensource/JoyAI-Echo Text-to-Video • Updated 7 days ago • 9.24k • 138 litert-community/gemma-4-12B-it-litert-lm Updated 11 days ago • 24.5k • 30 google/gemma-4-12B-it-qat-q4_0-unquantized Any-to-Any • 12B • Updated 9 days ago • 20k • 46
Weekly Releases (May 29, 2026) Comfy-Org/PixelDiT Updated 12 days ago • 42.2k • 91 spiritbuun/buun-Qwen3.6-chat_template Updated 17 days ago • 45 avaturn-live/avtr-1 Image-to-Video • Updated 14 days ago • 383 • 32 Kwai-Keye/Keye-VL-2.0-30B-A3B Image-Text-to-Text • 31B • Updated 4 days ago • 4.96k • 112
Weekly Releases (Jun 05, 2026) Comfy-Org/Ideogram-4 Updated 9 days ago • 147 jdopensource/JoyAI-Echo Text-to-Video • Updated 7 days ago • 9.24k • 138 litert-community/gemma-4-12B-it-litert-lm Updated 11 days ago • 24.5k • 30 google/gemma-4-12B-it-qat-q4_0-unquantized Any-to-Any • 12B • Updated 9 days ago • 20k • 46
Weekly Releases (May 29, 2026) Comfy-Org/PixelDiT Updated 12 days ago • 42.2k • 91 spiritbuun/buun-Qwen3.6-chat_template Updated 17 days ago • 45 avaturn-live/avtr-1 Image-to-Video • Updated 14 days ago • 383 • 32 Kwai-Keye/Keye-VL-2.0-30B-A3B Image-Text-to-Text • 31B • Updated 4 days ago • 4.96k • 112