view post Post 237 we're all sleeping on this OCR model rednote-hilab/dots.ocr 🔥dots.ocr is a new 3B model with sota performance, support for 100 languages & allowing commercial use! 🤯single e2e model to extract image, convert tables, formula, and more into markdown 📝try it MohamedRashad/Dots-OCR See translation Reply
view post Post 413 massive releases and tons of Flux 1. Krea LoRas past week! here's some of the picks, find more models in collection 🫡 merve/releases-august-2-6890c14248203522b7d0267fLLMs 💬> Tencent dropped tencent/Hunyuan-7B-Instruct > Qwen released Qwen/Qwen3-Coder-30B-A3B-Instruct, 30B MoE with 3B params for coding (OS)vision/multimodal> RedNote released rednote-hilab/dots.ocr - 3B OCR model (OS) > Cohere released CohereLabs/command-a-vision-07-2025 - 112B (dense!) VLM for 6 languages > StepFun-AI shipped stepfun-ai/step3 - 321B MoE VLM (OS)> Skywork shipped Skywork/Skywork-UniPic-1.5B - new any-to-any model (image+text → image+text) (OS) See translation 🔥 1 1 + Reply
MetaCLIP Collection MetaCLIP & MetaCLIP2 OpenCLIP and timm models. All models are dual timm + OpenCLIP (or just timm for specific vit encoders). • 24 items • Updated 4 days ago • 2
MetaCLIP Collection MetaCLIP & MetaCLIP2 OpenCLIP and timm models. All models are dual timm + OpenCLIP (or just timm for specific vit encoders). • 24 items • Updated 4 days ago • 2