Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation Paper • 2512.23705 • Published Dec 29, 2025 • 45
view article Article Unleash ML Power on iOS: Apple Silicon Optimization Secrets fguzman82 • Jul 18, 2024 • 7
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation Paper • 2411.19331 • Published Nov 28, 2024 • 5
view article Article AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models imomayiz • Sep 16, 2025 • 19
YanoljaNEXT-Rosetta Collection Translation Model for JSON-Structured Data • 3 items • Updated Sep 3, 2025 • 9
MobileCLIP2 Collection MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 30 items • Updated 22 days ago • 61
FastVLM Collection Efficient Vision Encoding for Vision Language Models • 8 items • Updated Mar 2 • 112
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 634
view article Article Open-Source Handwritten Signature Detection Model samuellimabraz • Mar 14, 2025 • 121