Towards Widening The Distillation Bottleneck for Reasoning Models Paper • 2503.01461 • Published Mar 3
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System Paper • 2303.00501 • Published Mar 1, 2023 • 1
ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding Paper • 2312.11112 • Published Dec 18, 2023
PNT-Edge: Towards Robust Edge Detection with Noisy Labels by Learning Pixel-level Noise Transitions Paper • 2307.14070 • Published Jul 26, 2023
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting Paper • 2211.10772 • Published Nov 19, 2022
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation Paper • 2412.18928 • Published Dec 25, 2024
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published 20 days ago • 73
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25 • 61