HyperCLOVA X SEED Collection HyperCLOVA X SEED is NAVER's lightweight open-source lineup with a strong focus on Korean language performance • 3 items • Updated 23 days ago • 24
ProLIP Collection Official ProLIP weights, Probabilistic Language-Image Pre-Training (ICLR 2025) • 7 items • Updated 29 days ago • 9
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation Paper • 2411.19067 • Published Nov 28, 2024 • 7
Cosmos Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 7 days ago • 40
Unified Speech-Text Pretraining for Spoken Dialog Modeling Paper • 2402.05706 • Published Feb 8, 2024 • 6
RDNet Collection DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs [ECCV 2024] • 9 items • Updated Oct 16, 2024 • 3
rope-vit Collection Rotary Position Embedding for Vision Transformer [ECCV 2024] • 22 items • Updated Oct 16, 2024 • 3
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs Paper • 2403.19588 • Published Mar 28, 2024 • 2