haisong's picture

haisong

rabbit19731

·

AI & ML interests

None yet

Organizations

upvoted a paper 9 months ago

Ovis2.5 Technical Report

Paper • 2508.11737 • Published Aug 15, 2025 • 115

upvoted a collection 9 months ago

Ovis2.5

Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated Aug 19, 2025 • 58

upvoted a paper 11 months ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published Jun 29, 2025 • 62

upvoted 2 papers about 1 year ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5, 2025 • 82

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 111

upvoted an article about 1 year ago

Article

SigLIP 2: A better multilingual vision language encoder

+1

ariG23498, merve, qubvel-hf

•

Feb 21, 2025

• 213

upvoted a collection over 1 year ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25, 2025 • 67

upvoted 2 papers almost 2 years ago

Parrot: Multilingual Visual Instruction Tuning

Paper • 2406.02539 • Published Jun 4, 2024 • 36

Ovis: Structural Embedding Alignment for Multimodal Large Language Model

Paper • 2405.20797 • Published May 31, 2024 • 33