Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenGVLab 's Collections
InternVL3
VisualPRM
PIIP
VideoChat-R1
InternVideo2.5
VideoMAE-v2
VideoChat-Flash
InternVL2.5
InternVL2.5-MPO
InternVL2.0
InternVL1.5
InternVL1.0
V2PE
InternVL Adaptation
Mono-InternVL
InternVideo2
VideoChat
VideoMamba
InternVid
OmniCorpus
All-Seeing Project
InternImage
PVT v2
InternVL Data

Mono-InternVL

updated 24 days ago

A Pioneering Monolithic MLLM

Upvote
6

  • Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

    Paper • 2410.08202 • Published Oct 10, 2024 • 4

    Note CVPR 2025


  • OpenGVLab/Mono-InternVL-2B

    Image-Text-to-Text • Updated Mar 12 • 6.96k • 33

  • OpenGVLab/Mono-InternVL-2B-S1-1

    Image-Text-to-Text • Updated Mar 12 • 18

  • OpenGVLab/Mono-InternVL-2B-S1-2

    Image-Text-to-Text • Updated Mar 12 • 6

  • OpenGVLab/Mono-InternVL-2B-S1-3

    Image-Text-to-Text • Updated Mar 12 • 16

  • OpenGVLab/Mono-InternVL-2B-Synthetic-Data

    Updated Feb 15 • 28 • 1
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs