Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lonjoy 's Collections
LLMs
MM-LLMs
RAG
PEFT
MOE-LLMs

MM-LLMs

updated Sep 9, 2024
Upvote
-

  • MM-LLMs: Recent Advances in MultiModal Large Language Models

    Paper • 2401.13601 • Published Jan 24, 2024 • 49

  • Orion-14B: Open-source Multilingual Large Language Models

    Paper • 2401.12246 • Published Jan 20, 2024 • 14

  • Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model

    Paper • 2405.09215 • Published May 15, 2024 • 23

  • AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability

    Paper • 2405.14129 • Published May 23, 2024 • 14

  • ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

    Paper • 2405.15738 • Published May 24, 2024 • 47

  • Improved Baselines with Visual Instruction Tuning

    Paper • 2310.03744 • Published Oct 5, 2023 • 37

  • MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

    Paper • 2402.03766 • Published Feb 6, 2024 • 15
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs