Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AXERA-TECH 's Collections
Multimodal Models
Qwen2.5
MiniCPM4
Qwen3
DeepSeek-R1-Distill
HuggingFaceTB
Vision Models
Audio Models
Tools
TestData

Multimodal Models

updated about 6 hours ago
Upvote
-

  • AXERA-TECH/lcm-lora-sdv1-5

    Updated Jun 23 • 9 • 1

  • AXERA-TECH/InternVL3-2B

    Visual Question Answering • Updated 8 days ago • 17 • 2

  • AXERA-TECH/Qwen2.5-VL-3B-Instruct

    Image-Text-to-Text • Updated 6 days ago • 20

  • AXERA-TECH/InternVL3-1B

    Image-Text-to-Text • Updated Jun 28 • 9

  • AXERA-TECH/SmolVLM2-500M-Video-Instruct

    Visual Question Answering • Updated 29 days ago • 8 • 2

  • AXERA-TECH/InternVL2_5-1B-MPO

    Image-Text-to-Text • Updated 4 days ago • 7

  • AXERA-TECH/InternVL2_5-1B

    Image-Text-to-Text • Updated Apr 4 • 6 • 1

  • AXERA-TECH/Janus-Pro-1B

    Visual Question Answering • Updated Apr 14 • 5 • 2

  • AXERA-TECH/SmolVLM-256M-Instruct

    Updated Apr 4 • 14 • 2

  • AXERA-TECH/YOLO-World-V2

    Object Detection • Updated Mar 23 • 6

  • AXERA-TECH/LivePortrait

    Image-to-Video • Updated Jun 21 • 2 • 4

  • AXERA-TECH/cnclip

    Updated 8 days ago • 6 • 1

  • AXERA-TECH/clip

    Updated 8 days ago • 5

  • AXERA-TECH/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • Updated 6 days ago • 5
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs