Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deepakkumar07 's Collections
vision-llm
tamil-dataset
document-parser
text-to-speech
voice-to-text
Transformers model
csv-dataset

vision-llm

updated Sep 5
Upvote
-

  • Running
    111

    Vision Papers

    đŸ’ģ
    111

    All paper summaries read by Merve


  • Runtime error
    20

    Ovis2 1B

    đŸĻĢ
    20

    Small model can do big things.


  • AIDC-AI/Ovis2-8B-GPTQ-Int4

    Image-Text-to-Text â€ĸ 9B â€ĸ Updated Mar 25 â€ĸ 295 â€ĸ 3

  • AIDC-AI/Ovis2-1B

    Image-Text-to-Text â€ĸ 1B â€ĸ Updated Aug 15 â€ĸ 325 â€ĸ 97

  • Runtime error
    12

    Ovis2 8B

    đŸĻĢ
    12

    Ovis2-8B


  • lambdalabs/Llama-3.3-70B-Instruct-AWQ-4bit

    71B â€ĸ Updated Dec 10, 2024 â€ĸ 12.4k â€ĸ 4

  • microsoft/GUI-Actor-7B-Qwen2-VL

    Image-Text-to-Text â€ĸ 8B â€ĸ Updated Aug 9 â€ĸ 65 â€ĸ 39

  • lambdalabs/sd-image-variations-diffusers

    Image-to-Image â€ĸ Updated Feb 8, 2023 â€ĸ 2.58k â€ĸ 454

  • vikhyatk/moondream2

    Image-Text-to-Text â€ĸ 2B â€ĸ Updated Sep 23 â€ĸ 1.96M â€ĸ 1.35k

  • OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview

    Image-Text-to-Text â€ĸ 0.4B â€ĸ Updated Aug 29 â€ĸ 45.8k â€ĸ 82
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs