Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
HuggingFaceTB 's Collections
🧠 SmolLM3
SmolLM3 pretraining datasets
SmolLM3 evaluation datasets
Dolma LongAttn Graded
Reasoning datasets
SmolLM2
SmolVLM2 📺 Smallest video LM ever 🤏🏻
📚 LLM pretraining datasets
SmolVLM
🧩 SmolLM2 Intermediate Checkpoints
The Ultimate Collection of Code Classifiers
SmolVLM 256M & 500M
📐 FineMath
💻 Local SmolLMs
🪐 SmolLM
Instruct datasets
🌌 Cosmopedia
Find textbooks in FineWeb with a classifier
FineWeb clustering & synthetic generations
Other: Stanford, OpenStax, khanAcademy, wikihow...
FW generation prompts
Wikipedia Science topics
Wikipedia textbooks
SFT Experiments
Decay mixture experiments
models

SmolVLM

updated May 5

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm

Upvote
38

  • HuggingFaceTB/SmolVLM-Instruct

    Image-Text-to-Text • 2B • Updated Apr 8 • 109k • 524

  • HuggingFaceTB/SmolVLM-Base

    Image-Text-to-Text • 2B • Updated Nov 28, 2024 • 8.19k • 82

  • HuggingFaceTB/SmolVLM-Synthetic

    Image-Text-to-Text • 2B • Updated Nov 26, 2024 • 79 • 12

  • HuggingFaceTB/SmolVLM-Instruct-DPO

    Image-Text-to-Text • Updated Nov 26, 2024 • 19 • 21

  • Running on Zero
    142
    142

    SmolVLM

    📊

    Generate answers by combining text and images

Upvote
38
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs