Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TIGER-Lab 's Collections
VisCoder
One-Shot-CFT
Pixel-Reasoner
MoCha
General-Reasoner
VL-Rethinker
Vamba
TheoremExplain
ABC
VisualWebInstruct
PixelWorld
AceCoder
CritiqueFineTuning
MAmmoTH-VL
ScholarCopilot
VISTA
OmniEdit
MEGA-Bench
VLM2Vec
TIGERScore
MAmmoTH
UniIR
ImagenHub
Science
StructLM
ConsistI2V
Mantis
MAmmoTH2
VideoScore
Long-Context

VisualWebInstruct

updated May 5

Scaling up MM data

Upvote
1

  • TIGER-Lab/VisualWebInstruct-Recall

    Viewer • Updated Mar 16 • 361k • 765 • 3

  • TIGER-Lab/VisualWebInstruct-Seed

    Viewer • Updated Mar 16 • 60.3k • 68 • 16

  • TIGER-Lab/VisualWebInstruct

    Viewer • Updated Apr 10 • 1.91M • 1.3k • 34

  • VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

    Paper • 2503.10582 • Published Mar 13 • 23

  • TIGER-Lab/MAmmoTH-VL2

    Image-Text-to-Text • 8B • Updated May 7 • 62 • 12

  • Runtime error
    2
    2

    MAmmoTH-VL2

    🐠

    Strong Vision Language Model trained with VisualWebInstruct

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs