Salesforce/xgen-mm-vid-phi3-mini-r-v1.5-128tokens-8frames Image-Text-to-Text • Updated about 19 hours ago • 573 • 10
BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions Paper • 2411.07461 • Published Nov 12, 2024 • 22
Salesforce/blip2-itm-vit-g-coco Zero-Shot Image Classification • Updated about 18 hours ago • 1.31k • 1
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 98
Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 Image-Text-to-Text • Updated about 19 hours ago • 7.79k • 47
Salesforce/xgen-mm-phi3-mini-instruct-dpo-r-v1.5 Image-Text-to-Text • Updated about 19 hours ago • 96 • 17
Salesforce/xgen-mm-phi3-mini-instruct-singleimg-r-v1.5 Image-Text-to-Text • Updated about 19 hours ago • 102 • 15
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 98
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 98 • 7
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 98 • 7
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 98 • 7