7 8 10

Manli Shu

Manli

azshue

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

Salesforce/ProVision-10M

updated a model 5 months ago

Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5

new activity 5 months ago

Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5:Dataset link doesn't work?

View all activity

Organizations

Manli's activity

liked a dataset about 2 months ago

Salesforce/ProVision-10M

Viewer • Updated 1 day ago • 24.5M • 736 • 14

updated a model 5 months ago

Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5

Image-Text-to-Text • Updated 1 day ago • 7.85k • 47

New activity in Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 5 months ago

Dataset link doesn't work?

#1 opened 6 months ago by

dibmvt

Extremely high GPU requirements for both basic (demo.ipynb) and batch (batch_inference.ipynb) notebooks

#3 opened 6 months ago by

dwb2023

upvoted a paper 6 months ago

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Paper • 2408.12590 • Published Aug 22, 2024 • 36

New activity in Salesforce/xgen-mm-phi3-mini-base-r-v1 6 months ago

Link model to paper

#1 opened 6 months ago by

nielsr

New activity in Salesforce/xgen-mm-phi3-mini-instruct-r-v1 6 months ago

Link model to paper

#12 opened 6 months ago by

nielsr

liked 4 models 6 months ago

authored a paper 6 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 98

New activity in Salesforce/xgen-mm-phi3-mini-base-r-v1.5 6 months ago

Upload examples.

#2 opened 6 months ago by

an-yan

Update README.md

#1 opened 6 months ago by

an-yan

upvoted a paper 6 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 98

upvoted a collection 6 months ago

🍃 MINT-1T

Collection

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 58

liked a dataset 6 months ago

mlfoundations/MINT-1T-HTML

Viewer • Updated Sep 21, 2024 • 623M • 148k • 81

liked a dataset 7 months ago

TIGER-Lab/VisualWebInstruct

Viewer • Updated Jan 2 • 60.3k • 336 • 16

authored 2 papers 8 months ago

Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

Paper • 2209.07511 • Published Sep 15, 2022

Coercing LLMs to do and reveal (almost) anything

Paper • 2402.14020 • Published Feb 21, 2024 • 13