L-Hongbin
's Collections
MutiModal_Dataset
updated
Updated
•
19.2k
•
109
Updated
•
10.4k
•
128
WildVision/wildvision-chat
Viewer
•
Updated
•
45.2k
•
1.08k
•
21
Viewer
•
Updated
•
12.4M
•
1.47k
•
162
lmms-lab/LLaVA-Video-178K
Viewer
•
Updated
•
1.63M
•
40.6k
•
159
Viewer
•
Updated
•
7.29M
•
168
•
45
Viewer
•
Updated
•
1.66M
•
16
VILA-U: a Unified Foundation Model Integrating Visual Understanding and
Generation
Paper
•
2409.04429
•
Published
Viewer
•
Updated
•
235M
•
19.2k
•
37
Viewer
•
Updated
•
9.81M
•
620
•
49
JefferyZhan/Language-prompted-Localization-Dataset
Preview
•
Updated
•
64
•
4
Viewer
•
Updated
•
392
•
44
•
12
mlfoundations/MINT-1T-HTML
Viewer
•
Updated
•
623M
•
37.1k
•
89
DINO-X: A Unified Vision Model for Open-World Object Detection and
Understanding
Paper
•
2411.14347
•
Published
•
15
Preview
•
Updated
•
44
•
50
Viewer
•
Updated
•
72.5k
•
112
•
8
Viewer
•
Updated
•
10.9M
•
116
•
9
Viewer
•
Updated
•
2.18M
•
26
•
2
Viewer
•
Updated
•
110k
•
43
•
3
Salesforce/blip3-grounding-50m
Viewer
•
Updated
•
52.4M
•
549
•
23
Intelligent-Internet/II-Thought-RL-v0
Viewer
•
Updated
•
342k
•
266
•
53
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and
Verifiable Mathematical Dataset for Advancing Reasoning
Paper
•
2504.11456
•
Published
•
13
Viewer
•
Updated
•
217M
•
39.1k
•
97