Kaining Ying's picture

2 5 9

Kaining Ying

Kaining

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

stdstu123/Yume-5B-720P

upvoted a paper 2 days ago

Yume-1.5: A Text-Controlled Interactive World Generation Model

upvoted a collection 5 days ago

View all activity

Organizations

liked a model 2 days ago

stdstu123/Yume-5B-720P

Updated 3 days ago • 56 • 41

upvoted a paper 2 days ago

Yume-1.5: A Text-Controlled Interactive World Generation Model

Paper • 2512.22096 • Published 5 days ago • 53

upvoted a collection 5 days ago

Qwen3-VL

37 items • Updated about 18 hours ago • 552

upvoted a paper 30 days ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 217

liked a Space about 1 month ago

Visualize Dataset

Visualize OWAMcap files

liked 2 datasets about 2 months ago

open-world-agents/D2E-Original

Viewer • Updated 12 days ago • 473 • 670 • 2

open-world-agents/vpt-owamcap

Updated Jul 10, 2025 • 4.36k • 1

updated a collection about 2 months ago

MeViS

MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation • 2 items • Updated Nov 14, 2025

updated a Space about 2 months ago

README

published a Space about 2 months ago

README

upvoted a paper about 2 months ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 141

commented a paper 2 months ago

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

Paper • 2510.23691 • Published Oct 27, 2025 • 53 •

New activity in FudanCVL/MOSEv2 3 months ago

Incorrect dataset size?

#1 opened 3 months ago by

updated 2 collections 3 months ago

MOVE

Motion-Guided Few-Shot Video Object Segmentation • 2 items • Updated Sep 28, 2025

OmniAVS

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation • 3 items • Updated Sep 28, 2025