1 9 3

Guan

Guan123

guankaisi

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Breaking Down Video LLM Benchmarks: Knowledge, Spatial Perception, or True Temporal Understanding?

updated a dataset 25 days ago

Aimind-dataset-share/youcook

updated a dataset 26 days ago

Aimind-dataset-share/data4yuyue

View all activity

Organizations

Guan123's activity

upvoted a paper 9 days ago

Breaking Down Video LLM Benchmarks: Knowledge, Spatial Perception, or True Temporal Understanding?

Paper • 2505.14321 • Published 19 days ago • 10

updated a dataset 25 days ago

Aimind-dataset-share/youcook

Viewer • Updated 25 days ago • 452 • 34

updated a dataset 26 days ago

Aimind-dataset-share/data4yuyue

Updated 26 days ago • 19

published a dataset 26 days ago

Aimind-dataset-share/data4yuyue

Updated 26 days ago • 19

published a dataset 27 days ago

Aimind-dataset-share/youcook

Viewer • Updated 25 days ago • 452 • 34

liked a Space about 2 months ago

729

MMAudio — generating synchronized audio from video/text

🔊

Generate audio from video or text prompts

upvoted a paper about 2 months ago

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Paper • 2503.23377 • Published Mar 30 • 57

authored a paper 2 months ago

BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain

Paper • 2409.20075 • Published Sep 30, 2024 • 2

upvoted a paper 3 months ago

BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain

Paper • 2409.20075 • Published Sep 30, 2024 • 2

commented a paper 3 months ago

ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering

Paper • 2503.16867 • Published Mar 21 • 11 •

authored a paper 3 months ago

ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering

Paper • 2503.16867 • Published Mar 21 • 11

upvoted 2 papers 3 months ago

ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering

Paper • 2503.16867 • Published Mar 21 • 11

Atom of Thoughts for Markov LLM Test-Time Scaling

Paper • 2502.12018 • Published Feb 17 • 17

upvoted a paper 5 months ago

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 67

upvoted a paper 6 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

updated a model 7 months ago

Guan123/baichuan_7b_ecommerce

Updated Oct 27, 2024 • 6

updated a dataset 8 months ago

Guan123/worthybuying

Preview • Updated Oct 11, 2024 • 50 • 3

liked a dataset 8 months ago

Guan123/worthybuying

Preview • Updated Oct 11, 2024 • 50 • 3

upvoted a paper 8 months ago

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3, 2024 • 55

upvoted a paper about 1 year ago

RA-DIT: Retrieval-Augmented Dual Instruction Tuning

Paper • 2310.01352 • Published Oct 2, 2023 • 7