Kaixiong Gong's picture

2 2 3

Kaixiong Gong

kxgong

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

tencent/Hunyuan3D-2.1

authored a paper 4 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

upvoted a paper 4 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

View all activity

Organizations

authored a paper 4 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 80

authored a paper 8 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

authored 3 papers over 1 year ago

Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Paper • 2401.14405 • Published Jan 25, 2024 • 13

Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors

Paper • 2312.04963 • Published Dec 7, 2023 • 17

OneLLM: One Framework to Align All Modalities with Language

Paper • 2312.03700 • Published Dec 6, 2023 • 24

authored a paper almost 2 years ago

Meta-Transformer: A Unified Framework for Multimodal Learning

Paper • 2307.10802 • Published Jul 20, 2023 • 44