Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kaixiong Gong's picture
2 2 3

Kaixiong Gong

kxgong
LighterDarkness's profile picture
·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago
tencent/Hunyuan3D-2.1
authored a paper 4 months ago
Video-R1: Reinforcing Video Reasoning in MLLMs
upvoted a paper 4 months ago
Video-R1: Reinforcing Video Reasoning in MLLMs
View all activity

Organizations

Test organization's profile picture AV-Odyssey Bench's profile picture Video-R1's profile picture

authored a paper 4 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 80
authored a paper 8 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24
authored 3 papers over 1 year ago

Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Paper • 2401.14405 • Published Jan 25, 2024 • 13

Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors

Paper • 2312.04963 • Published Dec 7, 2023 • 17

OneLLM: One Framework to Align All Modalities with Language

Paper • 2312.03700 • Published Dec 6, 2023 • 24
authored a paper almost 2 years ago

Meta-Transformer: A Unified Framework for Multimodal Learning

Paper • 2307.10802 • Published Jul 20, 2023 • 44
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs