7 10 28

Yixiao Ge

yxgeee

https://geyixiao.com/

AI & ML interests

Computer Vision, Foundation Models

Recent Activity

authored a paper 22 days ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

upvoted a paper 23 days ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

upvoted a paper about 1 month ago

Aligning Latent Spaces with Flow Priors

View all activity

Organizations

authored a paper 22 days ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

Paper • 2506.16141 • Published 29 days ago • 27

upvoted a paper 23 days ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

Paper • 2506.16141 • Published 29 days ago • 27

upvoted a paper about 1 month ago

Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published Jun 5 • 25

liked a model about 1 month ago

TencentARC/TokLIP

Updated Jun 5 • 5 • 8

upvoted a paper about 1 month ago

AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation

Paper • 2506.03126 • Published Jun 3 • 22

authored a paper about 2 months ago

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Paper • 2505.21374 • Published May 27 • 27

upvoted a paper about 2 months ago

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Paper • 2505.21374 • Published May 27 • 27

authored a paper 4 months ago

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published Apr 1 • 70

upvoted a paper 4 months ago

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published Apr 1 • 70

authored a paper 4 months ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31 • 39

upvoted a paper 4 months ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31 • 39

authored a paper 4 months ago

GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers

Paper • 2503.19480 • Published Mar 25 • 16

upvoted a paper 4 months ago

GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers

Paper • 2503.19480 • Published Mar 25 • 16

authored 2 papers 7 months ago

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

Paper • 2412.04432 • Published Dec 5, 2024 • 16

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published Dec 5, 2024 • 23

authored a paper 10 months ago

Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation

Paper • 2409.04410 • Published Sep 6, 2024 • 26

liked a Space 12 months ago

YOLO-World-Image

🚀

liked a dataset about 1 year ago

TencentARC/StoryStream

Preview • Updated Jul 17, 2024 • 132 • 27

authored 2 papers about 1 year ago

SEED-Story: Multimodal Long Story Generation with Large Language Model

Paper • 2407.08683 • Published Jul 11, 2024 • 26

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Paper • 2406.12275 • Published Jun 18, 2024 • 32

Yixiao Ge

AI & ML interests

Recent Activity

Organizations

yxgeee's activity

YOLO-World-Image