Zijian Zhou's picture

Zijian Zhou PRO

franciszzj

·

https://sites.google.com/view/zijian-zhou/home

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

Qwen/Qwen3-32B

liked a dataset 7 days ago

BestWishYsh/OpenS2V-Eval

liked a model 8 days ago

black-forest-labs/FLUX.1-Kontext-dev

View all activity

Organizations

None yet

upvoted a paper 20 days ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published 22 days ago • 62

upvoted a paper 22 days ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published 24 days ago • 252

upvoted 2 papers about 1 month ago

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2 • 46

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Paper • 2505.17952 • Published May 23 • 21

upvoted 2 papers about 2 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 213

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 147

upvoted a collection 2 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated 3 days ago • 292

upvoted a paper 2 months ago

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published Apr 30 • 47

upvoted 2 papers 3 months ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 136

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published Mar 30 • 95

upvoted a collection 4 months ago

FLUX.1

A collection of our FLUX.1 models and LoRAs. • 9 items • Updated 14 days ago • 146

upvoted 3 papers 4 months ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published Mar 10 • 53

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 45

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 63

upvoted 2 papers 5 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 146

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 196

upvoted a paper 6 months ago

Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation

Paper • 2501.04144 • Published Jan 7 • 19

upvoted a collection 6 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Apr 28 • 220

upvoted a paper 7 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

upvoted a collection 7 months ago

AI Paper of the Day

A collection of papers that I think are interesting, one added each day • 405 items • Updated about 18 hours ago • 52