3 14 1

Zedong Wang

ZedongWangAI

https://jacky1128.github.io

AI & ML interests

Computer Vision, Multi-task Learning, Multi-modal Learning.

Recent Activity

upvoted a paper about 11 hours ago

Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation

upvoted a paper about 23 hours ago

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

upvoted a paper about 23 hours ago

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

View all activity

Organizations

ZedongWangAI's activity

upvoted a paper about 11 hours ago

Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation

Paper • 2504.17207 • Published 3 days ago • 23

upvoted 2 papers about 23 hours ago

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Paper • 2504.17789 • Published 2 days ago • 11

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Paper • 2504.17432 • Published 3 days ago • 32

upvoted 2 papers 23 days ago

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published 25 days ago • 64

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation

Paper • 2504.02542 • Published 24 days ago • 42

upvoted 2 papers 24 days ago

Scaling Language-Free Visual Representation Learning

Paper • 2504.01017 • Published 25 days ago • 29

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published 25 days ago • 83

upvoted a paper 2 months ago

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Paper • 2502.10458 • Published Feb 12 • 35

upvoted 2 papers 7 months ago

OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning

Paper • 2209.04851 • Published Sep 11, 2022 • 2

Switch EMA: A Free Lunch for Better Flatness and Sharpness

Paper • 2402.09240 • Published Feb 14, 2024 • 3

upvoted a collection 7 months ago

Representation Learning & Generation

Collection

8 items • Updated 24 days ago • 1

upvoted 2 papers 7 months ago

SemiReward: A General Reward Model for Semi-supervised Learning

Paper • 2310.03013 • Published Oct 4, 2023 • 2

Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning

Paper • 2410.06373 • Published Oct 8, 2024 • 34

upvoted a paper over 1 year ago

Efficient Multi-order Gated Aggregation Network

Paper • 2211.03295 • Published Nov 7, 2022 • 3