1 4 1

Xichen Pan

xcpan

AI & ML interests

None yet

Recent Activity

authored a paper 12 days ago

Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

authored a paper 13 days ago

PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop

authored a paper 13 days ago

Transfer between Modalities with MetaQueries

View all activity

Organizations

xcpan's activity

authored a paper 12 days ago

Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

Paper • 2505.10046 • Published 13 days ago • 9

authored 3 papers 13 days ago

PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop

Paper • 2503.09595 • Published Mar 12

Transfer between Modalities with MetaQueries

Paper • 2504.06256 • Published Apr 8

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published 14 days ago • 85

updated a model 25 days ago

umd-vt-nyu/diffclipllama1b

Updated 25 days ago • 3

published a model 25 days ago

umd-vt-nyu/diffclipllama1b

Updated 25 days ago • 3

updated a model about 1 month ago

pllm-jt/pllm_ckpt

Updated Apr 26

published a model about 1 month ago

umd-vt-nyu/diffclip1e4_cos_llama1b_8

Updated Apr 25

updated a model about 1 month ago

umd-vt-nyu/ar1e4_cos_llama1b_bsz64_16

Updated Apr 23

published a model about 1 month ago

umd-vt-nyu/ar1e4_cos_llama1b_bsz64_16

Updated Apr 23

updated a dataset about 2 months ago

nyu-visionx/pyramid_flow_ft_results

Viewer • Updated Mar 30 • 8.42k • 9

published a dataset about 2 months ago

nyu-visionx/pyramid_flow_ft_results

Viewer • Updated Mar 30 • 8.42k • 9

updated a model about 2 months ago

nyu-visionx/pyramid_flow_ft_ckpt

Updated Mar 30

updated a model 2 months ago

umd-vt-nyu/flow_siglip2_512_sana_512_1e4_64token_2ndlast_sstk_16

Updated 7 days ago

published 2 models 2 months ago