David Leon's picture

1 2

David Leon

DavidLeon

https://www.linkedin.com/in/daweileng/

daweileng

AI & ML interests

AIGC & LMM

Recent Activity

commented on a paper 3 days ago

FG-CLIP: Fine-Grained Visual and Textual Alignment

upvoted a paper 3 days ago

FG-CLIP: Fine-Grained Visual and Textual Alignment

authored a paper about 2 months ago

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

View all activity

Organizations

None yet

DavidLeon's activity

commented a paper 3 days ago

FG-CLIP: Fine-Grained Visual and Textual Alignment

Paper • 2505.05071 • Published 4 days ago • 16 •

upvoted a paper 3 days ago

FG-CLIP: Fine-Grained Visual and Textual Alignment

Paper • 2505.05071 • Published 4 days ago • 16

authored a paper about 2 months ago

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Paper • 2502.14377 • Published Feb 20 • 12

authored 3 papers 7 months ago

Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities

Paper • 2309.00952 • Published Sep 2, 2023

FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

Paper • 2408.08189 • Published Aug 15, 2024 • 17

Qihoo-T2X: An Efficiency-Focused Diffusion Transformer via Proxy Tokens for Text-to-Any-Task

Paper • 2409.04005 • Published Sep 6, 2024 • 19

upvoted a paper 8 months ago

Qihoo-T2X: An Efficiency-Focused Diffusion Transformer via Proxy Tokens for Text-to-Any-Task

Paper • 2409.04005 • Published Sep 6, 2024 • 19

replied to HugoLaurencon's post about 1 year ago

Would you share the total training cost info? as traing of IDEFICS2-8B used "approximately 1.5 billion images and 225 billion text tokens" which is quite huge for a 8B sized LMM model