2 72 20

sdtana

roxani_17

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Group Downsampling with Equivariant Anti-aliasing

upvoted a paper 12 days ago

You Do Not Fully Utilize Transformer's Representation Capacity

upvoted a paper 12 days ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

View all activity

Organizations

sdtana's activity

upvoted a paper 3 days ago

Group Downsampling with Equivariant Anti-aliasing

Paper • 2504.17258 • Published 9 days ago • 7

upvoted 2 papers 12 days ago

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published Feb 13 • 38

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25 • 57

upvoted a paper 13 days ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 161

upvoted 2 papers 14 days ago

Gaussian Mixture Flow Matching Models

Paper • 2504.05304 • Published 25 days ago • 12

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Paper • 2504.10483 • Published 18 days ago • 20

upvoted a paper 17 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 18 days ago • 251

upvoted a paper 19 days ago

PixelFlow: Pixel-Space Generative Models with Flow

Paper • 2504.07963 • Published 22 days ago • 19

upvoted an article 26 days ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 237

upvoted a paper about 1 month ago

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 49

upvoted a paper 2 months ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published Feb 27 • 28

upvoted 5 papers 3 months ago

upvoted 4 papers 4 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 91

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 84

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 54

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Paper • 2412.16112 • Published Dec 20, 2024 • 23